Discrimination Mitigator

Summary: Discrimination mitigation algorithms are used to address unwanted bias throughout the entire lifecycle of AI systems.

Type of pattern: Product pattern

Type of objective: Trustworthiness

Target users: Data scientists

Impacted stakeholders: RAI governors, AI users, AI consumers

Lifecycle stages: Design

Relevant AI ethics principles: Fairness

Context: In the context of AI systems, discrimination is the illegal or unfavorable treatment of an individual based on their membership in a protected group such as gender, race, or age. There are two types of discrimination that can occur in AI systems: direct discrimination happening when protected groups are explicitly used as attributes in the decision-making algorithms and indirect discrimination that occurs when protected groups are correlated with attributes in the decision-making algorithms.

Problem: How can we mitigate algorithmic discrimination in the decision-making of AI systems?

Solution: Algorithmic discrimination can be mitigated in three ways:

  • Pre-processing techniques (such as data preprocessing technique [1], optimized data pre-processing technique [2], data perturbation technique for word embedding [3]) can be used to remove the underlying discrimination by removing or adjusting data points that are biased or not representative of the population the AI system will be used on
  • In-preprocessing techniques (such as adding fairness constraints [4] or prejudice remover regularizer [5] for fair classification, or adding minimizing adversary as one of the learning objectives [6]) can be applied during the training of the AI model to modify the learning algorithms and remove discrimination in the decision-making process, if the modification of the learning algorithms is allowed. 
  • Post-processing techniques (such as equalized odds or equal opportunity predictor derived from a learned predictor [7], removing gender associations in word embedding [8]) are used after the AI model has been trained and deployed to adjust its output and remove any unwanted bias. 

Benefits:

  • Improved model performance: When discrimination is removed, the AI model’s performance is improved and its output is more trustworthy.
  • Better balance between fairness and accuracy: Discrimination mitigation techniques can effectively balance fairness and accuracy.

Drawbacks:

  • Lack of generality: AI models are often designed for specific tasks, which makes it difficult to extend and adapt discrimination mitigation techniques to new tasks or domains.
  • Reduced model performance: Introducing discrimination mitigation techniques can affect the overall performance of the model.
  • Compromised accuracy: Discrimination mitigation techniques may involve trade-offs in order to achieve fairness.

Related Patterns:

  • Fairness Assessor: Fairness measurement metrics are used as objectives of mitigation techniques.

Known Uses:

  • LinkedIn Fairness Toolkit (LiFT) is a library that is used to measure fairness and mitigate bias in large-scale AI workflows. A post-processing method is used to transform scores to achieve equality of opportunity in rankings.
  • Microsoft Fairlearn is a fairness toolkit that contains model assessment metrics and mitigation algorithms for AI developers to assess their AI system’s fairness and mitigate the detected bias.
  • IBM AI Fairness 360 is a python library that contains a comprehensive set of fairness metrics and algorithms (such as optimized processing, equalized odds post-processing) to mitigate bias in datasets and AI models.

References:

[1] Kamiran, F. and T. Calders, Data preprocessing techniques for classification without discrimination. Knowledge and information systems, 2012. 33(1): p. 1-33.

[2] du Pin Calmon, F., et al., Optimized Data Pre-Processing for Discrimination Prevention. 2017.

[3] Brunet, M.-E., et al. Understanding the origins of bias in word embeddings. in International conference on machine learning. 2019. PMLR.

[4] Zafar, M.B., et al. Fairness constraints: Mechanisms for fair classification. in Artificial intelligence and statistics. 2017. PMLR.

[5] Kamishima, T., et al. Fairness-aware classifier with prejudice remover regularizer. in Joint European conference on machine learning and knowledge discovery in databases. 2012. Springer.

[6] Zhang, B.H., B. Lemoine, and M. Mitchell. Mitigating unwanted biases with adversarial learning. in Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society. 2018.

[7] Hardt, M., E. Price, and N. Srebro, Equality of opportunity in supervised learning. Advances in neural information processing systems, 2016. 29.

[8] Bolukbasi, T., et al., Man is to computer programmer as woman is to homemaker? debiasing word embeddings. Advances in neural information processing systems, 2016. 29.