Taking a step back and positioning bias: three considerations

In a previous blog post, I have mentioned some ways to investigate the gender bias of natural language processing (NLP) systems (language models). When discussing the undesirable biases, we often take a mathematical and ‘mechanistic’ approach, where we measure a deviation from a (prescriptive) norm of ideal behavior (e.g., a skew in the gender distribution from 50/50%) or try to explain how some biases are encoded in the NLP model’s parameters.

However, it is also useful to take a step back and to consider bias in NLP from a broader perspective. The analysis of bias is incomplete if we ignore the normative questions at hand and the sociotechnical context; both the technical details of the model and the social aspects (the designers, users, stakeholders, historical and cultural context, company goals etc.) are important to consider. In this blog post, we’ll discuss three such considerations:

algorithmic bias is a sociotechnical problem,
society is constantly changing and so is our conceptualization of bias,
algorithmic bias is not simply a reflection of the data/society.

1. Bias is a sociotechnical problem

When should we consider the gender bias of an AI system as harmful? The implicit assumption in the AI debate is generally that we should aim for gender-neutral behavior, which is based on the idea that not differentiating between the genders defines or constitutes fair behavior. However, whether this is the case might strongly depend on the particular task we want the system to perform and its sociotechnical context (i.e., both technical and social aspects are important to consider).

In translations, we might want the AI system to consider the (grammatical) gender of the subject, but not in assessing the competency of job candidates when automatically filtering resumes! (In fact, whether we should want to use AI for automating these tasks is another question entirely.)

Our perspective may also change if we do not see the bias of an AI system in isolation, but as situated in the broader practices it is part of: We may find that the individual examples of bias do not paint the full picture of the structural bias of the institutions, businesses, or organizations making use of it.

Not all bias is unwanted, and there might be contexts in which we need it to reach certain goals. To formulate the (moral) standards for an AI system, we need to look at the broader context in which it functions, understand the way the AI system interacts with this environment, and consider how the entire system might contribute to unfairness or cause harm to particular groups or individuals.

2. Society is constantly changing and so is “bias”

Ideally, the discussion about the norms and standards of a particular AI application are resolved before the build starts. But what counts as unfair or harmful behaviors are not stable societal factors that we can align our AI systems with. They are constantly changing, as the debate in society progresses, and therefore a principal solution of the bias problem is simply impossible. Worse even, new biases can emerge if our AI systems do not adjust for this change.

This concern is especially apparent for very large language models, which are expensive to train and therefore reused for many downstream tasks. Moreover, given the various applications that could make use of language technology, there is no way to have standards that fit them all.

3. Algorithmic bias is not only reflecting pre-existing bias

A popular argument in the AI community, is that the bias of a deep neural model simply reflects pre-existing biases that are present in the training data. However, we should not neglect the responsibility we have in designing and implementing these AI systems: Many forms of bias can emerge at the different stages of creating and deploying the language technology.

Language technology is not merely reflecting society, but its implementations can be a part of it and even change it in unexpected ways. A well-known theme in the philosophy of technology, is that technologies ‘mediate’ our experiences and shape our world-view of “how to live”. Machine translation systems may dictate a world-view primarily of men, with women restricted to stereotypical occupations, and search engines that only show men for the keyword “CEO” similarly shape our image of the archetypal business leader.

How we define bias and measure it, may also influence how we view bias itself. In the context of fairness metrics, Jacobs and Wallach (2021) refer to ‘consequential validity’, the fact that “the measurements shape the ways that we understand the construct itself”—which is often overlooked when designing a bias metric.

Algorithmic bias is an inherently complex phenomenon due to its sociotechnical and context-sensitive nature, which makes a precise definition difficult—yet a discussion of how it is conceptualized is crucial when researching it. Researchers cannot resort to a ‘catch-all’ bias metric for understanding the bias, and mitigating the harms might require more than simply removing the biased information.

Going even further, maybe the starting point should not be to ask ourselves how we can debias AI models, but rather to focus on the larger questions that we have to answer as a society: How do we want to shape the world with language technology as a part of life? How can we design these AI systems such that these help create a more just society, instead of solidifying or even leading to new forms of systemic bias?

Thanks to Wout Moltmaker for his helpful comments on this blog post.