Lightnews — Scholar-powered news

Oskar van der Wal

@ovdw.bsky.social

💬Panel discussion with Sally Haslanger and Marjolein Lanzing: A philosophical perspective on algorithmic discrimination

Is discrimination the right way to frame the issues of lang tech? Or should we answer deeper rooted questions? And how does tech fit in systems of oppression?

November 15, 2024 at 4:36 PM

Oskar van der Wal

@ovdw.bsky.social

📄Undesirable Biases in NLP: Addressing Challenges of Measurement

We also presented our own work on strategies for testing the validity and reliability of LM bias measures:

www.jair.org/index.php/ja...

Screenshot of a slide discussing how to improve how we communicate bias scores on Model Cards.

November 15, 2024 at 4:36 PM

Oskar van der Wal

@ovdw.bsky.social

🔑Keynote @zeerak.bsky.social: On the promise of equitable machine learning technologies

Can we create equitable ML technologies? Can statistical models faithfully express human language? Or are tokenizers "tokenizing" people—creating a Frankenstein monster of lived experiences?

Photo of the presentation. The slide shows an image of Frankenstein's monster.

November 15, 2024 at 4:36 PM

Oskar van der Wal

@ovdw.bsky.social

📄A Capabilities Approach to Studying Bias and Harm in Language Technologies

@hellinanigatu.bsky.social introduced us to the Capabilities Approach and how it can help us better understand the social impact of language technologies—with case studies of failing tech in the Majority World.

November 15, 2024 at 4:36 PM

Oskar van der Wal

@ovdw.bsky.social

📄Angry Men, Sad Women: Large Language Models Reflect Gendered Stereotypes in Emotion Attribution

Flor Plaza discussed the importance of studying gendered emotional stereotypes in LLMs, and how collaborating with philosophers benefits work on bias evaluation greatly.

November 15, 2024 at 4:36 PM

Oskar van der Wal

@ovdw.bsky.social

🔑Keynote by John Lalor: Should Fairness be a Metric or a Model?

While fairness is often viewed as a metric, using integrated models instead can help with explaining upstream bias, predicting downstream fairness, and capturing intersectional bias.

November 15, 2024 at 4:36 PM

Oskar van der Wal

@ovdw.bsky.social

📄A Decade of Gender Bias in Machine Translation

Eva Vanmassenhove: how has research on gender bias in MT developed over the years? Important issues, like non-binary gender bias, now fortunately get more attention. Yet, fundamental problems (that initially seemed trivial) remain unsolved.

November 15, 2024 at 4:36 PM

Oskar van der Wal

@ovdw.bsky.social

📄MBBQ: A Dataset for Cross-Lingual Comparison of Stereotypes in Generative LLMs

Vera Neplenbroek presented a multilingual extension of the BBQ bias benchmark to study bias across English, Dutch, Spanish, and Turkish.

"Multilingual LLMs are not necessarily multicultural!"

November 15, 2024 at 4:36 PM

Oskar van der Wal

@ovdw.bsky.social

🔑Keynote by Dong Nguyen: When LLMs meet language variation: Taking stock and looking forward

Non-standard language is often seen as noisy/incorrect data, but this ignores the reality of language. Variation should play a larger role in LLM developments and sociolinguistics can help!

November 15, 2024 at 4:36 PM

Oskar van der Wal

@ovdw.bsky.social

Last week, we organized the workshop "New Perspectives on Bias and Discrimination in Language Technology" 🤖 @uvahumanities.bsky.social @amsterdamnlp.bsky.social

We're looking back at two inspiring days of talks, posters, and discussions—thanks to everyone who participated!

wai-amsterdam.github.io

Photo of the poster session at the workshop.

November 15, 2024 at 4:36 PM

Oskar van der Wal

@ovdw.bsky.social

However, we believe that its flip side—divergent validity—deserves attention as well! Instead, we ask whether the bias measure is not too similar to another (easily confounded) measure or construct. We do not want to accidentally also measure something else!

Screenshot of a figure with the caption: "This figure illustrates the difference between convergent and divergent validity (see Section 4.2). In this example, the convergent validity is assessed by testing how related a gender bias measure is to another gender bias measure. The divergent validity, instead, is assessed by testing whether the gender bias measure is not strongly correlated with a measure for another, but easily confounded construct (e.g., grammatical gender)."

January 24, 2024 at 9:39 AM

Oskar van der Wal

@ovdw.bsky.social

Construct validity: How sure are we that we measure what we actually want to measure (the construct)? Critical work by e.g., Gonen & Goldberg, Blodgett et al., Orgad & Belinkov shows many flaws that could hurt the validity. How do we design bias measures that actually measure what we want?

Screenshot of a table with the caption: "An overview of the types of construct validity we discuss in Section 4. Examples are given in the last column."

January 24, 2024 at 9:36 AM

Oskar van der Wal

@ovdw.bsky.social

Reliability: How much precision can we get when applying the bias measure? How resilient is it to random measurement error? Naturally, we prefer measurement tools with a higher reliability! We discuss four forms of reliability we think can be applied easily to the NLP context.

Screenshot of a table with the caption: "Examples of the reliability types we discuss in Section 3. We specify, for each reliability type, across which variations (e.g., random seeds) the consistency is measured. In the last column, we provide examples of where these reliability types could be applied."

January 24, 2024 at 9:26 AM

Oskar van der Wal

@ovdw.bsky.social

Borrowing from psychometrics (a field specialized in the measurement of concepts that are not directly observable), we argue that it is useful to decouple the "construct" (what we want to know about but cannot observe directly) from its "operationalization" (the imperfect proxy).

Screenshot of a figure with the caption: "We assume that a training dataset's bias influences the bias of a model trained on that data (but other possible sources of bias are possible, e.g., model compression may amplify existing biases (Hooker et al., 2020)). Training dataset bias and model bias are unobservable constructs (circle) that both have different possible operationalizations (squares)."

January 24, 2024 at 9:25 AM

Oskar van der Wal

@ovdw.bsky.social

But when considering WinoBias a different picture emerges! While most interventions work somewhat, full model finetuning is most promising! Can we trust bias datasets (validity)? Differences in forms of gender bias? Which confounding factors (eg task performance) to control for?

December 11, 2023 at 4:30 PM

Oskar van der Wal

@ovdw.bsky.social

Now for the effect on 3 bias benchmarks. The Professions dataset (from 1️⃣) and CrowS-Pairs show that the narrow interventions are consistently improving the bias. The results for ACDC (all circuit components) and full model finetuning are more noisy, or not even effective at all!

December 11, 2023 at 4:29 PM

Oskar van der Wal

@ovdw.bsky.social

Unsurprisingly, perplexity increases more where a larger set of components is updated.

December 11, 2023 at 4:29 PM

Oskar van der Wal

@ovdw.bsky.social

We actually find a substantial overlap in the top 10 attn heads (out of 144) for CMA, ACDC, and DiffMask+. Most are found in the last 4 layers. (We used Professions dataset by Vig et al., but noticed that these methods can be sensitive to the dataset choice; Why?—that is future work!)

December 11, 2023 at 4:27 PM

Oskar van der Wal

@ovdw.bsky.social

For 1️⃣, we compare three approaches: causal mediation analysis, ACDC, and DiffMask+, which is an adaptation of earlier work by Nicola De Cao that learns a sparse ✨set✨ of task-important components (unlike CMA) while being less computationally prohibitive than ACDC.

December 11, 2023 at 4:26 PM

Oskar van der Wal

@ovdw.bsky.social

In this work, we combine two ideas: 1️⃣ use causal discovery methods to explore which components (e.g. attention heads) are responsible for gender bias; 2️⃣ use this information to do a targeted intervention through selective finetuning.

December 11, 2023 at 4:24 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news