Dialect prejudice predicts AI decisions about people's character, employability, and criminality

Valentin Hofmann, Pratyusha Ria Kalluri, Dan Jurafsky, Sharese King

Publikationsdatum: 01.03.2024

Diese Seite wurde seit mehr als 7 Monaten inhaltlich nicht mehr aktualisiert. Unter Umständen ist sie nicht mehr aktuell.

Zusammenfassungen

Hundreds of millions of people now interact with language models, with uses ranging from serving as a writing aid to informing hiring decisions. Yet these language models are known to perpetuate systematic racial prejudices, making their judgments biased in problematic ways about groups like African Americans. While prior research has focused on overt racism in language models, social scientists have argued that racism with a more subtle character has developed over time. It is unknown whether this covert racism manifests in language models. Here, we demonstrate that language models embody covert racism in the form of dialect prejudice: we extend research showing that Americans hold raciolinguistic stereotypes about speakers of African American English and find that language models have the same prejudice, exhibiting covert stereotypes that are more negative than any human stereotypes about African Americans ever experimentally recorded, although closest to the ones from before the civil rights movement. By contrast, the language models' overt stereotypes about African Americans are much more positive. We demonstrate that dialect prejudice has the potential for harmful consequences by asking language models to make hypothetical decisions about people, based only on how they speak. Language models are more likely to suggest that speakers of African American English be assigned less prestigious jobs, be convicted of crimes, and be sentenced to death. Finally, we show that existing methods for alleviating racial bias in language models such as human feedback training do not mitigate the dialect prejudice, but can exacerbate the discrepancy between covert and overt stereotypes, by teaching language models to superficially conceal the racism that they maintain on a deeper level. Our findings have far-reaching implications for the fair and safe employment of language technology.

Von Valentin Hofmann, Pratyusha Ria Kalluri, Dan Jurafsky, Sharese King im Text Dialect prejudice predicts AI decisions about people's character, employability, and criminality (2024)

Dieser Text erwähnt ...

Personen
KB IB clear

Sandhini Agarwal , Dario Amodei , Amanda Askell , Maria Bannert , Emily M. Bender , Christopher Berner , Tom B. Brown , Caitlin Ring Carlson , Mark Chen , Benjamin Chess , Rewon Child , Jack Clark , Daryna Dementieva , Kewal Dhariwal , Prafulla Dhariwal , Frank Fischer , Urs Gasser , Timnit Gebru , Scott Gray , Georg Groh , Stephan Günnemann , Tom Henighan , Ariel Herbert-Voss , Christopher Hesse , Eyke Hüllermeier , Jared Kaplan , Gjergji Kasneci , Enkelejda Kasneci , Gretchen Krueger , Stephan Krusche , Stefan Küchemann , Jochen Kuhn , Gitta Kutyniok , Mateusz Litwin , Benjamin Mann , Sam McCandlish , Angelina McMillan-Major , Tilman Michaeli , Arvind Neelakantan , Claudia Nerdel , OpenAI , Jürgen Pfeffer , Oleksandra Poquet , Alec Radford , Aditya Ramesh , Nick Ryder , Michael Sailer , Girish Sastry , Albrecht Schmidt , Tina Seidel , Kathrin Sessler , Shmargaret Shmitchell , Pranav Shyam , Eric Sigler , Matthias Stadler , Melanie Subbiah , Ilya Sutskever , Jochen Weller , Clemens Winter , Jeffrey Wu , Daniel M. Ziegler

Aussagen
KB IB clear

Machine Learning kann bestehende Vorurteile/Ungerechtigkeiten verstärken/weitertragen

Begriffe
KB IB clear

bias ,

Generative Machine-Learning-Systeme (GMLS)

computer-generated text ,

Rassismus ,

Sprache

language

Bücher

Jahr		Umschlag	Titel	Abrufe	IB	OB	KB	LB
2020			Language Models are Few-Shot Learners (Tom B. Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Kewal Dhariwal, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-Voss, Gretchen Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel M. Ziegler, Jeffrey Wu, Clemens Winter, Christopher Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam McCandlish, Alec Radford, Ilya Sutskever, Dario Amodei)	6, 8, 7, 6, 3, 2, 8, 2, 8, 5, 6, 4	40	5	4	147
2021			Hate Speech (Caitlin Ring Carlson)	3, 6, 8, 8, 2, 1, 5, 2, 2, 5, 4, 1	1	8	1	185

Texte

Jahr	Titel	Abrufe	IB	OB	KB	LB
2021	On the Dangers of Stochastic Parrots (Emily M. Bender, Timnit Gebru, Angelina McMillan-Major, Shmargaret Shmitchell)	2, 7, 12, 8, 3, 2, 9, 5, 9, 7, 5, 4	43	25	4	158
2023	ChatGPT for Good? (Enkelejda Kasneci, Kathrin Sessler, Stefan Küchemann, Maria Bannert, Daryna Dementieva, Frank Fischer, Urs Gasser, Georg Groh, Stephan Günnemann, Eyke Hüllermeier, Stephan Krusche, Gitta Kutyniok, Tilman Michaeli, Claudia Nerdel, Jürgen Pfeffer, Oleksandra Poquet, Michael Sailer, Albrecht Schmidt, Tina Seidel, Matthias Stadler, Jochen Weller, Jochen Kuhn, Gjergji Kasneci)	13, 9, 2, 1, 3, 1, 7, 3, 7, 4, 4, 1	15	18	1	123
2023	GPT-4 Technical Report (OpenAI)		25	17	0	0

Dieser Text erwähnt vermutlich nicht ...

Nicht erwähnte Begriffe

Chat-GPT, GMLS & Bildung

Zitationsgraph

Diese SVG-Grafik fensterfüllend anzeigen

Zitationsgraph (Beta-Test mit vis.js)

Volltext dieses Dokuments

Dialect prejudice predicts AI decisions about people's character, employability, and criminality: Artikel als Volltext ( lokal

, 8142 kByte; WWW

)

Anderswo suchen

Beat und dieser Text

Beat hat Dieser Text während seiner Zeit am Institut für Medien und Schule (IMS) ins Biblionetz aufgenommen. Er hat Dieser Text einmalig erfasst und bisher nicht mehr bearbeitet. Beat besitzt kein physisches, aber ein digitales Exemplar. Eine digitale Version ist auf dem Internet verfügbar (s.o.). Es gibt bisher nur wenige Objekte im Biblionetz, die dieses Werk zitieren.

Beats Biblionetz - Texte