Knowledge-Enhanced Language Models Are Not Bias-Proof: Situated Knowledge and Epistemic Injustice in AI

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

The factual inaccuracies ("hallucinations") of large language models have recently inspired more research on knowledge-enhanced language modeling approaches. These are often assumed to enhance the overall trustworthiness and objectivity of language models. Meanwhile, the issue of bias is usually only mentioned as a limitation of statistical representations. This dissociation of knowledge-enhancement and bias is in line with previous research on AI engineers' assumptions about knowledge, which indicate that knowledge is commonly understood as objective and value-neutral by this community. We argue that claims and practices by actors of the field still reflect this underlying conception of knowledge. We contrast this assumption with literature from social and, in particular, feminist epistemology, which argues that the idea of a universal disembodied knower is blind to the reality of knowledge practices and seriously challenges claims of "objective"or "neutral"knowledge. Knowledge enhancement techniques commonly use Wikidata and Wikipedia as their sources for knowledge, due to their large scales, public accessibility, and assumed trustworthiness. In this work, they serve as a case study for the influence of the social setting and the identity of knowers on epistemic processes. Indeed, the communities behind Wikidata and Wikipedia are known to be male-dominated and many instances of hostile behavior have been reported in the past decade. In effect, the contents of these knowledge bases are highly biased. It is therefore doubtful that these knowledge bases would contribute to bias reduction. In fact, our empirical evaluations of RoBERTa, KEPLER, and CoLAKE, demonstrate that knowledge enhancement may not live up to the hopes of increased objectivity. In our study, the average probability for stereotypical associations was preserved on two out of three metrics and performance-related gender gaps on knowledge-driven task were also preserved. We build on these results and critical literature to argue that the label of "knowledge"and the commonly held beliefs about it can obscure the harm that is still done to marginalized groups. Knowledge enhancement is at risk of perpetuating epistemic injustice, and AI engineers' understanding of knowledge as objective per se conceals this injustice. Finally, to get closer to trustworthy language models, we need to rethink knowledge in AI and aim for an agenda of diversification and scrutiny from outgroup members.

Original languageEnglish
Title of host publication2024 ACM Conference on Fairness, Accountability, and Transparency, FAccT 2024
Number of pages13
PublisherAssociation for Computing Machinery, Inc
Publication date03.06.2024
Pages1433-1445
ISBN (print)9798400704505
ISBN (electronic)979-8-4007-0450-5
DOIs
Publication statusPublished - 03.06.2024
EventACM Conference on Fairness, Accountability, and Transparency - FAccT 2024 - Rio de Janeiro, Brazil
Duration: 03.06.202406.06.2024
https://facctconference.org/2024/

Bibliographical note

Publisher Copyright:
© 2024 Owner/Author.

    Research areas

  • bias, epistemology, fairness, feminism, knowledge enhancement, knowledge graphs, language models, natural language processing, representation
  • Informatics

DOI

Recently viewed

Activities

  1. Users’ Handedness and Performance when Controlling Integrated Input Devices - Implications for Automotive HMI
  2. The role of different forms of cohesion and readers' expectations towards different types of text
  3. Trajectory-based Lagrangian approaches for the extraction and characterization of coherent structures in turbulent convection
  4. Plasma shock wave simulation for laser shock processing
  5. Workshop Medzin I
  6. From Magic to Systemics. Heinz von Foerster and the Reenchantment of Science
  7. Travelling Codes
  8. Eine Podiumsdiskussion zu Fracking
  9. Robotic Mobile Fulfillment Systems
  10. Time and Organizational Development
  11. The global classroom: Introduction, presentation and workshops
  12. It's how, not what we use that matters - Communications Modes in the Internet
  13. Keeping drivers engaged in automated driving through maneuver control- effects on perceived control and responsibility
  14. "Information-Oriented Communicative Acting in the Internet: Communication Modes between Mass- and Interpersonal Communication"
  15. Plenary lecture eintitled: "Mathematical insights for advanced ice-clamping control in the context of Industry 4.0"
  16. Adaptive Modeling
  17. Changing learning environments at university? Comparing the learning strategies of non-traditional European students engaged in lifelong learning.
  18. Activating an Integrative Mindset Improves the Subjective Outcomes of Value-Driven Conflicts
  19. Towards a Techno-Ecology of Participation - 2017
  20. Teams are changing! Going into the wild to expand theory on dynamics in modern teamwork settings
  21. Where To Start? Exploring 1-Year-Students’ Preconceptions of Sustainable Development
  22. Knowledge Spaces
  23. Lena Meyer-Bergner’s conception of modernism between graphics and weaving, between folk art and technology

Publications

  1. THE PARALLAX OF INDIVIDUATION
  2. From entity to process
  3. Control versus Complexity
  4. Predicting the Individual Mood Level based on Diary Data
  5. Machine Learning and Knowledge Discovery in Databases
  6. Understanding the properties of isospectral points and pairs in graphs
  7. Improvements in Flexibility depend on Stretching Duration
  8. Machine Learning and Knowledge Discovery in Databases
  9. Efficacy of a Web-Based Intervention With Mobile Phone Support in Treating Depressive Symptoms in Adults With Type 1 and Type 2 Diabetes
  10. Speed of processing and stimulus complexity in low-frequency and high-frequency channels
  11. Serendipity as a Mechanism of Change and its Potential for Explaining Change Processes
  12. Determination of 10 particle-associated multiclass polar and semi-polar pesticides from small streams using accelerated solvent extraction
  13. Biodiversity in space and time - towards a grid mapping for Mongolia
  14. A Lyapunov Approach to Set the Parameters of a PI-Controller to Minimise Velocity Oscillations in a Permanent Magnet Synchronous Motor Using Chopper Control for Electrical Vehicles
  15. A Genetic Algorithm for the Dynamic Management of Cellular Reconfigurable Manufacturing Systems
  16. How to attract visitors with strategic, value-based experience design
  17. Semi-Supervised Generative Models for Multi-Agent Trajectories
  18. Binary Random Nets II
  19. A Process Perspective on Organizational Failure
  20. Developing robust field survey protocols in landscape ecology
  21. MICSIM: Concept, Developments, and Applications of a PC Microsimulation Model for Research and Teaching
  22. CHANGING RECREATIONAL ACTIVITIES FOR REDUCING INSOMNIA SEVERITY? RESULTS FROM A SERIAL MEDIATION ANALYSIS ON THE IMPACT OF RECREATIONAL BEHAVIOR AS A MECHANISM OF CHANGE IN DIGITAL INTERVENTIONS FOR INSOMNIA
  23. Design of an Information-Based Distributed Production Planning System