Knowledge-Enhanced Language Models Are Not Bias-Proof: Situated Knowledge and Epistemic Injustice in AI

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

The factual inaccuracies ("hallucinations") of large language models have recently inspired more research on knowledge-enhanced language modeling approaches. These are often assumed to enhance the overall trustworthiness and objectivity of language models. Meanwhile, the issue of bias is usually only mentioned as a limitation of statistical representations. This dissociation of knowledge-enhancement and bias is in line with previous research on AI engineers' assumptions about knowledge, which indicate that knowledge is commonly understood as objective and value-neutral by this community. We argue that claims and practices by actors of the field still reflect this underlying conception of knowledge. We contrast this assumption with literature from social and, in particular, feminist epistemology, which argues that the idea of a universal disembodied knower is blind to the reality of knowledge practices and seriously challenges claims of "objective"or "neutral"knowledge. Knowledge enhancement techniques commonly use Wikidata and Wikipedia as their sources for knowledge, due to their large scales, public accessibility, and assumed trustworthiness. In this work, they serve as a case study for the influence of the social setting and the identity of knowers on epistemic processes. Indeed, the communities behind Wikidata and Wikipedia are known to be male-dominated and many instances of hostile behavior have been reported in the past decade. In effect, the contents of these knowledge bases are highly biased. It is therefore doubtful that these knowledge bases would contribute to bias reduction. In fact, our empirical evaluations of RoBERTa, KEPLER, and CoLAKE, demonstrate that knowledge enhancement may not live up to the hopes of increased objectivity. In our study, the average probability for stereotypical associations was preserved on two out of three metrics and performance-related gender gaps on knowledge-driven task were also preserved. We build on these results and critical literature to argue that the label of "knowledge"and the commonly held beliefs about it can obscure the harm that is still done to marginalized groups. Knowledge enhancement is at risk of perpetuating epistemic injustice, and AI engineers' understanding of knowledge as objective per se conceals this injustice. Finally, to get closer to trustworthy language models, we need to rethink knowledge in AI and aim for an agenda of diversification and scrutiny from outgroup members.

Original languageEnglish
Title of host publication2024 ACM Conference on Fairness, Accountability, and Transparency, FAccT 2024
Number of pages13
PublisherAssociation for Computing Machinery, Inc
Publication date03.06.2024
Pages1433-1445
ISBN (print)9798400704505
ISBN (electronic)979-8-4007-0450-5
DOIs
Publication statusPublished - 03.06.2024
EventACM Conference on Fairness, Accountability, and Transparency - FAccT 2024 - Rio de Janeiro, Brazil
Duration: 03.06.202406.06.2024
https://facctconference.org/2024/

Bibliographical note

Publisher Copyright:
© 2024 Owner/Author.

    Research areas

  • bias, epistemology, fairness, feminism, knowledge enhancement, knowledge graphs, language models, natural language processing, representation
  • Informatics

DOI

Recently viewed

Activities

  1. Mirrored piezo servo hydraulic actuators for use in camless combustion engines and its Control with mirrored inputs and MPC
  2. Chain of Fools? Sensemaking Dynamics regarding the Issue of the Blockchain Technology in the FinTech Field
  3. Processing of CSR communication: Insights from the ELM
  4. The Irish English discourse marker sure at the semantics/pragmatics interface
  5. Digital Abstraction at the Interface between Electronic Media Arts and Data Visualization
  6. Review in Application Process for External University
  7. Presentation of the paper entitled "Soft Optimal Computing to Identify Surface Roughness in Manufacturing using a Monotonic Regressor"
  8. How stereotypes affect grading and tutorial feedback: Shifting evaluations or shifting standards?
  9. Between Connections and Knowledge: An Approach to Culture through Graph Theory and Complex Systems
  10. Beyond Unity
  11. Global Platform Companies in Local Fields between Disruption and Integration
  12. Theorizing about Financing Behavior of New Ventures: Towards an Effectual Logic
  13. Interstitial spaces as garbage cans of field transformation where problems and solutions meet: the case of blockchain and music cross-fertilization
  14. Dynamical systems methods in fluid mechanics
  15. From Quantity to Quality: Structuring Provenance Data.
  16. A CALL for data-informed focus-on-form practice - Intelligent Language Tutoring System as the key to personalized and adaptive learning?
  17. Enhancing careless responding detection: A norm group-based calculation approach
  18. User Journey Analysis and Cross Channel Attribution

Publications

  1. Patching Together a Global Script
  2. An Orthogonal Wavelet Denoising Algorithm for Surface Images of Atomic Force Microscopy
  3. Performance concepts and performance theory
  4. Neural network-based estimation and compensation of friction for enhanced deep drawing process control
  5. Data-driven and physics-based modelling of process behaviour and deposit geometry for friction surfacing
  6. Competing Vegetation Structure Indices for Estimating Spatial Constrains in Carabid Abundance Patterns in Chinese Grasslands Reveal Complex Scale and Habitat Patterns
  7. Using transition management concepts for the evaluation of intersecting policy domains ('grand challenges')
  8. For a return to the forgotten formula: 'Data 1 + Data 2 > Data 1'
  9. Using Language Learning Resources on YouTube
  10. Cognitive Predictors of Child Second Language Comprehension and Syntactic Learning
  11. Errors in Training Computer Skills
  12. A Theoretical Dynamical Noninteracting Model for General Manipulation Systems Using Axiomatic Geometric Structures
  13. Using augmented video to test in-car user experiences of context analog HUDs
  14. GENESIS - A generic RDF data access interface
  15. Cognitive load and instructionally supported learning with provided and learner-generated visualizations
  16. Towards an Interoperable Ecosystem of AI and LT Platforms: A Roadmap for the Implementation of Different Levels of Interoperability
  17. A Multimethod Latent State-Trait Model for Structurally Different and Interchangeable Methods
  18. In-Vehicle Sensor System for Monitoring Efficiency of Vehicle E/E Architectures
  19. Acceleration of material-dominated calculations via phase-space simplicial subdivision and interpolation
  20. Mechanism of dynamic recrystallization and evolution of texture in the hot working domains of the processing map for Mg-4Al-2Ba-2Ca Alloy
  21. An Interactive Layers Model of Self-Regulated Learning and Cognitive Load
  22. How Much Home Office is Ideal? A Multi-Perspective Algorithm
  23. ActiveMath - a Learning Platform With Semantic Web Features
  24. Correlation of Microstructure and Local Mechanical Properties Along Build Direction for Multi-layer Friction Surfacing of Aluminum Alloys
  25. Binary Random Nets II
  26. Eliciting Learner Perceptions of Web 2.0 Tasks through Mixed-Methods Classroom Research
  27. Multiphase-field modeling of temperature-driven intermetallic compound evolution in an Al-Mg system for application to solid-state joining processes