Knowledge-Enhanced Language Models Are Not Bias-Proof: Situated Knowledge and Epistemic Injustice in AI

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

The factual inaccuracies ("hallucinations") of large language models have recently inspired more research on knowledge-enhanced language modeling approaches. These are often assumed to enhance the overall trustworthiness and objectivity of language models. Meanwhile, the issue of bias is usually only mentioned as a limitation of statistical representations. This dissociation of knowledge-enhancement and bias is in line with previous research on AI engineers' assumptions about knowledge, which indicate that knowledge is commonly understood as objective and value-neutral by this community. We argue that claims and practices by actors of the field still reflect this underlying conception of knowledge. We contrast this assumption with literature from social and, in particular, feminist epistemology, which argues that the idea of a universal disembodied knower is blind to the reality of knowledge practices and seriously challenges claims of "objective"or "neutral"knowledge. Knowledge enhancement techniques commonly use Wikidata and Wikipedia as their sources for knowledge, due to their large scales, public accessibility, and assumed trustworthiness. In this work, they serve as a case study for the influence of the social setting and the identity of knowers on epistemic processes. Indeed, the communities behind Wikidata and Wikipedia are known to be male-dominated and many instances of hostile behavior have been reported in the past decade. In effect, the contents of these knowledge bases are highly biased. It is therefore doubtful that these knowledge bases would contribute to bias reduction. In fact, our empirical evaluations of RoBERTa, KEPLER, and CoLAKE, demonstrate that knowledge enhancement may not live up to the hopes of increased objectivity. In our study, the average probability for stereotypical associations was preserved on two out of three metrics and performance-related gender gaps on knowledge-driven task were also preserved. We build on these results and critical literature to argue that the label of "knowledge"and the commonly held beliefs about it can obscure the harm that is still done to marginalized groups. Knowledge enhancement is at risk of perpetuating epistemic injustice, and AI engineers' understanding of knowledge as objective per se conceals this injustice. Finally, to get closer to trustworthy language models, we need to rethink knowledge in AI and aim for an agenda of diversification and scrutiny from outgroup members.

Original languageEnglish
Title of host publication2024 ACM Conference on Fairness, Accountability, and Transparency, FAccT 2024
Number of pages13
PublisherAssociation for Computing Machinery, Inc
Publication date03.06.2024
Pages1433-1445
ISBN (print)9798400704505
ISBN (electronic)979-8-4007-0450-5
DOIs
Publication statusPublished - 03.06.2024
EventACM Conference on Fairness, Accountability, and Transparency - FAccT 2024 - Rio de Janeiro, Brazil
Duration: 03.06.202406.06.2024
https://facctconference.org/2024/

Bibliographical note

Publisher Copyright:
© 2024 Owner/Author.

    Research areas

  • bias, epistemology, fairness, feminism, knowledge enhancement, knowledge graphs, language models, natural language processing, representation
  • Informatics

DOI

Recently viewed

Activities

  1. The Linguistic Complexity of Test Items: Differential Effects for Students With Low and High Language Proficiency
  2. Towards a fully-automated adaptive e-learning environment: A predictive model for difficulty generating factors in gap-filling activities that target English tense-aspect-mood
  3. Digital Abstraction at the Interface between Electronic Media Arts and Data Visualization
  4. Co-Supervisor for the Dissertation "The effects of forest structural element retention on insect communities"
  5. Presentation of the paper entitled "Soft Optimal Computing to Identify Surface Roughness in Manufacturing using a Monotonic Regressor"
  6. Co-supervisor of the dissertation "Diversity and functions of plant-insect interactions along a forest retention gradient"
  7. Uncertainty and Subjectivity in Provenance Linked Open Data
  8. Placemaking today: integrating place-oriented thinking into cultural policy frameworks
  9. From Archives to Activism: Using Data to Challenge Structures in Art Collections
  10. Explicit References in Chat-Based CSCL: Do They Faciliate Global Text Processing?
  11. International Symposium on Multiscale Computational Analysis of Complex Materials
  12. Explaining primary school teachers’ usage of digital learning data: A mixed method study
  13. Mediating Atmospheres: Apprehending the Intersections of Data, Memory and Space
  14. Experiences with applying for and managing large DFG projects
  15. Implementing Sustainability Strategies Through Accounting Controls: An Exploration of Practices in Seven Multinational Corporations
  16. LC-MS identification of the photo-transformation products of desipramine with studying the effect of different environmental variables on the kinetics of their formation
  17. Transformations 2017
  18. Blogs in the Foreign Language Classroom
  19. Employer Longevity Readiness Index Workshop: Session 2: How do you build a longevity readiness Index?
  20. Field Experimentation in Governance Research. Early insights from researching the effectiveness of public participation in implementing the EU Floods Directive

Publications

  1. An Orthogonal Wavelet Denoising Algorithm for Surface Images of Atomic Force Microscopy
  2. Data-driven and physics-based modelling of process behaviour and deposit geometry for friction surfacing
  3. Teaching methods for modelling problems and students’ task-specific enjoyment, value, interest and self-efficacy expectations
  4. Self-regulation in error management training: emotion control and metacognition as mediators of performance effects
  5. Spaces for challenging experiences, indeterminacy, and experimentation
  6. Teachers’ use of data from digital learning platforms for instructional design
  7. Second language learners' performance in mathematics
  8. More input, better output
  9. How Much Home Office is Ideal? A Multi-Perspective Algorithm
  10. Passive Peak Voltage Sensor for Multiple Sending Coils Inductive Power Transmission System
  11. Top-down contingent attentional capture during feed-forward visual processing
  12. Effectiveness of a Web-Based Cognitive Behavioural Intervention for Subthreshold Depression
  13. Primary Side Circuit Design of a Multi-coil Inductive System for Powering Wireless Sensors
  14. Biodegradation screening of chemicals in an artificial matrix simulating the water-sediment interface
  15. Promising practices for dealing with complexity in research for development
  16. A Framework for Applying Natural Language Processing in Digital Health Interventions
  17. Enhancing EFL classroom instruction via the FeedBook: effects on language development and communicative language use.
  18. Internet and computer based interventions for cannabis use
  19. Web-scale extension of RDF knowledge bases from templated websites
  20. Active learning for network intrusion detection
  21. Global Finite-Time Stabilization of Planar Linear Systems With Actuator Saturation
  22. Simple saturated PID control for fast transient of motion systems
  23. Embarrassment as a public vs. private emotion and symbolic coping behaviour
  24. Adaptive control of the nonlinear dynamic behavior of the cantilever-sample system of an atomic force microscope
  25. Transductive support vector machines for structured variables
  26. »HOW TO MAKE YOUR OWN SAMPLES«
  27. E-stability and stability of adaptive learning in models with private information