Entity Linking with Out-of-Knowledge-Graph Entity Detection and Clustering Using Only Knowledge Graphs

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Entity Linking is crucial for numerous downstream tasks, such as question answering, knowledge graph population, and general knowledge extraction. A frequently overlooked aspect of entity linking is the potential encounter with entities not yet present in a target knowledge graph. Although some recent studies have addressed this issue, they primarily utilize full-text knowledge bases or depend on external information such as crawled webpages. Full-text knowledge bases are not available in all domains and using external information is connected to increased effort. However, these resources are not available in most use cases. In this work, we solely rely on the information within a knowledge graph and assume no external information is accessible.

To investigate the challenge of identifying and disambiguating entities absent from the knowledge graph, we introduce a comprehensive silver-standard benchmark dataset that covers texts from 1999 to 2022. Based on our novel dataset, we develop an approach using pre-trained language models and knowledge graph embeddings without the need for a parallel full-text corpus. Moreover, by assessing the influence of knowledge graph embeddings on the given task, we show that implementing a sequential entity linking approach, which considers the whole sentence, can outperform clustering techniques that handle each mention separately in specific instances.
Original languageEnglish
Title of host publicationKnowledge Graphs in the Age of Language Models and Neuro-Symbolic AI- : Proceedings of the 20th International Conference on Semantic Systems, 17-19 September 2024, Amsterdam, The Netherlands
EditorsAngelo A. Salatino, Mehwish Alam, Femke Ongenae, Sahar Vahdati, Anna Lisa Gentile, Tassilo Pellegrini, Shufan Jiang
Number of pages18
Place of PublicationAmsterdam
PublisherIOS Press BV
Publication date11.09.2024
Pages88-105
ISBN (electronic)978-1-64368-537-3
DOIs
Publication statusPublished - 11.09.2024
Event20th International Conference on Semantic Systems - SEMANTiCS 2024: Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI - Universität Amsterdam, Amsterdam, Netherlands
Duration: 17.09.202419.09.2024
Conference number: 20
https://2024-eu.semantics.cc/

Bibliographical note

© 2024 The Authors

DOI

Recently viewed

Publications

  1. Inversion of Fuzzy Neural Networks for the Reduction of Noise in the Control Loop for Automotive Applications
  2. A simple nonlinear PD control for faster and high-precision positioning of servomechanisms with actuator saturation
  3. Applications of the Simultaneous Modular Approach in the Field of Material Flow Analysis
  4. Stability analysis of a linear model predictive control and its application in a water recovery process
  5. Robust feedback linearization using an adaptive PD regulator for a sensorless control of a throttle valve
  6. Using Language Learning Resources on YouTube
  7. Logistical Potentials of Load Balancing via the Build-up and Reduction of Stock
  8. Modeling self-determination theory motivation data by using unfolding IRT
  9. Complexity and Administrative Intensity
  10. E-privacy concerns
  11. Microstructure-based modeling of residual stresses in WC-12Co-sprayed coatings
  12. How generative drawing affects the learning process
  13. Orchestrating distributed data governance in open social innovation
  14. Achieving enhanced mechanical properties in Mg-Gd-Y-Zn-Mn alloy by altering dynamic recrystallization behavior via pre-ageing treatment
  15. Internet: Impact and Potential for Learning and Instruction
  16. Principled Interpolation in Normalizing Flows
  17. Combining mechanics and electrostatics
  18. Qualitätssicherung und Entwicklung in der Elementarpädagogik
  19. Integrated reporting with CSR practices
  20. Time and Income Poverty: An Interdependent Multidimensional Poverty Approach with German Time Use Diary Data
  21. Minimization of answer distortion in personality questionnaires
  22. The Impact of TV Ads on the Individual User's Purchasing Behavior
  23. Reconsidering adaptation as translation
  24. How real options and ecological resilience thinking can assist in environmental risk management