Entity Linking with Out-of-Knowledge-Graph Entity Detection and Clustering Using Only Knowledge Graphs

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Entity Linking is crucial for numerous downstream tasks, such as question answering, knowledge graph population, and general knowledge extraction. A frequently overlooked aspect of entity linking is the potential encounter with entities not yet present in a target knowledge graph. Although some recent studies have addressed this issue, they primarily utilize full-text knowledge bases or depend on external information such as crawled webpages. Full-text knowledge bases are not available in all domains and using external information is connected to increased effort. However, these resources are not available in most use cases. In this work, we solely rely on the information within a knowledge graph and assume no external information is accessible.

To investigate the challenge of identifying and disambiguating entities absent from the knowledge graph, we introduce a comprehensive silver-standard benchmark dataset that covers texts from 1999 to 2022. Based on our novel dataset, we develop an approach using pre-trained language models and knowledge graph embeddings without the need for a parallel full-text corpus. Moreover, by assessing the influence of knowledge graph embeddings on the given task, we show that implementing a sequential entity linking approach, which considers the whole sentence, can outperform clustering techniques that handle each mention separately in specific instances.
Original languageEnglish
Title of host publicationKnowledge Graphs in the Age of Language Models and Neuro-Symbolic AI- : Proceedings of the 20th International Conference on Semantic Systems, 17-19 September 2024, Amsterdam, The Netherlands
EditorsAngelo A. Salatino, Mehwish Alam, Femke Ongenae, Sahar Vahdati, Anna Lisa Gentile, Tassilo Pellegrini, Shufan Jiang
Number of pages18
Place of PublicationAmsterdam
PublisherIOS Press BV
Publication date11.09.2024
Pages88-105
ISBN (electronic)978-1-64368-537-3
DOIs
Publication statusPublished - 11.09.2024
Event20th International Conference on Semantic Systems - SEMANTiCS 2024: Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI - Universität Amsterdam, Amsterdam, Netherlands
Duration: 17.09.202419.09.2024
Conference number: 20
https://2024-eu.semantics.cc/

Bibliographical note

© 2024 The Authors

DOI