Entity Linking with Out-of-Knowledge-Graph Entity Detection and Clustering Using Only Knowledge Graphs
Research output: Contributions to collected editions/works › Article in conference proceedings › Research › peer-review
Authors
Entity Linking is crucial for numerous downstream tasks, such as question answering, knowledge graph population, and general knowledge extraction. A frequently overlooked aspect of entity linking is the potential encounter with entities not yet present in a target knowledge graph. Although some recent studies have addressed this issue, they primarily utilize full-text knowledge bases or depend on external information such as crawled webpages. Full-text knowledge bases are not available in all domains and using external information is connected to increased effort. However, these resources are not available in most use cases. In this work, we solely rely on the information within a knowledge graph and assume no external information is accessible.
To investigate the challenge of identifying and disambiguating entities absent from the knowledge graph, we introduce a comprehensive silver-standard benchmark dataset that covers texts from 1999 to 2022. Based on our novel dataset, we develop an approach using pre-trained language models and knowledge graph embeddings without the need for a parallel full-text corpus. Moreover, by assessing the influence of knowledge graph embeddings on the given task, we show that implementing a sequential entity linking approach, which considers the whole sentence, can outperform clustering techniques that handle each mention separately in specific instances.
To investigate the challenge of identifying and disambiguating entities absent from the knowledge graph, we introduce a comprehensive silver-standard benchmark dataset that covers texts from 1999 to 2022. Based on our novel dataset, we develop an approach using pre-trained language models and knowledge graph embeddings without the need for a parallel full-text corpus. Moreover, by assessing the influence of knowledge graph embeddings on the given task, we show that implementing a sequential entity linking approach, which considers the whole sentence, can outperform clustering techniques that handle each mention separately in specific instances.
Original language | English |
---|---|
Title of host publication | Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI- : Proceedings of the 20th International Conference on Semantic Systems, 17-19 September 2024, Amsterdam, The Netherlands |
Editors | Angelo A. Salatino, Mehwish Alam, Femke Ongenae, Sahar Vahdati, Anna Lisa Gentile, Tassilo Pellegrini, Shufan Jiang |
Number of pages | 18 |
Place of Publication | Amsterdam |
Publisher | IOS Press BV |
Publication date | 11.09.2024 |
Pages | 88-105 |
ISBN (electronic) | 978-1-64368-537-3 |
DOIs | |
Publication status | Published - 11.09.2024 |
Event | 20th International Conference on Semantic Systems - SEMANTiCS 2024: Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI - Universität Amsterdam, Amsterdam, Netherlands Duration: 17.09.2024 → 19.09.2024 Conference number: 20 https://2024-eu.semantics.cc/ |
Bibliographical note
© 2024 The Authors
- Business informatics - Entity Linking, Entity Disambiguation, Out-of-KG Entities