Entity Linking with Out-of-Knowledge-Graph Entity Detection and Clustering Using Only Knowledge Graphs

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Standard

Entity Linking with Out-of-Knowledge-Graph Entity Detection and Clustering Using Only Knowledge Graphs. / Möller, Cedric; Usbeck, Ricardo.
Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI- : Proceedings of the 20th International Conference on Semantic Systems, 17-19 September 2024, Amsterdam, The Netherlands. ed. / Angelo A. Salatino; Mehwish Alam; Femke Ongenae; Sahar Vahdati; Anna Lisa Gentile; Tassilo Pellegrini; Shufan Jiang. Amsterdam: IOS Press BV, 2024. p. 88-105 (Studies on the Semantic Web; Vol. 60).

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Harvard

Möller, C & Usbeck, R 2024, Entity Linking with Out-of-Knowledge-Graph Entity Detection and Clustering Using Only Knowledge Graphs. in AA Salatino, M Alam, F Ongenae, S Vahdati, AL Gentile, T Pellegrini & S Jiang (eds), Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI- : Proceedings of the 20th International Conference on Semantic Systems, 17-19 September 2024, Amsterdam, The Netherlands. Studies on the Semantic Web, vol. 60, IOS Press BV, Amsterdam, pp. 88-105, 20th International Conference on Semantic Systems - SEMANTiCS 2024, Amsterdam, Netherlands, 17.09.24. https://doi.org/10.3233/SSW240009

APA

Möller, C., & Usbeck, R. (2024). Entity Linking with Out-of-Knowledge-Graph Entity Detection and Clustering Using Only Knowledge Graphs. In A. A. Salatino, M. Alam, F. Ongenae, S. Vahdati, A. L. Gentile, T. Pellegrini, & S. Jiang (Eds.), Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI- : Proceedings of the 20th International Conference on Semantic Systems, 17-19 September 2024, Amsterdam, The Netherlands (pp. 88-105). (Studies on the Semantic Web; Vol. 60). IOS Press BV. https://doi.org/10.3233/SSW240009

Vancouver

Möller C, Usbeck R. Entity Linking with Out-of-Knowledge-Graph Entity Detection and Clustering Using Only Knowledge Graphs. In Salatino AA, Alam M, Ongenae F, Vahdati S, Gentile AL, Pellegrini T, Jiang S, editors, Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI- : Proceedings of the 20th International Conference on Semantic Systems, 17-19 September 2024, Amsterdam, The Netherlands. Amsterdam: IOS Press BV. 2024. p. 88-105. (Studies on the Semantic Web). doi: 10.3233/SSW240009

Bibtex

@inbook{0e3e0f1feb394832955e1c1aafac7b6d,
title = "Entity Linking with Out-of-Knowledge-Graph Entity Detection and Clustering Using Only Knowledge Graphs",
abstract = "Entity Linking is crucial for numerous downstream tasks, such as question answering, knowledge graph population, and general knowledge extraction. A frequently overlooked aspect of entity linking is the potential encounter with entities not yet present in a target knowledge graph. Although some recent studies have addressed this issue, they primarily utilize full-text knowledge bases or depend on external information such as crawled webpages. Full-text knowledge bases are not available in all domains and using external information is connected to increased effort. However, these resources are not available in most use cases. In this work, we solely rely on the information within a knowledge graph and assume no external information is accessible.To investigate the challenge of identifying and disambiguating entities absent from the knowledge graph, we introduce a comprehensive silver-standard benchmark dataset that covers texts from 1999 to 2022. Based on our novel dataset, we develop an approach using pre-trained language models and knowledge graph embeddings without the need for a parallel full-text corpus. Moreover, by assessing the influence of knowledge graph embeddings on the given task, we show that implementing a sequential entity linking approach, which considers the whole sentence, can outperform clustering techniques that handle each mention separately in specific instances.",
keywords = "Business informatics, Entity Linking, Entity Disambiguation, Out-of-KG Entities",
author = "Cedric M{\"o}ller and Ricardo Usbeck",
note = "{\textcopyright} 2024 The Authors; 20th International Conference on Semantic Systems - SEMANTiCS 2024 : Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI, SEMANTiCS 2024 ; Conference date: 17-09-2024 Through 19-09-2024",
year = "2024",
month = sep,
day = "11",
doi = "10.3233/SSW240009",
language = "English",
series = "Studies on the Semantic Web",
publisher = "IOS Press BV",
pages = "88--105",
editor = "Salatino, {Angelo A.} and Mehwish Alam and Femke Ongenae and Sahar Vahdati and Gentile, {Anna Lisa} and Tassilo Pellegrini and Shufan Jiang",
booktitle = "Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI-",
address = "Netherlands",
url = "https://2024-eu.semantics.cc/ ",

}

RIS

TY - CHAP

T1 - Entity Linking with Out-of-Knowledge-Graph Entity Detection and Clustering Using Only Knowledge Graphs

AU - Möller, Cedric

AU - Usbeck, Ricardo

N1 - Conference code: 20

PY - 2024/9/11

Y1 - 2024/9/11

N2 - Entity Linking is crucial for numerous downstream tasks, such as question answering, knowledge graph population, and general knowledge extraction. A frequently overlooked aspect of entity linking is the potential encounter with entities not yet present in a target knowledge graph. Although some recent studies have addressed this issue, they primarily utilize full-text knowledge bases or depend on external information such as crawled webpages. Full-text knowledge bases are not available in all domains and using external information is connected to increased effort. However, these resources are not available in most use cases. In this work, we solely rely on the information within a knowledge graph and assume no external information is accessible.To investigate the challenge of identifying and disambiguating entities absent from the knowledge graph, we introduce a comprehensive silver-standard benchmark dataset that covers texts from 1999 to 2022. Based on our novel dataset, we develop an approach using pre-trained language models and knowledge graph embeddings without the need for a parallel full-text corpus. Moreover, by assessing the influence of knowledge graph embeddings on the given task, we show that implementing a sequential entity linking approach, which considers the whole sentence, can outperform clustering techniques that handle each mention separately in specific instances.

AB - Entity Linking is crucial for numerous downstream tasks, such as question answering, knowledge graph population, and general knowledge extraction. A frequently overlooked aspect of entity linking is the potential encounter with entities not yet present in a target knowledge graph. Although some recent studies have addressed this issue, they primarily utilize full-text knowledge bases or depend on external information such as crawled webpages. Full-text knowledge bases are not available in all domains and using external information is connected to increased effort. However, these resources are not available in most use cases. In this work, we solely rely on the information within a knowledge graph and assume no external information is accessible.To investigate the challenge of identifying and disambiguating entities absent from the knowledge graph, we introduce a comprehensive silver-standard benchmark dataset that covers texts from 1999 to 2022. Based on our novel dataset, we develop an approach using pre-trained language models and knowledge graph embeddings without the need for a parallel full-text corpus. Moreover, by assessing the influence of knowledge graph embeddings on the given task, we show that implementing a sequential entity linking approach, which considers the whole sentence, can outperform clustering techniques that handle each mention separately in specific instances.

KW - Business informatics

KW - Entity Linking

KW - Entity Disambiguation

KW - Out-of-KG Entities

U2 - 10.3233/SSW240009

DO - 10.3233/SSW240009

M3 - Article in conference proceedings

T3 - Studies on the Semantic Web

SP - 88

EP - 105

BT - Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI-

A2 - Salatino, Angelo A.

A2 - Alam, Mehwish

A2 - Ongenae, Femke

A2 - Vahdati, Sahar

A2 - Gentile, Anna Lisa

A2 - Pellegrini, Tassilo

A2 - Jiang, Shufan

PB - IOS Press BV

CY - Amsterdam

T2 - 20th International Conference on Semantic Systems - SEMANTiCS 2024

Y2 - 17 September 2024 through 19 September 2024

ER -

DOI

Recently viewed

Publications

  1. Semi-supervised learning for structured output variables
  2. Unidimensional and Multidimensional Methods for Recurrence Quantification Analysis with crqa
  3. A model predictive control in Robotino and its implementation using ROS system
  4. Intentionality
  5. Investigation and modeling of the material behavior due to evolving dislocation microstructures in fcc and bcc metals
  6. How generative drawing affects the learning process
  7. Das John-Stuart-Mill-Problem
  8. Jackson networks in nonautonomous random environments
  9. What would Colin say?
  10. Introduction: Habitual Action, Automaticity, and Control
  11. Legitimation problems of participatory processes in technology assessment and technology policy
  12. Nest site selection and the effects of land use in a multi-scale approach on the distribution of a passerine in an island arid environment
  13. Collaborative open science as a way to reproducibility and new insights in primate cognition research
  14. Effect of yttrium addition on lattice parameter, Young's modulus and vacancy of magnesium
  15. Self-perceived quality of life predicts mortality risk better than a multi-biomarker panel, but the combination of both does best
  16. Multifractality Versus (Mono-) Fractality as Evidence of Nonlinear Interactions Across Timescales
  17. Spatial Tests, Familiarity with the Surroundings, and Spatial Activity Experience
  18. Applied Conversation Analysis in Foreign Language Didactics
  19. Metamodelizing the Territory
  20. (De)Composing Public Value
  21. Internal forces in robotic manipulation and in general mechanisms using a geometric approach
  22. What role for frames in scalar conflicts?
  23. Vector Fields Autonomous Control for Assistive Mobile Robots
  24. Set-Oriented and Finite-Element Study of Coherent Behavior in Rayleigh-Bénard Convection
  25. Learning to rule
  26. Same but different? Measurement invariance of the PIAAC motivation-to-learn scale across key socio-demographic groups
  27. Conceptual Dimensions of Embodiment
  28. The link between in- and external rotation of the auditor and the quality of financial accounting and audit
  29. The conservation against development paradigm in protected areas
  30. Why a Systematic Investigation of Production Planning and Control Procedures is Needed for the Target-oriented Configuration of PPC
  31. Modeling Bolt Load Retention of Ca modified AS41 using compliance-creep method
  32. Bird's Response to Revegetation of Different Structure and Floristics-Are "Restoration Plantings" Restoring Bird Communities?
  33. Tracing Concepts