Entity Linking with Out-of-Knowledge-Graph Entity Detection and Clustering Using Only Knowledge Graphs

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Standard

Entity Linking with Out-of-Knowledge-Graph Entity Detection and Clustering Using Only Knowledge Graphs. / Möller, Cedric; Usbeck, Ricardo.
Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI- : Proceedings of the 20th International Conference on Semantic Systems, 17-19 September 2024, Amsterdam, The Netherlands. ed. / Angelo A. Salatino; Mehwish Alam; Femke Ongenae; Sahar Vahdati; Anna Lisa Gentile; Tassilo Pellegrini; Shufan Jiang. Amsterdam: IOS Press BV, 2024. p. 88-105 (Studies on the Semantic Web; Vol. 60).

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Harvard

Möller, C & Usbeck, R 2024, Entity Linking with Out-of-Knowledge-Graph Entity Detection and Clustering Using Only Knowledge Graphs. in AA Salatino, M Alam, F Ongenae, S Vahdati, AL Gentile, T Pellegrini & S Jiang (eds), Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI- : Proceedings of the 20th International Conference on Semantic Systems, 17-19 September 2024, Amsterdam, The Netherlands. Studies on the Semantic Web, vol. 60, IOS Press BV, Amsterdam, pp. 88-105, 20th International Conference on Semantic Systems - SEMANTiCS 2024, Amsterdam, Netherlands, 17.09.24. https://doi.org/10.3233/SSW240009

APA

Möller, C., & Usbeck, R. (2024). Entity Linking with Out-of-Knowledge-Graph Entity Detection and Clustering Using Only Knowledge Graphs. In A. A. Salatino, M. Alam, F. Ongenae, S. Vahdati, A. L. Gentile, T. Pellegrini, & S. Jiang (Eds.), Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI- : Proceedings of the 20th International Conference on Semantic Systems, 17-19 September 2024, Amsterdam, The Netherlands (pp. 88-105). (Studies on the Semantic Web; Vol. 60). IOS Press BV. https://doi.org/10.3233/SSW240009

Vancouver

Möller C, Usbeck R. Entity Linking with Out-of-Knowledge-Graph Entity Detection and Clustering Using Only Knowledge Graphs. In Salatino AA, Alam M, Ongenae F, Vahdati S, Gentile AL, Pellegrini T, Jiang S, editors, Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI- : Proceedings of the 20th International Conference on Semantic Systems, 17-19 September 2024, Amsterdam, The Netherlands. Amsterdam: IOS Press BV. 2024. p. 88-105. (Studies on the Semantic Web). doi: 10.3233/SSW240009

Bibtex

@inbook{0e3e0f1feb394832955e1c1aafac7b6d,
title = "Entity Linking with Out-of-Knowledge-Graph Entity Detection and Clustering Using Only Knowledge Graphs",
abstract = "Entity Linking is crucial for numerous downstream tasks, such as question answering, knowledge graph population, and general knowledge extraction. A frequently overlooked aspect of entity linking is the potential encounter with entities not yet present in a target knowledge graph. Although some recent studies have addressed this issue, they primarily utilize full-text knowledge bases or depend on external information such as crawled webpages. Full-text knowledge bases are not available in all domains and using external information is connected to increased effort. However, these resources are not available in most use cases. In this work, we solely rely on the information within a knowledge graph and assume no external information is accessible.To investigate the challenge of identifying and disambiguating entities absent from the knowledge graph, we introduce a comprehensive silver-standard benchmark dataset that covers texts from 1999 to 2022. Based on our novel dataset, we develop an approach using pre-trained language models and knowledge graph embeddings without the need for a parallel full-text corpus. Moreover, by assessing the influence of knowledge graph embeddings on the given task, we show that implementing a sequential entity linking approach, which considers the whole sentence, can outperform clustering techniques that handle each mention separately in specific instances.",
keywords = "Business informatics, Entity Linking, Entity Disambiguation, Out-of-KG Entities",
author = "Cedric M{\"o}ller and Ricardo Usbeck",
note = "{\textcopyright} 2024 The Authors; 20th International Conference on Semantic Systems - SEMANTiCS 2024 : Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI, SEMANTiCS 2024 ; Conference date: 17-09-2024 Through 19-09-2024",
year = "2024",
month = sep,
day = "11",
doi = "10.3233/SSW240009",
language = "English",
series = "Studies on the Semantic Web",
publisher = "IOS Press BV",
pages = "88--105",
editor = "Salatino, {Angelo A.} and Mehwish Alam and Femke Ongenae and Sahar Vahdati and Gentile, {Anna Lisa} and Tassilo Pellegrini and Shufan Jiang",
booktitle = "Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI-",
address = "Netherlands",
url = "https://2024-eu.semantics.cc/ ",

}

RIS

TY - CHAP

T1 - Entity Linking with Out-of-Knowledge-Graph Entity Detection and Clustering Using Only Knowledge Graphs

AU - Möller, Cedric

AU - Usbeck, Ricardo

N1 - Conference code: 20

PY - 2024/9/11

Y1 - 2024/9/11

N2 - Entity Linking is crucial for numerous downstream tasks, such as question answering, knowledge graph population, and general knowledge extraction. A frequently overlooked aspect of entity linking is the potential encounter with entities not yet present in a target knowledge graph. Although some recent studies have addressed this issue, they primarily utilize full-text knowledge bases or depend on external information such as crawled webpages. Full-text knowledge bases are not available in all domains and using external information is connected to increased effort. However, these resources are not available in most use cases. In this work, we solely rely on the information within a knowledge graph and assume no external information is accessible.To investigate the challenge of identifying and disambiguating entities absent from the knowledge graph, we introduce a comprehensive silver-standard benchmark dataset that covers texts from 1999 to 2022. Based on our novel dataset, we develop an approach using pre-trained language models and knowledge graph embeddings without the need for a parallel full-text corpus. Moreover, by assessing the influence of knowledge graph embeddings on the given task, we show that implementing a sequential entity linking approach, which considers the whole sentence, can outperform clustering techniques that handle each mention separately in specific instances.

AB - Entity Linking is crucial for numerous downstream tasks, such as question answering, knowledge graph population, and general knowledge extraction. A frequently overlooked aspect of entity linking is the potential encounter with entities not yet present in a target knowledge graph. Although some recent studies have addressed this issue, they primarily utilize full-text knowledge bases or depend on external information such as crawled webpages. Full-text knowledge bases are not available in all domains and using external information is connected to increased effort. However, these resources are not available in most use cases. In this work, we solely rely on the information within a knowledge graph and assume no external information is accessible.To investigate the challenge of identifying and disambiguating entities absent from the knowledge graph, we introduce a comprehensive silver-standard benchmark dataset that covers texts from 1999 to 2022. Based on our novel dataset, we develop an approach using pre-trained language models and knowledge graph embeddings without the need for a parallel full-text corpus. Moreover, by assessing the influence of knowledge graph embeddings on the given task, we show that implementing a sequential entity linking approach, which considers the whole sentence, can outperform clustering techniques that handle each mention separately in specific instances.

KW - Business informatics

KW - Entity Linking

KW - Entity Disambiguation

KW - Out-of-KG Entities

U2 - 10.3233/SSW240009

DO - 10.3233/SSW240009

M3 - Article in conference proceedings

T3 - Studies on the Semantic Web

SP - 88

EP - 105

BT - Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI-

A2 - Salatino, Angelo A.

A2 - Alam, Mehwish

A2 - Ongenae, Femke

A2 - Vahdati, Sahar

A2 - Gentile, Anna Lisa

A2 - Pellegrini, Tassilo

A2 - Jiang, Shufan

PB - IOS Press BV

CY - Amsterdam

T2 - 20th International Conference on Semantic Systems - SEMANTiCS 2024

Y2 - 17 September 2024 through 19 September 2024

ER -

DOI

Recently viewed

Publications

  1. Median based algorithm as an entropy function for noise detectionin wavelet trees for data reconciliation
  2. Advanced Neural Classifier-Based Effective Human Assistance Robots Using Comparable Interactive Input Assessment Technique
  3. A Wavelet Based Algorithm without a Priori Knowledge of Noise Level for Gross Errors Detection
  4. Fostering Circularity: Building a Local Community and Implementing Circular Processes
  5. Calculation of Average Mutual Information (AMI) and false-nearest neighbors (FNN) for the estimation of embedding parameters of multidimensional time series in matlab
  6. Modeling and Performance Analysis of a Node in Fault Tolerant Wireless Sensor Networks
  7. Discourse Analyses in Chat-based CSCL with Learning Protocols
  8. Database Publishing Without Databases
  9. A Lightweight Simulation Model for Soft Robot's Locomotion and its Application to Trajectory Optimization
  10. Transformer with Tree-order Encoding for Neural Program Generation
  11. Closed-loop control of product geometry by using an artificial neural network in incremental sheet forming with active medium
  12. Preventive Emergency Detection Based on the Probabilistic Evaluation of Distributed, Embedded Sensor Networks
  13. A transfer operator based computational study of mixing processes in open flow systems
  14. Automatic enumeration of all connected subgraphs.
  15. Methodologies for Noise and Gross Error Detection using Univariate Signal-Based Approaches in Industrial Application
  16. Enabling Road Condition Monitoring with an on-board Vehicle Sensor Setup
  17. Efficient and accurate ℓ p-norm multiple kernel learning
  18. Neural network-based adaptive fault-tolerant control for strict-feedback nonlinear systems with input dead zone and saturation
  19. Different complex word problems require different combinations of cognitive skills
  20. Semantic Parsing for Knowledge Graph Question Answering with Large Language Models
  21. Control of the inverse pendulum based on sliding mode and model predictive control
  22. Clustering Hydrological Homogeneous Regions and Neural Network Based Index Flood Estimation for Ungauged Catchments
  23. Latent structure perceptron with feature induction for unrestricted coreference resolution
  24. Selecting and Adapting Methods for Analysis and Design in Value-Sensitive Digital Social Innovation Projects: Toward Design Principles
  25. Modeling Effective and Ineffective Knowledge Communication and Learning Discourses in CSCL with Hidden Markov Models
  26. Problem structuring for transitions
  27. Using Decision Trees and Reinforcement Learning for the Dynamic Adjustment of Composite Sequencing Rules in a Flexible Manufacturing System
  28. Spatial mislocalization as a consequence of sequential coding of stimuli
  29. DialogueMaps: Supporting interactive transdisciplinary dialogues with a web-based tool for multi-layer knowledge maps
  30. Real-time RDF extraction from unstructured data streams
  31. A Multivariate Method for Dynamic System Analysis
  32. On the Decoupling and Output Functional Controllability of Robotic Manipulation
  33. Analysis of long-term statistical data of cobalt flows in the EU
  34. Supporting the Development and Implementation of a Digitalization Strategy in SMEs through a Lightweight Architecture-based Method
  35. FFTSMC with Optimal Reference Trajectory Generated by MPC in Robust Robotino Motion Planning with Saturating Inputs