Entity Linking with Out-of-Knowledge-Graph Entity Detection and Clustering Using Only Knowledge Graphs

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Standard

Entity Linking with Out-of-Knowledge-Graph Entity Detection and Clustering Using Only Knowledge Graphs. / Möller, Cedric; Usbeck, Ricardo.
Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI- : Proceedings of the 20th International Conference on Semantic Systems, 17-19 September 2024, Amsterdam, The Netherlands. ed. / Angelo A. Salatino; Mehwish Alam; Femke Ongenae; Sahar Vahdati; Anna Lisa Gentile; Tassilo Pellegrini; Shufan Jiang. Amsterdam: IOS Press BV, 2024. p. 88-105 (Studies on the Semantic Web; Vol. 60).

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Harvard

Möller, C & Usbeck, R 2024, Entity Linking with Out-of-Knowledge-Graph Entity Detection and Clustering Using Only Knowledge Graphs. in AA Salatino, M Alam, F Ongenae, S Vahdati, AL Gentile, T Pellegrini & S Jiang (eds), Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI- : Proceedings of the 20th International Conference on Semantic Systems, 17-19 September 2024, Amsterdam, The Netherlands. Studies on the Semantic Web, vol. 60, IOS Press BV, Amsterdam, pp. 88-105, 20th International Conference on Semantic Systems - SEMANTiCS 2024, Amsterdam, Netherlands, 17.09.24. https://doi.org/10.3233/SSW240009

APA

Möller, C., & Usbeck, R. (2024). Entity Linking with Out-of-Knowledge-Graph Entity Detection and Clustering Using Only Knowledge Graphs. In A. A. Salatino, M. Alam, F. Ongenae, S. Vahdati, A. L. Gentile, T. Pellegrini, & S. Jiang (Eds.), Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI- : Proceedings of the 20th International Conference on Semantic Systems, 17-19 September 2024, Amsterdam, The Netherlands (pp. 88-105). (Studies on the Semantic Web; Vol. 60). IOS Press BV. https://doi.org/10.3233/SSW240009

Vancouver

Möller C, Usbeck R. Entity Linking with Out-of-Knowledge-Graph Entity Detection and Clustering Using Only Knowledge Graphs. In Salatino AA, Alam M, Ongenae F, Vahdati S, Gentile AL, Pellegrini T, Jiang S, editors, Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI- : Proceedings of the 20th International Conference on Semantic Systems, 17-19 September 2024, Amsterdam, The Netherlands. Amsterdam: IOS Press BV. 2024. p. 88-105. (Studies on the Semantic Web). doi: 10.3233/SSW240009

Bibtex

@inbook{0e3e0f1feb394832955e1c1aafac7b6d,
title = "Entity Linking with Out-of-Knowledge-Graph Entity Detection and Clustering Using Only Knowledge Graphs",
abstract = "Entity Linking is crucial for numerous downstream tasks, such as question answering, knowledge graph population, and general knowledge extraction. A frequently overlooked aspect of entity linking is the potential encounter with entities not yet present in a target knowledge graph. Although some recent studies have addressed this issue, they primarily utilize full-text knowledge bases or depend on external information such as crawled webpages. Full-text knowledge bases are not available in all domains and using external information is connected to increased effort. However, these resources are not available in most use cases. In this work, we solely rely on the information within a knowledge graph and assume no external information is accessible.To investigate the challenge of identifying and disambiguating entities absent from the knowledge graph, we introduce a comprehensive silver-standard benchmark dataset that covers texts from 1999 to 2022. Based on our novel dataset, we develop an approach using pre-trained language models and knowledge graph embeddings without the need for a parallel full-text corpus. Moreover, by assessing the influence of knowledge graph embeddings on the given task, we show that implementing a sequential entity linking approach, which considers the whole sentence, can outperform clustering techniques that handle each mention separately in specific instances.",
keywords = "Business informatics, Entity Linking, Entity Disambiguation, Out-of-KG Entities",
author = "Cedric M{\"o}ller and Ricardo Usbeck",
note = "{\textcopyright} 2024 The Authors; 20th International Conference on Semantic Systems - SEMANTiCS 2024 : Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI, SEMANTiCS 2024 ; Conference date: 17-09-2024 Through 19-09-2024",
year = "2024",
month = sep,
day = "11",
doi = "10.3233/SSW240009",
language = "English",
series = "Studies on the Semantic Web",
publisher = "IOS Press BV",
pages = "88--105",
editor = "Salatino, {Angelo A.} and Mehwish Alam and Femke Ongenae and Sahar Vahdati and Gentile, {Anna Lisa} and Tassilo Pellegrini and Shufan Jiang",
booktitle = "Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI-",
address = "Netherlands",
url = "https://2024-eu.semantics.cc/ ",

}

RIS

TY - CHAP

T1 - Entity Linking with Out-of-Knowledge-Graph Entity Detection and Clustering Using Only Knowledge Graphs

AU - Möller, Cedric

AU - Usbeck, Ricardo

N1 - Conference code: 20

PY - 2024/9/11

Y1 - 2024/9/11

N2 - Entity Linking is crucial for numerous downstream tasks, such as question answering, knowledge graph population, and general knowledge extraction. A frequently overlooked aspect of entity linking is the potential encounter with entities not yet present in a target knowledge graph. Although some recent studies have addressed this issue, they primarily utilize full-text knowledge bases or depend on external information such as crawled webpages. Full-text knowledge bases are not available in all domains and using external information is connected to increased effort. However, these resources are not available in most use cases. In this work, we solely rely on the information within a knowledge graph and assume no external information is accessible.To investigate the challenge of identifying and disambiguating entities absent from the knowledge graph, we introduce a comprehensive silver-standard benchmark dataset that covers texts from 1999 to 2022. Based on our novel dataset, we develop an approach using pre-trained language models and knowledge graph embeddings without the need for a parallel full-text corpus. Moreover, by assessing the influence of knowledge graph embeddings on the given task, we show that implementing a sequential entity linking approach, which considers the whole sentence, can outperform clustering techniques that handle each mention separately in specific instances.

AB - Entity Linking is crucial for numerous downstream tasks, such as question answering, knowledge graph population, and general knowledge extraction. A frequently overlooked aspect of entity linking is the potential encounter with entities not yet present in a target knowledge graph. Although some recent studies have addressed this issue, they primarily utilize full-text knowledge bases or depend on external information such as crawled webpages. Full-text knowledge bases are not available in all domains and using external information is connected to increased effort. However, these resources are not available in most use cases. In this work, we solely rely on the information within a knowledge graph and assume no external information is accessible.To investigate the challenge of identifying and disambiguating entities absent from the knowledge graph, we introduce a comprehensive silver-standard benchmark dataset that covers texts from 1999 to 2022. Based on our novel dataset, we develop an approach using pre-trained language models and knowledge graph embeddings without the need for a parallel full-text corpus. Moreover, by assessing the influence of knowledge graph embeddings on the given task, we show that implementing a sequential entity linking approach, which considers the whole sentence, can outperform clustering techniques that handle each mention separately in specific instances.

KW - Business informatics

KW - Entity Linking

KW - Entity Disambiguation

KW - Out-of-KG Entities

U2 - 10.3233/SSW240009

DO - 10.3233/SSW240009

M3 - Article in conference proceedings

T3 - Studies on the Semantic Web

SP - 88

EP - 105

BT - Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI-

A2 - Salatino, Angelo A.

A2 - Alam, Mehwish

A2 - Ongenae, Femke

A2 - Vahdati, Sahar

A2 - Gentile, Anna Lisa

A2 - Pellegrini, Tassilo

A2 - Jiang, Shufan

PB - IOS Press BV

CY - Amsterdam

T2 - 20th International Conference on Semantic Systems - SEMANTiCS 2024

Y2 - 17 September 2024 through 19 September 2024

ER -

DOI

Recently viewed

Activities

  1. A Local Feature Extraction Using Biorthogonal Bases for Classification of Embedded Classes of Signals
  2. Discourse Analyses in Chat-Based CSCL with Learning Protocols: Effects of Different Scripting Strategies
  3. Contemporary sinusoidal disturbance detection and nano parameters identification using data scaling based on Recursive Least Squares algorithms
  4. Probabilistic and discrete computational methods for studying coherent behavior in flows
  5. Global Text Processing in CSCL with Learning Protocols: A Coding Scheme for Eye Movement Analyses
  6. Detection of tactical patterns using semi-supervised graph neural networks
  7. Principal Ambidexterity: Exploring and Exploiting in dynamic Contexts
  8. Graph Conditional Variational Models: Too Complex for Multiagent Trajectories?
  9. A discrete-time fractional order PI controller for a three phase synchronous motor using an optimal loop shaping Approach
  10. Process Analyses of Grounding in Chat-based CSCL: An Approach for Adaptive Scripting?
  11. Modeling Efficient Grounding in Chat-based CSCL: An Approach for Adaptive Scripting?
  12. Probabilistic and discrete methods for the computational study of coherent behavior in flows
  13. Multi-Agent Path Finding with Kinematic Constraints for Robotic Mobile Fulfillment Systems
  14. Applications of transfer operator methods in fluid dynamics
  15. Model Predictive Control for Switching Gain Adaptation in a Sliding Mode Controller of a DC Drive with Nonlinear Friction
  16. Domestication and/or Digital Divide – How to Overcome Binary Classifications in Analysing Everyday Internet Use and Diffusion
  17. Structure and dynamics laboratory testing of an indirectly controlled full variable valve train for camless engines
  18. Framing Emerging Technologies in Interstitial Issue Fields: Insights from the Blockchain Technology
  19. Computer Simulations in Design. How Social Media meet Computational Methods in Design Processes
  20. Interpreting Strings, Weaving Threads – Structuring Provenance Data with AI
  21. Learner Performance of Language Learning Tasks in Web-Based Environments
  22. SIAM Conference on Applications of Dynamical Systems - DS 2023

Publications

  1. Analyzing multivariate dynamics using cross-recurrence quantification analysis (CRQA), diagonal-cross-recurrence profiles (DCRP), and multidimensional recurrence quantification analysis (MdRQA) - A tutorial in R
  2. Using sequential injection analysis to improve system and data reliability of online methods
  3. Median based algorithm as an entropy function for noise detectionin wavelet trees for data reconciliation
  4. archiDART: a R package allowing root system architecture analysis using Data Analysis of Root Tracings (DART) output files
  5. Wavelet based Fault Detection and RLS Parameter Estimation of Conductive Fibers with a Simultaneous Estimation of Time-Varying Disturbance
  6. Supervised clustering of streaming data for email batch detection
  7. A genetic algorithm for a self-learning parameterization of an aerodynamic part feeding system for high-speed assembly
  8. Using Euler Discrete Approximation to Control an Aggregate Actuator in Camless Engines
  9. Application of non-convex rate dependent gradient plasticity to the modeling and simulation of inelastic microstructure development and inhomogeneous material behavior
  10. Managing Business Process in Distributed Systems: Requirements, Models, and Implementation
  11. Joint entity and relation linking using EARL
  12. Learning Rotation Sensitive Neural Network for Deformed Objects' Detection in Fisheye Images
  13. Dynamic adjustment of dispatching rule parameters in flow shops with sequence-dependent set-up times
  14. Evaluating the construct validity of Objective Personality Tests using a multitrait-multimethod-Multioccasion-(MTMM-MO)-approach
  15. Analyzing different types of moderated method effects in confirmatory factor models for structurally different methods
  16. A coding scheme to analyse global text processing in computer supported collaborative learning: What eye movements can tell us
  17. Binary Random Nets I
  18. Using Natural Language Processing Techniques to Tackle the Construct Identity Problem in Information Systems Research
  19. Ant colony optimization algorithm and artificial immune system applied to a robot route
  20. Development of a Didactic Graphical Simulation Interface on MATLAB for Systems Control
  21. Graph Conditional Variational Models: Too Complex for Multiagent Trajectories?
  22. Analysis of Complexity Reduction in Kalman Filters Through Decoupling Control With Chattered Inputs in PMSM
  23. Towards a Dynamic Interpretation of Subjective and Objective Values
  24. Using protochirons for three-dimensional coding of certain chemical structures.
  25. Adaptive and Dynamic Feedback Loops between Production System and Production Network based on the Asset Administration Shell
  26. Predicting the Difficulty of Exercise Items for Dynamic Difficulty Adaptation in Adaptive Language Tutoring