Entity Linking with Out-of-Knowledge-Graph Entity Detection and Clustering Using Only Knowledge Graphs

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Entity Linking is crucial for numerous downstream tasks, such as question answering, knowledge graph population, and general knowledge extraction. A frequently overlooked aspect of entity linking is the potential encounter with entities not yet present in a target knowledge graph. Although some recent studies have addressed this issue, they primarily utilize full-text knowledge bases or depend on external information such as crawled webpages. Full-text knowledge bases are not available in all domains and using external information is connected to increased effort. However, these resources are not available in most use cases. In this work, we solely rely on the information within a knowledge graph and assume no external information is accessible.

To investigate the challenge of identifying and disambiguating entities absent from the knowledge graph, we introduce a comprehensive silver-standard benchmark dataset that covers texts from 1999 to 2022. Based on our novel dataset, we develop an approach using pre-trained language models and knowledge graph embeddings without the need for a parallel full-text corpus. Moreover, by assessing the influence of knowledge graph embeddings on the given task, we show that implementing a sequential entity linking approach, which considers the whole sentence, can outperform clustering techniques that handle each mention separately in specific instances.
Original languageEnglish
Title of host publicationKnowledge Graphs in the Age of Language Models and Neuro-Symbolic AI- : Proceedings of the 20th International Conference on Semantic Systems, 17-19 September 2024, Amsterdam, The Netherlands
EditorsAngelo A. Salatino, Mehwish Alam, Femke Ongenae, Sahar Vahdati, Anna Lisa Gentile, Tassilo Pellegrini, Shufan Jiang
Number of pages18
Place of PublicationAmsterdam
PublisherIOS Press BV
Publication date11.09.2024
Pages88-105
ISBN (electronic)978-1-64368-537-3
DOIs
Publication statusPublished - 11.09.2024
Event20th International Conference on Semantic Systems - SEMANTiCS 2024: Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI - Universität Amsterdam, Amsterdam, Netherlands
Duration: 17.09.202419.09.2024
Conference number: 20
https://2024-eu.semantics.cc/

Bibliographical note

© 2024 The Authors

DOI

Recently viewed

Activities

  1. Implemented Wavelet Packet Tree based Denoising Algorithm in Bus Signals of a Wearable Sensorarray
  2. Knowledge Acquisition and Problem Solving in CSCL with Learning Protocols: Effects of Different Scripting Strategies
  3. A Local Feature Extraction Using Biorthogonal Bases for Classification of Embedded Classes of Signals
  4. Presentaton of the paper entitled "Estimation of Parameters in the SIR Model using a Particle Swarm Optimization Algorithm" at the 2nd International Conference on Advances in Data-driven Computing and Intelligent Systems (ADCIS 2023)
  5. Model Predictive Control for Switching Gain Adaptation in a Sliding Mode Controller of a DC Drive with Nonlinear Friction
  6. Perturbation Analysis to Design a Robust Decoupling Geometric Technique in Linear Multi-Input Multi-Output Systems
  7. Interpreting Strings, Weaving Threads – Structuring Provenance Data with AI
  8. Can the ability to identify criteria explain why some selection procedures work? Results and unresolved issues
  9. SIAM Conference on Applications of Dynamical Systems - DS 2023
  10. Data-efficient Pattern Detection in Elite Soccer
  11. Plenary lecture entitled: "Wavelet Packets for Applications in Signal Processing and Control Systems"
  12. Presentation of the paper entitled: "Combining a PI Controller with an Adaptive Feedforward Control in PMSM"
  13. Predicting dropout in Internet Interventions - A framework to analyse user journey data
  14. Institutional Proxy Representatives of Future Generations: A Comparative Analysis of Types and Design Features

Publications

  1. Avoiding Algorithm Error in Computer-Aided Text Analyses
  2. Contemporary sinusoidal disturbance detection and nano parameters identification using data scaling based on Recursive Least Squares algorithms
  3. An adaptive derivative estimator for fault-detection Using a dynamic system with a suboptimal parameter
  4. Median based algorithm as an entropy function for noise detection in wavelet trees for data reconciliation
  5. Median Based Algorithm as an Entropy Function for Noise Detection in Wavelet Trees for Data Reconciliation
  6. Biorthogonal wavelet trees in the classification of embedded signal classes for intelligent sensors using machine learning applications
  7. Comparing data scaling based recursive least squares algorithms with Kalman Filter for nano parameters identification
  8. Fast template match algorithm for spatial object detection using a stereo vision system for autonomous navigation
  9. Analyzing multivariate dynamics using cross-recurrence quantification analysis (CRQA), diagonal-cross-recurrence profiles (DCRP), and multidimensional recurrence quantification analysis (MdRQA) - A tutorial in R
  10. archiDART: a R package allowing root system architecture analysis using Data Analysis of Root Tracings (DART) output files
  11. Database Publishing Without Databases
  12. Agile knowledge graph testing with TESTaLOD
  13. How to get really smart: Modeling retest and training effects in ability testing using computer-generated figural matrix items
  14. Semi-supervised learning for structured output variables
  15. Evolutionary generation of dispatching rule sets for complex dynamic scheduling problems
  16. Different complex word problems require different combinations of cognitive skills
  17. Simultaneous Constrained Adaptive Item Selection for Group-Based Testing
  18. Noise Detection for Biosignals Using an Orthogonal Wavelet Packet Tree Denoising Algorithm
  19. Applying Bayesian Parameter Estimation to A/B Tests in e-Business Applications
  20. Problem structuring for transitions
  21. Diffusion-driven microstructure evolution in OpenCalphad
  22. How to combine collaboration scripts and heuristic worked examples to foster mathematical argumentation - when working memory matters
  23. An Improved Approach to the Semi-Process-Oriented Implementation of Standardised ERP-Systems
  24. Global temporal typing patterns in foreign language writing
  25. Gain Scheduling Controller for Improving Level Control Performance
  26. Retest effects in matrix test performance
  27. Anatomy of Haar Wavelet Filter and Its Implementation for Signal Processing
  28. Template-based Question Answering using Recursive Neural Networks
  29. Sequencing and fading worked examples and collaboration scripts to foster mathematical argumentation - working memory capacity matters for fading
  30. Four Methods to Distinguish between Fractal Dimensions in Time Series through Recurrence Quantification Analysis