Survey on English Entity Linking on Wikidata: Datasets and approaches

Research output: Journal contributionsJournal articlesResearchpeer-review

Authors

Wikidata is a frequently updated, community-driven, and multilingual knowledge graph. Hence, Wikidata is an attractive basis for Entity Linking, which is evident by the recent increase in published papers. This survey focuses on four subjects: (1) Which Wikidata Entity Linking datasets exist, how widely used are they and how are they constructed? (2) Do the characteristics of Wikidata matter for the design of Entity Linking datasets and if so, how? (3) How do current Entity Linking approaches exploit the specific characteristics of Wikidata? (4) Which Wikidata characteristics are unexploited by existing Entity Linking approaches? This survey reveals that current Wikidata-specific Entity Linking datasets do not differ in their annotation scheme from schemes for other knowledge graphs like DBpedia. Thus, the potential for multilingual and time-dependent datasets, naturally suited for Wikidata, is not lifted. Furthermore, we show that most Entity Linking approaches use Wikidata in the same way as any other knowledge graph missing the chance to leverage Wikidata-specific characteristics to increase quality. Almost all approaches employ specific properties like labels and sometimes descriptions but ignore characteristics such as the hyper-relational structure. Hence, there is still room for improvement, for example, by including hyper-relational graph embeddings or type information. Many approaches also include information from Wikipedia, which is easily combinable with Wikidata and provides valuable textual information, which Wikidata lacks.

Original languageEnglish
JournalSemantic Web
Volume13
Issue number6
Pages (from-to)925-966
Number of pages42
ISSN1570-0844
DOIs
Publication statusPublished - 26.09.2022
Externally publishedYes

Bibliographical note

We acknowledge the support of the EU project TAILOR (GA 952215), the Federal Ministry for Economic Affairs and Energy (BMWi) project SPEAKER (FKZ 01MK20011A), the German Federal Ministry of Education and Research (BMBF) projects and excellence clusters ML2R (FKZ 01 15 18038 A/B/C), MLwin (01S18050 D/F), ScaDS.AI (01/S18026A) as well as the Fraunhofer Zukunftsstiftung project JOSEPH. The authors also acknowledge the financial support by the Federal Ministry for Economic Affairs and Energy of Germany in the project CoyPu (project number 01MK21007G).

Publisher Copyright:
© 2022 - The authors. Published by IOS Press.

Recently viewed

Publications

  1. Vorräte - Schätzung des Fertigstellungsgrades bei der Percentage of Completion Methode
  2. Effectiveness of the holistic primary school-based intervention MindMatters
  3. Teamplay, Clanhopping und Wallhacker
  4. Dem Editor-in-Chief der ZfB, Günter Fandel, zum Fünfundsechzigsten
  5. What makes me angry on the bicycle
  6. Predictive mapping of species richness and plant species' distributions of a peruvian fog oasis along an altitudinal gradient
  7. Ecosystem services as a boundary object for sustainability
  8. Implementation of a balanced scorecard for hybrid business models
  9. Theories of democratization
  10. Performance Saga: Interview 08
  11. A situational judgment test of personal initiative and its relationship to performance
  12. Keep calm and follow the news
  13. Prekäre Subjekte - Prekäre Kritik
  14. Technological opportunities and their rejection
  15. Efficient control of formation flying spacecraft
  16. Fallstudie
  17. Methodology for Integrating Biomimetic Beams in Abstracted Topology Optimization Results
  18. Ronald David Laing
  19. Factors affecting fruit set in Aizoaceae species of the Succulent Karoo
  20. How perfect is (too) perfect? Illuminating why the perfectionism-performance-relationship is (non-)linear
  21. Slowing resource loops in the Circular Economy: an experimentation approach in fashion retail
  22. Plant diversity effects on aboveground and belowground N pools in temperate grassland ecosystems
  23. Is it really worth it?
  24. Vergesellschaftung durch Konsum
  25. Distinguishing between struggling and skilled readers based on their prosodic speech patterns in oral reading
  26. A general result on absolute continuity of non-uniform self-similar measures on the real line
  27. Workshop on impacts of the EU-UK Trade and Cooperation Agreement on fisheries and aquaculture in the EU
  28. Management and organization in the work of Michel houellebecq unplugged - voices
  29. After Affects
  30. Ein gemeinsamer europäischer Referenzrahmen für Sprachen oder für Englisch?
  31. Paradigmawechsel
  32. Power and control on the waterfront
  33. Measurements of atmospheric mercury with high time resolution
  34. Soziologie lehren an einer staatlichen Universität in New Jersey
  35. Online hands-on trainings (real worlds in virtual environments)
  36. Routledge Handbook of Higher Education for Sustainable Development