Survey on English Entity Linking on Wikidata: Datasets and approaches

Publikation: Beiträge in ZeitschriftenZeitschriftenaufsätzeForschungbegutachtet

Authors

Wikidata is a frequently updated, community-driven, and multilingual knowledge graph. Hence, Wikidata is an attractive basis for Entity Linking, which is evident by the recent increase in published papers. This survey focuses on four subjects: (1) Which Wikidata Entity Linking datasets exist, how widely used are they and how are they constructed? (2) Do the characteristics of Wikidata matter for the design of Entity Linking datasets and if so, how? (3) How do current Entity Linking approaches exploit the specific characteristics of Wikidata? (4) Which Wikidata characteristics are unexploited by existing Entity Linking approaches? This survey reveals that current Wikidata-specific Entity Linking datasets do not differ in their annotation scheme from schemes for other knowledge graphs like DBpedia. Thus, the potential for multilingual and time-dependent datasets, naturally suited for Wikidata, is not lifted. Furthermore, we show that most Entity Linking approaches use Wikidata in the same way as any other knowledge graph missing the chance to leverage Wikidata-specific characteristics to increase quality. Almost all approaches employ specific properties like labels and sometimes descriptions but ignore characteristics such as the hyper-relational structure. Hence, there is still room for improvement, for example, by including hyper-relational graph embeddings or type information. Many approaches also include information from Wikipedia, which is easily combinable with Wikidata and provides valuable textual information, which Wikidata lacks.

OriginalspracheEnglisch
ZeitschriftSemantic Web
Jahrgang13
Ausgabenummer6
Seiten (von - bis)925-966
Anzahl der Seiten42
ISSN1570-0844
DOIs
PublikationsstatusErschienen - 26.09.2022
Extern publiziertJa

Bibliographische Notiz

Funding Information:
We acknowledge the support of the EU project TAILOR (GA 952215), the Federal Ministry for Economic Affairs and Energy (BMWi) project SPEAKER (FKZ 01MK20011A), the German Federal Ministry of Education and Research (BMBF) projects and excellence clusters ML2R (FKZ 01 15 18038 A/B/C), MLwin (01S18050 D/F), ScaDS.AI (01/S18026A) as well as the Fraunhofer Zukunftsstiftung project JOSEPH. The authors also acknowledge the financial support by the Federal Ministry for Economic Affairs and Energy of Germany in the project CoyPu (project number 01MK21007G).

Publisher Copyright:
© 2022 - The authors. Published by IOS Press.

DOI

Zuletzt angesehen

Publikationen

  1. The Problems of Modern Societies — Epistemic Design around 1970
  2. Introduction: The Political Project of Corbynism
  3. Reducing aquatic micropollutants – Increasing the focus on input prevention and integrated emission management
  4. Continental-scale ecology versus landscape-scale case studies
  5. Integrating Ecosystem Services in Nature Conservation for Colombia
  6. Translating children’s literature: what, for whom, how, and why. A basic map of actors, factors and contexts
  7. Größen bauen auf Längen
  8. Occurrence of the antidiabetic drug Metformin and its ultimate transformation product Guanylurea in several compartments of the aquatic cycle
  9. Justice in environmental institutions - How do frameworks for institutional analysis consider ideas of justice?
  10. Long-Range and Regional Atmospheric Transport of POPs and Implications for Global Cycling
  11. It´s All in the Game!
  12. The Impact of Digitalization on the IT Department
  13. In Situ Synchrotron Radiation Study of the Tension–Compression Asymmetry in an Extruded Mg–2Y–1Zn–1Mn Alloy
  14. Governance Challenges at the Interface of Food Security and Biodiversity Conservation
  15. Regensburger Schauspiele
  16. Die Zukunftsbäckerei
  17. The global perspective of education for sustainable development
  18. The Power to Resist
  19. Two-pass friction stir welding of cladded API X65
  20. Paare in der Bestandsphase
  21. Der Schwingung zum Trotz
  22. Special Section: Pragmatic Development and Stay Abroad
  23. Implementing Environmental Management Accounting in South-East Asian Companies
  24. Digital Transformation and Institutional Theory
  25. Implementing Sustainable and Responsible Business
  26. Worldwide distribution of Persistent Organic Pollutants in air, including results of air monitoring by passive air sampling in five continents
  27. Das Essen und seine Genderscripte.
  28. Mind, mousse and moderation
  29. Planning Table
  30. Determination of sulfur and selected trace elements in metallothionein-like proteins using capillary electrophoresis hyphenated to inductively coupled plasma mass spectrometry with an octopole reaction cell
  31. Between morality and the law
  32. Früherkennung als Problem der Unternehmensführung in virtuellen Organisationen
  33. Editorial
  34. Deliberative Bürgerbeteiligung in der Priorisierungsdebatte
  35. Biodiversity–stability relationships strengthen over time in a long-term grassland experiment
  36. Eating stuff found on the floor is good for you: academia takes media non grata under her wings.

Presse / Medien

  1. Zeit der Gewinner