Knowledge Graph Question Answering Leaderboard: A Community Resource to Prevent a Replication Crisis

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

  • Aleksandr Perevalov
  • Xi Yan
  • Liubov Kovriguina
  • Longquan Jiang
  • Andreas Both
  • Ricardo Usbeck

Data-driven systems need to be evaluated to establish trust in the scientific approach and its applicability. In particular, this is true for Knowledge Graph (KG) Question Answering (QA), where complex data structures are made accessible via natural-language interfaces. Evaluating the capabilities of these systems has been a driver for the community for more than ten years while establishing different KGQA benchmark datasets. However, comparing different approaches is cumbersome. The lack of existing and curated leaderboards leads to a missing global view over the research field and could inject mistrust into the results. In particular, the latest and most-used datasets in the KGQA community, LC-QuAD and QALD, miss providing central and up-to-date points of trust. In this paper, we survey and analyze a wide range of evaluation results with significant coverage of 100 publications and 98 systems from the last decade. We provide a new central and open leaderboard for any KGQA benchmark dataset as a focal point for the community - https://kgqa.github.io/leaderboard/. Our analysis highlights existing problems during the evaluation of KGQA systems. Thus, we will point to possible improvements for future evaluations.

Original languageEnglish
Title of host publication2022 Language Resources and Evaluation Conference, LREC 2022
EditorsNicoletta Calzolari, Frederic Bechet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Helene Mazo, Jan Odijk, Stelios Piperidis
Number of pages10
PublisherEuropean Language Resources Association (ELRA)
Publication date2022
Pages2998-3007
ISBN (electronic)9791095546726
Publication statusPublished - 2022
Externally publishedYes
Event13th International Conference on Language Resources and Evaluation Conference - LREC 2022: Identify, Describe and Share your LRs! - Palais du Pharo, Marseille, France
Duration: 20.06.202225.06.2022
Conference number: 13
https://lrec2022.lrec-conf.org/en/

Bibliographical note

Publisher Copyright:
© European Language Resources Association (ELRA), licensed under CC-BY-NC-4.0.

    Research areas

  • Evaluation Methodology, Knowledge Graph, Leaderboard, Question Answering, Replication Crisis
  • Business informatics

Links

Recently viewed

Publications

  1. Effects of grade retention on achievement and self-concept in science and mathematics
  2. Mindfulness and cognitive-behavioral strategies for psychological detachment
  3. Multimodal algebra learning
  4. Valuing beaches for beauty and recreation only? Uncovering perception bias through a hashtag analysis
  5. Sunspot equilibria in a monetary real business cycle model
  6. Freie Berufe im Mikrozensus II - Einkommen und Einkommensverteilung
  7. Three-dimensional microstructural analysis of Mg-Al-Zn alloys by synchrotron-radiation-based microtomography
  8. Statistical analysis
  9. Reliability and validity of the self-report version of the Strengths and Difficulties Questionnaire (SDQ) in primary school children
  10. Design of flat slabs for punching - European and North American practices
  11. Towards 3D Process Simulation for In Situ Hybridization of Fiber-Metal-Laminates (FML)
  12. Financing behavior in new ventures - Evidence from Germany
  13. Patterns of entrepreneurial career development
  14. Determination of rutin in green tea infusions using square-wave voltammetry with a rigid carbon-polyurethane composite electrode
  15. Compressive strength and hot deformation behavior of TX32 magnesium alloy with 0.4% Al and 0.4% Si additions
  16. Relations between idle time, exhaustion, and engagement at work
  17. Credit Constraints and Margins of Import
  18. An Adaptive Lyapunovs Internal PID Regulator in Automotive Applications
  19. Timing, fragmentation of work and income inequality
  20. Differences in labor supply to monopsonistic firms and the gender pay gap
  21. Numerical determination of heat distribution and castability simulations of as cast Mg-Al alloys
  22. Determinants of promotions in an internal labour market
  23. Threshold stress during tensile and compressive creep in AE42 magnesium alloy
  24. LC-QuAD 2.0
  25. Numerical simulation of friction extrusion
  26. Deciding whether to work after retirement
  27. Multiphoton ionization of magnesium and calcium atoms by short and intense laser pulses
  28. Design and control of an electromagnetic valve actuator
  29. Selbstreguliertes Lernen im Mathematikstudium
  30. People Information in Provenance Data
  31. Dimension theoretical properties of generalized Baker's transformations
  32. Environmental and structural health monitoring by optoelectronic scanner
  33. Flat-of-the-curve medicine
  34. Multiple plant diversity components drive consumer communities across ecosystems
  35. Data-driven analyses of electronic text books