GERBIL - General entity annotator benchmarking framework

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

  • Michael Röder
  • Axel Cyrille Ngonga Ngomo
  • Ciro Baron
  • Andreas Both
  • Martin Brümmer
  • Diego Ceccarelli
  • Marco Cornolti
  • Didier Cherix
  • Bernd Eickmann
  • Paolo Ferragina
  • Christiane Lemke
  • Andrea Moro
  • Roberto Navigli
  • Francesco Piccinno
  • Giuseppe Rizzo
  • Harald Sack
  • René Speck
  • Raphaël Troncy
  • Jörg Waitelonis
  • Lars Wesemann

The need to bridge between the unstructured data on the Document Web and the structured data on the Web of Data has led to the development of a considerable number of annotation tools. However, these tools are currently still hard to compare since the published evaluation results are calculated on diverse datasets and evaluated based on different measures. We present GERBIL, an evaluation framework for semantic entity annotation. The rationale behind our framework is to provide developers, end users and researchers with easy-To-use interfaces that allow for the agile, fine-grained and uniform evaluation of annotation tools on multiple datasets. By these means, we aim to ensure that both tool developers and end users can derive meaningful insights pertaining to the extension, integration and use of annotation applications. In particular, GERBIL provides comparable results to tool developers so as to allow them to easily discover the strengths and weaknesses of their implementations with respect to the state of the art. With the permanent experiment URIs provided by our framework, we ensure the reproducibility and archiving of evaluation results. Moreover, the framework generates data in machineprocessable format, allowing for the efficient querying and post-processing of evaluation results. Finally, the tool diagnostics provided by GERBIL allows deriving insights pertaining to the areas in which tools should be further refined, thus allowing developers to create an informed agenda for extensions and end users to detect the right tools for their purposes. GERBIL aims to become a focal point for the state of the art, driving the research agenda of the community by presenting comparable objective evaluation results.

Original languageEnglish
Title of host publicationWWW 2015 - Proceedings of the 24th International Conference on World Wide Web
EditorsAldo Gangemi, Stefano Leonardi, Alessandro Panconesi
Number of pages11
PublisherAssociation for Computing Machinery, Inc
Publication date18.05.2015
Pages1133-1143
ISBN (Print)978-1-4503-3469-3
DOIs
Publication statusPublished - 18.05.2015
Externally publishedYes
Event24th International Conference on World Wide Web, WWW 2015 - Florence, Italy
Duration: 18.05.201522.05.2015
https://dl.acm.org/doi/proceedings/10.1145/2740908
https://dblp.org/db/conf/www/www2015.html

Links

DOI