N3 - A collection of datasets for named entity recognition and disambiguation in the NLP interchange format

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

  • Michael Röder
  • Ricardo Usbeck
  • Sebastian Hellmann
  • Daniel Gerber
  • Andreas Both

Extracting Linked Data following the Semantic Web principle from unstructured sources has become a key challenge for scientific research. Named Entity Recognition and Disambiguation are two basic operations in this extraction process. One step towards the realization of the Semantic Web vision and the development of highly accurate tools is the availability of data for validating the quality of processes for Named Entity Recognition and Disambiguation as well as for algorithm tuning. This article presents three novel, manually curated and annotated corpora (N3). All of them are based on a free license and stored in the NLP Interchange Format to leverage the Linked Data character of our datasets.

Original languageEnglish
Title of host publicationProceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014
EditorsNicoletta Calzolari, Khalid Choukri, Sara Goggi, Thierry Declerck, Joseph Mariani, Bente Maegaard, Asuncion Moreno, Jan Odijk, Helene Mazo, Stelios Piperidis, Hrafn Loftsson
Number of pages5
Place of PublicationReykjavik, Iceland
PublisherEuropean Language Resources Association (ELRA)
Publication date05.2014
Pages3529-3533
ISBN (electronic)9782951740884
Publication statusPublished - 05.2014
Externally publishedYes
Event9th International Conference on Language Resources and Evaluation, LREC 2014 - Reykjavik, Iceland
Duration: 26.05.201431.05.2014
Conference number: 9
http://www.lrec-conf.org/proceedings/lrec2014/index.html

Bibliographical note

We thank Luise Erfurth and Didier Cherix for helping us creating annotations of
the datasets and Jens Lehmann for his feedback. A special thanks goes to news.de for allowing us to use their articles. Parts of this work were supported by the ESF and
the Free State of Saxony.

ACL materials are Copyright © 1963–2023

Links

Recently viewed

Researchers

  1. Goddert Oheimb

Publications

  1. An error management perspective on audit quality
  2. Individual Scans Fusion in Virtual Knowledge Base for Navigation of Mobile Robotic Group with 3D TVS
  3. Microstructural and mechanical aspects of reinforcement welds for lightweight components produced by friction hydro pillar processing
  4. Fusion of knowledge bases for better navigation of wheeled mobile robotic group with 3D TVS
  5. The polarity field concept
  6. FROM THE EDITORS ERRORS IN ORGANIZATIONS
  7. Quo Vadis, Umweltinformatik? 6. Workshop
  8. On-board pneumatic pressure generation methods for soft robotics applications
  9. Mythos
  10. Daniel Fiott (ed.), The csdp in 2020: The EU’s legacy and ambition in security and defence
  11. A plea for realistic pessimism
  12. Nuclear Power Worldwide
  13. Deciding whether to work after retirement
  14. Where is paradise? The EU's navigation system Galileo - Some comments on inherent risks (or paradise lost)
  15. How to Limit the Spillover from the 2021 Inflation Surge to Inflation Expectations?
  16. The effectiveness of interventions during and after residence in women’s shelters
  17. Testing a Calibration-Free Eye Tracker Prototype at the Kunsthistorisches Museum in Vienna
  18. Schools and their ‚culture of consumption‘: a context for consumer learning
  19. Wirksam führen auf Distanz
  20. Energy management for inductive power transmission
  21. Hindernisse überwinden
  22. Fides implicita
  23. Marktorientierte Planung des Produktsystems
  24. Polymorphic microsatellite loci in the endangered butterfly Lycaena helle (Lepidoptera: Lycaenidae)
  25. Förderkartei 3./4. Schuljahr