N3 - A collection of datasets for named entity recognition and disambiguation in the NLP interchange format

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

  • Michael Röder
  • Ricardo Usbeck
  • Sebastian Hellmann
  • Daniel Gerber
  • Andreas Both

Extracting Linked Data following the Semantic Web principle from unstructured sources has become a key challenge for scientific research. Named Entity Recognition and Disambiguation are two basic operations in this extraction process. One step towards the realization of the Semantic Web vision and the development of highly accurate tools is the availability of data for validating the quality of processes for Named Entity Recognition and Disambiguation as well as for algorithm tuning. This article presents three novel, manually curated and annotated corpora (N3). All of them are based on a free license and stored in the NLP Interchange Format to leverage the Linked Data character of our datasets.

OriginalspracheEnglisch
TitelProceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014
HerausgeberNicoletta Calzolari, Khalid Choukri, Sara Goggi, Thierry Declerck, Joseph Mariani, Bente Maegaard, Asuncion Moreno, Jan Odijk, Helene Mazo, Stelios Piperidis, Hrafn Loftsson
Anzahl der Seiten5
ErscheinungsortReykjavik, Iceland
VerlagEuropean Language Resources Association (ELRA)
Erscheinungsdatum05.2014
Seiten3529-3533
ISBN (elektronisch)9782951740884
PublikationsstatusErschienen - 05.2014
Extern publiziertJa
Veranstaltung9th International Conference on Language Resources and Evaluation, LREC 2014 - Reykjavik, Island
Dauer: 26.05.201431.05.2014
Konferenznummer: 9
http://www.lrec-conf.org/proceedings/lrec2014/index.html

Bibliographische Notiz

Funding Information:
We thank Luise Erfurth and Didier Cherix for helping us creating annotations of the datasets and Jens Lehmann for his feedback. A special thanks goes to news.de for allowing us to use their articles. Parts of this work were supported by the ESF and the Free State of Saxony.

Links

Zuletzt angesehen

Publikationen

  1. Managing Business Process in Distributed Systems: Requirements, Models, and Implementation
  2. On the Functional Controllability Using a Geometric Approach together with a Decoupled MPC for Motion Control in Robotino
  3. Applied quality assurance methods under the open source development model
  4. Analysis of PI controllers with anti-windup techniques on level systems
  5. Implementation of a Blended-Learning Course as Part of Faculty Development
  6. Top-down contingent feature-specific orienting with and without awareness of the visual input
  7. Detecting Various Road Damage Types in Global Countries Utilizing Faster R-CNN
  8. Analytic reproducibility in articles receiving open data badges at the journal Psychological Science
  9. Individual Differences in Infants' Speech Segmentation Performance
  10. Networking the environment
  11. Emotional text design in multimedia learning
  12. Article 11: Formal validity
  13. When Testing Becomes Learning—Underscoring the Relevance of Habituation to Improve Internal Validity of Common Neurocognitive Tests
  14. Mobilität
  15. FaQuAD
  16. Pre-service mathematics teachers' modelling processes within model eliciting activity through digital technologies
  17. Question Answering Mediated by Visual Clues and Knowledge Graphs
  18. Advantages and difficulties of conducting thinking-aloud protocols in the school setting
  19. Battery as a mediating technology of organization
  20. Introduction
  21. Using Multi-Label Classification for Improved Question Answering
  22. Explaining Age and Gender Differences in Employment Rates
  23. Controlling the Time Synchronicity of Convergent Supply Processes
  24. Delivering community benefits through REDD plus : Lessons from Joint Forest Management in Zambia
  25. Comfort and Adaptive Cruise Control in Highly Automated Vehicles
  26. Der "getarnte" Arbeitnehmer-Geschäftsführer