N3 - A collection of datasets for named entity recognition and disambiguation in the NLP interchange format

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

  • Michael Röder
  • Ricardo Usbeck
  • Sebastian Hellmann
  • Daniel Gerber
  • Andreas Both

Extracting Linked Data following the Semantic Web principle from unstructured sources has become a key challenge for scientific research. Named Entity Recognition and Disambiguation are two basic operations in this extraction process. One step towards the realization of the Semantic Web vision and the development of highly accurate tools is the availability of data for validating the quality of processes for Named Entity Recognition and Disambiguation as well as for algorithm tuning. This article presents three novel, manually curated and annotated corpora (N3). All of them are based on a free license and stored in the NLP Interchange Format to leverage the Linked Data character of our datasets.

OriginalspracheEnglisch
TitelProceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014
HerausgeberNicoletta Calzolari, Khalid Choukri, Sara Goggi, Thierry Declerck, Joseph Mariani, Bente Maegaard, Asuncion Moreno, Jan Odijk, Helene Mazo, Stelios Piperidis, Hrafn Loftsson
Anzahl der Seiten5
ErscheinungsortReykjavik, Iceland
VerlagEuropean Language Resources Association (ELRA)
Erscheinungsdatum05.2014
Seiten3529-3533
ISBN (elektronisch)9782951740884
PublikationsstatusErschienen - 05.2014
Extern publiziertJa
Veranstaltung9th International Conference on Language Resources and Evaluation, LREC 2014 - Reykjavik, Island
Dauer: 26.05.201431.05.2014
Konferenznummer: 9
http://www.lrec-conf.org/proceedings/lrec2014/index.html

Bibliographische Notiz

Funding Information:
We thank Luise Erfurth and Didier Cherix for helping us creating annotations of the datasets and Jens Lehmann for his feedback. A special thanks goes to news.de for allowing us to use their articles. Parts of this work were supported by the ESF and the Free State of Saxony.

Links

Zuletzt angesehen

Publikationen

  1. Species composition and forest structure explain the temperature sensitivity patterns of productivity in temperate forests
  2. Combining a PI Controller with an Adaptive Feedforward Control in PMSM
  3. Speed of processing and stimulus complexity in low-frequency and high-frequency channels
  4. Industrial applications using wavelet packets for gross error detection
  5. Life Cycle Assessment of Consumption Patterns – Understanding the links between changing social practices and environmental impacts
  6. Erratum to "Generic functions of railway stations-A conceptual basis for the development of common system understanding and assessment criteria" [Transp. Policy 18 (2010) 446-455]
  7. Computer Game Worlds
  8. The creation and analysis of employer-employee matched data, ed. by John C. Haltiwanger ...
  9. Development of a cell culture system for studying effects of native and photochemically transformed gaseous compounds using an air/liquid culture technique
  10. Ablation Study of a Multimodal Gat Network on Perfect Synthetic and Real-world Data to Investigate the Influence of Language Models in Invoice Recognition
  11. Magnesium recycling: State-of-the-Art developments, part II
  12. Ten essentials for action-oriented and second order energy transitions, transformations and climate change research
  13. Assessment of occupational exertion and strain in laboratory- and real occupational environments
  14. Data Practices
  15. Transparency in an Age of Digitalization and Responsibility
  16. Design of finger joint implants based on triply periodic minimal surfaces
  17. Facing complex crime
  18. Where Paintings Live
  19. Governance im Wandel