N3 - A collection of datasets for named entity recognition and disambiguation in the NLP interchange format

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Standard

N3 - A collection of datasets for named entity recognition and disambiguation in the NLP interchange format. / Röder, Michael; Usbeck, Ricardo; Hellmann, Sebastian et al.
Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014. ed. / Nicoletta Calzolari; Khalid Choukri; Sara Goggi; Thierry Declerck; Joseph Mariani; Bente Maegaard; Asuncion Moreno; Jan Odijk; Helene Mazo; Stelios Piperidis; Hrafn Loftsson. Reykjavik, Iceland: European Language Resources Association (ELRA), 2014. p. 3529-3533 (Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014).

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Harvard

Röder, M, Usbeck, R, Hellmann, S, Gerber, D & Both, A 2014, N3 - A collection of datasets for named entity recognition and disambiguation in the NLP interchange format. in N Calzolari, K Choukri, S Goggi, T Declerck, J Mariani, B Maegaard, A Moreno, J Odijk, H Mazo, S Piperidis & H Loftsson (eds), Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014. Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014, European Language Resources Association (ELRA), Reykjavik, Iceland, pp. 3529-3533, 9th International Conference on Language Resources and Evaluation, LREC 2014, Reykjavik, Iceland, 26.05.14. <https://aclanthology.org/L14-1662/>

APA

Röder, M., Usbeck, R., Hellmann, S., Gerber, D., & Both, A. (2014). N3 - A collection of datasets for named entity recognition and disambiguation in the NLP interchange format. In N. Calzolari, K. Choukri, S. Goggi, T. Declerck, J. Mariani, B. Maegaard, A. Moreno, J. Odijk, H. Mazo, S. Piperidis, & H. Loftsson (Eds.), Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014 (pp. 3529-3533). (Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014). European Language Resources Association (ELRA). https://aclanthology.org/L14-1662/

Vancouver

Röder M, Usbeck R, Hellmann S, Gerber D, Both A. N3 - A collection of datasets for named entity recognition and disambiguation in the NLP interchange format. In Calzolari N, Choukri K, Goggi S, Declerck T, Mariani J, Maegaard B, Moreno A, Odijk J, Mazo H, Piperidis S, Loftsson H, editors, Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014. Reykjavik, Iceland: European Language Resources Association (ELRA). 2014. p. 3529-3533. (Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014).

Bibtex

@inbook{2a893794b7f64b678dfd8ff257522d90,
title = "N3 - A collection of datasets for named entity recognition and disambiguation in the NLP interchange format",
abstract = "Extracting Linked Data following the Semantic Web principle from unstructured sources has become a key challenge for scientific research. Named Entity Recognition and Disambiguation are two basic operations in this extraction process. One step towards the realization of the Semantic Web vision and the development of highly accurate tools is the availability of data for validating the quality of processes for Named Entity Recognition and Disambiguation as well as for algorithm tuning. This article presents three novel, manually curated and annotated corpora (N3). All of them are based on a free license and stored in the NLP Interchange Format to leverage the Linked Data character of our datasets.",
keywords = "Datasets, Named entity detection, Named entity disambiguation, NLP interchange format, Informatics, Business informatics",
author = "Michael R{\"o}der and Ricardo Usbeck and Sebastian Hellmann and Daniel Gerber and Andreas Both",
note = "We thank Luise Erfurth and Didier Cherix for helping us creating annotations of the datasets and Jens Lehmann for his feedback. A special thanks goes to news.de for allowing us to use their articles. Parts of this work were supported by the ESF and the Free State of Saxony. ACL materials are Copyright {\textcopyright} 1963–2023; 9th International Conference on Language Resources and Evaluation, LREC 2014, LREC 2014 ; Conference date: 26-05-2014 Through 31-05-2014",
year = "2014",
month = may,
language = "English",
series = "Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014",
publisher = "European Language Resources Association (ELRA)",
pages = "3529--3533",
editor = "Nicoletta Calzolari and Khalid Choukri and Sara Goggi and Thierry Declerck and Joseph Mariani and Bente Maegaard and Asuncion Moreno and Jan Odijk and Helene Mazo and Stelios Piperidis and Hrafn Loftsson",
booktitle = "Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014",
address = "Luxembourg",
url = "http://www.lrec-conf.org/proceedings/lrec2014/index.html",

}

RIS

TY - CHAP

T1 - N3 - A collection of datasets for named entity recognition and disambiguation in the NLP interchange format

AU - Röder, Michael

AU - Usbeck, Ricardo

AU - Hellmann, Sebastian

AU - Gerber, Daniel

AU - Both, Andreas

N1 - Conference code: 9

PY - 2014/5

Y1 - 2014/5

N2 - Extracting Linked Data following the Semantic Web principle from unstructured sources has become a key challenge for scientific research. Named Entity Recognition and Disambiguation are two basic operations in this extraction process. One step towards the realization of the Semantic Web vision and the development of highly accurate tools is the availability of data for validating the quality of processes for Named Entity Recognition and Disambiguation as well as for algorithm tuning. This article presents three novel, manually curated and annotated corpora (N3). All of them are based on a free license and stored in the NLP Interchange Format to leverage the Linked Data character of our datasets.

AB - Extracting Linked Data following the Semantic Web principle from unstructured sources has become a key challenge for scientific research. Named Entity Recognition and Disambiguation are two basic operations in this extraction process. One step towards the realization of the Semantic Web vision and the development of highly accurate tools is the availability of data for validating the quality of processes for Named Entity Recognition and Disambiguation as well as for algorithm tuning. This article presents three novel, manually curated and annotated corpora (N3). All of them are based on a free license and stored in the NLP Interchange Format to leverage the Linked Data character of our datasets.

KW - Datasets

KW - Named entity detection

KW - Named entity disambiguation

KW - NLP interchange format

KW - Informatics

KW - Business informatics

UR - http://www.scopus.com/inward/record.url?scp=85032871168&partnerID=8YFLogxK

UR - https://www.mendeley.com/catalogue/0861e4d8-9e27-347c-b695-bfba479f1be1/

M3 - Article in conference proceedings

AN - SCOPUS:85032871168

T3 - Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014

SP - 3529

EP - 3533

BT - Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014

A2 - Calzolari, Nicoletta

A2 - Choukri, Khalid

A2 - Goggi, Sara

A2 - Declerck, Thierry

A2 - Mariani, Joseph

A2 - Maegaard, Bente

A2 - Moreno, Asuncion

A2 - Odijk, Jan

A2 - Mazo, Helene

A2 - Piperidis, Stelios

A2 - Loftsson, Hrafn

PB - European Language Resources Association (ELRA)

CY - Reykjavik, Iceland

T2 - 9th International Conference on Language Resources and Evaluation, LREC 2014

Y2 - 26 May 2014 through 31 May 2014

ER -

Links

Recently viewed

Researchers

  1. Tim Dornis

Publications

  1. A Quadrant Approach of Camera Calibration Method for Depth Estimation Using a Stereo Vision System
  2. Dynamic Performance Analysis and Fault Ride-Through Enhancement by a Modified Fault Current Protection Scheme of a Grid-Connected Doubly Fed Induction Generator
  3. Inversion of Fuzzy Neural Networks for the Reduction of Noise in the Control Loop for Automotive Applications
  4. Enabling Road Condition Monitoring with an on-board Vehicle Sensor Setup
  5. Efficient and accurate ℓ p-norm multiple kernel learning
  6. Multi-view learning with dependent views
  7. Modelling the Complexity of Measurement Estimation Situations - A Theoretical Framework for the Estimation of Lengths
  8. Model inversion using fuzzy neural network with boosting of the solution
  9. Fixed-term Contracts and Wages Revisited Using Linked Employer-Employee Data from Germany
  10. Evaluating entity annotators using GERBIL
  11. Emergency detection based on probabilistic modeling in AAL environments
  12. Modern Baselines for SPARQL Semantic Parsing
  13. Qualitätssicherung und Entwicklung in der Elementarpädagogik
  14. Quantification of phototrophically grown Galdieria sulphuraria and other microalgae using diphenylamine
  15. Commitment Strategies for Sustainability
  16. Cyberpunk
  17. Sudoko mathematics for and done by younger students
  18. Credit Constraints and Margins of Import
  19. Circularity in Automotive Electronics Design
  20. Empirical research on mathematical modelling
  21. Part III: Motion and control of autonomous unmanned aerial systems as a challenge in Industry 4.0 process
  22. Systemprogrammierung I
  23. Is Calluna vulgaris a suitable bio-monitor of management-mediated nutrient pools in heathland ecosystems?
  24. Stability matters: A dynamic process view on self-efficacy in training transfer.
  25. Anticipated imitation of multiple agents
  26. Characteristics of comprehension processes in mathematical modelling
  27. Proposing a social-ecological framework for successful grassland restoration in Germany—an overview and insights from the Grassworks project
  28. Utilization of organic residues using heterotrophic microalgae and insects
  29. An automated, modular system for organic waste utilization using heterotrophic alga Galdieria sulphuraria
  30. Characterization of the Basic Types of Lunar Highland Breccias by Quantitative Textural Analysis
  31. Bright Spots for Local WFD Implementation Through Collaboration with Nature Conservation Authorities?
  32. Sustainable Development and Material Flows

Press / Media

  1. Duration