N3 - A collection of datasets for named entity recognition and disambiguation in the NLP interchange format

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Standard

N3 - A collection of datasets for named entity recognition and disambiguation in the NLP interchange format. / Röder, Michael; Usbeck, Ricardo; Hellmann, Sebastian et al.
Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014. Hrsg. / Nicoletta Calzolari; Khalid Choukri; Sara Goggi; Thierry Declerck; Joseph Mariani; Bente Maegaard; Asuncion Moreno; Jan Odijk; Helene Mazo; Stelios Piperidis; Hrafn Loftsson. Reykjavik, Iceland: European Language Resources Association (ELRA), 2014. S. 3529-3533 (Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014).

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Harvard

Röder, M, Usbeck, R, Hellmann, S, Gerber, D & Both, A 2014, N3 - A collection of datasets for named entity recognition and disambiguation in the NLP interchange format. in N Calzolari, K Choukri, S Goggi, T Declerck, J Mariani, B Maegaard, A Moreno, J Odijk, H Mazo, S Piperidis & H Loftsson (Hrsg.), Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014. Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014, European Language Resources Association (ELRA), Reykjavik, Iceland, S. 3529-3533, 9th International Conference on Language Resources and Evaluation, LREC 2014, Reykjavik, Island, 26.05.14. <https://aclanthology.org/L14-1662/>

APA

Röder, M., Usbeck, R., Hellmann, S., Gerber, D., & Both, A. (2014). N3 - A collection of datasets for named entity recognition and disambiguation in the NLP interchange format. In N. Calzolari, K. Choukri, S. Goggi, T. Declerck, J. Mariani, B. Maegaard, A. Moreno, J. Odijk, H. Mazo, S. Piperidis, & H. Loftsson (Hrsg.), Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014 (S. 3529-3533). (Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014). European Language Resources Association (ELRA). https://aclanthology.org/L14-1662/

Vancouver

Röder M, Usbeck R, Hellmann S, Gerber D, Both A. N3 - A collection of datasets for named entity recognition and disambiguation in the NLP interchange format. in Calzolari N, Choukri K, Goggi S, Declerck T, Mariani J, Maegaard B, Moreno A, Odijk J, Mazo H, Piperidis S, Loftsson H, Hrsg., Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014. Reykjavik, Iceland: European Language Resources Association (ELRA). 2014. S. 3529-3533. (Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014).

Bibtex

@inbook{2a893794b7f64b678dfd8ff257522d90,
title = "N3 - A collection of datasets for named entity recognition and disambiguation in the NLP interchange format",
abstract = "Extracting Linked Data following the Semantic Web principle from unstructured sources has become a key challenge for scientific research. Named Entity Recognition and Disambiguation are two basic operations in this extraction process. One step towards the realization of the Semantic Web vision and the development of highly accurate tools is the availability of data for validating the quality of processes for Named Entity Recognition and Disambiguation as well as for algorithm tuning. This article presents three novel, manually curated and annotated corpora (N3). All of them are based on a free license and stored in the NLP Interchange Format to leverage the Linked Data character of our datasets.",
keywords = "Datasets, Named entity detection, Named entity disambiguation, NLP interchange format, Informatics, Business informatics",
author = "Michael R{\"o}der and Ricardo Usbeck and Sebastian Hellmann and Daniel Gerber and Andreas Both",
note = "We thank Luise Erfurth and Didier Cherix for helping us creating annotations of the datasets and Jens Lehmann for his feedback. A special thanks goes to news.de for allowing us to use their articles. Parts of this work were supported by the ESF and the Free State of Saxony. ACL materials are Copyright {\textcopyright} 1963–2023; 9th International Conference on Language Resources and Evaluation, LREC 2014, LREC 2014 ; Conference date: 26-05-2014 Through 31-05-2014",
year = "2014",
month = may,
language = "English",
series = "Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014",
publisher = "European Language Resources Association (ELRA)",
pages = "3529--3533",
editor = "Nicoletta Calzolari and Khalid Choukri and Sara Goggi and Thierry Declerck and Joseph Mariani and Bente Maegaard and Asuncion Moreno and Jan Odijk and Helene Mazo and Stelios Piperidis and Hrafn Loftsson",
booktitle = "Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014",
address = "Luxembourg",
url = "http://www.lrec-conf.org/proceedings/lrec2014/index.html",

}

RIS

TY - CHAP

T1 - N3 - A collection of datasets for named entity recognition and disambiguation in the NLP interchange format

AU - Röder, Michael

AU - Usbeck, Ricardo

AU - Hellmann, Sebastian

AU - Gerber, Daniel

AU - Both, Andreas

N1 - Conference code: 9

PY - 2014/5

Y1 - 2014/5

N2 - Extracting Linked Data following the Semantic Web principle from unstructured sources has become a key challenge for scientific research. Named Entity Recognition and Disambiguation are two basic operations in this extraction process. One step towards the realization of the Semantic Web vision and the development of highly accurate tools is the availability of data for validating the quality of processes for Named Entity Recognition and Disambiguation as well as for algorithm tuning. This article presents three novel, manually curated and annotated corpora (N3). All of them are based on a free license and stored in the NLP Interchange Format to leverage the Linked Data character of our datasets.

AB - Extracting Linked Data following the Semantic Web principle from unstructured sources has become a key challenge for scientific research. Named Entity Recognition and Disambiguation are two basic operations in this extraction process. One step towards the realization of the Semantic Web vision and the development of highly accurate tools is the availability of data for validating the quality of processes for Named Entity Recognition and Disambiguation as well as for algorithm tuning. This article presents three novel, manually curated and annotated corpora (N3). All of them are based on a free license and stored in the NLP Interchange Format to leverage the Linked Data character of our datasets.

KW - Datasets

KW - Named entity detection

KW - Named entity disambiguation

KW - NLP interchange format

KW - Informatics

KW - Business informatics

UR - http://www.scopus.com/inward/record.url?scp=85032871168&partnerID=8YFLogxK

UR - https://www.mendeley.com/catalogue/0861e4d8-9e27-347c-b695-bfba479f1be1/

M3 - Article in conference proceedings

AN - SCOPUS:85032871168

T3 - Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014

SP - 3529

EP - 3533

BT - Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014

A2 - Calzolari, Nicoletta

A2 - Choukri, Khalid

A2 - Goggi, Sara

A2 - Declerck, Thierry

A2 - Mariani, Joseph

A2 - Maegaard, Bente

A2 - Moreno, Asuncion

A2 - Odijk, Jan

A2 - Mazo, Helene

A2 - Piperidis, Stelios

A2 - Loftsson, Hrafn

PB - European Language Resources Association (ELRA)

CY - Reykjavik, Iceland

T2 - 9th International Conference on Language Resources and Evaluation, LREC 2014

Y2 - 26 May 2014 through 31 May 2014

ER -

Links

Zuletzt angesehen

Publikationen

  1. Changing the Administration from within:
  2. Using cross-recurrence quantification analysis to compute similarity measures for time series of unequal length with applications to sleep stage analysis
  3. Stepwise-based optimizing approaches for arrangements of loudspeaker in multi-zone sound field reproduction
  4. Contributions of declarative and procedural memory to accuracy and automatization during second language practice
  5. On the Functional Controllability Using a Geometric Approach together with a Decoupled MPC for Motion Control in Robotino
  6. On the Power and Performance of a Doubly Latent Residual Approach to Explain Latent Specific Factors in Multilevel-Bifactor-(S-1) Models
  7. Modeling and numerical simulation of multiscale behavior in polycrystals via extended crystal plasticity
  8. Using learning protocols for knowledge acquisition and problem solving with individual and group incentives
  9. An extended analytical approach to evaluating monotonic functions of fuzzy numbers
  10. FaST: A linear time stack trace alignment heuristic for crash report deduplication
  11. Age effects on controlling tools with sensorimotor transformations
  12. Age effects on controlling tools with sensorimotor transformations
  13. Predicting the Difficulty of Exercise Items for Dynamic Difficulty Adaptation in Adaptive Language Tutoring
  14. Distinguishing state variability from trait change in longitudinal data
  15. Return of Fibonacci random walks
  16. Knowledge Graph Question Answering Using Graph-Pattern Isomorphism
  17. Artificial Intelligence Algorithms for Collaborative Book Recommender Systems
  18. A discrete approximate solution for the asymptotic tracking problem in affine nonlinear systems
  19. A Switching Cascade Sliding PID-PID Controllers Combined with a Feedforward and an MPC for an Actuator in Camless Internal Combustion Engines
  20. Appendix A: Design, implementation, and analysis of the iGOES project
  21. Evaluation of Time/Phase Parameters in Frequency Measurements for Inertial Navigation Systems
  22. Modelling and implementation of an Order2Cash Process in distributed systems
  23. Investigation and modeling of the material behavior due to evolving dislocation microstructures in fcc and bcc metals
  24. The Scalable Question Answering Over Linked Data (SQA) Challenge 2018
  25. 7th open challenge on question answering over linked data (QALD-7)
  26. Effectiveness of a guided multicomponent internet and mobile gratitude training program - A pragmatic randomized controlled trial
  27. Graphism and Flatness. The Line as Mediator between Time and Space, Intuition and Concept
  28. An expert-based reference list of variables for characterizing and monitoring social-ecological systems
  29. Homogenization modeling of thin-layer-type microstructures
  30. Integration of laser scanning and projection speckle pattern for advanced pipeline monitoring
  31. Considerations on efficient touch interfaces - How display size influences the performance in an applied pointing task
  32. An Orthogonal Wavelet Denoising Algorithm for Surface Images of Atomic Force Microscopy
  33. Expertise in research integration and implementation for tackling complex problems
  34. For a return to the forgotten formula: 'Data 1 + Data 2 > Data 1'
  35. Efficient and accurate ℓ p-norm multiple kernel learning