N3 - A collection of datasets for named entity recognition and disambiguation in the NLP interchange format

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

  • Michael Röder
  • Ricardo Usbeck
  • Sebastian Hellmann
  • Daniel Gerber
  • Andreas Both

Extracting Linked Data following the Semantic Web principle from unstructured sources has become a key challenge for scientific research. Named Entity Recognition and Disambiguation are two basic operations in this extraction process. One step towards the realization of the Semantic Web vision and the development of highly accurate tools is the availability of data for validating the quality of processes for Named Entity Recognition and Disambiguation as well as for algorithm tuning. This article presents three novel, manually curated and annotated corpora (N3). All of them are based on a free license and stored in the NLP Interchange Format to leverage the Linked Data character of our datasets.

OriginalspracheEnglisch
TitelProceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014
HerausgeberNicoletta Calzolari, Khalid Choukri, Sara Goggi, Thierry Declerck, Joseph Mariani, Bente Maegaard, Asuncion Moreno, Jan Odijk, Helene Mazo, Stelios Piperidis, Hrafn Loftsson
Anzahl der Seiten5
ErscheinungsortReykjavik, Iceland
VerlagEuropean Language Resources Association (ELRA)
Erscheinungsdatum05.2014
Seiten3529-3533
ISBN (elektronisch)9782951740884
PublikationsstatusErschienen - 05.2014
Extern publiziertJa
Veranstaltung9th International Conference on Language Resources and Evaluation, LREC 2014 - Reykjavik, Island
Dauer: 26.05.201431.05.2014
Konferenznummer: 9
http://www.lrec-conf.org/proceedings/lrec2014/index.html

Bibliographische Notiz

Funding Information:
We thank Luise Erfurth and Didier Cherix for helping us creating annotations of the datasets and Jens Lehmann for his feedback. A special thanks goes to news.de for allowing us to use their articles. Parts of this work were supported by the ESF and the Free State of Saxony.

Links

Zuletzt angesehen

Publikationen

  1. Paraphrasing Method for Controlling a Robotic Arm Using a Large Language Model
  2. A Multilevel CFA-MTMM Model for Nested Structurally Different Methods
  3. Anatomy of Haar Wavelet Filter and Its Implementation for Signal Processing
  4. Development of a quality assurance framework for the open source development model
  5. Using the flatness of DC-Drives to emulate a generator for a decoupled MPC using a geometric approach for motion control in Robotino
  6. A discrete-time fractional order PI controller for a three phase synchronous motor using an optimal loop shaping approach
  7. GPU-accelerated meshfree computational framework for modeling the friction surfacing process
  8. A transfer operator based computational study of mixing processes in open flow systems
  9. Recurrence Quantification Analysis of Processes and Products of Discourse
  10. Modified dynamic programming approach for offline segmentation of long hydrometeorological time series
  11. Contributions of declarative and procedural memory to accuracy and automatization during second language practice
  12. On the Power and Performance of a Doubly Latent Residual Approach to Explain Latent Specific Factors in Multilevel-Bifactor-(S-1) Models
  13. A model predictive control for an aggregate actuator with a self-tuning initial condition procedure in combustion engines
  14. Foundations and applications of computer based material flow networks for einvironmental management
  15. Effectiveness of a guided multicomponent internet and mobile gratitude training program - A pragmatic randomized controlled trial
  16. A Review of Latent Variable Modeling Using R - A Step-by-Step-Guide
  17. Model inversion using fuzzy neural network with boosting of the solution
  18. Top-down contingent attentional capture during feed-forward visual processing
  19. Applied quality assurance methods under the open source development model
  20. Exploiting linear partial information for optimal use of forecasts. With an application to U.S. economic policy
  21. The role of reading time complexity and reading speed in text comprehension
  22. An application of multiple behavior SIA for analyzing data from student exams
  23. Making an Impression Through Openness
  24. Proceedings of TextGraphs-17: Graph-based Methods for Natural Language Processing
  25. Design and characterization of an EOG signal acquisition system based on the programming of saccadic movement routines
  26. Q-Adaptive Control of the nonlinear dynamics of the cantilever-sample system of an Atomic Force Microscope
  27. Topic Embeddings – A New Approach to Classify Very Short Documents Based on Predefined Topics
  28. Grazing, exploring and networking for sustainability-oriented innovations in learning-action networks
  29. Globally asymptotic output feedback tracking of robot manipulators with actuator constraints
  30. Lyapunov stability analysis to set up a PI controller for a mass flow system in case of a non-saturating input
  31. Modeling of Logistic Processes in Assembly Areas
  32. Different kinds of interactive exercises with response analysis on the web