N3 - A collection of datasets for named entity recognition and disambiguation in the NLP interchange format

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

  • Michael Röder
  • Ricardo Usbeck
  • Sebastian Hellmann
  • Daniel Gerber
  • Andreas Both

Extracting Linked Data following the Semantic Web principle from unstructured sources has become a key challenge for scientific research. Named Entity Recognition and Disambiguation are two basic operations in this extraction process. One step towards the realization of the Semantic Web vision and the development of highly accurate tools is the availability of data for validating the quality of processes for Named Entity Recognition and Disambiguation as well as for algorithm tuning. This article presents three novel, manually curated and annotated corpora (N3). All of them are based on a free license and stored in the NLP Interchange Format to leverage the Linked Data character of our datasets.

OriginalspracheEnglisch
TitelProceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014
HerausgeberNicoletta Calzolari, Khalid Choukri, Sara Goggi, Thierry Declerck, Joseph Mariani, Bente Maegaard, Asuncion Moreno, Jan Odijk, Helene Mazo, Stelios Piperidis, Hrafn Loftsson
Anzahl der Seiten5
ErscheinungsortReykjavik, Iceland
VerlagEuropean Language Resources Association (ELRA)
Erscheinungsdatum05.2014
Seiten3529-3533
ISBN (elektronisch)9782951740884
PublikationsstatusErschienen - 05.2014
Extern publiziertJa
Veranstaltung9th International Conference on Language Resources and Evaluation, LREC 2014 - Reykjavik, Island
Dauer: 26.05.201431.05.2014
Konferenznummer: 9
http://www.lrec-conf.org/proceedings/lrec2014/index.html

Bibliographische Notiz

Funding Information:
We thank Luise Erfurth and Didier Cherix for helping us creating annotations of the datasets and Jens Lehmann for his feedback. A special thanks goes to news.de for allowing us to use their articles. Parts of this work were supported by the ESF and the Free State of Saxony.

Links

Zuletzt angesehen

Publikationen

  1. Comparing the Sensitivity of Social Networks, Web Graphs, and Random Graphs with Respect to Vertex Removal
  2. Optimal trajectory generation using MPC in robotino and its implementation with ROS system
  3. Multi-Parallel Sending Coils for Movable Receivers in Inductive Charging Systems
  4. On the Nonlinearity Compensation in Permanent Magnet Machine Using a Controller Based on a Controlled Invariant Subspace
  5. Paraphrasing Method for Controlling a Robotic Arm Using a Large Language Model
  6. Anomaly detection in formed sheet metals using convolutional autoencoders
  7. A Multilevel CFA-MTMM Model for Nested Structurally Different Methods
  8. Selection and Recognition of Statistically Defined Signals in Learning Systems
  9. Linux-based Embedded System for Wavelet Denoising and Monitoring of sEMG Signals using an Axiomatic Seminorm
  10. Neural Combinatorial Optimization on Heterogeneous Graphs
  11. Constructions and Reconstructions. The Architectural Image between Rendering and Photography
  12. Analyzing different types of moderated method effects in confirmatory factor models for structurally different methods
  13. Using the flatness of DC-Drives to emulate a generator for a decoupled MPC using a geometric approach for motion control in Robotino
  14. Dynamic Lot Size Optimization with Reinforcement Learning
  15. Latent structure perceptron with feature induction for unrestricted coreference resolution
  16. Intersection tests for the cointegrating rank in dependent panel data
  17. Dispatching rule selection with Gaussian processes
  18. Unidimensional and Multidimensional Methods for Recurrence Quantification Analysis with crqa
  19. Optimizing sampling of flying insects using a modified window trap
  20. Finding Similar Movements in Positional Data Streams
  21. Exploration strategies, performance, and error consequences when learning a complex computer task
  22. The Use of Genetic Algorithm for PID Controller Auto-Tuning in ARM CORTEX M4 Platform
  23. Lyapunov stability analysis to set up a PI controller for a mass flow system in case of a non-saturating input
  24. Empowering materials processing and performance from data and AI
  25. Multidimensional Cross-Recurrence Quantification Analysis (MdCRQA)–A Method for Quantifying Correlation between Multivariate Time-Series
  26. Changing the Administration from within:
  27. Using cross-recurrence quantification analysis to compute similarity measures for time series of unequal length with applications to sleep stage analysis
  28. Using Decision Trees and Reinforcement Learning for the Dynamic Adjustment of Composite Sequencing Rules in a Flexible Manufacturing System
  29. On the Functional Controllability Using a Geometric Approach together with a Decoupled MPC for Motion Control in Robotino
  30. On the Power and Performance of a Doubly Latent Residual Approach to Explain Latent Specific Factors in Multilevel-Bifactor-(S-1) Models