N3 - A collection of datasets for named entity recognition and disambiguation in the NLP interchange format

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

  • Michael Röder
  • Ricardo Usbeck
  • Sebastian Hellmann
  • Daniel Gerber
  • Andreas Both

Extracting Linked Data following the Semantic Web principle from unstructured sources has become a key challenge for scientific research. Named Entity Recognition and Disambiguation are two basic operations in this extraction process. One step towards the realization of the Semantic Web vision and the development of highly accurate tools is the availability of data for validating the quality of processes for Named Entity Recognition and Disambiguation as well as for algorithm tuning. This article presents three novel, manually curated and annotated corpora (N3). All of them are based on a free license and stored in the NLP Interchange Format to leverage the Linked Data character of our datasets.

OriginalspracheEnglisch
TitelProceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014
HerausgeberNicoletta Calzolari, Khalid Choukri, Sara Goggi, Thierry Declerck, Joseph Mariani, Bente Maegaard, Asuncion Moreno, Jan Odijk, Helene Mazo, Stelios Piperidis, Hrafn Loftsson
Anzahl der Seiten5
ErscheinungsortReykjavik, Iceland
VerlagEuropean Language Resources Association (ELRA)
Erscheinungsdatum05.2014
Seiten3529-3533
ISBN (elektronisch)9782951740884
PublikationsstatusErschienen - 05.2014
Extern publiziertJa
Veranstaltung9th International Conference on Language Resources and Evaluation, LREC 2014 - Reykjavik, Island
Dauer: 26.05.201431.05.2014
Konferenznummer: 9
http://www.lrec-conf.org/proceedings/lrec2014/index.html

Bibliographische Notiz

Funding Information:
We thank Luise Erfurth and Didier Cherix for helping us creating annotations of the datasets and Jens Lehmann for his feedback. A special thanks goes to news.de for allowing us to use their articles. Parts of this work were supported by the ESF and the Free State of Saxony.

Links

Zuletzt angesehen

Publikationen

  1. Ambient Intelligence and Knowledge Processing in Distributed Autonomous AAL-Components
  2. Comparing the Sensitivity of Social Networks, Web Graphs, and Random Graphs with Respect to Vertex Removal
  3. Optimal trajectory generation using MPC in robotino and its implementation with ROS system
  4. Sequencing and fading worked examples and collaboration scripts to foster mathematical argumentation - working memory capacity matters for fading
  5. Enhancing Performance of Level System Modeling with Pseudo-Random Signals
  6. Neural Combinatorial Optimization on Heterogeneous Graphs
  7. Transformer with Tree-order Encoding for Neural Program Generation
  8. Using Complexity Metrics to Assess Silent Reading Fluency
  9. Continuous 3D scanning mode using servomotors instead of stepping motors in dynamic laser triangulation
  10. Development of a quality assurance framework for the open source development model
  11. Managing Business Process in Distributed Systems: Requirements, Models, and Implementation
  12. Entropy-guided feature generation for structured learning of Portuguese dependency parsing
  13. Constructions and Reconstructions. The Architectural Image between Rendering and Photography
  14. Analyzing different types of moderated method effects in confirmatory factor models for structurally different methods
  15. Evaluating OWL 2 reasoners in the context of checking entity-relationship diagrams during software development
  16. The elicitation process in developing of case library for Case-Based Reasoner system whilst consideration for validating electronic communication technologies