N3 - A collection of datasets for named entity recognition and disambiguation in the NLP interchange format

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

  • Michael Röder
  • Ricardo Usbeck
  • Sebastian Hellmann
  • Daniel Gerber
  • Andreas Both

Extracting Linked Data following the Semantic Web principle from unstructured sources has become a key challenge for scientific research. Named Entity Recognition and Disambiguation are two basic operations in this extraction process. One step towards the realization of the Semantic Web vision and the development of highly accurate tools is the availability of data for validating the quality of processes for Named Entity Recognition and Disambiguation as well as for algorithm tuning. This article presents three novel, manually curated and annotated corpora (N3). All of them are based on a free license and stored in the NLP Interchange Format to leverage the Linked Data character of our datasets.

OriginalspracheEnglisch
TitelProceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014
HerausgeberNicoletta Calzolari, Khalid Choukri, Sara Goggi, Thierry Declerck, Joseph Mariani, Bente Maegaard, Asuncion Moreno, Jan Odijk, Helene Mazo, Stelios Piperidis, Hrafn Loftsson
Anzahl der Seiten5
ErscheinungsortReykjavik, Iceland
VerlagEuropean Language Resources Association (ELRA)
Erscheinungsdatum05.2014
Seiten3529-3533
ISBN (elektronisch)9782951740884
PublikationsstatusErschienen - 05.2014
Extern publiziertJa
Veranstaltung9th International Conference on Language Resources and Evaluation, LREC 2014 - Reykjavik, Island
Dauer: 26.05.201431.05.2014
Konferenznummer: 9
http://www.lrec-conf.org/proceedings/lrec2014/index.html

Bibliographische Notiz

Funding Information:
We thank Luise Erfurth and Didier Cherix for helping us creating annotations of the datasets and Jens Lehmann for his feedback. A special thanks goes to news.de for allowing us to use their articles. Parts of this work were supported by the ESF and the Free State of Saxony.

Links

Zuletzt angesehen

Aktivitäten

  1. Event History Analysis and Applications Using STATA - 2013
  2. Cluster-based Extraction of Finite-time Coherent Sets from Trajectory Data
  3. Problem Framing Workshop with Local NGOs
  4. Transdisciplinary Evaluation of Alternative Adaptation Strategies Value-Tree Method as a Tool to Integrate Multiple Values of Science, Practice and the General Public into Decision-Making
  5. Revitalizing the Script as a Concept to Understand Structure and Agency in Institutional Theory
  6. Going Green: Digital project work as a transdisciplinary and transcultural task in the foreign language and STEM classrooms
  7. Tri-trophic interaction networks along a tree diversity gradient in BEF-China: How tree diversity effects higher trophic levels
  8. Reflexive Multi-Criteria Evaluation as a tool to integrate Multiple Values into Decision-Making – a Case Study from Germany
  9. Monkey Business: Who Pulls the Strings? 2013
  10. Intelligent software system for replacing a force sensor in the case of clearance measurement
  11. The influence of polycentricity on collaborative environmental management – the case of EU Water Framework Directive implementation in Germany
  12. Using the Method of Limits to Assess Comfortable Time Headways in Adaptive Cruise Control
  13. Placemaking today: integrating place-oriented thinking into cultural policy frameworks
  14. Undoing Ethnographic and Archaological Objects
  15. Removal of Methotrexate, 5-Fluorouracil and Cyclophosphamide from water by UV, UV/H2O2 and UV/Fe2+/H2O2 processe
  16. Spec­tral Ki­ne­tic Si­mu­la­ti­on of Ideal Mul­ti­po­le Re­so­nan­ce Probe
  17. The Relation of Children's Performances in Spatial Tasks at Two Different Scales of Space
  18. Bi-annual General Assembly of the World Values Survey Association - WVS 2014
  19. Correlates of Work Design and the Intention to Continue Work in Retirement
  20. All-Affected, Non-Identity and the Political Representation of Future Generations
  21. Symbolic Environmental Legislation and Societal Self-Deception: The Societal, Technical and Environmental Context
  22. Identification of photo-transformation products of ciprofloxacin and evaluation of their genotoxicity using in silco methods and in vitro assay
  23. Lecturer for the course "Mathematics & Statistics“
  24. 2013 5th International Conference on Modelling, Identification and Control - ICMIC 2013
  25. Journal of Molecular Catalysis A (Zeitschrift)