N3 - A collection of datasets for named entity recognition and disambiguation in the NLP interchange format

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

  • Michael Röder
  • Ricardo Usbeck
  • Sebastian Hellmann
  • Daniel Gerber
  • Andreas Both

Extracting Linked Data following the Semantic Web principle from unstructured sources has become a key challenge for scientific research. Named Entity Recognition and Disambiguation are two basic operations in this extraction process. One step towards the realization of the Semantic Web vision and the development of highly accurate tools is the availability of data for validating the quality of processes for Named Entity Recognition and Disambiguation as well as for algorithm tuning. This article presents three novel, manually curated and annotated corpora (N3). All of them are based on a free license and stored in the NLP Interchange Format to leverage the Linked Data character of our datasets.

OriginalspracheEnglisch
TitelProceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014
HerausgeberNicoletta Calzolari, Khalid Choukri, Sara Goggi, Thierry Declerck, Joseph Mariani, Bente Maegaard, Asuncion Moreno, Jan Odijk, Helene Mazo, Stelios Piperidis, Hrafn Loftsson
Anzahl der Seiten5
ErscheinungsortReykjavik, Iceland
VerlagEuropean Language Resources Association (ELRA)
Erscheinungsdatum05.2014
Seiten3529-3533
ISBN (elektronisch)9782951740884
PublikationsstatusErschienen - 05.2014
Extern publiziertJa
Veranstaltung9th International Conference on Language Resources and Evaluation, LREC 2014 - Reykjavik, Island
Dauer: 26.05.201431.05.2014
Konferenznummer: 9
http://www.lrec-conf.org/proceedings/lrec2014/index.html

Bibliographische Notiz

Funding Information:
We thank Luise Erfurth and Didier Cherix for helping us creating annotations of the datasets and Jens Lehmann for his feedback. A special thanks goes to news.de for allowing us to use their articles. Parts of this work were supported by the ESF and the Free State of Saxony.

Links

Zuletzt angesehen

Aktivitäten

  1. The Expert in the Loop: Developing a Provenance Linked Open Data Management Platform
  2. Discerning aspects of memory: The ethics of memory in (post)global and transnational contexts
  3. Temporary Organizing and Organizing Trmporality: On the Multilayered Architecture of Accelerators
  4. Is there a threshold effect of time headway on subjective variables for different velocities?
  5. Maximum-Likelihood-Based Panel Cointegration Testing
  6. A conceptual framework on users' digitalisation practices transforming their digital infrastructure for work
  7. Architecture of Computing Systems - ARCS2008
  8. Organizing temporality: A practice perspective on the multilayered architecture of accelerators
  9. Maximum-Likelihood-Based Panel Cointegration Test with Linear Time Trend and Fisher Hypothesis
  10. Learning Shortest Paths for Word Graphs
  11. Note-taking while Working on Mathematical Modelling Tasks
  12. Unit Root & Cointegration Testing Conference 2005
  13. Maximum-Likelihood-Based Panel Cointegration Test with Linear Time Trend
  14. Applied Econometrics with Stata for PhD Students
  15. Spas in the New Länder: A Transformation with an Uncertain Outcome
  16. Is there a threshold effect of time headway on subjective variables for different velocities?
  17. A Mixed Methods Longitudinal Design Study On Learning Results In An Innovative Study Model - First Qualitative Results In HESD
  18. Tilling the fields of knowledge in sustainability-oriented science
  19. Effects of enhanced visual feedback on postural control in static and dynamic conditions.
  20. Coauthoring an interorganizational collaboration: Exploring multi-voicedness and introducing spatiotemporal orientations
  21. Towards an Undercommons (Eco)Logistics?
  22. Navigating in the Digital Jungle: Articulating Combinatory Affordances of Digital Infrastructures for Collaboration

Publikationen

  1. A coding scheme to analyse global text processing in computer supported collaborative learning: What eye movements can tell us
  2. Optimal regulation for dynamic hybrid systems based on dynamic programming in the case of an intelligent vehicle drive assistant
  3. Cross-document coreference resolution using latent features
  4. The Scalable Question Answering Over Linked Data (SQA) Challenge 2018
  5. Emergency detection based on probabilistic modeling in AAL-environments
  6. Applied quality assurance methods under the open source development model
  7. Eliciting Learner Perceptions of Web 2.0 Tasks through Mixed-Methods Classroom Research
  8. Evaluating entity annotators using GERBIL
  9. Modeling of Logistic Processes in Assembly Areas
  10. The role of spatial ability in learning from instructional animations - Evidence for an ability-as-compensator hypothesis
  11. Measuring cognitive load with subjective rating scales during problem solving
  12. A Service-oriented Search framework for full text, geospatial and semantic search
  13. Real-time RDF extraction from unstructured data streams
  14. OKBQA framework towards an open collaboration for development of natural language question-answering systems over knowledge bases
  15. Simulation based optimization of lot sizes for opposing logistic objectives
  16. 7th open challenge on question answering over linked data (QALD-7)
  17. Evaluation of standard ERP software implementation approaches in terms of their capability for business process optimization
  18. Using transition management concepts for the evaluation of intersecting policy domains ('grand challenges')
  19. Dynamic environment modelling and prediction for autonomous systems
  20. Optimising business performance with standard software systems
  21. Learning how to request using textbooks
  22. Concepts
  23. A New Approach for Optimal Solving Cyclic and Non-Cyclic Bus Drvier Rostering Problems
  24. Value Structure and Dimensions
  25. Web-scale extension of RDF knowledge bases from templated websites
  26. Topic selection and development in learner-native speaker voice-based telecollaborative discourse