Cross-document coreference resolution using latent features

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Over the last years, entity detection approaches which combine named entity recognition and entity linking have been used to detect mentions of RDF resources from a given reference knowledge base in unstructured data. In this paper, we address the problem of assigning a single URI to named entities which stand for the same real-object across documents but are not yet available in the reference knowledge base. This task is known as cross-document co-reference resolution and has been addressed by manifold approaches in the past. We present a preliminary study of a novel take on the task based on the use of latent features derived from matrix factorizations combined with parameter-free graph clustering. We study the influence of different parameters (window size, rank, hardening) on our approach by comparing the F-measures we achieve on the N3 benchmark. Our results suggest that using latent features leads to higher F-measures with an increase of up to 20.5% on datasets of the N3 collection.

Original languageEnglish
Title of host publicationLinked Data for Information Extraction 2014. : Proceedings of the Second International Workshop on Linked Data for Information Extraction (LD4IE 2014), Riva del Garda, Italy, October 20, 2014.
EditorsAnna Lisa Gentile, Ziqi Zhang, Claudia d'Amato, Heiko Paulheim
Number of pages12
Volume1267
PublisherSun Site Central Europe (RWTH Aachen University)
Publication date15.10.2014
Pages33-44
Publication statusPublished - 15.10.2014
Externally publishedYes
Event2nd International Workshop on Linked Data for Information Extraction, LD4IE 2014, Co-located with the 13th International Semantic Web Conference, ISWC 2014 - Riva del Garda, Italy
Duration: 20.10.2014 → …
http://iswc2014.semanticweb.org/index.html

Bibliographical note

European Science Foundation

Recently viewed

Publications

  1. Spatial mislocalization as a consequence of sequential coding of stimuli
  2. An Improved Approach to the Semi-Process-Oriented Implementation of Standardised ERP-Systems
  3. Machine Learning and Knowledge Discovery in Databases
  4. Solving mathematical problems with dynamical sketches
  5. A Review of the Application of Machine Learning and Data Mining Approaches in Continuum Materials Mechanics
  6. Exact and approximate inference for annotating graphs with structural SVMs
  7. Simple saturated relay non-linear PD control for uncertain motion systems with friction and actuator constraint
  8. A fast sequential injection analysis system for the simultaneous determination of ammonia and phosphate
  9. What can conservation strategies learn from the ecosystem services approach?
  10. Cognitive load and instructionally supported learning with provided and learner-generated visualizations
  11. Quantum Computing and the Analog/Digital Distinction
  12. Towards an open question answering architecture
  13. Finding Datasets in Publications: The University of Paderborn Approach
  14. Performance Saga: Interview 01
  15. Dividing Apples and Pears: Towards a Taxonomy for Agile Transformation
  16. Using Conjoint Analysis to Elicit Preferences for Occupational Health Services in Small and Microenterprises
  17. Material flow analysis between dynamic modelling and life cycle assessment
  18. Understanding Low-Code Evolution, Adoption and Ecosystem for Software Development
  19. Application of feedforward artificial neural network in Muskingum flood routing
  20. On Software, or the Persistence of Visual Knowledge.
  21. Artificial Intelligence in Foreign Language Learning and Teaching
  22. Technical concept and evaluation design of the state subsidized project [Level-Q]
  23. Towards a dynamic value network perspective of sustainable business models
  24. Measurement in Machine Vision Editorial Paper
  25. Conceptions of problem solving mathematics teaching
  26. Learner characteristics and information processing in multimedia learning
  27. Modellieren in der Sekundarstufe
  28. The more severe the merrier: Severity of error consequences stimulates learning from error
  29. Plants, Androids and Operators
  30. A matter of connection
  31. The First 50 Contributions to the Data Observer Series - An Overview