Cross-document coreference resolution using latent features

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Over the last years, entity detection approaches which combine named entity recognition and entity linking have been used to detect mentions of RDF resources from a given reference knowledge base in unstructured data. In this paper, we address the problem of assigning a single URI to named entities which stand for the same real-object across documents but are not yet available in the reference knowledge base. This task is known as cross-document co-reference resolution and has been addressed by manifold approaches in the past. We present a preliminary study of a novel take on the task based on the use of latent features derived from matrix factorizations combined with parameter-free graph clustering. We study the influence of different parameters (window size, rank, hardening) on our approach by comparing the F-measures we achieve on the N3 benchmark. Our results suggest that using latent features leads to higher F-measures with an increase of up to 20.5% on datasets of the N3 collection.

Original languageEnglish
Title of host publicationLinked Data for Information Extraction 2014. : Proceedings of the Second International Workshop on Linked Data for Information Extraction (LD4IE 2014), Riva del Garda, Italy, October 20, 2014.
EditorsAnna Lisa Gentile, Ziqi Zhang, Claudia d'Amato, Heiko Paulheim
Number of pages12
Volume1267
PublisherSun Site Central Europe (RWTH Aachen University)
Publication date15.10.2014
Pages33-44
Publication statusPublished - 15.10.2014
Externally publishedYes
Event2nd International Workshop on Linked Data for Information Extraction, LD4IE 2014, Co-located with the 13th International Semantic Web Conference, ISWC 2014 - Riva del Garda, Italy
Duration: 20.10.2014 → …
http://iswc2014.semanticweb.org/index.html

Bibliographical note

European Science Foundation

Recently viewed

Publications

  1. Methodologies for Noise and Gross Error Detection using Univariate Signal-Based Approaches in Industrial Application
  2. A lyapunov approach in the derivative approximation using a dynamic system
  3. Scaffolding argumentation in mathematics with CSCL scripts
  4. Supporting the Development and Implementation of a Digitalization Strategy in SMEs through a Lightweight Architecture-based Method
  5. Impulsive Feedback Linearization for Decoupling of a Constant Disturbance with Low Relative Degree to Control Maglev Systems
  6. Control Allocation and Controller Tuning for an Over-Actuated Hexacopter Tilt-Rotor Applied for Precision Agriculture
  7. A Class of Simple Stochastic Online Bin Packing Algorithms
  8. Covert and overt automatic imitation are correlated
  9. Exploring large vegetation databases to detect temporal trends in species occurrences
  10. An integrative research framework for enabling transformative adaptation
  11. Trait-based approaches to analyze links between the drivers of change and ecosystem services
  12. Introduction to the special issue
  13. Spectral Early-Warning Signals for Sudden Changes in Time-Dependent Flow Patterns
  14. Introduction
  15. A latent state-trait analysis of current achievement motivation across different tasks of cognitive ability
  16. PI Control Applied to a Small-Scale Thermal System with Heating and Cooling Sources
  17. Big Data - Characterizing an Emerging Research Field using Topic Models
  18. The role of task meaning on output in groups
  19. Analysis of the relevance of models, influencing factors and the point in time of the forecast on the prediction quality in order-related delivery time determination using machine learning
  20. Quality Assurance of Specification - The Users Point of View
  21. BUSINESS MODELS IN BANKING: A CLUSTER ANALYSIS USING ARCHIVAL DATA
  22. A community of shared values? Dimensions and dynamics of cultural integration in the European Union
  23. Mapping industrial patterns in spatial agglomeration
  24. Framework for empirical research on science teaching and learning
  25. Improving the representation of smallholder farmers’ adaptive behaviour in agent-based models
  26. The Routledge Handbook of Pragmatics
  27. Creative Network Communities in the Translocal Space of Digital Networks
  28. Depression-specific Costs and their Factors based on SHI Routine data
  29. The Meaning of Higher-Order Factors in Reflective-Measurement Models
  30. Depoliticising EU migration policies
  31. Welcome to the Glitch and Make Some Noise: Understanding Media through Audio Hacking
  32. 3DMIN – Challenges and Interventions in Design, Development and Dissemination of New Musical Instruments.
  33. Toward Automatically Labeling Situations in Soccer
  34. The Legitimization of Ethically Questionable Business Practices via Self-Disclosure in Social Media
  35. A synthesis of atmospheric mercury depletion event chemistry in the atmosphere and snow
  36. Towards a Deconstruction of the Screen