Cross-document coreference resolution using latent features

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Over the last years, entity detection approaches which combine named entity recognition and entity linking have been used to detect mentions of RDF resources from a given reference knowledge base in unstructured data. In this paper, we address the problem of assigning a single URI to named entities which stand for the same real-object across documents but are not yet available in the reference knowledge base. This task is known as cross-document co-reference resolution and has been addressed by manifold approaches in the past. We present a preliminary study of a novel take on the task based on the use of latent features derived from matrix factorizations combined with parameter-free graph clustering. We study the influence of different parameters (window size, rank, hardening) on our approach by comparing the F-measures we achieve on the N3 benchmark. Our results suggest that using latent features leads to higher F-measures with an increase of up to 20.5% on datasets of the N3 collection.

Original languageEnglish
Title of host publicationLinked Data for Information Extraction 2014. : Proceedings of the Second International Workshop on Linked Data for Information Extraction (LD4IE 2014), Riva del Garda, Italy, October 20, 2014.
EditorsAnna Lisa Gentile, Ziqi Zhang, Claudia d'Amato, Heiko Paulheim
Number of pages12
Volume1267
PublisherSun Site Central Europe (RWTH Aachen University)
Publication date15.10.2014
Pages33-44
Publication statusPublished - 15.10.2014
Externally publishedYes
Event2nd International Workshop on Linked Data for Information Extraction, LD4IE 2014, Co-located with the 13th International Semantic Web Conference, ISWC 2014 - Riva del Garda, Italy
Duration: 20.10.2014 → …
http://iswc2014.semanticweb.org/index.html

Bibliographical note

European Science Foundation

Recently viewed

Publications

  1. A coding scheme to analyse global text processing in computer supported collaborative learning: What eye movements can tell us
  2. A Wavelet Packet Tree Denoising Algorithm for Images of Atomic-Force Microscopy
  3. A transfer operator based computational study of mixing processes in open flow systems
  4. Integrating Mobile Devices into AAL-Environments using Knowledge based Assistance Systems
  5. Analyzing different types of moderated method effects in confirmatory factor models for structurally different methods
  6. A Python toolbox for the numerical solution of the Maxey-Riley equation
  7. Automatic enumeration of all connected subgraphs.
  8. The Use of Genetic Algorithm for PID Controller Auto-Tuning in ARM CORTEX M4 Platform
  9. Methodologies for Noise and Gross Error Detection using Univariate Signal-Based Approaches in Industrial Application
  10. Spatial mislocalization as a consequence of sequential coding of stimuli
  11. Comparing Two Voltage Observers in a Sensorsystem using Repetitive Control
  12. Binary Random Nets I
  13. Modeling Effective and Ineffective Knowledge Communication and Learning Discourses in CSCL with Hidden Markov Models
  14. Evolutionary generation of dispatching rule sets for complex dynamic scheduling problems
  15. Using complexity metrics with R-R intervals and BPM heart rate measures
  16. Algebraic combinatorics in mathematical chemistry. Methods and algorithms. I. Permutation groups and coherent (cellular) algebras.
  17. Authenticity and authentication in language learning
  18. Ant colony optimization algorithm and artificial immune system applied to a robot route
  19. Detection and mapping of water pollution variation in the Nile Delta using multivariate clustering and GIS techniques
  20. Knowledge Graph Question Answering Using Graph-Pattern Isomorphism
  21. Supervised clustering of streaming data for email batch detection
  22. Data-Generating Mechanisms Versus Constructively Defined Latent Variables in Multitrait–Multimethod Analysis:
  23. Multidimensional Cross-Recurrence Quantification Analysis (MdCRQA)–A Method for Quantifying Correlation between Multivariate Time-Series
  24. Modified dynamic programming approach for offline segmentation of long hydrometeorological time series
  25. Development of a Didactic Graphical Simulation Interface on MATLAB for Systems Control
  26. Graph Conditional Variational Models: Too Complex for Multiagent Trajectories?
  27. A geometric algorithm for the output functional controllability in general manipulation systems and mechanisms
  28. Random measurement and prediction errors limit the practical relevance of two velocity sensors to estimate the 1RM back squat
  29. Contributions of declarative and procedural memory to accuracy and automatization during second language practice
  30. Using learning protocols for knowledge acquisition and problem solving with individual and group incentives
  31. Analysis of Complexity Reduction in Kalman Filters Through Decoupling Control With Chattered Inputs in PMSM
  32. Towards a Dynamic Interpretation of Subjective and Objective Values
  33. Discourse Analyses in Chat-based CSCL with Learning Protocols
  34. Modeling precipitation kinetics for multi-phase and multi-component systems using particle size distributions via a moving grid technique
  35. Substructure, subgraph, and walk counts as measures of the complexity of graphs and molecules.
  36. Homogenization modeling of thin-layer-type microstructures
  37. A Quadrant Approach of Camera Calibration Method for Depth Estimation Using a Stereo Vision System
  38. Multidimensional recurrence quantification analysis (MdRQA) for the analysis of multidimensional time-series