Cross-document coreference resolution using latent features

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

Over the last years, entity detection approaches which combine named entity recognition and entity linking have been used to detect mentions of RDF resources from a given reference knowledge base in unstructured data. In this paper, we address the problem of assigning a single URI to named entities which stand for the same real-object across documents but are not yet available in the reference knowledge base. This task is known as cross-document co-reference resolution and has been addressed by manifold approaches in the past. We present a preliminary study of a novel take on the task based on the use of latent features derived from matrix factorizations combined with parameter-free graph clustering. We study the influence of different parameters (window size, rank, hardening) on our approach by comparing the F-measures we achieve on the N3 benchmark. Our results suggest that using latent features leads to higher F-measures with an increase of up to 20.5% on datasets of the N3 collection.

OriginalspracheEnglisch
TitelLinked Data for Information Extraction 2014. : Proceedings of the Second International Workshop on Linked Data for Information Extraction (LD4IE 2014), Riva del Garda, Italy, October 20, 2014.
HerausgeberAnna Lisa Gentile, Ziqi Zhang, Claudia d'Amato, Heiko Paulheim
Anzahl der Seiten12
Band1267
VerlagSun Site Central Europe (RWTH Aachen University)
Erscheinungsdatum15.10.2014
Seiten33-44
PublikationsstatusErschienen - 15.10.2014
Extern publiziertJa
Veranstaltung2nd International Workshop on Linked Data for Information Extraction, LD4IE 2014, Co-located with the 13th International Semantic Web Conference, ISWC 2014 - Riva del Garda, Italien
Dauer: 20.10.2014 → …
http://iswc2014.semanticweb.org/index.html

Zuletzt angesehen

Aktivitäten

  1. Is there a threshold effect of time headway on subjective variables for different velocities?
  2. Maximum-Likelihood-Based Panel Cointegration Testing
  3. A conceptual framework on users' digitalisation practices transforming their digital infrastructure for work
  4. Architecture of Computing Systems - ARCS2008
  5. Organizing temporality: A practice perspective on the multilayered architecture of accelerators
  6. Maximum-Likelihood-Based Panel Cointegration Test with Linear Time Trend and Fisher Hypothesis
  7. Learning Shortest Paths for Word Graphs
  8. Note-taking while Working on Mathematical Modelling Tasks
  9. Unit Root & Cointegration Testing Conference 2005
  10. Maximum-Likelihood-Based Panel Cointegration Test with Linear Time Trend
  11. Applied Econometrics with Stata for PhD Students
  12. Spas in the New Länder: A Transformation with an Uncertain Outcome
  13. Is there a threshold effect of time headway on subjective variables for different velocities?
  14. A Mixed Methods Longitudinal Design Study On Learning Results In An Innovative Study Model - First Qualitative Results In HESD
  15. Tilling the fields of knowledge in sustainability-oriented science
  16. Effects of enhanced visual feedback on postural control in static and dynamic conditions.
  17. Coauthoring an interorganizational collaboration: Exploring multi-voicedness and introducing spatiotemporal orientations
  18. Towards an Undercommons (Eco)Logistics?
  19. Navigating in the Digital Jungle: Articulating Combinatory Affordances of Digital Infrastructures for Collaboration
  20. Learning and Re-learning in Chat-based CSCL: The Impact of Individual Learning Strategies
  21. Workshop "Digital Art History: Challenges, Tools and Practical Solutions" - 2011
  22. Multimodal Networks and Generative AI and Its Applications to Visual Culture. A Critical Perspective

Publikationen

  1. The Scalable Question Answering Over Linked Data (SQA) Challenge 2018
  2. Emergency detection based on probabilistic modeling in AAL-environments
  3. Applied quality assurance methods under the open source development model
  4. Eliciting Learner Perceptions of Web 2.0 Tasks through Mixed-Methods Classroom Research
  5. Evaluating entity annotators using GERBIL
  6. Modeling of Logistic Processes in Assembly Areas
  7. The role of spatial ability in learning from instructional animations - Evidence for an ability-as-compensator hypothesis
  8. Measuring cognitive load with subjective rating scales during problem solving
  9. A Service-oriented Search framework for full text, geospatial and semantic search
  10. Real-time RDF extraction from unstructured data streams
  11. OKBQA framework towards an open collaboration for development of natural language question-answering systems over knowledge bases
  12. Simulation based optimization of lot sizes for opposing logistic objectives
  13. 7th open challenge on question answering over linked data (QALD-7)
  14. Evaluation of standard ERP software implementation approaches in terms of their capability for business process optimization
  15. Using transition management concepts for the evaluation of intersecting policy domains ('grand challenges')
  16. Dynamic environment modelling and prediction for autonomous systems
  17. Optimising business performance with standard software systems
  18. Learning how to request using textbooks
  19. Concepts
  20. A New Approach for Optimal Solving Cyclic and Non-Cyclic Bus Drvier Rostering Problems
  21. Value Structure and Dimensions
  22. Web-scale extension of RDF knowledge bases from templated websites
  23. Topic selection and development in learner-native speaker voice-based telecollaborative discourse
  24. Holistic and scalable ranking of RDF data
  25. Intellectual property issues in the use and distribution of remote sensing data
  26. NNARX networks on didactic level system identification
  27. HAWK - hybrid question answering using linked data