CETUS – a baseline approach to type extraction

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

The concurrent growth of the Document Web and the Data Web demands accurate information extraction tools to bridge the gap between the two. In particular, the extraction of knowledge on real-world entities is indispensable to populate knowledge bases on theWeb of Data. Here, we focus on the recognition of types for entities to populate knowledge bases and enable subsequent knowledge extraction steps.We present CETUS, a baseline approach to entity type extraction. CETUS is based on a three-step pipeline comprising (i) offline, knowledge-driven type pattern extraction from natural-language corpora based on grammar-rules,(ii) an analysis of input text to extract types and (iii) the mapping of the extracted type evidence to a subset of the DOLCE+DnS Ultra Lite ontology classes. We implement and compare two approaches for the third step using the YAGO ontology as well as the FOX entity recognition tool.

Original languageEnglish
Title of host publicationSemantic Web Evaluation Challenges - SemWebEval, ESWC 2015, Revised Selected Papers
EditorsMilan Stankovic, Fabien Gandon, Elena Cabrio, Antoine Zimmermann
Number of pages12
PublisherSpringer International Publishing
Publication date2015
Pages16-27
ISBN (print)978-3-319-25517-0
ISBN (electronic)978-3-319-25518-7
DOIs
Publication statusPublished - 2015
Externally publishedYes
Event12th European Semantic Web Conference - ESWC 2015 - Portoroz, Slovenia
Duration: 31.05.201504.06.2015
Conference number: 12
https://2015.eswc-conferences.org/index.html
https://2015.eswc-conferences.org/call-challenges.html

Bibliographical note

This work has been supported by the FP7 project GeoKnow (GA No. 318159) and the BMWI Project SAKE (Project No. 01MD15006E).

Publisher Copyright:
©Springer International Publishing Switzerland 2015

Recently viewed

Publications

  1. How to Do Materialistic Dialectics with Words?
  2. Foreign bias in institutional portfolio allocation
  3. How can problems be turned into something good? The role of entrepreneurial learning and error mastery orientation
  4. Transformation products in the water cycle and the unsolved problem of their proactive assessment
  5. The case of the composite Higgs
  6. Integration of laboratory experiments into introductory electrical engineering courses
  7. Conceptualizing community in energy systems
  8. Circular Scanning Resolution Improvement by its Velocity Close Loop Control
  9. Understanding and managing post-acquisition integration as change process
  10. Priority Rule-based Planning Approaches for Regeneration Processes
  11. Abjection and Formlessness
  12. Facilitating collaborative processes in transdisciplinary research using design prototyping
  13. A Multilab Replication of the Ego Depletion Effect
  14. Authority and Authorship
  15. Introduction
  16. Identification and Root Cause Mapping of Supply Chain Collaboration Resistors
  17. Self-perceived quality of life predicts mortality risk better than a multi-biomarker panel, but the combination of both does best
  18. Generalized self-efficacy as a mediator and moderator between control and complexity at work and personal initiative
  19. Complexity and Administrative Intensity
  20. Indicator model of students' writing skills (IMOSS)
  21. Shared mobility business models
  22. How cognitive issue bracketing affects interdependent decision-making in negotiations
  23. Subsistence and substitutability in consumer preferences
  24. Anonymized Firm Data under Test: Evidence from a Replication Study
  25. Learning to collaborate from diverse interactions in project-based sustainability courses
  26. Project-Mentoring in Engineering Education - a competence-oriented teaching and learning approach
  27. Digital identity building:
  28. An Off-the-shelf Approach to Authorship Attribution
  29. A group-level theory of helping and altruism within and across group boundaries
  30. Problems in Mathematizing Systems Biology
  31. Multitrait-Multimethod Analysis
  32. Guest editorial
  33. Sustainable Development
  34. SoilTemp: A global database of near-surface temperature