CETUS – a baseline approach to type extraction

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

The concurrent growth of the Document Web and the Data Web demands accurate information extraction tools to bridge the gap between the two. In particular, the extraction of knowledge on real-world entities is indispensable to populate knowledge bases on theWeb of Data. Here, we focus on the recognition of types for entities to populate knowledge bases and enable subsequent knowledge extraction steps.We present CETUS, a baseline approach to entity type extraction. CETUS is based on a three-step pipeline comprising (i) offline, knowledge-driven type pattern extraction from natural-language corpora based on grammar-rules,(ii) an analysis of input text to extract types and (iii) the mapping of the extracted type evidence to a subset of the DOLCE+DnS Ultra Lite ontology classes. We implement and compare two approaches for the third step using the YAGO ontology as well as the FOX entity recognition tool.

OriginalspracheEnglisch
TitelSemantic Web Evaluation Challenges - SemWebEval, ESWC 2015, Revised Selected Papers
HerausgeberMilan Stankovic, Fabien Gandon, Elena Cabrio, Antoine Zimmermann
Anzahl der Seiten12
VerlagSpringer International Publishing AG
Erscheinungsdatum2015
Seiten16-27
ISBN (Print)978-3-319-25517-0
ISBN (elektronisch)978-3-319-25518-7
DOIs
PublikationsstatusErschienen - 2015
Extern publiziertJa
Veranstaltung12th European Semantic Web Conference - ESWC 2015 - Portoroz, Slowenien
Dauer: 31.05.201504.06.2015
Konferenznummer: 12
https://2015.eswc-conferences.org/index.html
https://2015.eswc-conferences.org/call-challenges.html

Bibliographische Notiz

Funding Information:
This work has been supported by the FP7 project GeoKnow (GA No. 318159) and the BMWI Project SAKE (Project No. 01MD15006E).

Publisher Copyright:
©Springer International Publishing Switzerland 2015

DOI