Lessons learned — The case of CROCUS: Cluster-based ontology data cleansing
Publikation: Beiträge in Sammelwerken › Aufsätze in Konferenzbänden › Forschung › begutachtet
Authors
Over the past years, a vast number of datasets have been published based on Semantic Web standards, which provides an opportunity for creating novel industrial applications. However, industrial requirements on data quality are high while the time to market as well as the required costs for data preparation have to be kept low. Unfortunately, many Linked Data sources are error-prone which prevents their direct use in productive systems. Hence, (semi-)automatic quality assurance processes are needed as manual ontology repair procedures by domain experts are expensive and time consuming. In this article, we present CROCUS – a pipeline for cluster-based ontology data cleansing. Our system provides a semi-automatic approach for instance-level error detection in ontologies which is agnostic of the underlying Linked Data knowledge base and works at very low costs. CROCUS has been evaluated on two datasets. The experiments show that we are able to detect errors with high recall. Furthermore, we provide an exhaustive related work as well as a number of lessons learned.
Originalsprache | Englisch |
---|---|
Titel | The Semantic Web: ESWC 2014 Satellite Events : ESWC 2014 Satellite Events, Anissaras, Crete, Greece, May 25-29, 2014 |
Herausgeber | Anna Tordai, Eva Blomqvist, Harald Sack, Raphaël Troncy, Valentina Presutti, Ioannis Papadakis |
Anzahl der Seiten | 11 |
Verlag | Springer Nature Switzerland AG |
Erscheinungsdatum | 2014 |
Seiten | 14-24 |
ISBN (Print) | 978-3-319-11954-0 |
ISBN (elektronisch) | 978-3-319-11955-7 |
DOIs | |
Publikationsstatus | Erschienen - 2014 |
Extern publiziert | Ja |
Veranstaltung | 11th European Semantic Web Symposium on Satellite Events, ESWC 2014 - Ouro Preto, Brasilien Dauer: 20.10.2014 → 22.10.2014 https://2014.eswc-conferences.org/index.html |
Bibliographische Notiz
Funding Information:
This work has been partly supported by the ESF and the Free State of Saxony and by grants from the European Union’s 7th Framework Programme provided for the project GeoKnow (GA no. 318159). Sincere thanks to Christiane Lemke
Publisher Copyright:
© Springer International Publishing Switzerland 2014.
- Informatik
- Wirtschaftsinformatik