Lessons learned — The case of CROCUS: Cluster-based ontology data cleansing

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

Over the past years, a vast number of datasets have been published based on Semantic Web standards, which provides an opportunity for creating novel industrial applications. However, industrial requirements on data quality are high while the time to market as well as the required costs for data preparation have to be kept low. Unfortunately, many Linked Data sources are error-prone which prevents their direct use in productive systems. Hence, (semi-)automatic quality assurance processes are needed as manual ontology repair procedures by domain experts are expensive and time consuming. In this article, we present CROCUS – a pipeline for cluster-based ontology data cleansing. Our system provides a semi-automatic approach for instance-level error detection in ontologies which is agnostic of the underlying Linked Data knowledge base and works at very low costs. CROCUS has been evaluated on two datasets. The experiments show that we are able to detect errors with high recall. Furthermore, we provide an exhaustive related work as well as a number of lessons learned.

Original languageEnglish
Title of host publicationThe Semantic Web: ESWC 2014 Satellite Events : ESWC 2014 Satellite Events, Anissaras, Crete, Greece, May 25-29, 2014
EditorsAnna Tordai, Eva Blomqvist, Harald Sack, Raphaël Troncy, Valentina Presutti, Ioannis Papadakis
Number of pages11
PublisherSpringer Nature Switzerland AG
Publication date2014
Pages14-24
ISBN (print)978-3-319-11954-0
ISBN (electronic)978-3-319-11955-7
DOIs
Publication statusPublished - 2014
Externally publishedYes
Event11th European Semantic Web Symposium on Satellite Events, ESWC 2014 - Ouro Preto, Brazil
Duration: 20.10.201422.10.2014
https://2014.eswc-conferences.org/index.html

Bibliographical note

This work has been partly supported by the ESF and the Free State of Saxony and by grants from the European Union’s 7th Framework Programme provided for the project GeoKnow (GA no. 318159). Sincere thanks to Christiane Lemke

Publisher Copyright:
© Springer International Publishing Switzerland 2014.

Recently viewed

Publications

  1. Dialogic interactions in higher vocational learning environments in mainland China
  2. Science-Related Outcomes
  3. Does cognitive load moderate the seductive details effect? A multimedia study
  4. Mobilität
  5. The Measurement of Grip-Strength in Automobiles
  6. Determinants and consequences of clawback provisions in management compensation contracts
  7. Giving is a question of time: response times and contributions to an environmental public good
  8. Complex Trait-Treatment-Interaction analysis
  9. Datenkritik
  10. Do You Like What You (Can't) See? The Differential Effects of Hardware and Software Upgrades on High-Tech Product Evaluations
  11. Introduction to Kant's Anthropology
  12. What has gone wrong with application development? Who is the culprit?
  13. Teaching pragmatic competence with corpora: Intensification in expressions of gratitude across varieties
  14. Introduction
  15. “Smart is not smart enough!” Anticipating critical raw material use in smart city concepts
  16. Study Protocol
  17. Schreibt Ihr Unternehmen auch "grüne" Zahlen?
  18. Mindfulness as self-confirmation? An exploratory intervention study on potentials and limitations of mindfulness-based interventions in the context of environmental and sustainability education
  19. Multilevel Water Governance and Problems of Scale
  20. Resisting alignment
  21. Predicting the future performance of soccer players
  22. Measuring the diversity of what? And for what purpose?
  23. Repräsentative Wahlstatistik
  24. Demographic Transition in Rural Areas: The Relationship between Public Services and Tourism Development
  25. Affect, stress, and health
  26. Thermal analysis of wire-based direct energy deposition of Al-Mg using different laser irradiances
  27. Statement
  28. Information seeking about tool properties in great apes
  29. Energy transitions in small-scale regions – What we can learn from a regional innovation systems perspective.
  30. Assessment of model uncertainty during the river export modelling of pesticides and transformation products
  31. Stir bar sorptive extraction and high-performance liquid chromatography-fluorescence detection for the determination of polycyclic aromatic hydrocarbons in Mate teas
  32. Rezension von Jutta Ecarius
  33. Home and fear
  34. Le vertige des sens

Press / Media

  1. Von Nachbarschaften