Lessons learned — The case of CROCUS: Cluster-based ontology data cleansing

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

Over the past years, a vast number of datasets have been published based on Semantic Web standards, which provides an opportunity for creating novel industrial applications. However, industrial requirements on data quality are high while the time to market as well as the required costs for data preparation have to be kept low. Unfortunately, many Linked Data sources are error-prone which prevents their direct use in productive systems. Hence, (semi-)automatic quality assurance processes are needed as manual ontology repair procedures by domain experts are expensive and time consuming. In this article, we present CROCUS – a pipeline for cluster-based ontology data cleansing. Our system provides a semi-automatic approach for instance-level error detection in ontologies which is agnostic of the underlying Linked Data knowledge base and works at very low costs. CROCUS has been evaluated on two datasets. The experiments show that we are able to detect errors with high recall. Furthermore, we provide an exhaustive related work as well as a number of lessons learned.

OriginalspracheEnglisch
TitelThe Semantic Web: ESWC 2014 Satellite Events : ESWC 2014 Satellite Events, Anissaras, Crete, Greece, May 25-29, 2014
HerausgeberAnna Tordai, Eva Blomqvist, Harald Sack, Raphaël Troncy, Valentina Presutti, Ioannis Papadakis
Anzahl der Seiten11
VerlagSpringer Nature Switzerland AG
Erscheinungsdatum2014
Seiten14-24
ISBN (Print)978-3-319-11954-0
ISBN (elektronisch)978-3-319-11955-7
DOIs
PublikationsstatusErschienen - 2014
Extern publiziertJa
Veranstaltung11th European Semantic Web Symposium on Satellite Events, ESWC 2014 - Ouro Preto, Brasilien
Dauer: 20.10.201422.10.2014
https://2014.eswc-conferences.org/index.html

Bibliographische Notiz

Funding Information:
This work has been partly supported by the ESF and the Free State of Saxony and by grants from the European Union’s 7th Framework Programme provided for the project GeoKnow (GA no. 318159). Sincere thanks to Christiane Lemke

Publisher Copyright:
© Springer International Publishing Switzerland 2014.

DOI

Zuletzt angesehen

Publikationen

  1. Scope of the book wastewater reuse and current challenges
  2. Implikationen der Digitalisierung für die Organisation
  3. Some Ideological Foundations of Organizational Downsizing
  4. Inexistent Ink
  5. Validation of the Behavioral Activation for Depression Scale (BADS)-Psychometric properties of the long and short form
  6. Rhetorik-Schulprojekte mit Lehramtsstudierenden
  7. Riskante Übergänge
  8. Differenzen in der pädagogischen Praxis
  9. CoLab
  10. Towards an Extended Enterprise Architecture Meta-Model for Big Data
  11. Democracy Misunderstood: Authoritarian Notions of Democracy around the Globe
  12. Evaluating Introductory Lectures in Entrepreneurship
  13. Irritierte Routinen
  14. Leverage points for sustainability transformation
  15. European governance and the deliberative challenge
  16. Bemächtigung, Entnaturalisierung oder Renaturierung?
  17. Decentering the argumentative turn
  18. Executive Prerogatives in the Legislative Process and Democratic Stability
  19. Future and organization studies
  20. Internet of Things-Specific Challenges for Enterprise Architectures
  21. Ethical and Regulatory Issues for Clinical Trials in Xenotransplantation
  22. Elementary School Students’ Length Estimation Skills
  23. The project CHEMOL
  24. Polarisierung als Strategie. Die Polarisierung des Schweizer Parteiensystems im internationalen Vergleich
  25. Learning to Rate Player Positioning in Soccer
  26. Incomplete aerobic degradation of the antidiabetic drug Metformin and identification of the bacterial dead-end transformation product Guanylurea
  27. Combination of different liquid chromatography/mass spectrometry technologies for the identification of transformation products of rhodamine B in groundwater
  28. Decentralized utilization of wasted organic material in urban areas
  29. Attitude-Based Target Groups to Reduce the Ecological Impact of Daily Mobility Behavior
  30. Realist Inquiry
  31. EEG 2014
  32. Which Relationality? Whose Personhood?
  33. Democratization
  34. Mit allen Kindern durch Anwendungsorientierung zu mathematischen Strukturen
  35. Common opossum population density in an agroforestry system in Bolivia
  36. Konflikt
  37. The role of scenarios in fostering collective action for sustainable development