Lessons learned — The case of CROCUS: Cluster-based ontology data cleansing

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

Over the past years, a vast number of datasets have been published based on Semantic Web standards, which provides an opportunity for creating novel industrial applications. However, industrial requirements on data quality are high while the time to market as well as the required costs for data preparation have to be kept low. Unfortunately, many Linked Data sources are error-prone which prevents their direct use in productive systems. Hence, (semi-)automatic quality assurance processes are needed as manual ontology repair procedures by domain experts are expensive and time consuming. In this article, we present CROCUS – a pipeline for cluster-based ontology data cleansing. Our system provides a semi-automatic approach for instance-level error detection in ontologies which is agnostic of the underlying Linked Data knowledge base and works at very low costs. CROCUS has been evaluated on two datasets. The experiments show that we are able to detect errors with high recall. Furthermore, we provide an exhaustive related work as well as a number of lessons learned.

OriginalspracheEnglisch
TitelThe Semantic Web: ESWC 2014 Satellite Events : ESWC 2014 Satellite Events, Anissaras, Crete, Greece, May 25-29, 2014
HerausgeberAnna Tordai, Eva Blomqvist, Harald Sack, Raphaël Troncy, Valentina Presutti, Ioannis Papadakis
Anzahl der Seiten11
VerlagSpringer Nature Switzerland AG
Erscheinungsdatum2014
Seiten14-24
ISBN (Print)978-3-319-11954-0
ISBN (elektronisch)978-3-319-11955-7
DOIs
PublikationsstatusErschienen - 2014
Extern publiziertJa
Veranstaltung11th European Semantic Web Symposium on Satellite Events, ESWC 2014 - Ouro Preto, Brasilien
Dauer: 20.10.201422.10.2014
https://2014.eswc-conferences.org/index.html

Bibliographische Notiz

Funding Information:
This work has been partly supported by the ESF and the Free State of Saxony and by grants from the European Union’s 7th Framework Programme provided for the project GeoKnow (GA no. 318159). Sincere thanks to Christiane Lemke

Publisher Copyright:
© Springer International Publishing Switzerland 2014.

DOI

Zuletzt angesehen

Publikationen

  1. Separating Cognitive and Content Domains in Mathematical Competence
  2. Is a severe clinical profile an effect modifier in a web-based depression treatment for adults with type 1 or type 2 diabetes ?
  3. Schematism, Imagination, and Pure Intuition in Kant
  4. Degradation of β-blockers in hospital wastewater by means of ozonation and Fe2+/ozonation
  5. Simulation wilder Spekulationen. Oder: Wie einmal Paul Baran mit einem falschen Modell das Internet erfand
  6. Working time flexibility and work-life balance
  7. Discovering Workscapes
  8. A Systematic Review and Meta-analysis of the Impact of Mindfulness-Based Interventions on the Well-Being of Healthcare Professionals
  9. Crossing borders
  10. Symbole unserer Zeit und ihre formierenden Elemente
  11. Developing sustainable business experimentation capability – A case study
  12. Cyberspace Battleground
  13. Spurred Emulation
  14. Testing Cort-Fitness and Cort-Adaptation hypotheses in a habitat suitability gradient for roe deer
  15. Serum Bactericidal Activity of Piperacillin/Tazobactam against Staphylococcus Aureus, Piperacillin-Susceptible and Piperacillin-Resistant Escherichia Coli and Pseudomonas Aeruginosa
  16. Alltag des Weisen
  17. Managing Stress During Long-Term Internships
  18. Victor Man
  19. Nordic game subcultures
  20. Ludus non tollit abusum
  21. Arbeit am Bild
  22. Martin Arnold und die Fortschreibung der Avantgarden
  23. Innovative Popular Science Communication? Materiality, Aesthetics and Gender in Science Slams
  24. A Hybrid Extended Kalman Filter as an Observer for a Pot-Electro-Magnetic Actuator
  25. „Das scheint mir das Widersprüchliche in der Kybernetik zu sein: Im gleichen Atemzug mit einer De-Anthropologisierung wird ein neuer Humanismus gefeiert.“
  26. Biopolitik in diskursiven Designs