Web-scale extension of RDF knowledge bases from templated websites

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

  • Lorenz Bühmann
  • Ricardo Usbeck
  • Axel Cyrille Ngonga Ngomo
  • Muhammad Saleem
  • Andreas Both
  • Valter Crescenzi
  • Paolo Merialdo
  • Disheng Qiu

Only a small fraction of the information on the Web is represented as Linked Data. This lack of coverage is partly due to the paradigms followed so far to extract Linked Data.While converting structured data to RDF is well supported by tools, most approaches to extract RDF from semi-structured data rely on extraction methods based on ad-hoc solutions. In this paper, we present a holistic and open-source framework for the extraction of RDF from templated websites. We discuss the architecture of the framework and the initial implementation of each of its components. In particular, we present a novel wrapper induction technique that does not require any human supervision to detect wrappers for web sites. Our framework also includes a consistency layer with which the data extracted by the wrappers can be checked for logical consistency. We evaluate the initial version of REX on three different datasets. Our results clearly show the potential of using templated Web pages to extend the Linked Data Cloud. Moreover, our results indicate the weaknesses of our current implementations and how they can be extended.

OriginalspracheEnglisch
TitelThe SemanticWeb - ISWC 2014 - 13th International SemanticWeb Conference, Proceedings
HerausgeberTania Tudorache, Craig Knoblock, Paul Groth, Carole Goble, Chris Welty, Abraham Bernstein, Peter Mika, Denny Vrandečić, Natasha Noy, Krzysztof Janowicz
Anzahl der Seiten16
VerlagSpringer Nature Switzerland AG
Erscheinungsdatum2014
Seiten66-81
ISBN (Print)978-3-319-11963-2
ISBN (elektronisch)978-3-319-11964-9
DOIs
PublikationsstatusErschienen - 2014
Extern publiziertJa
Veranstaltung13th International Semantic Web Conference, ISWC 2014 - Riva del Garda, Italien
Dauer: 19.10.201423.10.2014
Konferenznummer: 13
https://search.worldcat.org/de/title/semantic-web-iswc-2014-13th-international-semantic-web-conference-riva-del-garda-italy-october-19-23-2014-proceedings-part-i/oclc/941304230

Bibliographische Notiz

Publisher Copyright:
© Springer International Publishing Switzerland 2014.

DOI

Zuletzt angesehen

Publikationen

  1. Errors in Working with Office Computers
  2. Understanding the modes of use and availability of critical metals-An expert-based scenario analysis for the case of indium
  3. Learning from partially annotated sequences
  4. Other spaces
  5. Machine Learning Applications
  6. The Creation of the Concept through the Interaction of Philosophy with Science and Art
  7. Do guided internet-based interventions result in clinically relevant changes for patients with depression?
  8. Networking for the environment
  9. Design of an Information-Based Distributed Production Planning System
  10. Teaching methods for modelling problems and students’ task-specific enjoyment, value, interest and self-efficacy expectations
  11. Topic selection and development in learner-native speaker voice-based telecollaborative discourse
  12. Adaptive control of the nonlinear dynamic behavior of the cantilever-sample system of an atomic force microscope
  13. Estimation and interpretation of a Heckman selection model with endogenous covariates
  14. The buffering effect of selection, optimization, and compensation strategy use on the relationship between problem solving demands and occupational well-being
  15. Holistic and scalable ranking of RDF data
  16. Polar Coordinates and Interactive Learning
  17. Effects Of Different Order Processing Strategies On Operating Curves Of Logistic Models
  18. Meta-Image – a collaborative environment for the image discourse
  19. Recontextualizing Anthropomorphic Metaphors in Organization Studies
  20. Switching Dispatching Rules with Gaussian Processes
  21. Global fern and lycophyte richness explained: How regional and local factors shape plot richness
  22. Clause identification using entropy guided transformation learning
  23. ℓp-norm multiple kernel learning
  24. Individual Differences in Infants' Speech Segmentation Performance
  25. HAWK - hybrid question answering using linked data
  26. Extending talk on a prescribed discussion topic in a learner-native speaker eTandem learning task
  27. Knowledge-Enhanced Language Models Are Not Bias-Proof

Presse / Medien

  1. Wieder gefragt