Web-scale extension of RDF knowledge bases from templated websites

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

  • Lorenz Bühmann
  • Ricardo Usbeck
  • Axel Cyrille Ngonga Ngomo
  • Muhammad Saleem
  • Andreas Both
  • Valter Crescenzi
  • Paolo Merialdo
  • Disheng Qiu

Only a small fraction of the information on the Web is represented as Linked Data. This lack of coverage is partly due to the paradigms followed so far to extract Linked Data.While converting structured data to RDF is well supported by tools, most approaches to extract RDF from semi-structured data rely on extraction methods based on ad-hoc solutions. In this paper, we present a holistic and open-source framework for the extraction of RDF from templated websites. We discuss the architecture of the framework and the initial implementation of each of its components. In particular, we present a novel wrapper induction technique that does not require any human supervision to detect wrappers for web sites. Our framework also includes a consistency layer with which the data extracted by the wrappers can be checked for logical consistency. We evaluate the initial version of REX on three different datasets. Our results clearly show the potential of using templated Web pages to extend the Linked Data Cloud. Moreover, our results indicate the weaknesses of our current implementations and how they can be extended.

OriginalspracheEnglisch
TitelThe SemanticWeb - ISWC 2014 - 13th International SemanticWeb Conference, Proceedings
HerausgeberTania Tudorache, Craig Knoblock, Paul Groth, Carole Goble, Chris Welty, Abraham Bernstein, Peter Mika, Denny Vrandečić, Natasha Noy, Krzysztof Janowicz
Anzahl der Seiten16
VerlagSpringer Nature Switzerland AG
Erscheinungsdatum2014
Seiten66-81
ISBN (Print)978-3-319-11963-2
ISBN (elektronisch)978-3-319-11964-9
DOIs
PublikationsstatusErschienen - 2014
Extern publiziertJa
Veranstaltung13th International Semantic Web Conference, ISWC 2014 - Riva del Garda, Italien
Dauer: 19.10.201423.10.2014
Konferenznummer: 13
https://search.worldcat.org/de/title/semantic-web-iswc-2014-13th-international-semantic-web-conference-riva-del-garda-italy-october-19-23-2014-proceedings-part-i/oclc/941304230

Bibliographische Notiz

Publisher Copyright:
© Springer International Publishing Switzerland 2014.

DOI

Zuletzt angesehen

Publikationen

  1. Kalman Filter for Predictive Maintenance and Anomaly Detection
  2. Scale-dependent diversity patterns affect spider assemblages of two contrasting forest ecosystems
  3. The Creation of the Concept through the Interaction of Philosophy with Science and Art
  4. Understanding and Supporting Management Decision-Making
  5. Object-Oriented Construction Handbook
  6. A geometric approach for the design and control of an electromagnetic actuator to optimize its dynamic performance
  7. Effect of thermo-mechanical conditions during constrained friction processing on the particle refinement of AM50 Mg-alloy phases
  8. Bridging the Gap: Generating a Comprehensive Biomedical Knowledge Graph Question Answering Dataset
  9. Modelling biodegradability based on OECD 301D data for the design of mineralising ionic liquids
  10. Reading Comprehension as Embodied Action: Exploratory Findings on Nonlinear Eye Movement Dynamics and Comprehension of Scientific Texts
  11. Performance of methods to select landscape metrics for modelling species richness
  12. Determination of the construction and the material identity values of outside building components with the help of in-situ measuring procedures and FEM-simulation calculations
  13. DISKNET – A Platform for the Systematic Accumulation of Knowledge in IS Research
  14. A Sensitive Microsystem as Biosensor for Cell Growth Monitoring and Antibiotic Testing
  15. Experimental investigation of the fluid-structure interaction during deep drawing of fiber metal laminates in the in-situ hybridization process
  16. The Role of Output Vocabulary in T2T LMs for SPARQL Semantic Parsing
  17. Late developers and the inequity of "equitable utilization" and the harm of "do no harm"
  18. The impact of goal focus, task type and group size on synchronous net-based collaborative learning discourses
  19. Differences in adjustment flexibility between regular and temporary agency work
  20. Mechanical characterization of as-cast AA7075/6060 and CuSn6/Cu99.5 compounds using an experimental and numerical push-out test
  21. Integrating errors into the training process
  22. Dichotomy or continuum? A global review of the interaction between autonomous and planned adaptations
  23. Lost-customers approximation of semi-open queueing networks with backordering
  24. Quality System Development at the University of Graz
  25. Collaborative benchmarking of functional-structural root architecture models
  26. Unlocking knowledge-policy action gaps in disaster-recovery-risk governance cycle
  27. Variational pragmatics in the foreign language classroom