Real-time RDF extraction from unstructured data streams

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

  • Daniel Gerber
  • Sebastian Hellmann
  • Lorenz Bühmann
  • Tommaso Soru
  • Ricardo Usbeck
  • Axel Cyrille Ngonga Ngomo

The vision behind the Web of Data is to extend the current document-oriented Web with machine-readable facts and structured data, thus creating a representation of general knowledge. However, most of the Web of Data is limited to being a large compendium of encyclopedic knowledge describing entities. A huge challenge, the timely and massive extraction of RDF facts from unstructured data, has remained open so far. The availability of such knowledge on the Web of Data would provide significant benefits to manifold applications including news retrieval, sentiment analysis and business intelligence. In this paper, we address the problem of the actuality of the Web of Data by presenting an approach that allows extracting RDF triples from unstructured data streams. We employ statistical methods in combination with deduplication, disambiguation and unsupervised as well as supervised machine learning techniques to create a knowledge base that reflects the content of the input streams. We evaluate a sample of the RDF we generate against a large corpus of news streams and show that we achieve a precision of more than 85%.

OriginalspracheEnglisch
TitelThe Semantic Web, ISWC 2013 : 12th International Semantic Web Conference, Proceedings
HerausgeberHarith Alani, Lalana Kagal, Achille Fokoue, Paul Groth, Chris Biemann, Josiane Xavier Parreira, Lora Aroyo, Natasha Noy, Chris Welty, Krzyztof Janowicz
Anzahl der Seiten16
VerlagSpringer Verlag
Erscheinungsdatum2013
Seiten135-150
ISBN (Print)9783642413346
DOIs
PublikationsstatusErschienen - 2013
Extern publiziertJa
Veranstaltung12th International Semantic Web Conference, ISWC 2013 - Sydney Convention Centre , Sydney, NSW, Australien
Dauer: 21.10.201325.10.2013
http://iswc2013.semanticweb.org

DOI

Zuletzt angesehen

Aktivitäten

  1. Material Migrations I Online Lecture Series
  2. Exploring Affective Human-Robot Interaction with Movie Scenes
  3. Implementing aspects of inquiry-based learning in secondary chemistry classes: a case study
  4. Maximum-Likelihood-Based Panel Cointegration Test with Linear Time Trend
  5. The Linguistic Complexity of Test Items: Differential Effects for Students With Low and High Language Proficiency
  6. Towards a fully-automated adaptive e-learning environment: A predictive model for difficulty generating factors in gap-filling activities that target English tense-aspect-mood
  7. Digital Abstraction at the Interface between Electronic Media Arts and Data Visualization
  8. Co-Supervisor for the Dissertation "The effects of forest structural element retention on insect communities"
  9. Presentation of the paper entitled "Soft Optimal Computing to Identify Surface Roughness in Manufacturing using a Monotonic Regressor"
  10. Co-supervisor of the dissertation "Diversity and functions of plant-insect interactions along a forest retention gradient"
  11. Uncertainty and Subjectivity in Provenance Linked Open Data
  12. Placemaking today: integrating place-oriented thinking into cultural policy frameworks
  13. From Archives to Activism: Using Data to Challenge Structures in Art Collections
  14. Explicit References in Chat-Based CSCL: Do They Faciliate Global Text Processing?
  15. International Symposium on Multiscale Computational Analysis of Complex Materials
  16. Explaining primary school teachers’ usage of digital learning data: A mixed method study
  17. Mediating Atmospheres: Apprehending the Intersections of Data, Memory and Space
  18. Experiences with applying for and managing large DFG projects
  19. Implementing Sustainability Strategies Through Accounting Controls: An Exploration of Practices in Seven Multinational Corporations
  20. LC-MS identification of the photo-transformation products of desipramine with studying the effect of different environmental variables on the kinetics of their formation

Publikationen

  1. Age effects on controlling tools with sensorimotor transformations
  2. Supporting the Development and Realization of Data-Driven Business Models with Enterprise Architecture Modeling and Management
  3. Computing regression statistics from grouped data
  4. A localized boundary element method for the floating body problem
  5. On the Decoupling and Output Functional Controllability of Robotic Manipulation
  6. Analysis of PI controllers with anti-windup techniques on level systems
  7. Image compression based on periodic principal components
  8. TRY plant trait database – enhanced coverage and open access
  9. A Review of Latent Variable Modeling Using R - A Step-by-Step-Guide
  10. Knowledge-Enhanced Language Models Are Not Bias-Proof
  11. An Orthogonal Wavelet Denoising Algorithm for Surface Images of Atomic Force Microscopy
  12. Data-driven and physics-based modelling of process behaviour and deposit geometry for friction surfacing
  13. Teaching methods for modelling problems and students’ task-specific enjoyment, value, interest and self-efficacy expectations
  14. Self-regulation in error management training: emotion control and metacognition as mediators of performance effects
  15. Spaces for challenging experiences, indeterminacy, and experimentation
  16. Teachers’ use of data from digital learning platforms for instructional design
  17. Second language learners' performance in mathematics
  18. More input, better output
  19. How Much Home Office is Ideal? A Multi-Perspective Algorithm
  20. Passive Peak Voltage Sensor for Multiple Sending Coils Inductive Power Transmission System
  21. Top-down contingent attentional capture during feed-forward visual processing
  22. Effectiveness of a Web-Based Cognitive Behavioural Intervention for Subthreshold Depression
  23. Primary Side Circuit Design of a Multi-coil Inductive System for Powering Wireless Sensors
  24. Biodegradation screening of chemicals in an artificial matrix simulating the water-sediment interface
  25. Promising practices for dealing with complexity in research for development
  26. A Framework for Applying Natural Language Processing in Digital Health Interventions
  27. Enhancing EFL classroom instruction via the FeedBook: effects on language development and communicative language use.