Real-time RDF extraction from unstructured data streams

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

  • Daniel Gerber
  • Sebastian Hellmann
  • Lorenz Bühmann
  • Tommaso Soru
  • Ricardo Usbeck
  • Axel Cyrille Ngonga Ngomo

The vision behind the Web of Data is to extend the current document-oriented Web with machine-readable facts and structured data, thus creating a representation of general knowledge. However, most of the Web of Data is limited to being a large compendium of encyclopedic knowledge describing entities. A huge challenge, the timely and massive extraction of RDF facts from unstructured data, has remained open so far. The availability of such knowledge on the Web of Data would provide significant benefits to manifold applications including news retrieval, sentiment analysis and business intelligence. In this paper, we address the problem of the actuality of the Web of Data by presenting an approach that allows extracting RDF triples from unstructured data streams. We employ statistical methods in combination with deduplication, disambiguation and unsupervised as well as supervised machine learning techniques to create a knowledge base that reflects the content of the input streams. We evaluate a sample of the RDF we generate against a large corpus of news streams and show that we achieve a precision of more than 85%.

Original languageEnglish
Title of host publicationThe Semantic Web, ISWC 2013 : 12th International Semantic Web Conference, Proceedings
EditorsHarith Alani, Lalana Kagal, Achille Fokoue, Paul Groth, Chris Biemann, Josiane Xavier Parreira, Lora Aroyo, Natasha Noy, Chris Welty, Krzyztof Janowicz
Number of pages16
PublisherSpringer Verlag
Publication date2013
Pages135-150
ISBN (print)9783642413346
DOIs
Publication statusPublished - 2013
Externally publishedYes
Event12th International Semantic Web Conference, ISWC 2013 - Sydney Convention Centre , Sydney, NSW, Australia
Duration: 21.10.201325.10.2013
http://iswc2013.semanticweb.org

Recently viewed

Publications

  1. A Quadrant Approach of Camera Calibration Method for Depth Estimation Using a Stereo Vision System
  2. DialogueMaps: Supporting interactive transdisciplinary dialogues with a web-based tool for multi-layer knowledge maps
  3. A sufficient asymptotic stability condition in generalised model predictive control to avoid input saturation
  4. Gaussian processes for dispatching rule selection in production scheduling
  5. Comments on "Tracking Control of Robotic Manipulators With Uncertain Kinematics and Dynamics"
  6. Authenticity and authentication in language learning
  7. A Switching Cascade Sliding PID-PID Controllers Combined with a Feedforward and an MPC for an Actuator in Camless Internal Combustion Engines
  8. Supporting the Development and Implementation of a Digitalization Strategy in SMEs through a Lightweight Architecture-based Method
  9. Appendix A: Design, implementation, and analysis of the iGOES project
  10. Effectiveness of a guided multicomponent internet and mobile gratitude training program - A pragmatic randomized controlled trial
  11. On the Nonlinearity Compensation in Permanent Magnet Machine Using a Controller Based on a Controlled Invariant Subspace
  12. The fuzzy relationship of intelligence and problem solving in computer simulations
  13. Modeling Conditional Dependencies in Multiagent Trajectories
  14. Enabling Road Condition Monitoring with an on-board Vehicle Sensor Setup
  15. Fixed-term Contracts and Wages Revisited Using Linked Employer-Employee Data from Germany
  16. Stability analysis of a linear model predictive control and its application in a water recovery process
  17. Probabilistic approach to modelling of recession curves
  18. Some model properties to control a permanent magnet machine using a controlled invariant subspace
  19. Gain Scheduling Controller for Improving Level Control Performance
  20. Understanding the socio-technical aspects of low-code adoption for software development
  21. Scholarly Question Answering Using Large Language Models in the NFDI4DataScience Gateway
  22. Study on the effects of tool design and process parameters on the robustness of deep drawing
  23. Neural correlates of the enactment effect in the brain
  24. A Structure and Content Prompt-based Method for Knowledge Graph Question Answering over Scholarly Data
  25. Children's use of spatial skills in solving two map-reading tasks in real space.
  26. Outperformed by a Computer? - Comparing Human Decisions to Reinforcement Learning Agents, Assigning Lot Sizes in a Learning Factory
  27. Modeling Grounding Processes in Chat-based CSCL
  28. Foreign bias in institutional portfolio allocation
  29. Cross-case knowledge transfer in transformative research: enabling learning in and across sustainability-oriented labs through case reporting
  30. Masked Autoencoder Pretraining for Event Classification in Elite Soccer
  31. Comparison of EKF and TSO for Health Monitoring of a Textile-Based Heater Structure and its Control
  32. Determinants and Outcomes of Dual Distribution:
  33. The identification of up-And downstream industries using input-output tables and a firm-level application to minority shareholdings
  34. Sliding mode and model predictive control for inverse pendulum