Real-time RDF extraction from unstructured data streams

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Standard

Real-time RDF extraction from unstructured data streams. / Gerber, Daniel; Hellmann, Sebastian; Bühmann, Lorenz et al.
The Semantic Web, ISWC 2013: 12th International Semantic Web Conference, Proceedings. Hrsg. / Harith Alani; Lalana Kagal; Achille Fokoue; Paul Groth; Chris Biemann; Josiane Xavier Parreira; Lora Aroyo; Natasha Noy; Chris Welty; Krzyztof Janowicz. Springer Verlag, 2013. S. 135-150 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Band 8218 LNCS, Nr. PART 1).

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Harvard

Gerber, D, Hellmann, S, Bühmann, L, Soru, T, Usbeck, R & Ngonga Ngomo, AC 2013, Real-time RDF extraction from unstructured data streams. in H Alani, L Kagal, A Fokoue, P Groth, C Biemann, JX Parreira, L Aroyo, N Noy, C Welty & K Janowicz (Hrsg.), The Semantic Web, ISWC 2013: 12th International Semantic Web Conference, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Nr. PART 1, Bd. 8218 LNCS, Springer Verlag, S. 135-150, 12th International Semantic Web Conference, ISWC 2013, Sydney, NSW, New South Wales, Australien, 21.10.13. https://doi.org/10.1007/978-3-642-41335-3_9

APA

Gerber, D., Hellmann, S., Bühmann, L., Soru, T., Usbeck, R., & Ngonga Ngomo, A. C. (2013). Real-time RDF extraction from unstructured data streams. In H. Alani, L. Kagal, A. Fokoue, P. Groth, C. Biemann, J. X. Parreira, L. Aroyo, N. Noy, C. Welty, & K. Janowicz (Hrsg.), The Semantic Web, ISWC 2013: 12th International Semantic Web Conference, Proceedings (S. 135-150). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Band 8218 LNCS, Nr. PART 1). Springer Verlag. https://doi.org/10.1007/978-3-642-41335-3_9

Vancouver

Gerber D, Hellmann S, Bühmann L, Soru T, Usbeck R, Ngonga Ngomo AC. Real-time RDF extraction from unstructured data streams. in Alani H, Kagal L, Fokoue A, Groth P, Biemann C, Parreira JX, Aroyo L, Noy N, Welty C, Janowicz K, Hrsg., The Semantic Web, ISWC 2013: 12th International Semantic Web Conference, Proceedings. Springer Verlag. 2013. S. 135-150. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); PART 1). doi: 10.1007/978-3-642-41335-3_9

Bibtex

@inbook{bd4458c832904167a7a7c449e3f0beb6,
title = "Real-time RDF extraction from unstructured data streams",
abstract = "The vision behind the Web of Data is to extend the current document-oriented Web with machine-readable facts and structured data, thus creating a representation of general knowledge. However, most of the Web of Data is limited to being a large compendium of encyclopedic knowledge describing entities. A huge challenge, the timely and massive extraction of RDF facts from unstructured data, has remained open so far. The availability of such knowledge on the Web of Data would provide significant benefits to manifold applications including news retrieval, sentiment analysis and business intelligence. In this paper, we address the problem of the actuality of the Web of Data by presenting an approach that allows extracting RDF triples from unstructured data streams. We employ statistical methods in combination with deduplication, disambiguation and unsupervised as well as supervised machine learning techniques to create a knowledge base that reflects the content of the input streams. We evaluate a sample of the RDF we generate against a large corpus of news streams and show that we achieve a precision of more than 85%.",
keywords = "Informatics, Time Slice, Name Entry Recognition, Pattern Mapping, Link Open Data, String Similarity, Business informatics",
author = "Daniel Gerber and Sebastian Hellmann and Lorenz B{\"u}hmann and Tommaso Soru and Ricardo Usbeck and {Ngonga Ngomo}, {Axel Cyrille}",
year = "2013",
doi = "10.1007/978-3-642-41335-3_9",
language = "English",
isbn = "9783642413346",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
publisher = "Springer Verlag",
number = "PART 1",
pages = "135--150",
editor = "Harith Alani and Lalana Kagal and Achille Fokoue and Paul Groth and Chris Biemann and Parreira, {Josiane Xavier} and Lora Aroyo and Natasha Noy and Chris Welty and Krzyztof Janowicz",
booktitle = "The Semantic Web, ISWC 2013",
address = "Germany",
note = "12th International Semantic Web Conference, ISWC 2013 ; Conference date: 21-10-2013 Through 25-10-2013",
url = "http://iswc2013.semanticweb.org",

}

RIS

TY - CHAP

T1 - Real-time RDF extraction from unstructured data streams

AU - Gerber, Daniel

AU - Hellmann, Sebastian

AU - Bühmann, Lorenz

AU - Soru, Tommaso

AU - Usbeck, Ricardo

AU - Ngonga Ngomo, Axel Cyrille

PY - 2013

Y1 - 2013

N2 - The vision behind the Web of Data is to extend the current document-oriented Web with machine-readable facts and structured data, thus creating a representation of general knowledge. However, most of the Web of Data is limited to being a large compendium of encyclopedic knowledge describing entities. A huge challenge, the timely and massive extraction of RDF facts from unstructured data, has remained open so far. The availability of such knowledge on the Web of Data would provide significant benefits to manifold applications including news retrieval, sentiment analysis and business intelligence. In this paper, we address the problem of the actuality of the Web of Data by presenting an approach that allows extracting RDF triples from unstructured data streams. We employ statistical methods in combination with deduplication, disambiguation and unsupervised as well as supervised machine learning techniques to create a knowledge base that reflects the content of the input streams. We evaluate a sample of the RDF we generate against a large corpus of news streams and show that we achieve a precision of more than 85%.

AB - The vision behind the Web of Data is to extend the current document-oriented Web with machine-readable facts and structured data, thus creating a representation of general knowledge. However, most of the Web of Data is limited to being a large compendium of encyclopedic knowledge describing entities. A huge challenge, the timely and massive extraction of RDF facts from unstructured data, has remained open so far. The availability of such knowledge on the Web of Data would provide significant benefits to manifold applications including news retrieval, sentiment analysis and business intelligence. In this paper, we address the problem of the actuality of the Web of Data by presenting an approach that allows extracting RDF triples from unstructured data streams. We employ statistical methods in combination with deduplication, disambiguation and unsupervised as well as supervised machine learning techniques to create a knowledge base that reflects the content of the input streams. We evaluate a sample of the RDF we generate against a large corpus of news streams and show that we achieve a precision of more than 85%.

KW - Informatics

KW - Time Slice

KW - Name Entry Recognition

KW - Pattern Mapping

KW - Link Open Data

KW - String Similarity

KW - Business informatics

UR - http://www.scopus.com/inward/record.url?scp=84891950965&partnerID=8YFLogxK

UR - https://www.mendeley.com/catalogue/5d304550-be6f-361f-8bc5-05940fd2117e/

U2 - 10.1007/978-3-642-41335-3_9

DO - 10.1007/978-3-642-41335-3_9

M3 - Article in conference proceedings

AN - SCOPUS:84891950965

SN - 9783642413346

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 135

EP - 150

BT - The Semantic Web, ISWC 2013

A2 - Alani, Harith

A2 - Kagal, Lalana

A2 - Fokoue, Achille

A2 - Groth, Paul

A2 - Biemann, Chris

A2 - Parreira, Josiane Xavier

A2 - Aroyo, Lora

A2 - Noy, Natasha

A2 - Welty, Chris

A2 - Janowicz, Krzyztof

PB - Springer Verlag

T2 - 12th International Semantic Web Conference, ISWC 2013

Y2 - 21 October 2013 through 25 October 2013

ER -

DOI

Zuletzt angesehen

Publikationen

  1. A tutorial introduction to adaptive fractal analysis
  2. Early Detection of Faillure in Conveyor Chain Systems by Wireless Sensor Node
  3. Temperature control in Peltier cells comparing sliding mode control and PID controllers
  4. Assessment of university students’ understanding of abstract binary operations
  5. An Adaptive and Optimized Switching Observer for Sensorless Control of an Electromagnetic Valve Actuator in Camless Internal Combustion Engines
  6. What´s in a net? or: The end of the average
  7. Obstacle Coordinates Transformation from TVS Body-Frame to AGV Navigation-Frame
  8. Top-down contingent feature-specific orienting with and without awareness of the visual input
  9. A Sliding Mode Control with a Bang-Bang Observer for Detection of Particle Pollution
  10. Kalman Filter for Predictive Maintenance and Anomaly Detection
  11. Applications of the Simultaneous Modular Approach in the Field of Material Flow Analysis
  12. Active learning for network intrusion detection
  13. Explaining and controlling for the psychometric properties of computer-generated figural matrix items
  14. Transductive support vector machines for structured variables
  15. Object-Oriented Construction Handbook
  16. Aspect-oriented software development
  17. Machine Learning and Data Mining for Sports Analytics
  18. A geometric approach for the design and control of an electromagnetic actuator to optimize its dynamic performance
  19. The Lifecycle of "Facts'': A Survey of Social Bias in Knowledge Graphs
  20. Analyzing the Influence of Knowledge Graph Information on Relation Extraction
  21. Treating dialogue quality evaluation as an anomaly detection problem
  22. Internal forces in robotic manipulation and in general mechanisms using a geometric approach
  23. Early Edema Detection Based on the Examination of Multidimensional Ultra-Wide band Data
  24. Reducing the peaking phenomenon in Luenberger observers in presence of quasi-static disturbances for linear time invariant systems
  25. A geometric approach to the decoupling control and to speed up the dynamics of a general rigid body manipulation system
  26. Das John-Stuart-Mill-Problem
  27. Document assignment in multi-site search engines
  28. Is implicit Theory of Mind real but hard to detect?
  29. The Challenge of Democratic Representation in the European Union