Holistic and scalable ranking of RDF data

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

The volume and number of data sources published using Semantic Web standards such as RDF grows continuously. The largest of these data sources now contain billions of facts and are updated periodically. A large number of applications driven by such data sources requires the ranking of entities and facts contained in such knowledge graphs. Hence, there is a need for time-efficient approaches that can compute ranks for entities and facts simultaneously. In this paper, we present the first holistic ranking approach for RDF data. Our approach, dubbed HARE, allows the simultaneous computation of ranks for RDF triples, resources, properties and literals. To this end, HARE relies on the representation of RDF graphs as bi-partite graphs. It then employs a time-efficient extension of the random walk paradigm to bi-partite graphs. We show that by virtue of this extension, the worst-case complexity of HARE is O(n5) while that of PageRank is O(n6). In addition, we evaluate the practical efficiency of our approach by comparing it with PageRank on 6 real and 6 synthetic datasets with sizes up to 108 triples. Our results show that HARE is up to 2 orders of magnitude faster than PageRank. We also present a brief evaluation of HARE's ranking accuracy by comparing it with that of PageRank applied directly to RDF graphs. Our evaluation on 19 classes of DBpedia demonstrates that there is no statistical difference between HARE and PageRank. We hence conclude that our approach goes beyond the state of the art by allowing the ranking of all RDF entities and of RDF triples without being worse w.r.t. the ranking quality it achieves on resources. HARE is open-source and is available at http://github.com/dice-group/hare.

OriginalspracheEnglisch
TitelProceedings - 2017 IEEE International Conference on Big Data, Big Data 2017
HerausgeberJian-Yun Nie, Zoran Obradovic, Toyotaro Suzumura, Rumi Ghosh, Raghunath Nambiar, Chonggang Wang, Hui Zang, Ricardo Baeza-Yates, Ricardo Baeza-Yates, Xiaohua Hu, Jeremy Kepner, Alfredo Cuzzocrea, Jian Tang, Masashi Toyoda
Anzahl der Seiten10
VerlagInstitute of Electrical and Electronics Engineers Inc.
Erscheinungsdatum01.07.2017
Seiten746-755
ISBN (Print)978-1-5386-2714-3, 978-1-5386-2716-7
ISBN (elektronisch)978-1-5386-2715-0
DOIs
PublikationsstatusErschienen - 01.07.2017
Extern publiziertJa
Veranstaltung5th IEEE International Conference on Big Data, Big Data 2017 - Boston, USA / Vereinigte Staaten
Dauer: 11.12.201714.12.2017
Konferenznummer: 5
https://cci.drexel.edu/bigdata/bigdata2017/

Bibliographische Notiz

Funding Information:
This work was supported by the H2020 project HOBBIT (GA no. 688227), the EuroStars projects DIESEL (E!9367) and QAMEL (E!9725) as well as the BMVI projects LIMBO (project no. 19F2029C) and OPAL (project no. 19F20284).

Publisher Copyright:
© 2017 IEEE.

DOI

Zuletzt angesehen

Publikationen

  1. Employing A-B tests for optimizing prices levels in e-commerce applications
  2. Machine Learning For Determining Planned Order Lead Times In Job Shop Production: A Systematic Review Of Input Factors And Applied Methods
  3. Implementing aspects of inquiry-based learning in secondary chemistry classes: a case study
  4. Convergence of adaptive learning and expectational stability
  5. Memory Acts: Memory without Representation.
  6. Visual Detection of Traffic Incident through Automatic Monitoring of Vehicle Activities
  7. Crowdsourcing
  8. Joint Proceedings of Scholarly QALD 2023 and SemREC 2023 co-located with 22nd International Semantic Web Conference ISWC 2023
  9. An Optimization Approach for Crew Rostering in Public Bus Transit
  10. Wireless power transmission via a multi-coil inductive system
  11. Towards Faster IT Delivery: Identifying Factors Limiting the Speed of Enterprise IT
  12. Chapter 9: Particular Remedies for Non-performance: Section 1: Right to Performance
  13. Pluralism and diversity: Trends in the use and application of ordination methods 1990-2007
  14. Neural Networks for Energy Optimization of Production Processes in Small and Medium Sized Enterprises
  15. Effects of plyometric training on postural control in static and dynamic testing situations
  16. Development and application of a laboratory flux measurement system (LFMS) for the investigation of the kinetics of mercury emissions from soils
  17. Frame-based Optimal Design
  18. Effect of silicon content on hot working, processing maps, and microstructural evolution of cast TX32-0.4Al magnesium alloy
  19. Chapter 9: Particular Remedies for Non-performance: Section 2: Withholding Performance
  20. E-stability and stability of adaptive learning in models with private information
  21. Reconceptualizing the role of socioeconomic material stocks in the leverage points framework to enable transformative change
  22. Study of the solidification of AS alloys combining in situ synchrotron diffraction and differential scanning calorimetry
  23. Does symbolic representation through class signalling appeal to voters? Evidence from a conjoint experiment
  24. Visions of Process—Swarm Intelligence and Swarm Robotics in Architectural Design and Construction
  25. Klassengröße
  26. Part based decentralized information handling for process improvements along the supply chain
  27. Cross-Border Knowledge Transfer in the Digital Age
  28. Magnesium-based metal matrix nanocomposites—processing and properties
  29. Counteracting electric vehicle range concern with a scalable behavioural intervention
  30. Was fehlt in der EVS?