Holistic and scalable ranking of RDF data

Research output: Contributions to collected editions/worksArticle in conference proceedingsResearchpeer-review

Authors

The volume and number of data sources published using Semantic Web standards such as RDF grows continuously. The largest of these data sources now contain billions of facts and are updated periodically. A large number of applications driven by such data sources requires the ranking of entities and facts contained in such knowledge graphs. Hence, there is a need for time-efficient approaches that can compute ranks for entities and facts simultaneously. In this paper, we present the first holistic ranking approach for RDF data. Our approach, dubbed HARE, allows the simultaneous computation of ranks for RDF triples, resources, properties and literals. To this end, HARE relies on the representation of RDF graphs as bi-partite graphs. It then employs a time-efficient extension of the random walk paradigm to bi-partite graphs. We show that by virtue of this extension, the worst-case complexity of HARE is O(n5) while that of PageRank is O(n6). In addition, we evaluate the practical efficiency of our approach by comparing it with PageRank on 6 real and 6 synthetic datasets with sizes up to 108 triples. Our results show that HARE is up to 2 orders of magnitude faster than PageRank. We also present a brief evaluation of HARE's ranking accuracy by comparing it with that of PageRank applied directly to RDF graphs. Our evaluation on 19 classes of DBpedia demonstrates that there is no statistical difference between HARE and PageRank. We hence conclude that our approach goes beyond the state of the art by allowing the ranking of all RDF entities and of RDF triples without being worse w.r.t. the ranking quality it achieves on resources. HARE is open-source and is available at http://github.com/dice-group/hare.

Original languageEnglish
Title of host publicationProceedings - 2017 IEEE International Conference on Big Data, Big Data 2017
EditorsJian-Yun Nie, Zoran Obradovic, Toyotaro Suzumura, Rumi Ghosh, Raghunath Nambiar, Chonggang Wang, Hui Zang, Ricardo Baeza-Yates, Ricardo Baeza-Yates, Xiaohua Hu, Jeremy Kepner, Alfredo Cuzzocrea, Jian Tang, Masashi Toyoda
Number of pages10
PublisherInstitute of Electrical and Electronics Engineers Inc.
Publication date01.07.2017
Pages746-755
ISBN (print)978-1-5386-2714-3, 978-1-5386-2716-7
ISBN (electronic)978-1-5386-2715-0
DOIs
Publication statusPublished - 01.07.2017
Externally publishedYes
Event5th IEEE International Conference on Big Data, Big Data 2017 - Boston, United States
Duration: 11.12.201714.12.2017
Conference number: 5
https://cci.drexel.edu/bigdata/bigdata2017/

Bibliographical note

Publisher Copyright:
© 2017 IEEE.

Recently viewed

Projects

  1. Translate

Activities

  1. CTRL + F_eminist futures: Hacking algorithmic architectures of cities to come
  2. 4th Global TraPs Workshop "Defining Case Studies – Setting Priorities”
  3. The role of different forms of cohesion and readers' expectations towards different types of text
  4. Carbon Dioxide Treatment, Summary and Presentation of the Final Version of the Computerprogram CO2
  5. Sensor based on Coplanar μ-Strips to Measure the Electronics Properties of the Polyethylene Oxide (PEO) Electrospun
  6. Preparing Pre-Service Teachers for Inclusive Education: Analyzing the Status Quo and Comparing the Effect of Different Types of Subject-Specific Learning Opportunities at University on Beliefs, Self-Efficacy and Pedagogical Content Knowledge
  7. Object-oriented scarcity as a technology of governmentality
  8. Sustainable use of biomass
  9. Using Artificial Intelligence (AI) in Higher Education: Perspectives from Academic Staff.
  10. Komplementärstudium (Organisation)
  11. Responsivität als transdisziplinäres Forschungsprinzip?
  12. „Create.Music - Live!“ 2012
  13. Gerechtigkeit und Transformation. Eine Tagung in Tutzing
  14. 2nd Symposium on Predictive Control of Electrical Drives and Power - SLED PRECEDE IEEE 2013
  15. Bioconversion of coffee residues into lactic acid
  16. International Seminar On “Solid State Materials Processing”
  17. Twitter as a virtual stage. An enactment perspective on co-creative networks
  18. Weltwasserwoche 2021
  19. Analyzing management preferences for sustainability: Toward a new framework
  20. Sustainable Development und Zeit
  21. Competencies for sustainable development of rural areas need the integration of Gender
  22. Curb Cuts and Computer. A Media-Archeological Perspective on Digital lnclusion
  23. Gender, Sexuality and Contentious Speech Today
  24. IX ISQOLS Conference - 2009
  25. Business Strategy and the Environment (Zeitschrift)
  26. ACS Sustainable Chemistry & Engineering (Zeitschrift)
  27. Lodz University of Technology
  28. (Conducting) quantitative diary studies in organizational behavior research