Holistic and scalable ranking of RDF data

Publikation: Beiträge in SammelwerkenAufsätze in KonferenzbändenForschungbegutachtet

Authors

The volume and number of data sources published using Semantic Web standards such as RDF grows continuously. The largest of these data sources now contain billions of facts and are updated periodically. A large number of applications driven by such data sources requires the ranking of entities and facts contained in such knowledge graphs. Hence, there is a need for time-efficient approaches that can compute ranks for entities and facts simultaneously. In this paper, we present the first holistic ranking approach for RDF data. Our approach, dubbed HARE, allows the simultaneous computation of ranks for RDF triples, resources, properties and literals. To this end, HARE relies on the representation of RDF graphs as bi-partite graphs. It then employs a time-efficient extension of the random walk paradigm to bi-partite graphs. We show that by virtue of this extension, the worst-case complexity of HARE is O(n5) while that of PageRank is O(n6). In addition, we evaluate the practical efficiency of our approach by comparing it with PageRank on 6 real and 6 synthetic datasets with sizes up to 108 triples. Our results show that HARE is up to 2 orders of magnitude faster than PageRank. We also present a brief evaluation of HARE's ranking accuracy by comparing it with that of PageRank applied directly to RDF graphs. Our evaluation on 19 classes of DBpedia demonstrates that there is no statistical difference between HARE and PageRank. We hence conclude that our approach goes beyond the state of the art by allowing the ranking of all RDF entities and of RDF triples without being worse w.r.t. the ranking quality it achieves on resources. HARE is open-source and is available at http://github.com/dice-group/hare.

OriginalspracheEnglisch
TitelProceedings - 2017 IEEE International Conference on Big Data, Big Data 2017
HerausgeberJian-Yun Nie, Zoran Obradovic, Toyotaro Suzumura, Rumi Ghosh, Raghunath Nambiar, Chonggang Wang, Hui Zang, Ricardo Baeza-Yates, Ricardo Baeza-Yates, Xiaohua Hu, Jeremy Kepner, Alfredo Cuzzocrea, Jian Tang, Masashi Toyoda
Anzahl der Seiten10
VerlagInstitute of Electrical and Electronics Engineers Inc.
Erscheinungsdatum01.07.2017
Seiten746-755
ISBN (Print)978-1-5386-2714-3, 978-1-5386-2716-7
ISBN (elektronisch)978-1-5386-2715-0
DOIs
PublikationsstatusErschienen - 01.07.2017
Extern publiziertJa
Veranstaltung5th IEEE International Conference on Big Data, Big Data 2017 - Boston, USA / Vereinigte Staaten
Dauer: 11.12.201714.12.2017
Konferenznummer: 5
https://cci.drexel.edu/bigdata/bigdata2017/

Bibliographische Notiz

Funding Information:
This work was supported by the H2020 project HOBBIT (GA no. 688227), the EuroStars projects DIESEL (E!9367) and QAMEL (E!9725) as well as the BMVI projects LIMBO (project no. 19F2029C) and OPAL (project no. 19F20284).

Publisher Copyright:
© 2017 IEEE.

DOI

Zuletzt angesehen

Aktivitäten

  1. Learning and Re-learning in Chat-based CSCL: The Impact of Individual Learning Strategies
  2. Development of a temperature controlled weathering test box to evaluate the life cycle behaviour of interior automotive components
  3. Presenting paper 'Writing Organization Atmospherically'
  4. Between Connections and Knowledge: An Approach to Culture through Graph Theory and Complex Systems
  5. Local Interest Representation in Multi-Level Parties
  6. Sino-German Summer School on Design and data analysis of biodiversity-ecosystem functioning experiments 2011
  7. The influence of polycentricity on collaborative environmental management – the case of EU Water Framework Directive implementation in Germany
  8. Presentation: Nexus of Housing and Migration
  9. International Symposium on Multiscale Computational Analysis of Complex Materials
  10. Empirical Research Methods on Legitimacy: Repertory Grid as the Interface between „Measuring“ and „Evaluating“
  11. A CALL for data-informed focus-on-form practice - Intelligent Language Tutoring System as the key to personalized and adaptive learning?
  12. Doing the right things at the right time: How negotiators make trade-offs in sequential resource allocation negotiations
  13. Explaining Healthcare System Change
  14. It's how, not what we use that matters - Communications Modes in the Internet
  15. Knowledge Space(s) of Globalization – Musealizing Things, People and Spaces of Global Trade
  16. Comparison of Two Panel Cointegration Tests
  17. JoSch - Journal der Schreibberatung (Zeitschrift)
  18. Time and Income Poverty Dynamics - An Interdependent Multidimensional Approach with German Time Use Data
  19. Gender Relation and Sustainable Spatial Development
  20. Spas in the New Länder: A Transformation with an Uncertain Outcome.
  21. Modern Language Journal: devoted to research and discussion about the learning and teaching of foreign and second languages (Zeitschrift)
  22. Combining SMC and MTPA Using an EKF to estimate parameters and states of an interior PMSM
  23. Picasso and AI: analysing and questioning the technology that overwhelms us

Publikationen

  1. Employing A-B tests for optimizing prices levels in e-commerce applications
  2. Incorporating ecosystem services into ecosystem-based management to deal with complexity
  3. Highly Efficient IPT Transmitter Circuit Based on a Novel Enhanced Class B Amplifier for Consumer Applications
  4. Towards an Interoperable Ecosystem of AI and LT Platforms: A Roadmap for the Implementation of Different Levels of Interoperability
  5. TRY plant trait database – enhanced coverage and open access
  6. Machine Learning and Data Mining for Sports Analytics
  7. Pluralism and diversity: Trends in the use and application of ordination methods 1990-2007
  8. ENVISIONING PROTECTED AREAS THROUGH PARTICIPATORY SCENARIO PLANNING: NAVIGATING COVERAGE AND EFFECTIVENESS CHALLENGES AHEAD
  9. An interdisciplinary perspective on scaling in transitions
  10. Cross-case knowledge transfer in transformative research: enabling learning in and across sustainability-oriented labs through case reporting
  11. In-Vehicle Sensor System for Monitoring Efficiency of Vehicle E/E Architectures
  12. A Sensitive Microsystem as Biosensor for Cell Growth Monitoring and Antibiotic Testing
  13. Functional traits mediate the effect of land use on drivers of community stability within and across trophic levels
  14. Playing in the Spaces: Anarchism in the Classroom
  15. Functional Richness and Relative Resilience of Bird Communities in Regions with Different Land Use Intensities
  16. Experiences of the Self between Limit, Transgression, and the Explosion of the Dialectical System
  17. Two models for gradient inelasticity based on non-convex energy
  18. Temporal dynamics of conflict monitoring and the effects of one or two conflict sources on error-(related) negativity
  19. Users’ handedness and performance when controlling integrated input devices
  20. How methods influence nature's values we find – A comparison of three elicitation methods
  21. Soil conditions modify species diversity effects on tree functional trait expression
  22. Increased Reliability of Draw-In Prediction in a Single Stage Deep-Drawing Operation via Transfer Learning