HDTQ: Managing RDF Datasets in Compressed Space

Javier David Fernandez Garcia, Miguel A. Martínez-Prieto, Axel Polleres, Julian Reindorf

Publication: Chapter in book/Conference proceedingContribution to conference proceedings

54 Downloads (Pure)

Abstract

HDT (Header-Dictionary-Triples) is a compressed representation of RDF data that supports retrieval features without prior decompression. Yet, RDF datasets often contain additional graph information, such as the origin, version or validity time of a triple. Traditional HDT is not capable of handling this additional parameter(s). This work introduces HDTQ (HDT Quads), an extension of HDT that is able to represent quadruples (or quads) while still being highly compact and queryable. Two HDTQ-based approaches are introduced: Annotated Triples and Annotated Graphs, and their performance is compared to the leading open-source RDF stores on the market. Results show that HDTQ achieves the best compression rates and is a competitive alternative to well-established systems.
Original languageEnglish
Title of host publicationThe Semantic Web. Proceedings of ESWC 2018
Editors Springer International Publishing
Place of PublicationCham
PublisherSpringer International Publishing
Pages191-208
ISBN (Print)978-3-319-93417-4
DOIs
Publication statusPublished - 2018

Austrian Classification of Fields of Science and Technology (ÖFOS)

  • 102001 Artificial intelligence
  • 502050 Business informatics
  • 102015 Information systems

Cite this