Distributed Collaboration on Rdf Datasets Using Git

Abstract

Collaboration is one of the most important topics regarding the evolution of the World Wide Web and thus also for the Web of Data. In scenarios of distributed collaboration on datasets it is necessary to provide support for multiple dif- ferent versions of datasets to exist simultaneously, while also providing support for merging diverged datasets. In this pa- per we present an approach that uses SPARQL 1.1 in com- bination with the version control system Git, that creates commits for all changes applied to an RDF dataset contain- ing multiple named graphs. Further the operations provided by Git are used to distribute the commits among collabora- tors and merge diverged versions of the dataset. We show the advantages of (public) Git repositories for RDF datasets and how this represents a way to collaborate on RDF data and consume it. With SPARQL 1.1 and Git in combina- tion, users are given several opportunities to participate in the evolution of RDF data.


Backlinks