RiverBench

RiverBench is an open RDF streaming benchmark suite that can be used for a wide range of benchmarking tasks

<~  Go back


I created RiverBench to provide a common, high-quality base for benchmarking various RDF systems. It aims to solve some of the frequent woes of benchmarks, such as poorly described and buggy datasets, missing license information, messy distribution formats, no systematic way to report results, and lacking documentation.

RiverBench is fully open and community-driven – you can submit your own datasets, benchmark tasks, results, and more. It heavily relies on CI automation to make sure all datasets follow the same high-quality standards: rich RDF metadata, clear licensing, multiple distribution formats, detailed documentation, and more.

RiverBench uses the RDF Stream Taxonomy (RDF-STaX) to describe and validate the stream types of benchmark datasets. The datasets are distributed both in W3C-standard formats (N-Triples, Turtle), and in Jelly, a high-performance binary RDF format.

References

  1. arXiv
    riverbench.png
    RiverBench: an Open RDF Streaming Benchmark Suite
    Piotr Sowiński, Maria Ganzha, and Marcin Paprzycki
    arXiv preprint arXiv:2305.06226, 2023