RiverBench | Piotr Sowiński

I created RiverBench to provide a common, high-quality base for benchmarking various RDF systems. It aims to solve some of the frequent woes of benchmarks, such as poorly described and buggy datasets, missing license information, messy distribution formats, no systematic way to report results, and lacking documentation.

RiverBench is fully open and community-driven – you can submit your own datasets, benchmark tasks, results, and more. It heavily relies on CI automation to make sure all datasets follow the same high-quality standards: rich RDF metadata, clear licensing, multiple distribution formats, detailed documentation, and more.

RiverBench uses the RDF Stream Taxonomy (RDF-STaX) to describe and validate the stream types of benchmark datasets. The datasets are distributed both in W3C-standard formats (N-Triples, Turtle), and in Jelly, a high-performance binary RDF format.

Find out more on the RiverBench website: https://w3id.org/riverbench/
GitHub: https://github.com/RiverBench
Preprint: (Sowiński et al., 2023)

References