Dockerized BSBM

This is the Dockerized version of the Berlin SPARQL Benchmark.

Links

Original work : http://wbsg.informatik.uni-mannheim.de/bizer/berlinsparqlbenchmark/
Sources : https://github.com/VCityTeam/BSBM
Images published on Docker hub.

Usage

docker run -v "$PWD:/app/data" -e "DATA_DESTINATION=<folder>" vcity/bsbm generate [args]
docker run -v "$PWD:/app/data" -e "DATA_DESTINATION=<folder>" vcity/bsbm generate-n [args]
docker run -v "$PWD:/app/data" -e "DATA_DESTINATION=<folder>" vcity/bsbm qualification [args]
docker run -v "$PWD:/app/data" -e "DATA_DESTINATION=<folder>" vcity/bsbm testdriver [args]

generate-n options

The generate-n command accepts the following arguments:

--versions or -v (required): Number of dataset versions to generate
--products or -p (default: 100): Initial product count
--step or -s (default: 1000): Product increment per version
--format or -f (default: ttl): Output format (nt, ttl, trig, xml, sql, virt, monetdb)
--var (default: 0): Variability percentage (0-100). Controls the percentage of products that change between versions. When set to a value greater than 0, each version generates an update dataset containing the specified percentage of products as changes.

Diff output files

For each version >= 2, generate-n automatically computes the RDF diff between consecutive versions and outputs:

dataset-X_additions.nt: triples present in version X but not in version X-1
dataset-X_deletions.nt: triples present in version X-1 but not in version X

These files are always generated in N-Triples format regardless of the --format option, since they are computed by comparing sorted N-Triples representations of each version.

If you want more information about the different arguments, please refer to the original documentation.

docker run vcity/bsbm generate -help
docker run vcity/bsbm generate-n -help
docker run vcity/bsbm qualification -help
docker run vcity/bsbm testdriver -help

$PWD is the directory where the data will be stored. You can change it to any directory you want.

Modifications from source:

Dockerfile:

Added new authors
Dockerized the benchmark

entrypoint.sh

Added a new entrypoint script to run the benchmark

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github/workflows		.github/workflows
lib		lib
queries		queries
src/benchmark		src/benchmark
usecases		usecases
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
build.xml		build.xml
entrypoint.sh		entrypoint.sh
generate		generate
generate-n		generate-n
givennames.txt		givennames.txt
log4j.xml		log4j.xml
qualification		qualification
testdriver		testdriver
titlewords.txt		titlewords.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dockerized BSBM

Links

Usage

generate-n options

Diff output files

Modifications from source:

About

Uh oh!

Releases 8

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Dockerized BSBM

Links

Usage

generate-n options

Diff output files

Modifications from source:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 8

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages