Dataspace Simulator

The simulator is a standalone, interactive dataspace environment for research and teaching. It lets users experience end-to-end dataspace behavior visually, including participant discovery, catalog visibility, policy-based access control, semantic search, contract negotiation, and data transfer.

The focus is not protocol compliance, but conceptual correctness and reproducibility. This makes the simulator suitable as the primary artifact for a semantics-focused paper.

Purpose

Demonstrate dataspace fundamentals in an interactive and explainable way
Provide a controllable testbed for semantic discovery with DCAT and SPARQL
Show how credential-based policies affect visibility and access decisions
Offer a reproducible local environment with no external dependencies

Scope and non-goals

What it includes:

Interactive UI for participants, catalogs, semantic search, negotiation, and transfer
Policy engine that evaluates access constraints against participant claims
RDF/DCAT metadata indexing in Apache Fuseki with SPARQL querying
Persistent local state via SQLite and Docker volumes

What it does not include:

Full DSP/EDC interoperability
Production-grade trust infrastructure and identity federation
Real network-level data plane implementation

Quick start

docker compose up -d --build

On first startup, the backend seeds a ready-to-run demo scenario with preset participants and sample assets (including multiple product passports for NordBeton).

Open:

Simulator UI: http://localhost:4000
Backend API: http://localhost:4001
Fuseki UI: http://localhost:4030

To reset to a clean first-run state (and re-apply demo seed data):

docker compose down -v
docker compose up -d --build

Runtime architecture

Component	Role	Port
`sim-frontend`	React-based interactive simulator UI	3000
`sim-backend`	Node.js API, policy checks, state machine, persistence	3001
`sim-fuseki`	RDF store and SPARQL endpoint for semantic metadata	3030

Persistence:

SQLite database inside backend container (/data/simulator.db)
Fuseki dataset in Docker volume (fuseki-data)

Functional model

1) Participants and credentials

Participants (nodes) represent organizations in a dataspace. Each node can carry claims such as industry and orgRole and role capabilities (provider, consumer).

These claims are evaluated by policies during catalog and negotiation phases.

2) Publish

A provider publishes an asset with:

Base fields: title, description, filename
Optional policy template
Optional DCAT metadata fields
File payload (JSON, CSV, or TXT)

On publish:

Asset and payload are persisted in SQLite
Semantic metadata is converted to RDF/DCAT and indexed in Fuseki

3) Discovery and catalog access

The consumer discovers available providers and requests catalogs. Catalog responses are policy-filtered first, based on the consumer's claims.

This is intentional: semantic ranking is only applied within policy-visible assets.

4) Semantic search

Semantic search follows a catalog-first strategy:

Determine visible assets using policy checks
Restrict semantic query scope to those visible dataset IDs
Run SPARQL in Fuseki for semantic refinement

Supported query styles:

Free-text search
Field-level filters over DCAT fields
Combined text + structured filters

5) Negotiation

The negotiation state machine models:

REQUESTED -> OFFERED -> AGREED
or TERMINATED on policy denial

Policy is checked again at negotiation time to model access enforcement at contract phase.

6) Transfer

After AGREED, transfer is initiated and completed through the backend state machine.

In this simulator, transfer means:

provider asset payload is copied to the consumer-side received store
transfer lifecycle is persisted and visualized

This models functional transfer semantics, not low-level data-plane transport.

Semantic implementation details

Data model

Assets are indexed as dcat:Dataset resources with common fields like:

dct:title, dct:description, dct:identifier
dcat:keyword, dcat:theme
dct:spatial, dct:temporal
optional additional DCAT/DCT predicates

Named graphs

RDF is stored in named graphs grouped by publisher and session context. This improves conceptual isolation and supports cleaner argumentation for multi-party data spaces (even though runtime is simulated in a single local deployment).

Query behavior

SPARQL queries are generated by backend code and include:

catalog-derived dataset ID restrictions
optional text matching across title/description/keywords/themes
optional field-specific constraints over selected DCAT predicates

Policy model

Policies are represented as constraint sets and evaluated against consumer claims. The evaluator supports practical claim checks (for example In over values like industry, role, or participant identifier).

Policy effects appear in two places:

Catalog visibility (what can be discovered)
Negotiation decision (what can be contracted)

Visual simulation model

The UI intentionally visualizes control flow:

discovery pulses
control plane request/response beams
negotiation states
data transfer states

This is used as explanatory instrumentation for teaching and for paper figures/demo videos.

Project structure

simulator/
  backend/
    server.js            API and orchestration
    db.js                SQLite schema and persistence access
    semantic.js          RDF mapping, Fuseki IO, SPARQL search
    policy.js            Policy evaluation
    state-machine.js     Negotiation and transfer lifecycle
  frontend/
    src/
      components/        visualization and interaction components
      pages/             application pages
  docker-compose.yml
  README.md
  PRESENTATION.md

Why this is paper-ready

The simulator is suitable as the central artifact for a semantics paper because it provides:

Controlled experimentation under reproducible local conditions
Clear separation of policy filtering and semantic ranking
Explainable UI traces for each phase of discovery and access
Explicit RDF/SPARQL implementation that can be inspected and replicated

Limitations (to state explicitly in the paper)

Single deployment simulates multiple participants; no real distributed trust fabric
No full connector protocol stack (DSP/EDC) in runtime path
No binary data-plane implementation beyond local transfer semantics

These limitations are acceptable for early-stage semantic workflow validation and interaction-centered evaluation.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
backend		backend
frontend		frontend
.env.example		.env.example
.gitignore		.gitignore
DOCUMENTATION.md		DOCUMENTATION.md
LICENSE		LICENSE
LICENSE_non-code		LICENSE_non-code
PRESENTATION.md		PRESENTATION.md
README.md		README.md
WALKTHROUGH.md		WALKTHROUGH.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dataspace Simulator

Purpose

Scope and non-goals

Quick start

Runtime architecture

Functional model

1) Participants and credentials

2) Publish

3) Discovery and catalog access

4) Semantic search

5) Negotiation

6) Transfer

Semantic implementation details

Data model

Named graphs

Query behavior

Policy model

Visual simulation model

Project structure

Why this is paper-ready

Limitations (to state explicitly in the paper)

Related documentation

About

Licenses found

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Dataspace Simulator

Purpose

Scope and non-goals

Quick start

Runtime architecture

Functional model

1) Participants and credentials

2) Publish

3) Discovery and catalog access

4) Semantic search

5) Negotiation

6) Transfer

Semantic implementation details

Data model

Named graphs

Query behavior

Policy model

Visual simulation model

Project structure

Why this is paper-ready

Limitations (to state explicitly in the paper)

Related documentation

About

Resources

License

Licenses found

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages