Test your SPARQL queries and transformations with precision, using Given-When-Then principles.
# One spec. Given data. When query. Then result. :my_test must:given [ a must:FileDataset ; must:file "test/data/given.ttl" ] ; must:when [ a must:TextSparqlSource ; must:queryText "CONSTRUCT { ?s ?p ?o } WHERE { ?s a ex:Person ; ?p ?o }" ; must:queryType must:ConstructSparql ] ; must:then [ a must:FileDataset ; must:file "test/data/expected.ttl" ] .
Define your data, run your query, check the result. All in Turtle.
Add --mustrd to your pytest command. Specs become tests. Results show in your runner, CI, and VS Code.
Run specs against embedded RDFLib, GraphDB, or Anzo. Same specs, different engines. Swap with configuration, not code.
Inspired by Cucumber. Define concrete examples of what your query should do, not abstract assertions about what it might.
SHACL validates data structure. mustrd validates data transformations. Your query produces the right triples? mustrd checks.
Every spec follows the same shape. Load data, run a query, check the output.
The input dataset. A Turtle file, inline statements, an Anzo graphmart, or inherited state.
must:given [ a must:FileDataset ; must:file "data/input.ttl" ] ;
The SPARQL action. Inline text, a file reference, or a query builder source. SELECT, CONSTRUCT, or UPDATE.
must:when [ a must:FileSparqlSource ; must:file "sparql/enrich.rq" ; must:queryType must:UpdateSparql ] ;
The expected result. An RDF graph, a table of bindings, CSV/Excel, or an empty result.
must:then [ a must:FileDataset ; must:file "data/expected.ttl" ] .
mustrd validates across SPARQL's full query surface.
| Type | What it does | Result compared as |
|---|---|---|
| SELECT | Return variable bindings | Table (pandas DataFrame) |
| CONSTRUCT | Build an RDF graph | Graph isomorphism |
| UPDATE | INSERT/DELETE triples | Modified graph comparison |
| ASK | Boolean existence check | True/false |
| DESCRIBE | Describe a resource | Graph isomorphism |
Write specs once. Run them against any supported engine.
| Engine | Protocol | Notes |
|---|---|---|
| RDFLib | Embedded (in-memory) | Default. No server needed. Zero config. |
| GraphDB | HTTP SPARQL endpoint | Repository-based. Optional named graphs. |
| Anzo | HTTP REST API | Graphmart layers, query builders, AnzoGraph. |
Givens and thens support multiple data formats.
| Type | Description |
|---|---|
| FileDataset | RDF file — Turtle, NTriples, N3, RDF/XML, TriX |
| StatementsDataset | Inline reified RDF statements in the spec |
| TableDataset | Inline tabular bindings for SELECT results |
| OrderedTableDataset | Tabular data with row ordering (ORDER BY) |
| FolderDataset | File path passed as a runtime argument |
| EmptyTable / EmptyGraph | Expect no results |
| InheritedDataset | Keep existing graph state (chain specs) |
Specs appear as tests in the Test Explorer. Click to run, see diffs on failure.
// settings.json { "python.testing.pytestArgs": [ "--mustrd", "--md=junit/github_job_summary.md", "--config=test/test_config_local.ttl" ] }
pip install mustrd
pytest --mustrd --config=test/mustrd_configuration.ttl