Monorepo for managing and storing the data of the integration layer of Colonial Collections.
- Update the file
organizations.ttl
in thedata-registry
folder. Add the details of the data provider (e.g. name and address). The details will automatically be added to the Colonial Collections knowledge graph if you commit the changes using Git
- Update the file
dataset-measurements.ttl
in thedata-registry
folder. Add the measurements for the new dataset. The measurements will automatically be added to the Colonial Collections knowledge graph if you commit the changes using Git
- Create a folder in the root folder. By convention: a lower case name, consisting of the name of the data provider and the name of the dataset. For example:
wereldmuseum-collection-archives
- Inside the folder, create a
queries
folder - Create the file
iterate.rq
in thequeries
folder. Put a SPARQL query in this file (e.g. by copying one from the existing files). The query defines how the entities in the dataset can be retrieved from a SPARQL endpoint, e.g. the Colonial Collections knowledge graph - Create the file
generate.rq
in thequeries
folder. Put a SPARQL query in this file (e.g. by copying one from the existing files). The query defines how the entities in the dataset must be transformed, e.g. to the data model of the Colonial Collections search graph - Optionally, create the file
check.rq
in thequeries
folder. Put a SPARQL query in this file (e.g. by copying one from the existing files). The query detects if the dataset has been changed in the Colonial Collections knowledge graph. If that is the case, theiterate.rq
andgenerate.rq
queries will be executed. Be aware: acheck
query only makes sense if the date of last modification of the dataset can be retrieved from a SPARQL endpoint - Inside the
.github/workflows
folder, create a YAML file (e.g. by copying one from the existing files). By convention: a lower case name, consisting of the name of the data provider, the name of the dataset and thecreate-graph
suffix, e.g.wereldmuseum-collection-archives-create-graph.yaml
. Put a GitHub Action workflow definition in this file. The definition describes which steps must be taken to execute theiterate.rq
,generate.rq
and - optionally - thecheck.rq
queries. The results of the queries will automatically be added to the Colonial Collections search graph