Colonial Collections: integration data

Monorepo for managing and storing the data of the integration layer of Colonial Collections.

Steps for adding a new data provider

Update the file organizations.ttl in the data-registry folder. Add the details of the data provider (e.g. name and address). The details will automatically be added to the Colonial Collections knowledge graph if you commit the changes using Git

Steps for adding a new dataset

Update the file dataset-measurements.ttl in the data-registry folder. Add the measurements for the new dataset. The measurements will automatically be added to the Colonial Collections knowledge graph if you commit the changes using Git

Steps for adding a new dataset to the search engine

Create a folder in the root folder. By convention: a lower case name, consisting of the name of the data provider and the name of the dataset. For example: wereldmuseum-collection-archives
Inside the folder, create a queries folder
Create the file iterate.rq in the queries folder. Put a SPARQL query in this file (e.g. by copying one from the existing files). The query defines how the entities in the dataset can be retrieved from a SPARQL endpoint, e.g. the Colonial Collections knowledge graph
Create the file generate.rq in the queries folder. Put a SPARQL query in this file (e.g. by copying one from the existing files). The query defines how the entities in the dataset must be transformed, e.g. to the data model of the Colonial Collections search graph
Optionally, create the file check.rq in the queries folder. Put a SPARQL query in this file (e.g. by copying one from the existing files). The query detects if the dataset has been changed in the Colonial Collections knowledge graph. If that is the case, the iterate.rq and generate.rq queries will be executed. Be aware: a check query only makes sense if the date of last modification of the dataset can be retrieved from a SPARQL endpoint
Inside the .github/workflows folder, create a YAML file (e.g. by copying one from the existing files). By convention: a lower case name, consisting of the name of the data provider, the name of the dataset and the create-graph suffix, e.g. wereldmuseum-collection-archives-create-graph.yaml. Put a GitHub Action workflow definition in this file. The definition describes which steps must be taken to execute the iterate.rq, generate.rq and - optionally - the check.rq queries. The results of the queries will automatically be added to the Colonial Collections search graph

Name		Name	Last commit message	Last commit date
Latest commit History 6,041 Commits
.github/workflows		.github/workflows
.vscode		.vscode
aat		aat
bronbeek-stamboeken		bronbeek-stamboeken
data-registry		data-registry
datasets		datasets
geonames		geonames
nde-dataset-register		nde-dataset-register
rce-colonial-objects		rce-colonial-objects
rijksmuseum-objects		rijksmuseum-objects
wereldmuseum-collection-archives		wereldmuseum-collection-archives
wereldmuseum-thesaurus		wereldmuseum-thesaurus
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Colonial Collections: integration data

Steps for adding a new data provider

Steps for adding a new dataset

Steps for adding a new dataset to the search engine

About

Releases

Packages

Contributors 2

colonial-heritage/integration-data

Folders and files

Latest commit

History

Repository files navigation

Colonial Collections: integration data

Steps for adding a new data provider

Steps for adding a new dataset

Steps for adding a new dataset to the search engine

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Packages