InTaVia Milestone 3 Documentation - Basic Operational System
Individual components of the system operational with connectivity partially established and ready for evaluation by WP7.
The repository https://github.com/InTaVia/idm-rdf contains the IDM-RDF at its current state (including ongoing discussions in the issues).
The repository https://github.com/InTaVia/source-dataset-conversion contains the conversion scripts and the datasets in the IDM data model at their current state for the following prosopographical source datasets:
- Finland (Aalto / UH)
- Austria (OeAW)
- Netherlands Part1, Part2, Part3 (zipped in three parts for size) (VU)
The repository https://github.com/InTaVia/grlc contains the grlc software, that is used for providing the JSON API, at its current state.
The repository https://github.com/InTaVia/grlc_sparql contains the JSON API definitions at their current state, namely the following APIs:
- Generic entity search
- Person history
The test version of the API is available under https://grlc.acdh-dev.oeaw.ac.at/api/InTaVia/grlc_sparql#/
In addition to the resources mentioned above we set up a researchspace instance that is meant for internal work on and exploration of the knowledge graph. This service is currently for internal use only.
We deployed prefect within the ACDH-CH kubernetes cluster to run conversion, enrichment and ingestion scripts on the cluster. Given that the open-source version of this software solution doesnt come with authentication built in, this component is currently only reachable from within ACDH-CH subnet. Given some delay in our original planning the scripts are currently still executed locally instead of running within prefect.
We include the code to train an abbreviation identification classifier and to run an abbreviation expansion generator, both based on pre-trained transformer language models. The code can be found here. This is the preliminary version, where we test the concept on a small gold-standard slovene dataset (for now, a separate request for the dataset is required). More thorough evaluation and expansion of the experiments to Dutch and German coming soon.
The milestone 3 version of our interactive text mining environment can be found here. It consists of two connected components, Performancer and AnnoXplorer, providing an overview and detail view on the data respectively; brushing over texts in the former displays them in the latter.
The Milestone 3 Prototype (v0.1.0) of the InTaVia web client (frontend) is available as a permanent release: https://github.com/InTaVia/web/releases/tag/v0.1.0.
The current prototype is available online: https://intavia.acdh-dev.oeaw.ac.at/.
The data shown in the application are automatically generated mock data including person and place entities. As soon as the connection between the backend and the frontend is established, the mock data will be replaced. Note: if below linked visualizations do not show results, please first do a search on the search page and then switch back to the visualization.
The prototype implements aspects of the three top-level components (Data Curation Lab, DC lab; Visual Analytics Studio, VA studio; Visual Storytelling Suite, ST suite) in a single application on separate pages. The components implemented are:
- Keyword search and list view of search results: https://intavia.acdh-dev.oeaw.ac.at/search
- Detail view of entities: e.g., https://intavia.acdh-dev.oeaw.ac.at/person/876859d3-dee8-468d-9c61-a29e97ef478a
- Basic editing capability of person entities (name, description, event types & dates) - click edit on the detail view page.
- Shared global Redux store that can be accessed by all components
- Shared frontend network module using RTKQuery (handles deduplication of requests to data endpoints and caching of results)
Data views:
- Timeline view showing a set of person entities with selected life events (e.g. birth, death, lived): https://intavia.acdh-dev.oeaw.ac.at/timeline
- A geographic map view showing localized life events (i.e. birth and death) connected with a line: https://intavia.acdh-dev.oeaw.ac.at/geomap
Coordinated views:
- Multiple views integrated on a single page are shown here: https://intavia.acdh-dev.oeaw.ac.at/coordination
- The example coordinates an entity list view, a timeline and a map showing persons and their life events via mouseover highlighting (red colour).
- Currently, applies only to the map, a dropdown menu allows to filter the depicted events. If more than one event type is selected, the localized events are connected with lines in chronological order.
Visual querying:
- A workspace to visually query for persons based on attribute constraints including name and date of birth and death: https://intavia.acdh-dev.oeaw.ac.at/visual-querying
-
The ST creator implements an interactive user interface allowing to generate and import slide-based stories. The prototype can depict persons’ life-events on a map and provides annotation capabilities (i.e., images and text): https://intavia.acdh-dev.oeaw.ac.at/storycreator
- Stories Overview (create and delete stories)
- Story Flow (create, layout and delete slides per drag and drop)
- Slide Editor (create, edit, layout and delete map, images and text per drag and drop)
- Text Mode (edit or upload the whole story via textarea) (accessible through the clipboard icon in the top right corner)
-
The single event slides for the "Life of Paolo Vergerio" can be imported through the text editor using the content of json this file.
-
A clickable Adobe XD prototype of the ST viewer is available online: ST Viewer Prototype
- The prototype tells the story "Life of Paolo Vergerio" primarily based on a map/timeline visualization featuring annotations, images, quiz and timebased visualizations.
- All screens including a selection of desktop views are available as PNG: Screens ST viewer [.zip]