Skip to content

Commit 4150830

Browse files
authored
Merge pull request #296 from cidgoh/example-data-0-15-4
example data for 0.15.4 release
2 parents 1ff4bab + 0024808 commit 4150830

14 files changed

+51
-21
lines changed

README.md

+18
Original file line numberDiff line numberDiff line change
@@ -57,6 +57,24 @@ You can also automatically fill a column with a specified value, but only in row
5757

5858
For _more information_ on available application features, select the **Help** button followed by **Getting Started** from within the DataHarmonizer application.
5959

60+
## Example Data
61+
62+
Templates with example data testing functionalities can be found within the following folder structure when available:
63+
64+
```
65+
. TOP LEVEL DIRECTORY
66+
├── images
67+
├── libraries
68+
├── script
69+
└── template
70+
│ ├── templateOfInterest
71+
│ │ └── exampleInput
72+
│ └── ...
73+
```
74+
75+
Alternatively, you can find links to all available test data below:
76+
- [`canada_covid19`](https://github.com/cidgoh/DataHarmonizer/tree/master/template/canada_covid19/exampleInput) CanCOGeN Covid-19
77+
6078
## Additional Information
6179

6280
For more information about the DataHarmonizer, it's templates, and how to use them, check out the [DataHarmonizer Wiki](https://github.com/Public-Health-Bioinformatics/DataHarmonizer/wiki).
Loading
Loading
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,22 @@
1+
# CanCOGeN Example Input Data
2+
This directory contains example input/test data for the Canadian COVID Genomics Network (CanCOGeN) DataHarmonizer application template: `CanCOGeN Covid-19`. This data is appropriate for testing up to the version appended to the end of the file name, for example:
3+
4+
- `validTestData_0-15-4.csv` is _valid_ for version `0.15.4` of the DataHarmonizer.
5+
6+
## Valid Test Data
7+
8+
Demonstrates _valid_ example input, from controlled vocabulary and date formats to free text strings, for all minimal metadata fields and some recommended/enhanced metadata fields.
9+
10+
## Invalid Test Data
11+
12+
Demonstrates _invalid_ example input, from controlled vocabulary and date formats, for all minimal metadata fields and some recommended/enhanced metadata fields. After validating, empty cells that require input appear dark red while invalid cell contents appear light red.
13+
14+
There are special validation rules for some fields, for example:
15+
16+
- There cannot be duplicates of the database identifier `specimen collector sample ID` field. When duplicates occur the corresponding cells will be flagged light red:
17+
18+
> ![invalidData_specimenCollectorSampleID.png](/images/invalid/invalidData_specimenCollectorSampleID.png?raw=true)
19+
20+
- Sample collection date cannot be before SARS-CoV-2 sampling began even when formatted using the appropriate ISO 8601 standard format "YYYY-MM-DD":
21+
22+
> ![invalidData_sampleCollectionDate.png](/images/invalid/invalidData_sampleCollectionDate.png?raw=true)

template/canada_covid19/exampleInput/invalidTestData.csv

-7
This file was deleted.

0 commit comments

Comments
 (0)