Skip to content

Latest commit

 

History

History

data_files

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
V1.4 FDA ARGOS Data Dictionary README		
		
Description: 		
List of controlled vocabulary terms for data.ARGOSdb.org datasets and data properties. 		
This data dictionary was created to aid in the integration of ARGOS data from many disparate sources.		
Each of the column headers in each of the respective data sheets displayed on data.ARGOSdb.org was recommended by project members and collaborators from the FDA.		
The resulting list was manually curated to combine similar terms, and provide a consistent representation across all datasets in ARGOSdb.		
The primary use case for the data dictionary is to ensure all data submitted to data.argosdb.org is following a consistent representation of the data properties.		
The draft dictionary is maintained as a excel file for project members to contribute to. 		
After a final review, all datasets listed here downloaded, QC'd with test scripts, converted to .tsv, published to data.ARGOSdb, and are saved in the respective versioned directory in GitHub.		
GitHub Directory: 	https://github.com/FDA-ARGOS/data.argosdb/tree/main/data_dictionary 	
		
Data Dictionary Contents:		
1) README.tsv => A summary of each sheet and the column headers		
2) release_notes.tsv => An itemized list of the changes implemented in the current version.		
		
3) property_definition.tsv => List of all controlled vocabulary terms for data.ARGOSdb		
BCO ID: ARGOS_000015		
column header	data	
property	Consensus name for data property described in row	
data_files	A `|` separated list of dataset names where this property is utilized	
recommended	Ther person or resource that suggested using the property	
description	A definition and additional information about the property.	
source_or_type_def	The data source for obtaining the property.	
		
4) core_property_list.tsv => List of all core dataset properties		
BCO ID: ARGOS_000016		
column header	data	
property	Consensus name for data property described in row	
data_object	The dataset this property is used 	
requirement	Indicates if the property is REQUIRED to hava a valid data row	
id	For JSON schema conversion	
title	Human readable name for property. Default is the same as property.	
data_type	Property type as defined by JSON types	
constraint	Set per a term to indicate an acceptable value range. Can be used as a QC tool.	
default	Default value for property	
examples	Example for the property	
pattern	The regular expression evaluation for this property. Can be used as a QC tool. 	
		
5) annotation_property_list.tsv => List of all annotation dataset properties		
BCO ID: ARGOS_000017		
column header	data	
property	Consensus name for data property described in row	
data_object	The dataset this property is used 	
requirement	Indicates if the property is REQUIRED to hava a valid data row	
id	For JSON schema conversion	
title	Human readable name for property. Default is the same as property.	
data_type	Property type as defined by JSON types	
constraint	Set per a term to indicate an acceptable value range. Can be used as a QC tool.	
default	Default value for property	
examples	Example for the property	
pattern	The regular expression evaluation for this property. Can be used as a QC tool. 	
		
V1.4 Data Sheet Statistics:		
Class	Sheet	Total Properties
Dictionary	property_definition.tsv	132
Dictionary 	core_property_list.tsv	254
Dictionary 	annotation_property_list.tsv	29
Core	assemblyQC*	33
Core	biosampleMeta*	29
Core	ngsQC*	41
Core	siteQC*	34
Core	ngs_ID_list.tsv	14
Annotation	DRM_all_orgs.tsv	29