Skip to content

Questions and answers about TOPmed, GTEx, and AGR resources.

Notifications You must be signed in to change notification settings

dcppc/data-stewards

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

36 Commits
 
 
 
 
 
 

Repository files navigation

Commons Data Stewards Github Repo

This document derived from March 5th, 2018 data steward conference call.

Introduction

  • Short introduction to the Data Commons
  • The KCs are described here. See, in particular, KC7, which is focused on indexing.
  • There are many dimensions of the commons and at different levels of maturity
  • It is recognized that much of this should be driven by usecases, and the DCPPC- developers should be sharing these with the Data Stewards as soon as possible. Our use-case situation is complicated - we need to push forward in the meantime

Purpose and approach

  • We want this to be a partnership wrt role definition, how things change over time, how to make best use of resources
  • We recognize that Push/pull models, protocols will change over time

Agreement: for each data source we will record:

  • Version
  • Short manifest, preferably computable
  • Point of contact (POC)
  • Schema version
  • Is this possible: Description for generation dataset in the manifest - how it would be re-generated and transferred to the full stacks if that POC turned into a pump

Data Stewards points of contact:

Agreement - Alliance / MODs

The Alliance manifest is [here]

  • The Alliance site (alliancegenome.org), cloud file access, GOC API will be used to obtain gene function (aka GO annotations)
  • Items such as these can be obtained from DC cloud file access, and soon Alliance API.
    • Disease associations (Disease Ontology)
    • Chromosomal features (Sequence Ontology)
    • Orthology/Conservation/Homologs schema
    • Basic Gene Information
    • Expression (soon) - available from select MODs now
    • Phenotypes (soon) - available from select MODs now
    • Physical and Genetic interactions (soon) - available from select MODs now
    • Others are in progress and well be release when available The Alliance is currently focusing on data types are that are shared between all member projects. If the data you require is only available from a subset of the MODs please file a ticket.

About

Questions and answers about TOPmed, GTEx, and AGR resources.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published