Skip to content

giacuong171/va-spark

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

va-spark

va-spark is a scalable and high performance toolkit for the analysis, annotation, and prioritization of genomic variants.

Introduction

va-spark was created by the software development team at Vinbigdata's Biomedical Information center, which leverages spark parallelism to speed up data processing times of genomic annotate tools like vep, annovar, snpeff, etc. With a simple architecture, making the integration of tools like vep, annovar, snpeff with spark easy and effective, the results of the integration is remarkable. The architecture of VEP is shown in the following figure:

va-spark integration flow

Table of contents

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Scala 56.4%
  • Python 41.1%
  • Dockerfile 2.5%