-
Notifications
You must be signed in to change notification settings - Fork 2
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
feat: add pindel artifact annotation and filter (#71)
* refactor: add pipeline version and restructure software versions * feat: hydra genetics 3.0.0 to include software versions into multiqc * refactor: remove missed comment on min_version() * style: make snakefmt happy * feat: add background tsv file from refereces module * feat: add background annotation to snv_indels vcf-file * test: add missing background in test config * chore: update cnv_sv module to include type in pindel vcf samplename * refactor: move all pindel processing rules into one snakefile * feat: create an artifact tsv-file in reference pipeline * fix: add samplename in dict, e.g. more than one del same pos * feat: add artifact annotation to pindel vcf * feat: add artifact filtering to pindel vcf * fix: broken svdb-merge-vcf when using priority flag * docs: slight rewording and formatting --------- Co-authored-by: Niklas Mähler <[email protected]>
- Loading branch information
Showing
22 changed files
with
661 additions
and
122 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,23 +1,112 @@ | ||
|
||
# Software used in Poppy | ||
Rules specifically for Poppy listed here. | ||
|
||
## [annotation_vep_pindel](https://www.ensembl.org/info/docs/tools/vep/index.html) | ||
Since pindel is run on limited region it does not always produce results, if an empty vcf-file is used with VEP it will fail and the entire pipeline will stop, therefor a specific rule is needed to ensure there are variants in the pindel vcf before annotating the vcf. If no variants are found the empty vcf file is just copied to the output. | ||
## pindel_processing.smk | ||
[Pindel](http://gmt.genome.wustl.edu/packages/pindel/) creates an older type of VCF and therefore has to be processed slightly different than more modern VCFs. Here we add the AF and DP fields to the VCF INFO column, annotate the calls using [vep](https://www.ensembl.org/info/docs/tools/vep/index.html) and add artifact annotation based an on artifact panel created with the reference pipeline. | ||
|
||
<!-- Since pindel is run on limited region it does not always produce results, if an empty vcf-file is used with VEP it will fail and the entire pipeline will stop, therefor a specific rule is needed to ensure there are variants in the pindel vcf before annotating the vcf. If no variants are found the empty vcf file is just copied to the output. --> | ||
|
||
### :snake: Rule | ||
|
||
#SNAKEMAKE_RULE_SOURCE__pindel_processing__pindel_processing_annotation_vep# | ||
|
||
#### :left_right_arrow: input / output files | ||
|
||
#SNAKEMAKE_RULE_TABLE__pindel_processing__pindel_processing_annotation_vep# | ||
|
||
### :wrench: Configuration | ||
|
||
#### Software settings (`config.yaml`) | ||
|
||
#CONFIGSCHEMA__pindel_processing_annotation_vep# | ||
|
||
#### Resources settings (`resources.yaml`) | ||
|
||
#RESOURCESSCHEMA__pindel_processing_annotation_vep# | ||
|
||
|
||
### :snake: Rule | ||
|
||
#SNAKEMAKE_RULE_SOURCE__pindel_processing__pindel_processing_fix_af# | ||
|
||
#### :left_right_arrow: input / output files | ||
|
||
#SNAKEMAKE_RULE_TABLE__pindel_processing__pindel_processing_fix_af# | ||
|
||
### :wrench: Configuration | ||
|
||
#### Software settings (`config.yaml`) | ||
|
||
#CONFIGSCHEMA__pindel_processing_fix_af# | ||
|
||
#### Resources settings (`resources.yaml`) | ||
|
||
#RESOURCESSCHEMA__pindel_processing_fix_af# | ||
|
||
|
||
### :snake: Rule | ||
|
||
#SNAKEMAKE_RULE_SOURCE__pindel_processing__pindel_processing_artifact_annotation# | ||
|
||
#### :left_right_arrow: input / output files | ||
|
||
#SNAKEMAKE_RULE_TABLE__pindel_processing__pindel_processing_artifact_annotation# | ||
|
||
### :wrench: Configuration | ||
|
||
#### Software settings (`config.yaml`) | ||
|
||
#CONFIGSCHEMA__pindel_processing_artifact_annotation# | ||
|
||
#### Resources settings (`resources.yaml`) | ||
|
||
#RESOURCESSCHEMA__pindel_processing_artifact_annotation# | ||
|
||
|
||
## [svdb](https://github.com/J35P312/SVDB).smk | ||
Since when running `svdb --merge` with the priority flag set, svdb cuts off the FORMAT column for cnvkit variants [git issue](). We use a non-Hydra Genetics rule for the `svdb --merge` command. | ||
|
||
### :snake: Rule | ||
|
||
#SNAKEMAKE_RULE_SOURCE__annotation_vep_pindel__annotation_vep_pindel# | ||
#SNAKEMAKE_RULE_SOURCE__svdb__svdb_merge_wo_priority# | ||
|
||
#### :left_right_arrow: input / output files | ||
|
||
#SNAKEMAKE_RULE_TABLE__annotation_vep_pindel__annotation_vep_pindel# | ||
#SNAKEMAKE_RULE_TABLE__svdb__svdb_merge_wo_priority# | ||
|
||
### :wrench: Configuration | ||
|
||
#### Software settings (`config.yaml`) | ||
|
||
#CONFIGSCHEMA__annotation_vep_pindel# | ||
#CONFIGSCHEMA__svdb_merge# | ||
|
||
#### Resources settings (`resources.yaml`) | ||
|
||
#RESOURCESSCHEMA__annotation_vep_pindel# | ||
#RESOURCESSCHEMA__svdb_merge# | ||
|
||
|
||
--- | ||
|
||
## reference_rules.smk | ||
Software used specifically to create the reference-files for Poppy. | ||
|
||
### :snake: Rule | ||
|
||
#SNAKEMAKE_RULE_SOURCE__reference_rules__reference_rules_create_artefact_file_pindel# | ||
|
||
#### :left_right_arrow: input / output files | ||
|
||
#SNAKEMAKE_RULE_TABLE__reference_rules__reference_rules_create_artifact_file_pindel# | ||
|
||
### :wrench: Configuration | ||
|
||
#### Software settings (`config.yaml`) | ||
|
||
#CONFIGSCHEMA__reference_rules_create_artifact_file_pindel# | ||
|
||
#### Resources settings (`resources.yaml`) | ||
|
||
#RESOURCESSCHEMA__reference_rules_create_artefact_file_pindel# | ||
|
||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -7,12 +7,14 @@ __email__ = "[email protected]" | |
__license__ = "GPL-3" | ||
|
||
|
||
ruleorder: annotation_vep_pindel > annotation_vep | ||
ruleorder: pindel_processing_annotation_vep > annotation_vep | ||
ruleorder: pindel_processing_artifact_annotation > annotation_artifact_annotation | ||
ruleorder: svdb_merge_wo_priority > cnv_sv_svdb_merge | ||
|
||
|
||
include: "rules/common.smk" | ||
include: "rules/fix_af_pindel.smk" | ||
include: "rules/annotation_vep_pindel.smk" | ||
include: "rules/svdb.smk" | ||
include: "rules/pindel_processing.smk" | ||
|
||
|
||
report: "report/workflow.rst" | ||
|
@@ -53,7 +55,7 @@ use rule * from annotation as annotation_* | |
|
||
module cnv_sv: | ||
snakefile: | ||
github("hydra-genetics/cnv_sv", path="workflow/Snakefile", tag="b549266") | ||
github("hydra-genetics/cnv_sv", path="workflow/Snakefile", tag="1aa9a68") | ||
config: | ||
config | ||
|
||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,4 +1,5 @@ | ||
include: "rules/common_references.smk" | ||
include: "rules/reference_rules.smk" | ||
|
||
|
||
rule all: | ||
|
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file was deleted.
Oops, something went wrong.
Oops, something went wrong.