Skip to content

Commit

Permalink
Merge pull request #166 from nf-core/dev
Browse files Browse the repository at this point in the history
PR for release 2.0.6
  • Loading branch information
apeltzer authored Mar 6, 2019
2 parents 2786af3 + ae956e2 commit bc55df3
Show file tree
Hide file tree
Showing 11 changed files with 280 additions and 115 deletions.
8 changes: 7 additions & 1 deletion .travis.yml
Original file line number Diff line number Diff line change
Expand Up @@ -13,7 +13,7 @@ before_install:
# Pull the docker image first so the test doesn't wait for this
- docker pull nfcore/eager:dev
# Fake the tag locally so that the pipeline runs properly
- docker tag nfcore/eager:dev nfcore/eager:2.0.5
- docker tag nfcore/eager:dev nfcore/eager:2.0.6

install:
# Install Nextflow
Expand All @@ -40,6 +40,12 @@ script:
- nextflow run ${TRAVIS_BUILD_DIR} -profile test,docker --pairedEnd --saveReference
# Run the basic pipeline with single end data (pretending its single end actually)
- nextflow run ${TRAVIS_BUILD_DIR} -profile test,docker --singleEnd --bwa_index results/reference_genome/bwa_index/bwa_index/
# Run the basic pipeline with paired end data without collapsing
- nextflow run ${TRAVIS_BUILD_DIR} -profile test,docker --pairedEnd --skip_collapse --saveReference
# Run the basic pipeline with paired end data without trimming
- nextflow run ${TRAVIS_BUILD_DIR} -profile test,docker --pairedEnd --skip_trim --saveReference
# Run the basic pipeline with paired end data without adapterRemoval
- nextflow run ${TRAVIS_BUILD_DIR} -profile test,docker --pairedEnd --skip_adapterremoval --saveReference
# Run the same pipeline testing optional step: fastp, complexity
- nextflow run ${TRAVIS_BUILD_DIR} -profile test,docker --pairedEnd --complexity_filter --bwa_index results/reference_genome/bwa_index/bwa_index/
# Test BAM Trimming
Expand Down
23 changes: 20 additions & 3 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,24 @@ All notable changes to this project will be documented in this file.
The format is based on [Keep a Changelog](http://keepachangelog.com/en/1.0.0/)
and this project adheres to [Semantic Versioning](http://semver.org/spec/v2.0.0.html).

## [Unpublished / Dev Branch]
## [2.0.6] - 2019-03-05

### `Added`

* [#152](https://github.com/nf-core/eager/pull/152) - Clarified `--complexity_filter` flag to be specifically for poly G trimming.
* [#155](https://github.com/nf-core/eager/pull/155) - Added [Dedup log to output folders](https://github.com/nf-core/eager/issues/154)
* [#159](https://github.com/nf-core/eager/pull/159) - Added Possibility to skip AdapterRemoval, skip merging, skip trimming fixing [#64](https://github.com/nf-core/eager/issues/64),[#137](https://github.com/nf-core/eager/issues/137) - thanks to @maxibor, @jfy133

### `Fixed`

* [#151](https://github.com/nf-core/eager/pull/151) - Fixed [post-deduplication step errors](https://github.com/nf-core/eager/issues/128
* [#147](https://github.com/nf-core/eager/pull/147) - Fix Samtools Index for [large references](https://github.com/nf-core/eager/issues/146)
* [#145](https://github.com/nf-core/eager/pull/145) - Added Picard Memory Handling [fix](https://github.com/nf-core/eager/issues/144)

### `Dependencies`
* Picard Tools 2.18.23 -> 2.18.27
* GATK 4.0.12.0 -> 4.1.0.0
* FastP 0.19.6 -> 0.19.7

## [2.0.5] - 2019-01-28

Expand All @@ -18,8 +35,8 @@ and this project adheres to [Semantic Versioning](http://semver.org/spec/v2.0.0.

### `Dependencies`
* Picard Tools 2.18.21 -> 2.18.23
* R-Markdown 1.10 -> 1.11
* FastP 0.19.5 -> 0.19.6
* R-Markdown 1.10 -> 1.11
* FastP 0.19.5 -> 0.19.6

## [2.0.4] - 2019-01-09

Expand Down
2 changes: 1 addition & 1 deletion Dockerfile
Original file line number Diff line number Diff line change
Expand Up @@ -3,4 +3,4 @@ FROM nfcore/base
LABEL description="Docker image containing all requirements for nf-core/eager pipeline"
COPY environment.yml /
RUN conda env create -f /environment.yml && conda clean -a
ENV PATH /opt/conda/envs/nf-core-eager-2.0.5/bin:$PATH
ENV PATH /opt/conda/envs/nf-core-eager-2.0.6/bin:$PATH
27 changes: 24 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -45,20 +45,28 @@ Additional functionality contained by the pipeline currently includes:
## Quick Start

1. Install [`nextflow`](docs/installation.md)

2. Install one of [`docker`](https://docs.docker.com/engine/installation/), [`singularity`](https://www.sylabs.io/guides/3.0/user-guide/) or [`conda`](https://conda.io/miniconda.html)

3. Download the EAGER pipeline

```bash
nextflow pull nf-core/eager
```

4. Set up your job with default parameters
4. Test the pipeline using the provided test data

```bash
nextflow run nf-core -profile <docker/singularity/conda> --reads'*_R{1,2}.fastq.gz' --fasta '<REFERENCE>.fasta'
nextflow run nf-core/eager -profile <docker/singularity/conda>,test --pairedEnd
```

5. See the overview of the run with under `<OUTPUT_DIR>/MultiQC/multiqc_report.html`
5. Start running your own ancient DNA analysis!

```bash
nextflow run nf-core/eager -profile <docker/singularity/conda> --reads'*_R{1,2}.fastq.gz' --fasta '<REFERENCE>.fasta'
```

NB. You can see an overview of the run in the MultiQC report located at `<OUTPUT_DIR>/MultiQC/multiqc_report.html`

Modifications to the default pipeline are easily made using various options
as described in the documentation.
Expand All @@ -84,6 +92,19 @@ James Fellows Yates, Raphael Eisenhofer and Judith Neukamm. If you want to
contribute, please open an issue and ask to be added to the project - happy to
do so and everyone is welcome to contribute here!

## Contributors

- [James A. Fellows-Yates](https://github.com/jfy133)
- [Stephen Clayton](https://github.com/sc13-bioinf)
- [Maxime Borry](https://github.com/maxibor)
- [Judith Neukamm](https://github.com/JudithNeukamm)
- [Raphael Eisenhofer](https://github.com/EisenRa)
- [Maxime Garcia](https://github.com/MaxUlysse)
- [Luc Venturini](https://github.com/lucventurini)
- [Hester van Schalkwyk](https://github.com/hesterjvs)

If you've contributed and you're missing in here, please let me know and I'll add you in.

## Tool References

* **EAGER v1**, CircularMapper, DeDup* Peltzer, A., Jäger, G., Herbig, A., Seitz, A., Kniep, C., Krause, J., & Nieselt, K. (2016). EAGER: efficient ancient genome reconstruction. Genome Biology, 17(1), 1–14. [https://doi.org/10.1186/s13059-016-0918-z](https://doi.org/10.1186/s13059-016-0918-z) Download: [https://github.com/apeltzer/EAGER-GUI](https://github.com/apeltzer/EAGER-GUI) and [https://github.com/apeltzer/EAGER-CLI](https://github.com/apeltzer/EAGER-CLI)
Expand Down
4 changes: 2 additions & 2 deletions Singularity
Original file line number Diff line number Diff line change
Expand Up @@ -4,10 +4,10 @@ Bootstrap:docker
%labels
MAINTAINER Alexander Peltzer <[email protected]>
DESCRIPTION Container image containing all requirements for the nf-core/eager pipeline
VERSION 2.0.5
VERSION 2.0.6

%environment
PATH=/opt/conda/envs/nf-core-eager-2.0.5/bin:$PATH
PATH=/opt/conda/envs/nf-core-eager-2.0.6/bin:$PATH
export PATH

%files
Expand Down
5 changes: 4 additions & 1 deletion conf/base.config
Original file line number Diff line number Diff line change
Expand Up @@ -31,7 +31,10 @@ process {
withName:convertBam {
cpus = { check_max(8 * task.attempt, 'cpus') }
}

withName:makeSeqDict {
memory = { check_max( 16.GB * task.attempt, 'memory' ) }
}

withName:bwa {
memory = { check_max( 16.GB * task.attempt, 'memory' ) }
cpus = { check_max(8 * task.attempt, 'cpus') }
Expand Down
1 change: 1 addition & 0 deletions conf/multiqc_config.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -9,6 +9,7 @@ top_modules:
- '*_fastqc.zip'
path_filters_exclude:
- '*.combined.prefixed_fastqc.zip'
- 'fastp'
- 'adapterRemoval'
- 'fastqc':
name: 'FastQC (post-AdapterRemoval)'
Expand Down
32 changes: 29 additions & 3 deletions docs/usage.md
Original file line number Diff line number Diff line change
Expand Up @@ -170,6 +170,10 @@ If you prefer, you can specify the full path to your reference genome when you r
```
> If you don't specify appropriate `--bwa_index`, `--fasta_index` parameters, the pipeline will create these indices for you automatically. Note, that saving these for later has to be turned on using `--saveReference`. You may also specify the path to a gzipped (`*.gz` file extension) FastA as reference genome - this will be uncompressed by the pipeline automatically for you. Note that other file extensions such as `.fna`, `.fa` are also supported but will be renamed to `.fasta` automatically by the pipeline.
### `--large_ref`

This parameter is required to be set for large reference genomes. If your reference genome is larger than 3.5GB, the `samtools index` calls in the pipeline need to generate `CSI` indices instead of `BAI` indices to accompensate for the size of the reference genome. This parameter is not required for smaller references (including a human `hg19` or `grch37`/`grch38` reference), but `>4GB` genomes have been shown to need `CSI` indices.

### `--genome` (using iGenomes)

The pipeline config files come bundled with paths to the illumina iGenomes reference index files. If running with docker or AWS, the configuration is set up to use the [AWS-iGenomes](https://ewels.github.io/AWS-iGenomes/) resource.
Expand Down Expand Up @@ -237,7 +241,7 @@ Use to set a top-limit for the default time requirement for each process.
Should be a string in the format integer-unit. eg. `--max_time '2.h'`. If not specified, will be taken from the configuration in the `-profile` flag.

### `--max_cpus`
Use to set a top-limit for the default CPU requirement for each process.
Use to set a top-limit for the default CPU requirement for each **process**. This is not the maximum number of CPUs that can be used for the whole pipeline, but the maximum number of CPUs each program can use for each program submission (known as a process). Do not set this higher than what is available on your workstation or computing node can provide. If you're unsure, ask your local IT administrator for details on compute node capabilities!
Should be a string in the format integer-unit. eg. `--max_cpus 1`. If not specified, will be taken from the configuration in the `-profile` flag.

### `--email`
Expand Down Expand Up @@ -279,12 +283,16 @@ This part of the documentation contains a list of user-adjustable parameters in

## Step skipping parameters

Some of the steps in the pipeline can be executed optionally. If you specify specific steps to be skipped, there won't be any output related to these modules.
Some of the steps in the pipeline can be executed optionally. If you specify specific steps to be skipped, there won't be any output related to these modules.

### `--skip_preseq`

Turns off the computation of library complexity estimation.

### `--skip_adapterremoval`

Turns off adaptor trimming and paired-end read merging. Equivalent to setting both `--skip_collapse` and `--skip_trim`.

### `--skip_damage_calculation`

Turns off the DamageProfiler module to compute DNA damage profiles.
Expand All @@ -299,7 +307,7 @@ Turns off duplicate removal methods DeDup and MarkDuplicates respectively. No du

## Complexity Filtering Options

### `--complexity_filter`
### `--complexity_filter_poly_g`

Performs a poly-G tail removal step in the beginning of the pipeline, if turned on. This can be useful for trimming ploy-G tails from short-fragments sequenced on two-colour Illumina chemistry such as NextSeqs (where no-fluorescence is read as a G on two-colour chemistry), which can inflate reported GC content values.

Expand Down Expand Up @@ -329,6 +337,24 @@ Defines the minimum read quality per base that is required for a base to be kept
### `--clip_min_adap_overlap` 1
Sets the minimum overlap between two reads when read merging is performed. Default is set to `1` base overlap.

### `--skip_collapse`

Turns off the paired-end read merging.

For example
```bash
--pairedEnd --skip_collapse --reads '*.fastq'
```

### `--skip_trim`

Turns off adaptor and quality trimming.

For example:
```bash
--pairedEnd --skip_trim --reads '*.fastq'
```

## Read Mapping Parameters

## BWA (default)
Expand Down
8 changes: 4 additions & 4 deletions environment.yml
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
name: nf-core-eager-2.0.5
name: nf-core-eager-2.0.6
channels:
- defaults
- bioconda
Expand All @@ -9,12 +9,12 @@ dependencies:
- bioconda::adapterremoval=2.2.2
- bioconda::adapterremovalfixprefix=0.0.4
- bioconda::bwa=0.7.17
- bioconda::picard=2.18.23
- bioconda::picard=2.18.27
- bioconda::samtools=1.9
- bioconda::dedup=0.12.3
- bioconda::angsd=0.923
- bioconda::circularmapper=1.93.4
- bioconda::gatk4=4.0.12.0
- bioconda::gatk4=4.1.0.0
- bioconda::qualimap=2.2.2b
- bioconda::vcf2genome=0.91
- bioconda::damageprofiler=0.4.4
Expand All @@ -25,6 +25,6 @@ dependencies:
- conda-forge::pigz=2.3.4
- bioconda::sequencetools=1.2.2
- bioconda::preseq=2.0.3
- bioconda::fastp=0.19.6
- bioconda::fastp=0.19.7
- bioconda::bamutil=1.0.14
#Missing Schmutzi,snpAD
Loading

0 comments on commit bc55df3

Please sign in to comment.