Skip to content

Commit

Permalink
Merge branch 'master' of github.com:rpetit3/fastq-dl
Browse files Browse the repository at this point in the history
  • Loading branch information
rpetit3 committed Oct 24, 2019
2 parents b71eb6e + ec90bf7 commit 00a0662
Showing 1 changed file with 8 additions and 9 deletions.
17 changes: 8 additions & 9 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -16,7 +16,7 @@ conda install -c conda-forge -c bioconda fastq-dl
```

# Usage
*fastq-dl* requires a single ENA/SRA Study, Experiment, or Run accession and FASTQs for all Runs that fall under the given accession will be downloaded. For example, if a Study accession is given all Runs under that studies umbrella will be downloaded.
*fastq-dl* requires a single ENA/SRA Study, Experiment, or Run accession and FASTQs for all Runs that fall under the given accession will be downloaded. For example, if a Study accession is given all Runs under that studies umbrella will be downloaded. Which archive to download from, either *ENA* or *SRA*, is also required.

### Usage Output
```
Expand Down Expand Up @@ -72,23 +72,22 @@ Helpful Options:
### Example Usage
#### Download a Study
```
fastq-dl.py PRJNA248678 SRA
fastq-dl.py PRJNA248678 ENA
fastq-dl PRJNA248678 SRA
fastq-dl PRJNA248678 ENA
```

The above command would download 3 runs that fall under Study PRJNA248678 from either SRA or ENA. The relationship of Study to Run is a 1-to-many relationship, or there can be many Run accessions associated with a single Study Accession. You could use `--group_by_experiment` to group these runs by Experiment accession (or `--group_by_sample` for Sample accession).
The above commands will download 3 runs that fall under Study PRJNA248678 from either SRA (`fastq-dl PRJNA248678 SRA`) or ENA (`fastq-dl PRJNA248678 ENA`). The relationship of Study to Run is a 1-to-many relationship, or there can be many Run accessions associated with a single Study Accession. You could use `--group_by_experiment` to group these runs by Experiment accession (or `--group_by_sample` for Sample accession).

#### Download an Experiment
```
fastq-dl.py SRX477044 SRA
fastq-dl.py SRX477044 ENA
fastq-dl SRX477044 ENA
```

The above command would download the single run from ENA or SRA that falls under Experiment SRX477044. The relationship of Experiment to Run is a 1-to-many relationship, or there can be many Run accessions associated with a single Experiment Accession (e.g. resequencing the same sample). Although in most cases, especially for bacterial samples, it is a 1-to-1 relationship. In any case, you can use `--group_by_experiment` to merge multiple runs associated with an Experiment accession into a single FASTQ file (or `--group_by_sample` for Sample accession).
The above command would download the single run from ENA that falls under Experiment SRX477044. The relationship of Experiment to Run is a 1-to-many relationship, or there can be many Run accessions associated with a single Experiment Accession (e.g. resequencing the same sample). Although in most cases, especially for bacterial samples, it is a 1-to-1 relationship. In any case, you can use `--group_by_experiment` to merge multiple runs associated with an Experiment accession into a single FASTQ file (or `--group_by_sample` for Sample accession).

#### Download a Run
```
fastq-dl.py SRR1178105 SRA
fastq-dl SRR1178105 SRA
```

The above command would download the Run SRR1178105 from SRA. Run accessions are the end of the line (1-to-1 relationship)
Expand All @@ -97,7 +96,7 @@ The above command would download the Run SRR1178105 from SRA. Run accessions are
Users can use [Aspera Connect](https://downloads.asperasoft.com/connect2/) to speed up the download of FASTQs from ENA. Installation and setup of Aspera Connect is out of the scope of this documentation, but I can assure you its a simple installation.

```
fastq-dl.py SRR1178105 ENA --aspera /path/to/ascp
fastq-dl SRR1178105 ENA --aspera /path/to/ascp
```

The above command will attempt to download SRR1178105 using the `ascp` tool. By default it will try to use the private key file that is included during the Aspera Connect installation. If it is not found you will need to use the `--aspera_key` parameter to specify its path.

0 comments on commit 00a0662

Please sign in to comment.