Skip to content

Commit

Permalink
version upd
Browse files Browse the repository at this point in the history
  • Loading branch information
mikessh committed Oct 7, 2016
1 parent bf66933 commit 921a6eb
Show file tree
Hide file tree
Showing 3 changed files with 7 additions and 8 deletions.
3 changes: 1 addition & 2 deletions doc/body.rst
Original file line number Diff line number Diff line change
Expand Up @@ -259,8 +259,7 @@ is given below:
- In the dataset section select **Ensembl genes 84** and choose **Homo sapiens genes**.
- In the filter section and **Gene** subsection, input your gene IDs to **external references ID** textbox and select appropriate nomenclature (e.g. **HGNC symbol** when using gene symbols).
- In attributes section select **Sequences** and untick all header information features.
- Under the **SEQUENCES** menu select **Exon sequences** and input the 5' and 3' flank base count, e.g. 50 (it should be the same for both 5'
and 3' flank). This is to ensure that reads produced by exome capture techniques are fully mapped.
- Under the **SEQUENCES** menu select **Exon sequences** and input the 5' and 3' flank base count, e.g. 50 (it should be the same for both 5' and 3' flank). This is to ensure that reads produced by exome capture techniques are fully mapped.
- Select the following features (order matters here): **Associated Gene Name**, **Ensembl Exon ID**, **Chromosome Name**, **Exon Chr Start (bp)**, **Exon Chr End (bp)** and **Strand**.
- Go to results, select **unique results only** and download the resulting FASTA file (save it as ``biomart_refs.fa``).
- Download and run the following `script <https://raw.githubusercontent.com/mikessh/mageri-paper/master/get_refs/ExtractBedFormBiomartRefs.groovy>`__ (requires `Groovy <http://www.groovy-lang.org/>`__ to be installed): ``groovy ExtractBedFormBiomartRefs.groovy biomart_refs.fa refs.fa refs.bed 50``. The last argument specifies the flank size.
Expand Down
10 changes: 5 additions & 5 deletions doc/index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -18,13 +18,13 @@ be browsed and post-processed by the majority of conventional bioinformatics sof
Terminology
-----------

- UMI - unique molecular identifier, a short (typically 6-20nt) degenerate nucleotide sequence, that is attach to cDNA/DNA molecules in order to trace them throughout the entire experiment.
- UMI - unique molecular identifier, a short (4-20bp) degenerate nucleotide sequence, that is attach to cDNA/DNA molecules in order to trace them throughout the entire experiment.
- Sample barcode - a short specific nucleotide sequence used to mark cDNA/DNA molecules that correspond to a given sample in a pooled sequencing library
- MIG - molecular identifier group, a set of reads or read pairs that have an identical UMI sequence
- MIG consensus - the consensus sequence of MIG, calculated from MIG position-weight matrix by taking the letter with highest frequency at each position
- CQS - consensus quality score, calculated as the maximal relative nucleotide frequency at a given PWM position. Usually scaled to [2, 40] range to fit Phred33 string representation. Indicates our confidence in the consensus sequence at a given position.
- Major variant (aka dominant variant, supermutant) - a sequence variant that is present in MIG consensus sequence
- Minor variant - a sequence variant that is present in reads within an MIG, but doesn’t get to the final MIG consensus sequence
- MIG consensus - the consensus sequence of MIG, that is, the consensus of multiple alignment of all reads in a given MIG
- CQS - consensus quality score, calculated as the fraction of reads matching the consensus sequence at a given position. Can be scaled to ``[2, 40]`` range to fit Phred33 quality representation.
- Major variant (aka dominant variant, supermutant) - a sequence variant that is present in MIG consensus, but doesn't match the reference sequence
- Minor variant - a sequence variant that differs from the consensus sequence found in one or more reads within a given MIG

Table of contents
-----------------
Expand Down
2 changes: 1 addition & 1 deletion pom.xml
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,7 @@

<groupId>com.antigenomics</groupId>
<artifactId>mageri</artifactId>
<version>1.0.1-SNAPSHOT</version>
<version>1.1.0</version>
<packaging>jar</packaging>
<name>MAGERI</name>

Expand Down

0 comments on commit 921a6eb

Please sign in to comment.