forked from smsrts/EricScript
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathREADME
67 lines (44 loc) · 2.97 KB
/
README
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
# EricScript Readme v2.1 (Feb 2016)
# Please feel free to email the author if you have any questions or issues.
# matteo.benelli AT gmail.com
#########################
######INFORMATION #######
#########################
# EricScript is a software package developed in R, perl and bash scripts.
# EricScript uses the BWA aligner to perform the mapping on the transcriptome reference and samtools to handle with SAM/BAM files. Recalibration of the exon junction reference is performed by using BLAT.
#########################
###### REQUIREMENTS #####
#########################
# Download and install R: http://cran.r-project.org/
# Download and install the "ada" R package: http://cran.r-project.org/web/packages/ada/index.html
# Download and install BWA: http://bio-bwa.sourceforge.net
# Download and install SAMtools (>0.1.17): http://samtools.sourceforge.net/
# Download and install bedtools (>2.15): http://code.google.com/p/bedtools/
# Download and install BLAT binaries: http://genome-test.cse.ucsc.edu/~kent/exe/
# Download and install seqtk: https://github.com/lh3/seqtk
# Be sure that all of these programs are included in your PATH.
#########################
###### RUNNING ERIC #####
#########################
# Once you have downloaded EricScript, extract the package
tar -xjf ericscript.tar.bz2
# Make a copy of the program folder to your favorite location. Before running for the first time EricScript, you need to make ericscript.pl executable:
chmod +x /PATH/TO/ERIC/ericscript.pl
#To get information about running EricScript, digit:
/PATH/TO/ERIC/ericscript.pl --help
# In order to perform chimeric transcript detection, you need to download and build the Ensembl Database of a genome. To list the available genomes, digit:
/PATH/TO/ERIC/ericscript.pl --printdb
# After a reference id is selected, you need to download and build the corresponding Ensembl Database. In the example below, it's shown how to prepare the database for saccharomyces cerevisiae.
/PATH/TO/ERIC/ericscript.pl --downdb --refid saccharomyces_cerevisiae -db /PATH/TO/YOUR/DBFOLDER
# You can also select a specific ensembl release (>= 70) to download
/PATH/TO/ERIC/ericscript.pl --downdb --refid saccharomyces_cerevisiae -db /PATH/TO/YOUR/DBFOLDER --ensversion 74
# To run EricScript with default parameters (if parameter "refid" is not specified the analysis takes the homo sapiens species as default):
/PATH/TO/ERIC/ericscript.pl -db /PATH/TO/YOUR/DBFOLDER --refid saccharomyces_cerevisiae -name SAMPLENAME -o /PATH/TO/OUTPUT/ YOUR_FASTQ_1 YOUR_FASTQ_2
# You can check if your database is up-to-date by the following:
/PATH/TO/ERIC/ericscript.pl --checkdb
#########################
###### OUTPUT FILES #####
#########################
# The /PATH/TO/OUTPUT/ folder contains the results of the analysis. Predicted gene fusion products are reported in 2 files:
# samplename.results.total.csv: contains all the predicted gene fusions.
# samplename.results.filtered.csv: contains the predicted gene fusions with EricScore > 0.50.