Collection of practices. Documents are currently in Wiki page.
- gb2tsv.pl: extract information in FEATURE section of genbank files into tab-delimited text format.
- isoformSummary.pl: ISOseq isoform summarizing script.
- RPFpeptidePredict.pl: Riboseq based ORF finding.
- GmtIntersectFisher.pl: Enrichment computation with GMT files.
- genOntoHeatmapData.pl: Generate heatmap data matrix based on enrichment results.
- genOntoDataframe.pl: Generate dataframe based on enrichment results.
- tsv2xlsx.pl: Translate a TSV file into a XLSX file. Perl module Excel::Writer::XLSX is required.
- xlsx2tsv.pl: Translate a data sheet in a XLSX file into a TSV file. Perl module Spreadsheet::ParseXLSX is required.
- General setup: General setup of used tools
- From ontology annotation results to enrichment computation: (i) transferring an annotation table into a GSEA GMT file (ii) steps of enrichment computation
- GFF3 handling, Araport11 AtRTD2 merge
- GFF3 handling, Araport11 example
- Ribo seq 3bp periodicity preference: Example of computing 3bp periodicity preference of Ribo-seq data
- Ribo seq based ORF prediction
- Ribo seq read mapping: A simple package for mapping Ribo-seq reads and computing expression levels. The default setting is for arabidopsis.
- Ribo seq read mapping, customization example: A walkthrough example of a multi-stage approach for mapping Ribo-seq reads (of one sample).
- Riboseq multi stage mapping and filtering: This is an index page pointing to parts of a complete procedure that we applied to a number of Riboseq read datasets for mapping them to the arabidopsis genome. In the procedure, we tried leveraging on existing genome annotation and supports from available RNAseq datasets. Given that reads been adapter clipped by tools like cutadapt and not filtered by rRNA read removal tools like sortmerna, the procedure used to map our Riboseq read datasets to the arabidopsis genome with mapping ratios more than 90%.
- Summarize ISOseq isoforms: This page describes programs and terminologies applied in the paper An improved repertoire of splicing variants and their potential roles in Arabidopsis photomorphogenic development (PMID: 35139889). The programs and scripts applied are aimed to: (i) compare isoforms detected in ISOseq reads against database gene models and report alternative splicing events in reads (ii) summarize detected existing and novel isoforms for comparing samples in terms of expressed isoforms.