Skip to content

Latest commit

 

History

History
23 lines (19 loc) · 1.69 KB

README.md

File metadata and controls

23 lines (19 loc) · 1.69 KB

GffCompare

  • compare and evaluate the accuracy of RNA-Seq transcript assemblers (Cufflinks, Stringtie).
  • collapse (merge) duplicate transcripts from multiple GTF/GFF3 files (e.g. resulted from assembly of different samples)
  • classify transcripts from one or multiple GTF/GFF3 files as they relate to reference transcripts provided in a annotation file (also in GTF/GFF3 format)

The original form of this program is also distributed as part of the Cufflinks suite, under the name "CuffCompare" (see manual: http://cole-trapnell-lab.github.io/cufflinks/cuffcompare/). All the options and parameters of CuffCompare are supported by GffCompare (for now), while new features will likely be added to GffCompare in the future.

For example, when -G option is provided and a single query GTF/GFF file is given, and a reference annotation file is given with the -r option, gffcompare switches into "annotation mode" and it generates a .annotated.gtf file (instead of the .merged.gtf produced by CuffCompare). This file has the same general format as CuffCompare's .merged.gtf file (with "class codes" assigned to transcripts as per their relationship with the matching/overlapping reference transcript), but the original transcript IDs are preserved, so gffcompare can thus be used as a simple way of annotating a set of transcripts.

This package only depends on my other code library, GCLib. In order to build GffCompare from this source package the following steps can be taken:

  cd /some/build/dir
  git clone https://github.com/gpertea/gclib
  git clone https://github.com/gpertea/gffcompare
  cd gffcompare
  make release

This should build the gffcompare binary in the current directory.