Skip to content
Jared Johnson edited this page Apr 1, 2024 · 2 revisions

EPITOME (Enhanced Phylogenetic Inference Through Optimized Mapping Efficiency) condenses a diverse set of DNA sequences (species-level or closer) into discrete, composite sequences that represent the overarching diversity of the dataset. In other words, EPITOME creates sequences that are the epitome of the dataset diversity. This is accomplished by clustering the input based on pairwise genetic distances and then averaging the nucleotides at each genomic position (ties selected at random). When the genetic distance is based on read mapping efficiency, EPITOME creates a set of reference genomes for consensus-based assembly pipelines, like VAPER or viralrecon.

Clone this wiki locally