-
Notifications
You must be signed in to change notification settings - Fork 40
FASTA Files
Tony Boyles edited this page May 16, 2018
·
1 revision
A FASTA file represents one or more DNA sequences. It is a simple text format, composed of a greater than sign (>) followed by an identifier, followed by a newline, followed by a nucleotide string (actg). For example:
> ID_of_person_A
CCTCAGATCACTCTTTGGCAACGACCCCTCGTCACAATAAARATAGGRGGGCA
ACTAAAGGAAGCTCTACTAGATACAGGAGCAGATGATACAGTATTAGAAGAAC
TRAGTTTACCAGGAAGATGGAAACCAAAAATGATAGGGGGAATTGGAGGTTTT
ATCAAAGTAAGACAGTATGATCAGGTAKCCATAGAAATCTGTGGGCATAAAGC
TGTAGGTACAGTATTAGTAGGACCTACACCAGTCAACATAATTGG
> ID_of_person_B
CCTCAGATCACTCTTTGGCAACGACCCCTCGTCACAATAAARATAGGRGGGCA
ACTAAAGGAAGCTCTACTAGATACAGGAGCAGATGATACAGTATTAGAAGAAC
TRAGTTTACCAGGAAGATGGAAACCAAAAATGATAGGGGGAATTGGAGGTTTT
ATCAAAGTAAGACAGTATGATCAGGTAKCCATAGAAATCTGTGGGCATAAAGC
TGTAGGTACAGTATTAGTAGGACCTACACCAGTCAACATAATTGG
The file extension varies (.FASTA, .FAS, .FA). Any file that doesn't have a csv file extension will be parsed as a FASTA file.
Copyright 2017-2018 Centers for Disease Control and Prevention • Acknowledgements