Skip to content

Commit

Permalink
added script to generate draft_seeds
Browse files Browse the repository at this point in the history
  • Loading branch information
l-singh-biomsu committed Oct 30, 2024
1 parent ce97304 commit 7a4c742
Show file tree
Hide file tree
Showing 286 changed files with 15,126 additions and 1,127 deletions.
2,310 changes: 2,310 additions & 0 deletions CURATED_SET/curated_service/curatedDB/generate_draft_seeds.ipynb

Large diffs are not rendered by default.

Original file line number Diff line number Diff line change
Expand Up @@ -4661,7 +4661,7 @@
},
{
"cell_type": "code",
"execution_count": 18,
"execution_count": 15,
"id": "b7719968-002f-4229-846d-e4a8f180ec6b",
"metadata": {
"tags": []
Expand Down
Empty file modified CURATED_SET/draft_seeds/Archaeal.fasta
100755 → 100644
Empty file.
4 changes: 4 additions & 0 deletions CURATED_SET/draft_seeds/CS_H2B_(Echinoidea).fasta
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
>Psammechinus|AAB48832.1|CS_H2B_(Echinoidea) organism=Psammechinus miliaris phylum=Echinodermata class=Echinoidea
MPAKGAATKGEKKQAVKSKAMASSRTGDKKRRRRRLESYNIYIYKVLKQVHPDTGISSKA
MSIMNSFVNDIFERIAAEASRLAQYNKKSTISSREVQTAVRLLLPGELAKHAVSEGTKAV
TKYTTSR
75 changes: 75 additions & 0 deletions CURATED_SET/draft_seeds/H1.0.fasta
Original file line number Diff line number Diff line change
@@ -0,0 +1,75 @@
>Thalassiosira|EED88841.1|H1.0 organism=Thalassiosira pseudonana CCMP1335 phylum=Bacillariophyta class=Coscinodiscophyceae
----------------------------------------MSYKAGIAKAITELKDRTGS
SSIAIKKHMQANLPADKKWMNATFLKALKDMVASGELVKTK-----ASYKLSA-------
-------VAKQKASSAGKPKKAPKKKA---------------------APKKTAPKKKAA
PKKKTATKKAATAKKPAAKKATTAK---KSTTKKTAKK-
>Esox|XP_010887142.1|H1.0 organism=Esox lucius phylum=Chordata class=Actinopteri
---MAETVAAPAP-------------KAKKAKAPKKPASHPKYSDMIKAAVQADKSRGGA
SRQSVQKYIKSHYKVGDN-ADSQIKLSLKRMVSGGLLRHTKGIGASGSFKLAKAEDTKKA
PKPKPVVKAKKSPVKAAKPKKVAKPKKVVKSPAKAKKAKVAVKKVKK-SPKKVAPKPKKV
VK-KVKAAKPAKAVKP--KKAKAAKPKPKAAAKKAAKKK
>Salmo|ACH70944.1|H1.0 organism=Salmo salar phylum=Chordata class=Actinopteri
---MAETAAAPAP-------------KAKKAKAPKKPASHPKYSDMIKAAVHADKSRGGA
SRQSVQKYIKSHYKVGDN-ADSQIKLSLKRMVSEGVLRHTKGIGASGSFKLAKAEDTKKA
PKVKAVVKAKKSPVKSAKPKKVAKPKKVAKSPAKAKKAKVAVKKVKK-SPKKAAPKPKKV
AK-KTKVAKPAKATKP--KKAKAAKPKPKAAAKKAAKKK
>Salmo|ACM08534.1|H1.0 organism=Salmo salar phylum=Chordata class=Actinopteri
---------------------------------------------MIKAAVHADKSRGGA
SRQSVQKYIKSHYKVGDN-ADSQIKLSLKRMVSEGVLRHTKGIGASGSFKLAKAEDTKKA
PKVKAVVKAKKSPVKSAKPKKVAKPKKVAKSPAKAKKAKVAVKKVKK-SPKKAAPKPKKV
VK-KTKVAKPAKATKP--KKAKAAKPKPKAAAKKAAKKK
>Salmo|ACM09660.1|H1.0 organism=Salmo salar phylum=Chordata class=Actinopteri
---------------------------------------------MIKAAVHADKSRGGA
SRQSVQKYIKSHYKVGDN-ADSQIKLSLKRMVSEGVLRHTKGIGASGSFKLAKAEDTKKA
PKVKAVVKAKKSPVKSAKPKKVAKPKKVAKSPAKAKKAKVAVKKVKK-SPKKAAPKPKKV
AK-KTKVAKPAKATKP--KKAKAAKPKPKAAAKKAAKKK
>Xenopus|NP_998836.1|H1.0 organism=Xenopus tropicalis phylum=Chordata class=Amphibia
--MTENSAAAPAG-------------KPKRSKASKKATDHPKYSDMILAAVQAEKSRSGS
SRQSIQKYIKNHYKVGEN-ADSQIKLSIKRLVTSGTLKQTKGVGASGSFRLAKADEGKKP
AK-----KPKKEIKKAASPKKAAKPKKAAKSPAKAKKPKVAEKKVKKPAKKKPAPSPKKA
KKTKTVKAKPVRASRV--KKAKPSKPKAKASPKKSGRKK
>Cairina|P06513.2|H1.0 organism=Cairina moschata phylum=Chordata class=Aves
--MTDSPIPAPAPAA-----------KPKRAKAPRKPASHPSYSEMIVAAIRAEKSRGGS
SRQSIQKYVKSHYKVGQH-ADLQIKLSIRRLLAAGVLKQTKGVGASGSYRLAKGDKAKKS
PAGRK--KKKKAARRSTSPRKAARPRK---ARSPAKKPKAA---ARK-ARKKSRASPKKA
KKPKTVKAKSLKTSKV--KKAKRSKPRAKSGARKSPKKK
>Gallus|NP_001038138.1|H1.0 organism=Gallus gallus phylum=Chordata class=Aves
--MTESLVLSPAPA------------KPKRVKASRRSASHPTYSEMIAAAIRAEKSRGGS
SRQSIQKYIKSHYKVGHN-ADLQIKLSIRRLLAAGVLKQTKGVGASGSFRLAKSDKAKRS
PG-----KKKKAVRRSTSPKKAARPRK---ARSPAKKPKAT---ARK-ARKKSRASPKKA
KKPKTVKAKSRKASKA--KKVKRSKPRAKSGARKSPKKK
>Taeniopygia|XP_004175972.1|H1.0 organism=Taeniopygia guttata phylum=Chordata class=Aves
--MTRLPSMCKASSSLMSCVHLCAFPPPKRARSARRPAAHPAYSDMVTAAVRADKSRGGA
SRQSIQKYVKSNYKVGQN-ADVQIRLAIRRLLAAGVLKQTKGVGASGSFRLAKAGKAKRS
PSR----KRKKAARRSTSPRKTARSRK---ARSPAKKPKSA---ARK-ARKKSRS-PKKA
KKPKTVKAKSLKASKP--KKARRSKSRAKSGARKSPKKK
>Bos|NP_001069955.1|H1.0 organism=Bos taurus phylum=Chordata class=Mammalia
--MTENSTSTPAA-------------KPKRAKASKKSTDHPKYSDMIVAAIQAEKNRAGS
SRQSIQKYIKSHYKVGEN-ADSQIKLSIKRLVTTGVLKQTKGVGASGSFRLAKSDEPKRS
VAFK---KTKKEVKKVATPKKAAKPKKAA-SKAPSKKPKATP--VKK-AKKKPAATPKKT
KKPKTVKAKPVKASKP--KKTKPVKPKAKSSAKRTGKKK
>Mus|NP_032223.2|H1.0 organism=Mus musculus phylum=Chordata class=Mammalia
--MTENSTSAPAA-------------KPKRAKASKKSTDHPKYSDMIVAAIQAEKNRAGS
SRQSIQKYIKSHYKVGEN-ADSQIKLSIKRLVTTGVLKQTKGVGASGSFRLAKGDEPKRS
VAFK---KTKKEVKKVATPKKAAKPKKAA-SKAPSKKPKATP--VKK-AKKKPAATPKKA
KKPKVVKVKPVKASKP--KKAKTVKPKAKSSAKRASKKK
>Pongo|NP_001127680.1|H1.0 organism=Pongo abelii phylum=Chordata class=Mammalia
--MTENSTSAPAA-------------KPKRAKASKKSTDHPKYSDMVVAAIQAEKNRAGS
SRQSIQKYIKSHYKVGEN-ADSQIKLSIKRLVTTGVLKQTKGVGASGSFRLAKSDEPKKS
VAFK---KTKKEIKKVATPKKASKPKKAA-SKAPTKKPKATP--VKK-AKKKLAATPKKA
KKPKTVKAKPVKASKP--KKAKPVKPKAKSSAKRAGKKK
>Rattus|NP_036710.1|H1.0 organism=Rattus norvegicus phylum=Chordata class=Mammalia
--MTENSTSTPAA-------------KPKRAKAAKKSTDHPKYSDMIVAAIQAEKNRAGS
SRQSIQKYIKSHYKVGEN-ADSQIKLSIKRLVTTGVLKQTKGVGASGSFRLAKGDEPKRS
VAFK---KTKKEVKKVATPKKAAKPKKAA-SKAPSKKPKATP--VKK-AKKKPAATPKKA
KKPKIVKVKPVKASKP--KKAKPVKPKAKSSAKRASKKK
>Strongylocentrotus|NP_999722.1|H1.0 organism=Strongylocentrotus purpuratus phylum=Echinodermata class=Echinoidea
MADTDAAPAAPAPSTPKKA-------AKKKASKPKTPASHPKYSDMIASALESLKEKKGS
SRQAILKYVKANFTVGDN-ANVHIKQALKRGVTSGQLRHVKGSGASGSFLLAEKTK----
-------TPKKAAAKKATPKKKPAAKK---TKKPA---------AKK-ATKKPAKKP--A
AKKKVAKPAAKKAAKPVAKKATPKKKVVKKAAKGKGKKK
>Homo|NP_005309.1|H1.0_(Homo_sapiens) organism=Homo sapiens phylum=Chordata class=Mammalia
--MTENSTSAPAA-------------KPKRAKASKKSTDHPKYSDMIVAAIQAEKNRAGS
SRQSIQKYIKSHYKVGEN-ADSQIKLSIKRLVTTGVLKQTKGVGASGSFRLAKSDEPKKS
VAFK---KTKKEIKKVATPKKASKPKKAA-SKAPTKKPKATP--VKK-AKKKLAATPKKA
KKPKTVKAKPVKASKP--KKAKPVKPKAKSSAKRAGKKK
5 changes: 5 additions & 0 deletions CURATED_SET/draft_seeds/H1.0_(Homo_sapiens).fasta
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
>Homo|NP_005309.1|H1.0_(Homo_sapiens) organism=Homo sapiens phylum=Chordata class=Mammalia
MTENSTSAPAAKPKRAKASKKSTDHPKYSDMIVAAIQAEKNRAGSSRQSIQKYIKSHYKV
GENADSQIKLSIKRLVTTGVLKQTKGVGASGSFRLAKSDEPKKSVAFKKTKKEIKKVATP
KKASKPKKAASKAPTKKPKATPVKKAKKKLAATPKKAKKPKTVKAKPVKASKPKKAKPVK
PKAKSSAKRAGKKK
70 changes: 70 additions & 0 deletions CURATED_SET/draft_seeds/H1.0_only.fasta
Original file line number Diff line number Diff line change
@@ -0,0 +1,70 @@
>Thalassiosira|EED88841.1|H1.0 organism=Thalassiosira pseudonana CCMP1335 phylum=Bacillariophyta class=Coscinodiscophyceae
----------------------------------------MSYKAGIAKAITELKDRTGS
SSIAIKKHMQANLPADKKWMNATFLKALKDMVASGELVKTK-----ASYKLSA-------
-------VAKQKASSAGKPKKAPKKKA---------------------APKKTAPKKKAA
PKKKTATKKAATAKKPAAKKATTAK---KSTTKKTAKK-
>Esox|XP_010887142.1|H1.0 organism=Esox lucius phylum=Chordata class=Actinopteri
---MAETVAAPAP-------------KAKKAKAPKKPASHPKYSDMIKAAVQADKSRGGA
SRQSVQKYIKSHYKVGDN-ADSQIKLSLKRMVSGGLLRHTKGIGASGSFKLAKAEDTKKA
PKPKPVVKAKKSPVKAAKPKKVAKPKKVVKSPAKAKKAKVAVKKVKK-SPKKVAPKPKKV
VK-KVKAAKPAKAVKP--KKAKAAKPKPKAAAKKAAKKK
>Salmo|ACH70944.1|H1.0 organism=Salmo salar phylum=Chordata class=Actinopteri
---MAETAAAPAP-------------KAKKAKAPKKPASHPKYSDMIKAAVHADKSRGGA
SRQSVQKYIKSHYKVGDN-ADSQIKLSLKRMVSEGVLRHTKGIGASGSFKLAKAEDTKKA
PKVKAVVKAKKSPVKSAKPKKVAKPKKVAKSPAKAKKAKVAVKKVKK-SPKKAAPKPKKV
AK-KTKVAKPAKATKP--KKAKAAKPKPKAAAKKAAKKK
>Salmo|ACM08534.1|H1.0 organism=Salmo salar phylum=Chordata class=Actinopteri
---------------------------------------------MIKAAVHADKSRGGA
SRQSVQKYIKSHYKVGDN-ADSQIKLSLKRMVSEGVLRHTKGIGASGSFKLAKAEDTKKA
PKVKAVVKAKKSPVKSAKPKKVAKPKKVAKSPAKAKKAKVAVKKVKK-SPKKAAPKPKKV
VK-KTKVAKPAKATKP--KKAKAAKPKPKAAAKKAAKKK
>Salmo|ACM09660.1|H1.0 organism=Salmo salar phylum=Chordata class=Actinopteri
---------------------------------------------MIKAAVHADKSRGGA
SRQSVQKYIKSHYKVGDN-ADSQIKLSLKRMVSEGVLRHTKGIGASGSFKLAKAEDTKKA
PKVKAVVKAKKSPVKSAKPKKVAKPKKVAKSPAKAKKAKVAVKKVKK-SPKKAAPKPKKV
AK-KTKVAKPAKATKP--KKAKAAKPKPKAAAKKAAKKK
>Xenopus|NP_998836.1|H1.0 organism=Xenopus tropicalis phylum=Chordata class=Amphibia
--MTENSAAAPAG-------------KPKRSKASKKATDHPKYSDMILAAVQAEKSRSGS
SRQSIQKYIKNHYKVGEN-ADSQIKLSIKRLVTSGTLKQTKGVGASGSFRLAKADEGKKP
AK-----KPKKEIKKAASPKKAAKPKKAAKSPAKAKKPKVAEKKVKKPAKKKPAPSPKKA
KKTKTVKAKPVRASRV--KKAKPSKPKAKASPKKSGRKK
>Cairina|P06513.2|H1.0 organism=Cairina moschata phylum=Chordata class=Aves
--MTDSPIPAPAPAA-----------KPKRAKAPRKPASHPSYSEMIVAAIRAEKSRGGS
SRQSIQKYVKSHYKVGQH-ADLQIKLSIRRLLAAGVLKQTKGVGASGSYRLAKGDKAKKS
PAGRK--KKKKAARRSTSPRKAARPRK---ARSPAKKPKAA---ARK-ARKKSRASPKKA
KKPKTVKAKSLKTSKV--KKAKRSKPRAKSGARKSPKKK
>Gallus|NP_001038138.1|H1.0 organism=Gallus gallus phylum=Chordata class=Aves
--MTESLVLSPAPA------------KPKRVKASRRSASHPTYSEMIAAAIRAEKSRGGS
SRQSIQKYIKSHYKVGHN-ADLQIKLSIRRLLAAGVLKQTKGVGASGSFRLAKSDKAKRS
PG-----KKKKAVRRSTSPKKAARPRK---ARSPAKKPKAT---ARK-ARKKSRASPKKA
KKPKTVKAKSRKASKA--KKVKRSKPRAKSGARKSPKKK
>Taeniopygia|XP_004175972.1|H1.0 organism=Taeniopygia guttata phylum=Chordata class=Aves
--MTRLPSMCKASSSLMSCVHLCAFPPPKRARSARRPAAHPAYSDMVTAAVRADKSRGGA
SRQSIQKYVKSNYKVGQN-ADVQIRLAIRRLLAAGVLKQTKGVGASGSFRLAKAGKAKRS
PSR----KRKKAARRSTSPRKTARSRK---ARSPAKKPKSA---ARK-ARKKSRS-PKKA
KKPKTVKAKSLKASKP--KKARRSKSRAKSGARKSPKKK
>Bos|NP_001069955.1|H1.0 organism=Bos taurus phylum=Chordata class=Mammalia
--MTENSTSTPAA-------------KPKRAKASKKSTDHPKYSDMIVAAIQAEKNRAGS
SRQSIQKYIKSHYKVGEN-ADSQIKLSIKRLVTTGVLKQTKGVGASGSFRLAKSDEPKRS
VAFK---KTKKEVKKVATPKKAAKPKKAA-SKAPSKKPKATP--VKK-AKKKPAATPKKT
KKPKTVKAKPVKASKP--KKTKPVKPKAKSSAKRTGKKK
>Mus|NP_032223.2|H1.0 organism=Mus musculus phylum=Chordata class=Mammalia
--MTENSTSAPAA-------------KPKRAKASKKSTDHPKYSDMIVAAIQAEKNRAGS
SRQSIQKYIKSHYKVGEN-ADSQIKLSIKRLVTTGVLKQTKGVGASGSFRLAKGDEPKRS
VAFK---KTKKEVKKVATPKKAAKPKKAA-SKAPSKKPKATP--VKK-AKKKPAATPKKA
KKPKVVKVKPVKASKP--KKAKTVKPKAKSSAKRASKKK
>Pongo|NP_001127680.1|H1.0 organism=Pongo abelii phylum=Chordata class=Mammalia
--MTENSTSAPAA-------------KPKRAKASKKSTDHPKYSDMVVAAIQAEKNRAGS
SRQSIQKYIKSHYKVGEN-ADSQIKLSIKRLVTTGVLKQTKGVGASGSFRLAKSDEPKKS
VAFK---KTKKEIKKVATPKKASKPKKAA-SKAPTKKPKATP--VKK-AKKKLAATPKKA
KKPKTVKAKPVKASKP--KKAKPVKPKAKSSAKRAGKKK
>Rattus|NP_036710.1|H1.0 organism=Rattus norvegicus phylum=Chordata class=Mammalia
--MTENSTSTPAA-------------KPKRAKAAKKSTDHPKYSDMIVAAIQAEKNRAGS
SRQSIQKYIKSHYKVGEN-ADSQIKLSIKRLVTTGVLKQTKGVGASGSFRLAKGDEPKRS
VAFK---KTKKEVKKVATPKKAAKPKKAA-SKAPSKKPKATP--VKK-AKKKPAATPKKA
KKPKIVKVKPVKASKP--KKAKPVKPKAKSSAKRASKKK
>Strongylocentrotus|NP_999722.1|H1.0 organism=Strongylocentrotus purpuratus phylum=Echinodermata class=Echinoidea
MADTDAAPAAPAPSTPKKA-------AKKKASKPKTPASHPKYSDMIASALESLKEKKGS
SRQAILKYVKANFTVGDN-ANVHIKQALKRGVTSGQLRHVKGSGASGSFLLAEKTK----
-------TPKKAAAKKATPKKKPAAKK---TKKPA---------AKK-ATKKPAKKP--A
AKKKVAKPAAKKAAKPVAKKATPKKKVVKKAAKGKGKKK
5 changes: 5 additions & 0 deletions CURATED_SET/draft_seeds/H1.1.fasta
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
>Homo|NP_005316.1|H1.1_(Homo_sapiens) organism=Homo sapiens phylum=Chordata class=Mammalia
MSETVPPAPAASAAPEKPLAGKKAKKPAKAAAASKKKPAGPSVSELIVQAASSSKERGGV
SLAALKKALAAAGYDVEKNNSRIKLGIKSLVSKGTLVQTKGTGASGSFKLNKKASSVETK
PGASKVATKTKATGASKKLKKATGASKKSVKTPKKAKKPAATRKSSKNPKKPKTVKPKKV
AKSPAKAKAVKPKAAKARVTKPKTAKPKKAAPKKK
30 changes: 30 additions & 0 deletions CURATED_SET/draft_seeds/H1.10.fasta
Original file line number Diff line number Diff line change
@@ -0,0 +1,30 @@
>Caligus|ACO10502.1|H1.10 organism=Caligus rogercresseyi phylum=Arthropoda class=Hexanauplia
MVKSEVEVTINAEEAPV--------ASSLKPAK---K---------KKNKKKKNKPGKYS
VLVLDAVKKLNERSGSSLVKIYNEAKKASWFDEQNGRTYLRYSIRALVLNNTLIQVKGMG
ANGSFRLNEDKFAKGVPKKTQS--KPAKNTTKTAKASTTKKATV-VKAKSSPKKAPDAKM
PAAKLKKLGVKKVSAAQ---K------NKKPKKASKPPAKS-PRKK--
>Oncorhynchus|ACO07616.1|H1.10 organism=Oncorhynchus mykiss phylum=Chordata class=Actinopteri
MVKSEVDVTINAEEAPV--------ASGPKPAK---K---------KKKKKKKNKPGKYS
VLVLDAVKKLNERSGSSLVKIYNEAKKASWFDEQNGRTYLRYSIRALVLNNTLIQVKGMG
ANGSFRLNEDKFAKEVPKKTQS--KPAKTTTKTAKASTTKKATVKPKAKSSPKKAPDAKK
PAAKMKKLGVKKVIAAQ---K------NKKPKKASKPPAKS-PRKK--
>Osmerus|ACO09903.1|H1.10 organism=Osmerus mordax phylum=Chordata class=Actinopteri
-MASDTEV-VPAAEAPVAAKSKKRTATKPKPKA---KPATVATSSAKKKKRKGKGPGKYS
VLVVDAIKQLGERNGSSLAKIYNKAREAIWFDQQHGRTYLRYSIRALVLNDTLIQVKGTG
ANGSFKLNKKKFETKAPKKAPTPVKAVKTKAPAKKAKAAIKTKAKPKASPKKKSTPK-KK
PAAKPKKLAAKKATPVKS--K------KPKPKKASKPAAKS-PRKK--
>Salmo|ACM09455.1|H1.10 organism=Salmo salar phylum=Chordata class=Actinopteri
MVKSEVEVTINAEEAPV--------ASSLKPAK---K---------KKNKKKKNKPGKYS
VLVLDAVKKLNERSGSSLVKIYNEAKKASWFDEQNGRTYLRYSIRALVLNNTLIQVKGMG
ANGSFRLNEDKFAKGVPKKTQS--KPAKNTTKTAKASTTKKATV-VKAKSSPKKAPDAKM
PAAKLKKLGVKKVSAAQ---K------NKKPKKASKPPAKS-PRKK--
>Xenopus|NP_001080265.1|H1.10 organism=Xenopus laevis phylum=Chordata class=Amphibia
-MALELEENLHSTEEEDEEEEEEEEGDEMRSRSTRNKGGAASSSGNKKKKKKKNQPGRYS
QLVVDTIRKLGERNGSSLAKIYSEAKKVSWFDQQNGRTYLKYSIKALVQNDTLLQVKGVG
ANGSFRLNKKKLE-GLPYDKKP--PPAKPSSSSSNKKQQQQGPSSSPSKSHKKAKPKAKA
EKEKPKTSSAKAKSPKKSAAK------GKKMKKGAKPSVRKAPKSKKA
>Homo|NP_006017.1|H1.10_(Homo_sapiens) organism=Homo sapiens phylum=Chordata class=Mammalia
-MSVELEEALPVTTAEGMAKKVTKAG----------GSAALSPSKKRKNSKKKNQPGKYS
QLVVETIRRLGERNGSSLAKIYTEAKKVPWFDQQNGRTYLKYSIKALVQNDTLLQVKGTG
ANGSFKLNRKKLEGGGERRGAP--AAATAPAPTAH-KAKKAAPGAAGSRRADKKPARGQK
PEQRSHKKGAGAKKDKGGKAKKTAAAGGKKVKKAAKPSVPKVPKGRK-
5 changes: 5 additions & 0 deletions CURATED_SET/draft_seeds/H1.10_(Homo_sapiens).fasta
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
>Homo|NP_006017.1|H1.10_(Homo_sapiens) organism=Homo sapiens phylum=Chordata class=Mammalia
MSVELEEALPVTTAEGMAKKVTKAGGSAALSPSKKRKNSKKKNQPGKYSQLVVETIRRLG
ERNGSSLAKIYTEAKKVPWFDQQNGRTYLKYSIKALVQNDTLLQVKGTGANGSFKLNRKK
LEGGGERRGAPAAATAPAPTAHKAKKAAPGAAGSRRADKKPARGQKPEQRSHKKGAGAKK
DKGGKAKKTAAAGGKKVKKAAKPSVPKVPKGRK
25 changes: 25 additions & 0 deletions CURATED_SET/draft_seeds/H1.10_only.fasta
Original file line number Diff line number Diff line change
@@ -0,0 +1,25 @@
>Caligus|ACO10502.1|H1.10 organism=Caligus rogercresseyi phylum=Arthropoda class=Hexanauplia
MVKSEVEVTINAEEAPV--------ASSLKPAK---K---------KKNKKKKNKPGKYS
VLVLDAVKKLNERSGSSLVKIYNEAKKASWFDEQNGRTYLRYSIRALVLNNTLIQVKGMG
ANGSFRLNEDKFAKGVPKKTQS--KPAKNTTKTAKASTTKKATV-VKAKSSPKKAPDAKM
PAAKLKKLGVKKVSAAQ---KNKKPKKASKPPAKS-PRKK--
>Oncorhynchus|ACO07616.1|H1.10 organism=Oncorhynchus mykiss phylum=Chordata class=Actinopteri
MVKSEVDVTINAEEAPV--------ASGPKPAK---K---------KKKKKKKNKPGKYS
VLVLDAVKKLNERSGSSLVKIYNEAKKASWFDEQNGRTYLRYSIRALVLNNTLIQVKGMG
ANGSFRLNEDKFAKEVPKKTQS--KPAKTTTKTAKASTTKKATVKPKAKSSPKKAPDAKK
PAAKMKKLGVKKVIAAQ---KNKKPKKASKPPAKS-PRKK--
>Osmerus|ACO09903.1|H1.10 organism=Osmerus mordax phylum=Chordata class=Actinopteri
-MASDTEV-VPAAEAPVAAKSKKRTATKPKPKA---KPATVATSSAKKKKRKGKGPGKYS
VLVVDAIKQLGERNGSSLAKIYNKAREAIWFDQQHGRTYLRYSIRALVLNDTLIQVKGTG
ANGSFKLNKKKFETKAPKKAPTPVKAVKTKAPAKKAKAAIKTKAKPKASPKKKSTPK-KK
PAAKPKKLAAKKATPVKS--KKPKPKKASKPAAKS-PRKK--
>Salmo|ACM09455.1|H1.10 organism=Salmo salar phylum=Chordata class=Actinopteri
MVKSEVEVTINAEEAPV--------ASSLKPAK---K---------KKNKKKKNKPGKYS
VLVLDAVKKLNERSGSSLVKIYNEAKKASWFDEQNGRTYLRYSIRALVLNNTLIQVKGMG
ANGSFRLNEDKFAKGVPKKTQS--KPAKNTTKTAKASTTKKATV-VKAKSSPKKAPDAKM
PAAKLKKLGVKKVSAAQ---KNKKPKKASKPPAKS-PRKK--
>Xenopus|NP_001080265.1|H1.10 organism=Xenopus laevis phylum=Chordata class=Amphibia
-MALELEENLHSTEEEDEEEEEEEEGDEMRSRSTRNKGGAASSSGNKKKKKKKNQPGRYS
QLVVDTIRKLGERNGSSLAKIYSEAKKVSWFDQQNGRTYLKYSIKALVQNDTLLQVKGVG
ANGSFRLNKKKLE-GLPYDKKP--PPAKPSSSSSNKKQQQQGPSSSPSKSHKKAKPKAKA
EKEKPKTSSAKAKSPKKSAAKGKKMKKGAKPSVRKAPKSKKA
5 changes: 5 additions & 0 deletions CURATED_SET/draft_seeds/H1.1_(Homo_sapiens).fasta
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
>Homo|NP_005316.1|H1.1_(Homo_sapiens) organism=Homo sapiens phylum=Chordata class=Mammalia
MSETVPPAPAASAAPEKPLAGKKAKKPAKAAAASKKKPAGPSVSELIVQAASSSKERGGV
SLAALKKALAAAGYDVEKNNSRIKLGIKSLVSKGTLVQTKGTGASGSFKLNKKASSVETK
PGASKVATKTKATGASKKLKKATGASKKSVKTPKKAKKPAATRKSSKNPKKPKTVKPKKV
AKSPAKAKAVKPKAAKARVTKPKTAKPKKAAPKKK
Empty file.
5 changes: 5 additions & 0 deletions CURATED_SET/draft_seeds/H1.2.fasta
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
>Homo|NP_005310.1|H1.2_(Homo_sapiens) organism=Homo sapiens phylum=Chordata class=Mammalia
MSETAPAAPAAAPPAEKAPVKKKAAKKAGGTPRKASGPPVSELITKAVAASKERSGVSLA
ALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEAKPKV
KKAGGTKPKKPVGAAKKPKKAAGGATPKKSAKKTPKKAKKPAAATVTKKVAKSPKKAKVA
KPKKAAKSAAKAVKPKAAKPKVVKPKKAAPKKK
5 changes: 5 additions & 0 deletions CURATED_SET/draft_seeds/H1.2_(Homo_sapiens).fasta
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
>Homo|NP_005310.1|H1.2_(Homo_sapiens) organism=Homo sapiens phylum=Chordata class=Mammalia
MSETAPAAPAAAPPAEKAPVKKKAAKKAGGTPRKASGPPVSELITKAVAASKERSGVSLA
ALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEAKPKV
KKAGGTKPKKPVGAAKKPKKAAGGATPKKSAKKTPKKAKKPAAATVTKKVAKSPKKAKVA
KPKKAAKSAAKAVKPKAAKPKVVKPKKAAPKKK
Empty file.
5 changes: 5 additions & 0 deletions CURATED_SET/draft_seeds/H1.3.fasta
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
>Homo|NP_005311.1|H1.3_(Homo_sapiens) organism=Homo sapiens phylum=Chordata class=Mammalia
MSETAPLAPTIPAPAEKTPVKKKAKKAGATAGKRKASGPPVSELITKAVAASKERSGVSL
AALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEGKPK
AKKAGAAKPRKPAGAAKKPKKVAGAATPKKSIKKTPKKVKKPATAAGTKKVAKSAKKVKT
PQPKKAAKSPAKAKAPKPKAAKPKSGKPKVTKAKKAAPKKK
5 changes: 5 additions & 0 deletions CURATED_SET/draft_seeds/H1.3_(Homo_sapiens).fasta
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
>Homo|NP_005311.1|H1.3_(Homo_sapiens) organism=Homo sapiens phylum=Chordata class=Mammalia
MSETAPLAPTIPAPAEKTPVKKKAKKAGATAGKRKASGPPVSELITKAVAASKERSGVSL
AALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEGKPK
AKKAGAAKPRKPAGAAKKPKKVAGAATPKKSIKKTPKKVKKPATAAGTKKVAKSAKKVKT
PQPKKAAKSPAKAKAPKPKAAKPKSGKPKVTKAKKAAPKKK
Empty file.
5 changes: 5 additions & 0 deletions CURATED_SET/draft_seeds/H1.4.fasta
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
>Homo|NP_005312.1|H1.4_(Homo_sapiens) organism=Homo sapiens phylum=Chordata class=Mammalia
MSETAPAAPAAPAPAEKTPVKKKARKSAGAAKRKASGPPVSELITKAVAASKERSGVSLA
ALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEAKPKA
KKAGAAKAKKPAGAAKKPKKATGAATPKKSAKKTPKKAKKPAAAAGAKKAKSPKKAKAAK
PKKAPKSPAKAKAVKPKAAKPKTAKPKAAKPKKAAAKKK
5 changes: 5 additions & 0 deletions CURATED_SET/draft_seeds/H1.4_(Homo_sapiens).fasta
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
>Homo|NP_005312.1|H1.4_(Homo_sapiens) organism=Homo sapiens phylum=Chordata class=Mammalia
MSETAPAAPAAPAPAEKTPVKKKARKSAGAAKRKASGPPVSELITKAVAASKERSGVSLA
ALKKALAAAGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEAKPKA
KKAGAAKAKKPAGAAKKPKKATGAATPKKSAKKTPKKAKKPAAAAGAKKAKSPKKAKAAK
PKKAPKSPAKAKAVKPKAAKPKTAKPKAAKPKKAAAKKK
Empty file.
5 changes: 5 additions & 0 deletions CURATED_SET/draft_seeds/H1.5.fasta
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
>Homo|NP_005313.1|H1.5_(Homo_sapiens) organism=Homo sapiens phylum=Chordata class=Mammalia
MSETAPAETATPAPVEKSPAKKKATKKAAGAGAAKRKATGPPVSELITKAVAASKERNGL
SLAALKKALAAGGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEAK
PKAKKAGAAKAKKPAGATPKKAKKAAGAKKAVKKTPKKAKKPAAAGVKKVAKSPKKAKAA
AKPKKATKSPAKPKAVKPKAAKPKAAKPKAAKPKAAKAKKAAAKKK
5 changes: 5 additions & 0 deletions CURATED_SET/draft_seeds/H1.5_(Homo_sapiens).fasta
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
>Homo|NP_005313.1|H1.5_(Homo_sapiens) organism=Homo sapiens phylum=Chordata class=Mammalia
MSETAPAETATPAPVEKSPAKKKATKKAAGAGAAKRKATGPPVSELITKAVAASKERNGL
SLAALKKALAAGGYDVEKNNSRIKLGLKSLVSKGTLVQTKGTGASGSFKLNKKAASGEAK
PKAKKAGAAKAKKPAGATPKKAKKAAGAKKAVKKTPKKAKKPAAAGVKKVAKSPKKAKAA
AKPKKATKSPAKPKAVKPKAAKPKAAKPKAAKPKAAKAKKAAAKKK
Empty file.
5 changes: 5 additions & 0 deletions CURATED_SET/draft_seeds/H1.6_(Homo_sapiens).fasta
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
>Homo|NP_005314.2|H1.6_(Homo_sapiens) organism=Homo sapiens phylum=Chordata class=Mammalia
MSETVPAASASAGVAAMEKLPTKKRGRKPAGLISASRKVPNLSVSKLITEALSVSQERVG
MSLVALKKALAAAGYDVEKNNSRIKLSLKSLVNKGILVQTRGTGASGSFKLSKKVIPKST
RSKAKKSVSAKTKKLVLSRDSKSPKTAKTNKRAKKPRATTPKTVRSGRKAKGAKGKQQQK
SPVKARASKSKLTQHHEVNVRKATSKK
6 changes: 6 additions & 0 deletions CURATED_SET/draft_seeds/H1.7_(Homo_sapiens).fasta
Original file line number Diff line number Diff line change
@@ -0,0 +1,6 @@
>Homo|NP_861453.1|H1.7_(Homo_sapiens) organism=Homo sapiens phylum=Chordata class=Mammalia
MEQALTGEAQSRWPRRGGSGAMAEAPGPSGESRGHSATQLPAEKTVGGPSRGCSSSVLRV
SQLVLQAISTHKGLTLAALKKELRNAGYEVRRKSGRHEAPRGQAKATLLRVSGSDAAGYF
RVWKVPKPRRKPGRARQEEGTRAPWRTPAAPRSSRRRRQPLRKAARKAREVWRRNARAKA
KANARARRTRRARPRAKEPPCARAKEEAGATAADEGRGQAVKEDTTPRSGKDKRRSSKPR
EEKQEPKKPAQRTIQ
14 changes: 14 additions & 0 deletions CURATED_SET/draft_seeds/H1.8_(Homo_sapiens).fasta
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
>Homo|NP_001295191.1|H1.8_(Homo_sapiens) organism=Homo sapiens phylum=Chordata class=Mammalia
------------------------------------------------------------
------------------------------------------------------------
-------------------MAPATAPRRAGEAKGKGPKKPSEAKEDPPNVGKVKKAAKRP
AKVQKPPPKPGAATEKARKQGGAAKDTRAQSGEARKVPPKPDKAMRAPSSAGGLSRKAKA
KGSRSSQGDAEAYRKTKAESKSSKPTASKVKNGAASPTKKKVVAKAKAPKAGQGPNTKAA
APAKGSGSKVVPAHLSRKTEAPKGPRKAGLPIKASSSKVSSQRAEA
>Homo|NP_722575.1|H1.8_(Homo_sapiens) organism=Homo sapiens phylum=Chordata class=Mammalia
MAPGSVTSDISPSSTSTAGSSRSPESEKPGPSHGGVPPGGPSHSSLPVGRRHPPVLRMVL
EALQAGEQRRGTSVAAIKLYILHKYPTVDVLRFKYLLKQALATGMRRGLLARPLNSKARG
ATGSFKLVPKHKKKIQPRKMAPATAPRRAGEAKGKGPKKPSEAKEDPPNVGKVKKAAKRP
AKVQKPPPKPGAATEKARKQGGAAKDTRAQSGEARKVPPKPDKAMRAPSSAGGLSRKAKA
KGSRSSQGDAEAYRKTKAESKSSKPTASKVKNGAASPTKKKVVAKAKAPKAGQGPNTKAA
APAKGSGSKVVPAHLSRKTEAPKGPRKAGLPIKASSSKVSSQRAEA
Loading

0 comments on commit 7a4c742

Please sign in to comment.