The Fst scores calculated using the Weir and Cockerham method comparing the populations are shown in Table 1. The pairwise Fst values show lower differentiation between African-European and African-Asian pairs and higher level of differentiation between European-Asian populations. The higher level of differentiation between European and Asian populations because patterns of migration through human history show that humans moved out of Africa to eventually form the different populations. Therefore, the African population serves as the evolutionary common ancestor to the European and Asian populations. In otherwords, after migrating out of Africa, the European and Asian populations took different evolutionary paths and the lower level of differentiation between each population and the African population points to the common ancestors that each population had.
Table 1: Mean and Weighted Fst scores
Population | Mean Fst | Weighted Fst |
---|---|---|
CEU v YRI | 0.22196 | 0.072607 |
CEU v CHB | 0.19103 | 0.066182 |
YRI v CHB | 0.22196 | 0.072607 |
The top snp is located on Chromosome 2 at position 135911422. A search of the dbSNP database shows this position maps to rs17261772. This SNP is located within exon 19 of the RAB3GAP1 gene. RAB3GAP1 (RAB3 GTPase activation protein subunit 1) encodes for the catalytic subunit of Rab GTPase activating protein complex. Mutations in this gene have been found in Warburg micro syndrome. Warburg micro syndrome is an autosomal recessive syndrome that is characterized by abnormalities in the eye, central nervous system and genitalia {Aligianis 2005}. There is a possible founder effect in the Danish population indicated by a common haplotype shared by unrelated individuals in that population {Morris-Rosendahl 2010}.
rs17261772 is a tri-allelic SNP with C/G/T as possible alternative alleles. The C allele leads to a synonymous variant while the G allele leads to a missense variant. In the CHB population, the SNP is monomorphic for the C allele while in the CEU population, the T and C alleles are present. The following table shows the allele frequencies in the dataset.
Table 2: Allele frequency in dataset
## CHROM POS FREQ POP
## 2 135911422 0.2353 CEU
## 2 135911422 1.0000 CHB
Table 3: rs17261772 allele frequency in various populations
Population | T frequency | C frequency |
---|---|---|
All | 27% | 73% |
EUR | 59% | 41% |
AFR | 41% | 96% |
ASN | 0% | 100% |
In most other populations, the T allele is the minor allele. Table 3 shows the allele frequency for different populations taken from 1000 genomes data. The high frequency of T allele in the CEU population is most likely due to a founder effect where a subset of the population that migrated into Europe contained the T allele and because both the T and C alleles code for the same amino acid, Phe, there was no selective pressure to keep one allele over the other.