Wednesday, August 6, 2014
Haplotype-based PCA of West Eurasia and Europe
The Principal Component Analyses (PCA) below are based on pairwise Identity-by-Descent (IBD) sharing inferred with fastIBD. My aim was to create PCA that took into account haplotype information to see how they might differ from similar plots based on unlinked loci (such as here).
Clearly, they're less reflective of geography and isolation-by-distance, and instead more profoundly influenced by relatively recent isolation, founder effects and/or rapid expansions, especially in Northern and Eastern Europe, and in particular among the Finns, Balts and East Slavs. Unfortunately, I don't have time to say much more about these results. But feel free to post any questions or observations in the comments below. I have done something very similar in the past, but with far fewer samples (see here).
Please note, to ensure that the PCA were as informative as possible I was forced to drop several populations that produced unusual results, probably because of extreme founder effects. This is why, for instance, there are no Ashkenazi Jews on any of the plots, and the only Finns you'll find come from western Finland.
I'll try this again on a much larger dataset when more samples come in, and also include populations from Central and South Asia.
Update 7/8/2014: Apparently some people are wondering what the plots with Finns and Jews look like. Here you go...