<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" media="screen" href="/~d/styles/atom10full.xsl"?><?xml-stylesheet type="text/css" media="screen" href="http://feeds.feedburner.com/~d/styles/itemcontent.css"?><feed xmlns="http://www.w3.org/2005/Atom" xmlns:openSearch="http://a9.com/-/spec/opensearch/1.1/" xmlns:georss="http://www.georss.org/georss" xmlns:gd="http://schemas.google.com/g/2005" xmlns:thr="http://purl.org/syndication/thread/1.0" gd:etag="W/&quot;DkMCRno4cSp7ImA9WhVTFkQ.&quot;"><id>tag:blogger.com,1999:blog-6533996127304587865</id><updated>2012-03-02T15:21:07.439+02:00</updated><category term="Kurds" /><category term="Baltic" /><category term="Genomes Unzipped" /><category term="Armenians" /><category term="Central Asians" /><category term="Pygmies" /><category term="Finnic" /><category term="Iranian" /><category term="Results" /><category term="Basque" /><category term="23andMe" /><category term="Experiments" /><category term="IBS" /><category term="Indo-Europeans" /><category term="FTDNA" /><category term="Iberians" /><category term="Arabs" /><category term="GALORE" /><category term="ChromoPainter" /><category term="South Asians" /><category term="fastIBD" /><category term="Italians" /><category term="Oracle" /><category term="Sardinian" /><category term="Uralic" /><category term="Dodecad" /><category term="Germanic" /><category term="Hungarians" /><category term="Slavic" /><category term="Papuans" /><category term="Turkic" /><category term="Assyrians" /><category term="Population Concordance Ratio" /><category term="Jews" /><category term="DIYDodecad" /><category term="Romanians" /><category term="Greeks" /><category term="Africa" /><category term="Anatolia" /><category term="Mendels" /><category term="Zombies" /><category term="Europe" /><category term="Facebook" /><category term="Caucasus" /><category term="Balkans" /><category term="Siberians" /><category term="Cypriots" /><title>Dodecad Ancestry Project</title><subtitle type="html">Personal anthropology through the power of genomics</subtitle><link rel="http://schemas.google.com/g/2005#feed" type="application/atom+xml" href="http://dodecad.blogspot.com/feeds/posts/default" /><link rel="alternate" type="text/html" href="http://dodecad.blogspot.com/" /><link rel="next" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default?start-index=26&amp;max-results=25&amp;redirect=false&amp;v=2" /><author><name>Dienekes</name><uri>http://www.blogger.com/profile/02082684850093948970</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="http://4.bp.blogspot.com/-KXXemZigoEc/TlK7wIUP_EI/AAAAAAAAEAk/uJ-FlueoC6o/s220/prosopon.png" /></author><generator version="7.00" uri="http://www.blogger.com">Blogger</generator><openSearch:totalResults>201</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" type="application/atom+xml" href="http://feeds.feedburner.com/DodecadAncestryProject" /><feedburner:info xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" uri="dodecadancestryproject" /><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="hub" href="http://pubsubhubbub.appspot.com/" /><entry gd:etag="W/&quot;Ck4MRn85eSp7ImA9WhRaE0w.&quot;"><id>tag:blogger.com,1999:blog-6533996127304587865.post-6205588921599661930</id><published>2012-02-15T14:59:00.000+02:00</published><updated>2012-02-15T15:03:07.121+02:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-02-15T15:03:07.121+02:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="Turkic" /><category scheme="http://www.blogger.com/atom/ns#" term="Greeks" /><category scheme="http://www.blogger.com/atom/ns#" term="Experiments" /><category scheme="http://www.blogger.com/atom/ns#" term="Balkans" /><category scheme="http://www.blogger.com/atom/ns#" term="Anatolia" /><category scheme="http://www.blogger.com/atom/ns#" term="ChromoPainter" /><category scheme="http://www.blogger.com/atom/ns#" term="Slavic" /><category scheme="http://www.blogger.com/atom/ns#" term="Iranian" /><title>Correspondence between ChromoPainter clusters and ADMIXTURE components in Balkans/West Asia</title><content type="html">I took the 25 different inferred clusters from my recent &lt;a href="http://dodecad.blogspot.com/2012/02/chromopainterfinestructure-analysis-of.html"&gt;ChromoPainter analysis&lt;/a&gt;, and calculated their normalized median components in terms of the &lt;a href="http://dodecad.blogspot.com/2012/01/k12b-and-k7b-calculators.html"&gt;K12b calculator&lt;/a&gt;. This is a quite useful exercise, since it can show in what sense clusters are different from each other.&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://2.bp.blogspot.com/-d7tfmIGgYPk/TzujWHTE1xI/AAAAAAAAEfc/tSY8cEPxqqc/s1600/K12b.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="300" src="http://2.bp.blogspot.com/-d7tfmIGgYPk/TzujWHTE1xI/AAAAAAAAEfc/tSY8cEPxqqc/s400/K12b.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://3.bp.blogspot.com/-WPn4zaHFx3Q/Tzujbp9fdZI/AAAAAAAAEfo/HLvvOrqdkNA/s1600/K12b_medianprops.png.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="172" src="http://3.bp.blogspot.com/-WPn4zaHFx3Q/Tzujbp9fdZI/AAAAAAAAEfo/HLvvOrqdkNA/s400/K12b_medianprops.png.jpg" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;br /&gt;&lt;/div&gt;
Here are two ways in which you may use this correspondence.
&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;1. Different clusters of a single population&lt;/b&gt;
&lt;br /&gt;
&lt;br /&gt;
For example, the Turks with partial Balkan ancestry &lt;a href="http://1.bp.blogspot.com/-YL8zClHpohE/Tzovm1pH0LI/AAAAAAAAEew/VE150K6bbDs/s1600/mappops.jpg"&gt;tend to belong&lt;/a&gt; to pop10, whereas those of Anatolian ancestry to pop13, and those from northeastern Anatolia to pop22. If we compare the admixture proportions of these three groups, we notice e.g.,
&lt;br /&gt;
&lt;br /&gt;
&lt;ul&gt;
&lt;li&gt;An excess of Atlantic_Baltic and North_European in pop10&lt;/li&gt;
&lt;li&gt;An excess of Caucasus in pop22&lt;/li&gt;
&lt;/ul&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://3.bp.blogspot.com/-2-eAu_wrgw4/TzuqJEEBa-I/AAAAAAAAEf0/2jXCW3fsaIo/s1600/ADMIXTURE%2BIranians_12.jpg" imageanchor="1" style="clear: right; float: right; margin-bottom: 1em; margin-left: 1em;"&gt;&lt;img border="0" height="200" src="http://3.bp.blogspot.com/-2-eAu_wrgw4/TzuqJEEBa-I/AAAAAAAAEf0/2jXCW3fsaIo/s320/ADMIXTURE%2BIranians_12.jpg" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;
Or, there is a group of 5 Iranians that belong to pop12, whereas the overwhelming majority of Iranians and Kurds belong to pop21. Strikingly, pop12 differs from all other populations in having substantial levels of East_African and Sub_Saharan. So, it seems that fineSTRUCTURE was able to infer that some Iranian individuals had this feature in common. These individuals were already evident in the Iranian population portrait (right), but fineSTRUCTURE was able to group them even though there were no African populations in the ChromoPainter analysis; presumably, the software was able to detect that these individuals shared a set of chunks that were quite different than is the norm for the Balkan/West Asian area.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;2. Related clusters&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
fineSTRUCTURE grouped the different populations in a &lt;a href="http://3.bp.blogspot.com/-A8hbOWSeAxk/TzoumCq_LvI/AAAAAAAAEeo/BrL2JPsRSB0/s1600/heatmap.png"&gt;tree structure&lt;/a&gt;. For example, it grouped pop18, the "North Balkan" cluster with pop23, the "Bulgarian-Romanian" one.&lt;br /&gt;
&lt;br /&gt;
Looking at the admixture proportions, we can tell that the two clusters do indeed seem quite similar, but there are some differences, e.g., an excess of North_European in pop18, and an excess of Caucasus in pop23. This makes sense given the geographical origin of individuals belonging to the two clusters.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6533996127304587865-6205588921599661930?l=dodecad.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel="replies" type="application/atom+xml" href="http://dodecad.blogspot.com/feeds/6205588921599661930/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://dodecad.blogspot.com/2012/02/correspondence-between-chromopainter.html#comment-form" title="0 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/6205588921599661930?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/6205588921599661930?v=2" /><link rel="alternate" type="text/html" href="http://dodecad.blogspot.com/2012/02/correspondence-between-chromopainter.html" title="Correspondence between ChromoPainter clusters and ADMIXTURE components in Balkans/West Asia" /><author><name>Dienekes</name><uri>http://www.blogger.com/profile/02082684850093948970</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="http://4.bp.blogspot.com/-KXXemZigoEc/TlK7wIUP_EI/AAAAAAAAEAk/uJ-FlueoC6o/s220/prosopon.png" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://2.bp.blogspot.com/-d7tfmIGgYPk/TzujWHTE1xI/AAAAAAAAEfc/tSY8cEPxqqc/s72-c/K12b.png" height="72" width="72" /><thr:total>0</thr:total></entry><entry gd:etag="W/&quot;C0ABSXk8eyp7ImA9WhRaEko.&quot;"><id>tag:blogger.com,1999:blog-6533996127304587865.post-8348364299609618966</id><published>2012-02-14T13:24:00.000+02:00</published><updated>2012-02-15T04:09:18.773+02:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-02-15T04:09:18.773+02:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="Balkans" /><category scheme="http://www.blogger.com/atom/ns#" term="Anatolia" /><category scheme="http://www.blogger.com/atom/ns#" term="ChromoPainter" /><category scheme="http://www.blogger.com/atom/ns#" term="Results" /><category scheme="http://www.blogger.com/atom/ns#" term="Caucasus" /><title>ChromoPainter/fineSTRUCTURE analysis of Balkans/West Asia</title><content type="html">I have carried out a &lt;a href="http://dienekes.blogspot.com/2012/01/finestructure-paper-lawson-et-al-2012.html"&gt;ChromoPainter/fineSTRUCTURE&lt;/a&gt; analysis of Balkans/West Asia. This is a slightly different dataset than the one used in the previous fastIBD &lt;a href="http://dodecad.blogspot.com/2012/01/fastibd-analysis-of-balkanswest-asia.html"&gt;analysis&lt;/a&gt; of the same region. It also took much longer (about a week, with two CPUs dedicated to the task) to complete, so it is not something that can be done routinely.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Technical details (skip if you want)&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
413 individuals from 33 populations were studied, on 258,100 SNPs, after --geno 0.03 --maf 0.01 filters were applied. Data were phased in Beagle with the default 10 iterations. Genetic maps from the HapMap were used. fineSTRUCTURE was used on ChromoPainter output, with 500,000 burnin/runtime iterations each.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;25 Inferred Populations&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
fineSTRUCTURE imposes a tree structure on a number of inferred populations. The following heatmap shows this tree structure; columns represent donor populations, rows, recipient ones.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://3.bp.blogspot.com/-A8hbOWSeAxk/TzoumCq_LvI/AAAAAAAAEeo/BrL2JPsRSB0/s1600/heatmap.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="288" src="http://3.bp.blogspot.com/-A8hbOWSeAxk/TzoumCq_LvI/AAAAAAAAEeo/BrL2JPsRSB0/s320/heatmap.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
There was a total of 25 populations, labeled pop0, pop1, ..., pop24.&lt;br /&gt;
&lt;br /&gt;
The following table summarizes how many individuals from each original population were assigned to each inferred population:&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://1.bp.blogspot.com/-YL8zClHpohE/Tzovm1pH0LI/AAAAAAAAEew/VE150K6bbDs/s1600/mappops.jpg" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="148" src="http://1.bp.blogspot.com/-YL8zClHpohE/Tzovm1pH0LI/AAAAAAAAEew/VE150K6bbDs/s320/mappops.jpg" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
I will limit myself to populations which include Dodecad Project members:&lt;br /&gt;
&lt;br /&gt;
&lt;ul&gt;
&lt;li&gt;pop6 includes a Project North Ossetian, as well as all Yunusbayev et al. North Ossetians&lt;/li&gt;
&lt;li&gt;pop7 is mainly Armenian&lt;/li&gt;
&lt;li&gt;pop16 is also mainly Armenian; it would be interesting to see whether this bipartite division of Armenians is in agreement with the one inferred in the previous fastIBD analysis&lt;/li&gt;
&lt;li&gt;pop8 is mainly Greek, and appears to be "continental Greek"; it also includes some other Balkan individuals&lt;/li&gt;
&lt;li&gt;pop14 is also Greek, and includes a variety of people with ancestry from Crete, the Aegean, Cyprus, Asia Minor, Cappadocia, and the Pontus as well as continental Greek. It could be labeled "eastern Greek"&lt;/li&gt;
&lt;li&gt;pop11 is Cypriot, including the single 100% Greek Cypriot of the Project, all 3 100% Turkish Cypriots, as well as a Turkish individual of partial Turkish_Cypriot ancestry&lt;/li&gt;
&lt;li&gt;pop10 is Turkish, and includes people with some ancestry from the Balkans, as well as Anatolia. It could be labelled "Balkan Turkish"&lt;/li&gt;
&lt;li&gt;pop13 is also Turkish, and seems to include people with ancestry exclusively from Anatolia, including almost all the Behar et al. Turks&lt;/li&gt;
&lt;li&gt;pop15 is Assyrian; some Assyrians also fall on the aforementioned pop16 which includes mainly Armenians&lt;/li&gt;
&lt;li&gt;pop18 could be labelled "North Balkan"; there is probably structure to be uncovered within this cluster, once more participants from the Balkans join the Project&lt;/li&gt;
&lt;li&gt;pop20 is "Georgian-Abkhazian"&lt;/li&gt;
&lt;li&gt;pop21 is "Kurdish-Iranian"&lt;/li&gt;
&lt;li&gt;pop22 could be labeled "Northeastern Anatolia" or (more classically) "Pontus-Colchis". It appears to unite various individuals from Northeastern Turkey and neighboring Georgia, having Karadeniz Turkish, Armenian, Pontic Greek, and Kartvelian ancestry. I strongly encourage participants from this region to join the Project, especially Pontic Greeks, as there are no 100% Pontic Greeks currently in the Project.&lt;/li&gt;
&lt;li&gt;pop23 is "Bulgarian-Romanian" mainly, and also includes one Serb. Once again, I emphasize that the power of this approach using haplotypes depends on participation, so I encourage all people from the Balkans to consider joining the Project.&lt;/li&gt;
&lt;/ul&gt;
&lt;b&gt;Principal Components Analysis&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
I have also used the PCA feature of fineSTRUCTURE to carry out principal components analysis. I am plotting the first two dimensions of this PCA, using my own visualization code that places labels in the average position on the plane:&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://1.bp.blogspot.com/-UCP5T1pduGU/TzpBa9QbK3I/AAAAAAAAEe4/_uWuqnnb1zQ/s1600/1_2.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="320" src="http://1.bp.blogspot.com/-UCP5T1pduGU/TzpBa9QbK3I/AAAAAAAAEe4/_uWuqnnb1zQ/s320/1_2.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
&lt;b&gt;Results&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
Results for Project participants are included in the &lt;a href="https://docs.google.com/spreadsheet/ccc?key=0ArJDEoCgzRKedG4wM2VFMGtiN3Y3QmFDd1phVTRNUEE"&gt;spreadsheet&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
&lt;ul&gt;
&lt;li&gt;Population matrix, shows how many individuals from each population were assigned to each cluster&lt;/li&gt;
&lt;li&gt;Z score population matrix, shows the normalized number of "chunks" from each donor population (columns) to each recipient (row). Do not compare across rows! The way to read this table is the following: for each row, higher values indicate more sharing. For example, the "Cypriots" population has pop11 as its main donor.&lt;/li&gt;
&lt;li&gt;Individual assignments: the pop number that all Project and reference IDs were assigned to&lt;/li&gt;
&lt;li&gt;Individual Chunkcounts: the number of chunks copied from its donor population (column) to each individual&lt;/li&gt;
&lt;li&gt;Individual PCA: your PCA co-ordinates that can help you find your dot on the Principal Components Analysis graphic (see above)&lt;/li&gt;
&lt;/ul&gt;
Averaged results were included only for populations with &amp;gt;=5 members.&lt;br /&gt;
The raw chunkcounts for all 413x413 individuals can be found &lt;a href="https://docs.google.com/open?id=0B7JDEoCgzRKeM2EzNjdmNzItYjMzOS00YTQ0LTljYzEtMmUzOWFjYmMzYTM2"&gt;here&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6533996127304587865-8348364299609618966?l=dodecad.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel="replies" type="application/atom+xml" href="http://dodecad.blogspot.com/feeds/8348364299609618966/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://dodecad.blogspot.com/2012/02/chromopainterfinestructure-analysis-of.html#comment-form" title="6 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/8348364299609618966?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/8348364299609618966?v=2" /><link rel="alternate" type="text/html" href="http://dodecad.blogspot.com/2012/02/chromopainterfinestructure-analysis-of.html" title="ChromoPainter/fineSTRUCTURE analysis of Balkans/West Asia" /><author><name>Dienekes</name><uri>http://www.blogger.com/profile/02082684850093948970</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="http://4.bp.blogspot.com/-KXXemZigoEc/TlK7wIUP_EI/AAAAAAAAEAk/uJ-FlueoC6o/s220/prosopon.png" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://3.bp.blogspot.com/-A8hbOWSeAxk/TzoumCq_LvI/AAAAAAAAEeo/BrL2JPsRSB0/s72-c/heatmap.png" height="72" width="72" /><thr:total>6</thr:total></entry><entry gd:etag="W/&quot;CUIFQXs4eyp7ImA9WhRbFU8.&quot;"><id>tag:blogger.com,1999:blog-6533996127304587865.post-765175594304674776</id><published>2012-02-06T12:18:00.001+02:00</published><updated>2012-02-06T12:18:30.533+02:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-02-06T12:18:30.533+02:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="Dodecad" /><title>Other testing companies</title><content type="html">&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;
The Dodecad Project is not affiliated with any genetic testing companies. Until now, I have included Project participants from 23andMe and FamilyTreeDNA "Family Finder" tests, but it has come to my attention that there are new players in the field, such as Ancestry.com (see post on &lt;a href="http://www.yourgeneticgenealogist.com/2012/01/update-on-new-autosomal-dna-test-from.html"&gt;Your Genetic Genealogist&lt;/a&gt;) and Lumigenix (see post on &lt;a href="http://www.genomesunzipped.org/2012/02/review-of-the-lumigenix-comprehensive-personal-genome-service.php"&gt;GenomesUnzipped&lt;/a&gt;).&lt;br /&gt;
&lt;br /&gt;
If you have data from any company entering this field, please contact me at dodecad@gmail.com (do not send data right away!). That way, I can find out how many markers are in common between the new tests and my existing datasets, and figure out how easy it will be to convert them for use in the Project and in DIYDodecad.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6533996127304587865-765175594304674776?l=dodecad.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel="replies" type="application/atom+xml" href="http://dodecad.blogspot.com/feeds/765175594304674776/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://dodecad.blogspot.com/2012/02/other-testing-companies.html#comment-form" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/765175594304674776?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/765175594304674776?v=2" /><link rel="alternate" type="text/html" href="http://dodecad.blogspot.com/2012/02/other-testing-companies.html" title="Other testing companies" /><author><name>Dodecad Project</name><uri>http://www.blogger.com/profile/10447516703222698752</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>1</thr:total></entry><entry gd:etag="W/&quot;C0ECRHo_cCp7ImA9WhRbEEk.&quot;"><id>tag:blogger.com,1999:blog-6533996127304587865.post-8093957617440164195</id><published>2012-01-31T22:27:00.000+02:00</published><updated>2012-01-31T22:27:45.448+02:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-01-31T22:27:45.448+02:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="DIYDodecad" /><category scheme="http://www.blogger.com/atom/ns#" term="Results" /><category scheme="http://www.blogger.com/atom/ns#" term="Oracle" /><title>'K12b' and 'K7b' calculators</title><content type="html">&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;
I am releasing two new calculators with K=12 and K=7 components, named 'K12b' and 'K7b'. You can scroll down to the bottom if you are just interested in the downloads, or read on.&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;New Features&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
The new &lt;b&gt;'K12b'&lt;/b&gt; calculator is an update of the previous &lt;a href="http://dienekes.blogspot.com/2011/12/first-analysis-of-metspalu-et-al-2011.html"&gt;K12a&lt;/a&gt; one, that was inferred using all the new samples submitted during the last submission opportunity. The 12 components are still roughly the same, although their allele frequencies may have changed by a bit, so existing participants can expect to have slightly altered results, and new participants in the Project more so, since their data are now contributing to the creation of the new tool. Non-participants can, of course, use the new calculator with &lt;a href="http://dodecad.blogspot.com/2011/09/do-it-yourself-dodecad-v-21.html"&gt;DIYDodecad&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
I have also taken the opportunity to do some minor tweaks. I am releasing &lt;b&gt;population portraits&lt;/b&gt; for K12b (which were lacking in K12a); I've changed my visualization code so that the sample IDs of non-Dodecad populations can now be seen in the barplots. This may be useful for anyone else using these reference populations, by quickly identifying potential outliers in them.&lt;br /&gt;
&lt;br /&gt;
I have also decided to use &lt;b&gt;normalized median &lt;/b&gt;admixture proportions for the populations. For example, if 5 individuals in a population have 0, 0, 0.2, 0.5, 10.0% of a particular component, then the average is 2.14%, but the median is 0.2%. By using the median, the proportions become less susceptible to the presence of outliers (such as the 10%). However, if the median is calculated over every component separately, it is no longer guaranteed that the components will add up to 100%; this can be addressed by re-normalizing them (scaling them by a constant factor) so that they do. I believe that use of the normalized median will not only give better proportions that are less susceptible to outliers, but will also improve results of the new Dodecad Oracle for K12b.&lt;br /&gt;
&lt;br /&gt;
At the same time I am also releasing &lt;b&gt;'K7b' &lt;/b&gt;which is an update of the existing '&lt;a href="http://dodecad.blogspot.com/2011/10/eurasia7-calculator.html"&gt;eurasia7&lt;/a&gt;' calculator and which has been built on exactly the same dataset as 'K12b' but at a lower (K=7) level of detail.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Information on K7b&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
Information &lt;a href="https://docs.google.com/spreadsheet/ccc?key=0ArAJcY18g2GadHZ6SHpiLTNTa3lsUmZJY2pQblVRR2c"&gt;spreadsheet&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
Normalized median admixture proportions barplot for all included populations (a high resolution version of this is included in the download bundle):
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://3.bp.blogspot.com/-C9ebS214BrI/TyewlRJboII/AAAAAAAAEb8/PFnwD3ygE2I/s1600/_7.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="64" src="http://3.bp.blogspot.com/-C9ebS214BrI/TyewlRJboII/AAAAAAAAEb8/PFnwD3ygE2I/s320/_7.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;br /&gt;
Table of Fst divergences:&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://3.bp.blogspot.com/-Lr5Pv8z6I4g/Tyewvavt5xI/AAAAAAAAEcE/94LZet-K_jY/s1600/fst.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="78" src="http://3.bp.blogspot.com/-Lr5Pv8z6I4g/Tyewvavt5xI/AAAAAAAAEcE/94LZet-K_jY/s320/fst.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
Neighbor-joining tree (based on above):&lt;br /&gt;
&lt;a href="http://3.bp.blogspot.com/-2JplvwzgtOw/Tyew3hYgzMI/AAAAAAAAEcM/vIdqDVn-fpI/s1600/nj.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em; text-align: center;"&gt;&lt;img border="0" height="320" src="http://3.bp.blogspot.com/-2JplvwzgtOw/Tyew3hYgzMI/AAAAAAAAEcM/vIdqDVn-fpI/s320/nj.png" width="320" /&gt;&lt;/a&gt;
&lt;br /&gt;
&lt;b&gt;Information on K12b&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
Information &lt;a href="https://docs.google.com/spreadsheet/ccc?key=0ArJDEoCgzRKedEY4Y3lTUVBaaFp0bC1zZlBDcTZEYlE"&gt;spreadsheet&lt;/a&gt;.&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
Normalized median admixture proportions barplot for all included populations (a high resolution version of this is included in the download bundle):&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://4.bp.blogspot.com/-E7FSQjgXIkI/TybKSHs8BhI/AAAAAAAAEb0/EFVxw--IEVU/s1600/_12.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="64" src="http://4.bp.blogspot.com/-E7FSQjgXIkI/TybKSHs8BhI/AAAAAAAAEb0/EFVxw--IEVU/s320/_12.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
Table of Fst divergences:&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://1.bp.blogspot.com/-kXZ8Mxu5dns/TybJ7CQJuPI/AAAAAAAAEbk/QYJc4rvQ3ww/s1600/fst.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="62" src="http://1.bp.blogspot.com/-kXZ8Mxu5dns/TybJ7CQJuPI/AAAAAAAAEbk/QYJc4rvQ3ww/s320/fst.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
Neighbor-joining tree (based on above):&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://4.bp.blogspot.com/-cYC4bY-koB0/TybKBMtXRSI/AAAAAAAAEbs/zka5fg8fmRQ/s1600/nj.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="320" src="http://4.bp.blogspot.com/-cYC4bY-koB0/TybKBMtXRSI/AAAAAAAAEbs/zka5fg8fmRQ/s320/nj.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;b&gt;Multidimensional Scaling Plots of K12b and K7b&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
I have created MDS plots using synthetic individuals representing the 12 ancestral components of K12b and the 7 ancestral components of K7b. By including both in the same plot, one gets an idea of the relationship of the components at different resolution. The first 10 dimensions can be seen below:&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://3.bp.blogspot.com/-o3aBKiyqagc/TyfglQUIlPI/AAAAAAAAEdI/LQj6qi7LE8k/s1600/1_2.png" imageanchor="1"&gt;&lt;img border="0" height="200" src="http://3.bp.blogspot.com/-o3aBKiyqagc/TyfglQUIlPI/AAAAAAAAEdI/LQj6qi7LE8k/s200/1_2.png" width="200" /&gt;&lt;/a&gt;&lt;a href="http://1.bp.blogspot.com/-CvJXLQdObYU/TyfggcZGXfI/AAAAAAAAEc8/LzgFqC_xbhU/s1600/3_4.png" imageanchor="1"&gt;&lt;img border="0" height="200" src="http://1.bp.blogspot.com/-CvJXLQdObYU/TyfggcZGXfI/AAAAAAAAEc8/LzgFqC_xbhU/s200/3_4.png" width="200" /&gt;&lt;/a&gt;&lt;a href="http://3.bp.blogspot.com/-cu1NH3d5Xhw/Tyfgb6JeF7I/AAAAAAAAEcw/HakO3HnTREo/s1600/5_6.png" imageanchor="1"&gt;&lt;img border="0" height="200" src="http://3.bp.blogspot.com/-cu1NH3d5Xhw/Tyfgb6JeF7I/AAAAAAAAEcw/HakO3HnTREo/s200/5_6.png" width="200" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://4.bp.blogspot.com/-Q6zbEHJdk04/TyfgOfOdC_I/AAAAAAAAEck/0cLNMa6lYFw/s1600/7_8.png" imageanchor="1"&gt;&lt;img border="0" height="200" src="http://4.bp.blogspot.com/-Q6zbEHJdk04/TyfgOfOdC_I/AAAAAAAAEck/0cLNMa6lYFw/s200/7_8.png" width="200" /&gt;&lt;/a&gt;&lt;a href="http://1.bp.blogspot.com/-LJeGxIzZZaU/TyfgJ9_AH9I/AAAAAAAAEcY/EUibFcYHTGI/s1600/9_10.png" imageanchor="1"&gt;&lt;img border="0" height="200" src="http://1.bp.blogspot.com/-LJeGxIzZZaU/TyfgJ9_AH9I/AAAAAAAAEcY/EUibFcYHTGI/s200/9_10.png" width="200" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;br /&gt;&lt;/div&gt;
Here is a blowup of the main West Eurasian groups from the plot of the first two dimensions:&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://2.bp.blogspot.com/-gtp9YiXJjkc/TyfhuRvn8uI/AAAAAAAAEdU/-fgt57DJRio/s1600/thesix_global.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="380" src="http://2.bp.blogspot.com/-gtp9YiXJjkc/TyfhuRvn8uI/AAAAAAAAEdU/-fgt57DJRio/s400/thesix_global.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
Some observations:&lt;br /&gt;
&lt;br /&gt;
&lt;ul&gt;
&lt;li&gt;The Atlantic_Med component which is bi-modal in Basques and Sardinians occupies the apex of the figure; this makes sense, since Southwest Europe is quite distant (along land routes) to both Asia and Africa.&lt;/li&gt;
&lt;li&gt;The Caucasus component is surrounded by most of the others; this is consistent with my theory elaborated in &lt;a href="http://dienekes.blogspot.com/2011/12/womb-of-nations-how-west-eurasians-came.html"&gt;The womb of nations: how West Eurasians came to be&lt;/a&gt;.&lt;/li&gt;
&lt;li&gt;The Atlantic_Baltic component (from K=7) is intermediate between the Atlantic_Med and North_European components.&lt;/li&gt;
&lt;li&gt;Similarly, the West_Asian component (from K=7) is intermediate between the Caucasus and Gedrosia components; the Gedrosia component diverges in the direction of the Asian groups (not shown in this figure), and in particular of South Asians. This divergence can also be seen in the plot of dimension #3.&lt;/li&gt;
&lt;li&gt;The Northwest_African component diverges in the direction of Sub-Saharan Africans.&lt;/li&gt;
&lt;/ul&gt;
&lt;br /&gt;
&lt;b&gt;Technical Details&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
A dataset of 268 populations/3,115 individuals was assembled. A total of&amp;nbsp;265,519 SNPs are in common in the various source datasets as well as the 23andMe v2/v3 and Family Finder platforms. Iterative removal of distant relatives was performed by removing one individual from each pair within a population if that pair had a RATIO of 2.5 or greater or more than the mean and two standard deviations in IBD analysis performed in PLINK 1.07. A total of 2,675 individuals remained. 4 individuals were removed for low genotyping rate (less than 97%).&amp;nbsp;264,328 SNPs remained after removal of SNPs with less than 97% genotyping rate or 1% minor allele frequency. 166,770 SNPs remained after linkage-based disequilibrium pruning (--indep-pairwise 200 25 0.4). The final set thus consisted of 2,671 individuals/268 populations/166,770 SNPs. Ancestral populations (components) were inferred using ADMIXTURE 1.21, with K=7 and K=12 and default parameters.&lt;br /&gt;
&lt;br /&gt;
No individuals were removed from the source datasets, except in the case of the Armenians_Y sample, where one individual (ID: armenia3) was dropped because he/she was the same as a Dodecad Project participant.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Downloads&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
K7b population &lt;a href="https://docs.google.com/open?id=0B7JDEoCgzRKeYTA0ZGE1N2ItZTE0ZC00YjdmLWE1NWItYTk0NjdjMjc1OGZm"&gt;portraits&lt;/a&gt;, &lt;a href="https://docs.google.com/spreadsheet/ccc?key=0ArAJcY18g2GadHZ6SHpiLTNTa3lsUmZJY2pQblVRR2c"&gt;spreadsheet&lt;/a&gt;, and DIYDodecad &lt;a href="https://docs.google.com/open?id=0B7AJcY18g2GaZGVlMDJiNWItMDdmMy00YWYxLTljNTAtMzcyNzRkMzc1MTRj"&gt;files&lt;/a&gt;.&lt;br /&gt;
K12b population &lt;a href="https://docs.google.com/open?id=0B7JDEoCgzRKeMzgzOWVhNjUtZWIxYy00MjI0LTlkYTMtNGNkZjhmMmI3NjQz"&gt;portraits&lt;/a&gt;, &lt;a href="https://docs.google.com/spreadsheet/ccc?key=0ArJDEoCgzRKedEY4Y3lTUVBaaFp0bC1zZlBDcTZEYlE&amp;amp;hl=en_US#gid=0"&gt;spreadsheet&lt;/a&gt;, and DIYDodecad &lt;a href="https://docs.google.com/open?id=0B7AJcY18g2GaZWFhZTYxOTUtNDI2Yi00M2VlLWEzZGYtODIyNzUxNWJlZTdl"&gt;files&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
Dodecad Oracle (K12b edition) can be downloaded from &lt;a href="https://docs.google.com/open?id=0B7AJcY18g2GaYWNiZjI3ZGYtM2EwYy00OTdjLTgwNjUtMWZkODFhNDQ5NjFi"&gt;here&lt;/a&gt;. Please read the instructions of the &lt;a href="http://dodecad.blogspot.com/2011/12/dodecad-oracle-k12a-edition.html"&gt;previous&lt;/a&gt; Oracle on how to use this tool. Note that the number of populations is now 223.&lt;br /&gt;
&lt;br /&gt;
To use either calculator with &lt;a href="http://dodecad.blogspot.com/2011/09/do-it-yourself-dodecad-v-21.html"&gt;DIYDodecad&lt;/a&gt;, with your 23andMe or Family Finder data, follow the instructions in the README file, but substitute 'K12b' or 'K7b' for 'dv3'.&lt;br /&gt;
&lt;br /&gt;
Project participant results for both K7b and K12b are found in the spreadsheets in the Individual Results tab.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Terms of Use&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
You are free to use K12b and K7b, including all downloaded files for any non-commercial purpose, as long as you attribute them to the Dodecad Project and to Dienekes Pontikos as follows:&lt;br /&gt;
&lt;br /&gt;
The [K7b/K12b] admixture calculator is courtesy of &lt;a href="http://dienekes.blogspot.com/"&gt;Dienekes Pontikos&lt;/a&gt; and was developed as part of the &lt;a href="http://dodecad.blogspot.com/"&gt;Dodecad Ancestry Project&lt;/a&gt;;&amp;nbsp;more information &lt;a href="http://dodecad.blogspot.com/2012/01/k12b-and-k7b-calculators.html"&gt;here&lt;/a&gt;.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6533996127304587865-8093957617440164195?l=dodecad.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel="replies" type="application/atom+xml" href="http://dodecad.blogspot.com/feeds/8093957617440164195/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://dodecad.blogspot.com/2012/01/k12b-and-k7b-calculators.html#comment-form" title="26 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/8093957617440164195?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/8093957617440164195?v=2" /><link rel="alternate" type="text/html" href="http://dodecad.blogspot.com/2012/01/k12b-and-k7b-calculators.html" title="'K12b' and 'K7b' calculators" /><author><name>Dodecad Project</name><uri>http://www.blogger.com/profile/10447516703222698752</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://3.bp.blogspot.com/-C9ebS214BrI/TyewlRJboII/AAAAAAAAEb8/PFnwD3ygE2I/s72-c/_7.png" height="72" width="72" /><thr:total>26</thr:total></entry><entry gd:etag="W/&quot;CEcERnY_fSp7ImA9WhRUE0o.&quot;"><id>tag:blogger.com,1999:blog-6533996127304587865.post-7415433921685442670</id><published>2012-01-24T04:10:00.000+02:00</published><updated>2012-01-24T04:26:47.845+02:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-01-24T04:26:47.845+02:00</app:edited><title>Submission Opportunity is OVER</title><content type="html">&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;
Thank you everyone for submitting their data. I will not accept any more data at this time. A couple of submissions came in at the last second, so I accepted one more than I promised, who got the brand new DPD001 ID.&lt;br /&gt;
&lt;br /&gt;
Those who submitted in time will get their IDs and their results will be posted in the &lt;a href="https://docs.google.com/spreadsheet/ccc?key=0ArJDEoCgzRKedGdRbkxKMDdlZkJWc21tdkpldWxwVmc"&gt;K12a spreadsheet&lt;/a&gt;.&lt;br /&gt;
Additionally, I will run all participants over &lt;a href="http://dodecad.blogspot.com/2011/12/world9-calculator.html"&gt;world9&lt;/a&gt;, so that &lt;a href="https://docs.google.com/spreadsheet/ccc?key=0ArAJcY18g2GadGlpc3JQaVdQbS1QTWF3SzNjTVdfZEE"&gt;spreadsheet&lt;/a&gt; will also include everybody.&lt;br /&gt;
&lt;br /&gt;
From now on, I will be reworking some of the Project tools to make use of newer samples submitted during this submission opportunity.&lt;br /&gt;
&lt;br /&gt;
If you wish to submit your data during this off period, note that you must contact me at dodecad@gmail.com. &lt;b&gt;Do not send data at this time, unless I indicate that I can accept it!&lt;/b&gt; I will let you know if I can process it, and note that I will normally only consider those who matched the &lt;a href="http://dodecad.blogspot.com/2012/01/submission-opportunity-january-2012.html"&gt;eligibility criteria&lt;/a&gt; of the most recent submission period.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6533996127304587865-7415433921685442670?l=dodecad.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel="replies" type="application/atom+xml" href="http://dodecad.blogspot.com/feeds/7415433921685442670/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://dodecad.blogspot.com/2012/01/submission-opportunity-is-over.html#comment-form" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/7415433921685442670?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/7415433921685442670?v=2" /><link rel="alternate" type="text/html" href="http://dodecad.blogspot.com/2012/01/submission-opportunity-is-over.html" title="Submission Opportunity is OVER" /><author><name>Dodecad Project</name><uri>http://www.blogger.com/profile/10447516703222698752</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>1</thr:total></entry><entry gd:etag="W/&quot;CEcCQ3czeSp7ImA9WhRUE0o.&quot;"><id>tag:blogger.com,1999:blog-6533996127304587865.post-3650502082685212000</id><published>2012-01-23T22:23:00.000+02:00</published><updated>2012-01-24T04:27:42.981+02:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-01-24T04:27:42.981+02:00</app:edited><title>Open submission for everybody until DOD999</title><content type="html">&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;
&lt;b&gt;SUBMISSION OPPORTUNITY IS NOW &lt;a href="http://dodecad.blogspot.com/2012/01/submission-opportunity-is-over.html"&gt;OVER&lt;/a&gt;&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Everyone on the planet is invited to submit their data, regardless of their ancestry&lt;/b&gt;.&lt;br /&gt;
&lt;br /&gt;
All other &lt;a href="http://dodecad.blogspot.com/2012/01/submission-opportunity-january-2012.html"&gt;rules&lt;/a&gt; apply, especially the &lt;b&gt;no relatives&lt;/b&gt; clause. Additionally, I will accept &lt;b&gt;a single submission from each submitter&lt;/b&gt;, so don't submit all your friends. Moreover, regardless of your ancestry, you should &lt;b&gt;let me know the origin of your four grandparents.&lt;/b&gt;&lt;br /&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
There are 35 spots open, so hurry, since last time I had a free-for-all I had to close it down after about 12 hours due to overwhelming demand. I will close project submission after I assign DOD999.&lt;/div&gt;
&lt;div&gt;
&lt;br /&gt;&lt;/div&gt;
&lt;div&gt;
All submissions after I post the end-of-submission announcement on the blog will be ignored. If you post this in any forums or mailing lists, include this post link so that people will know whether the opportunity is over.&lt;/div&gt;
&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6533996127304587865-3650502082685212000?l=dodecad.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel="replies" type="application/atom+xml" href="http://dodecad.blogspot.com/feeds/3650502082685212000/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://dodecad.blogspot.com/2012/01/open-submission-for-everybody-until.html#comment-form" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/3650502082685212000?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/3650502082685212000?v=2" /><link rel="alternate" type="text/html" href="http://dodecad.blogspot.com/2012/01/open-submission-for-everybody-until.html" title="Open submission for everybody until DOD999" /><author><name>Dodecad Project</name><uri>http://www.blogger.com/profile/10447516703222698752</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>1</thr:total></entry><entry gd:etag="W/&quot;DU8MRHo9eSp7ImA9WhRUEUo.&quot;"><id>tag:blogger.com,1999:blog-6533996127304587865.post-1507163015182312324</id><published>2012-01-21T22:27:00.000+02:00</published><updated>2012-01-21T22:31:25.461+02:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-01-21T22:31:25.461+02:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="fastIBD" /><category scheme="http://www.blogger.com/atom/ns#" term="Africa" /><category scheme="http://www.blogger.com/atom/ns#" term="GALORE" /><category scheme="http://www.blogger.com/atom/ns#" term="Assyrians" /><category scheme="http://www.blogger.com/atom/ns#" term="Results" /><category scheme="http://www.blogger.com/atom/ns#" term="Jews" /><category scheme="http://www.blogger.com/atom/ns#" term="Arabs" /><title>fastIBD analysis of Afroasiatic groups (Jews, Arabs, Assyrians, Berbers, Somalis, Amharas, etc.)</title><content type="html">Please refer to the previous analysis on the&amp;nbsp;&lt;a href="http://dodecad.blogspot.com/2012/01/fastibd-analysis-of-balkanswest-asia.html"&gt;Balkans/West Asia&lt;/a&gt;&amp;nbsp;for more information about the interpretation of this type of analysis.&lt;br /&gt;
&lt;br /&gt;
I am very pleased with the way this analysis of Afroasiatic groups has turned out, revealing an exceptional degree of resolution. I invite individuals from the Near East and Africa who are&lt;a href="http://dodecad.blogspot.com/2012/01/submission-opportunity-january-2012.html"&gt; eligible&lt;/a&gt;, to submit their data, so that they can be included in future runs of this kind.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Clusters Galore&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
45 clusters were inferred with 29 dimensions.&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://2.bp.blogspot.com/-lfHdp5xcSPg/TxsWaDQak1I/AAAAAAAAAtQ/8_QUwn5jEvE/s1600/galore_afroasiatic.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="197" src="http://2.bp.blogspot.com/-lfHdp5xcSPg/TxsWaDQak1I/AAAAAAAAAtQ/8_QUwn5jEvE/s320/galore_afroasiatic.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
I can't comment on all 45 clusters, so I'll just limit myself to the ones that are significantly represented among Project participants: &lt;b&gt;1.&lt;/b&gt; Ashkenazi, &lt;b&gt;4.&lt;/b&gt; Assyrian/Mandaean, &lt;b&gt;6.&lt;/b&gt; Somali, &lt;b&gt;7.&lt;/b&gt; Moroccan, &lt;b&gt;8.&lt;/b&gt; Algerian/Tunisian, &lt;b&gt;9.&lt;/b&gt; Sephardic, &lt;b&gt;10.&lt;/b&gt; Morocco Jews, &lt;b&gt;11.&lt;/b&gt; Iran/Iraq Jews, &lt;b&gt;12.&lt;/b&gt; Non-Jewish Ethiopians, &lt;b&gt;13.&lt;/b&gt; Saudi, &lt;b&gt;14.&lt;/b&gt; Arab #1, &lt;b&gt;15.&lt;/b&gt; Arab #2, &lt;b&gt;16.&lt;/b&gt; Egyptian&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Inter-Population IBD&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://1.bp.blogspot.com/-4Pc47nT1Fos/TxsYtZpGoYI/AAAAAAAAAtY/DhsZ5FIADFo/s1600/heatmap.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="320" src="http://1.bp.blogspot.com/-4Pc47nT1Fos/TxsYtZpGoYI/AAAAAAAAAtY/DhsZ5FIADFo/s320/heatmap.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;b&gt;Results for Project Participants&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
The results can be found in the &lt;a href="https://docs.google.com/spreadsheet/ccc?key=0ArAJcY18g2GadGIxMmNvMEVJYlR2ZWdGYzJtaV9HV1E"&gt;spreadsheet&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
I have also added the full &lt;a href="https://docs.google.com/open?id=0B7AJcY18g2GaMzVhY2M5ZDgtMzk5NC00MDJlLWI0ZjUtOWQ4YzMyMzY5MDE2"&gt;IBD sharing matrix&lt;/a&gt;&amp;nbsp;which lists how many Morgans of sequence are estimated to be IBD with probability greater than 10^-6 between all pairs of individuals.&lt;br /&gt;
&lt;br /&gt;
You can google any non-Project sample IDs to get some more information about their origin.&amp;nbsp;For example,&amp;nbsp;&lt;a href="https://www.google.com/search?sourceid=chrome&amp;amp;ie=UTF-8&amp;amp;q=GSM536710"&gt;GSM536710&lt;/a&gt;&amp;nbsp;is an Iraqi Jew who shares about half his genome with&amp;nbsp;&lt;a href="https://www.google.com/search?sourceid=chrome&amp;amp;ie=UTF-8&amp;amp;q=GSM536714"&gt;GSM536714&lt;/a&gt;, also an Iraqi Jew. These two samples are almost certainly first-degree relatives. Or,&amp;nbsp;GSM537032, a Samaritan shares 740-1,480cM with the other 2 Samaritans, an exceptional amount in this small and probably highly inbred population.&lt;br /&gt;
&lt;br /&gt;
You can manipulate this matrix in R. After you download it and unzip it, you can load it into R as follows:&lt;br /&gt;
&lt;br /&gt;
X&amp;lt;-read.table('afroasiatic_ibd_sharing.txt',row.names=1,header=T)&lt;br /&gt;
&lt;br /&gt;
Then, you can, for example, sort the IBD sharing for a particular individual, as follows:&lt;br /&gt;
&lt;br /&gt;
sort(X['DOD026',])&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6533996127304587865-1507163015182312324?l=dodecad.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel="replies" type="application/atom+xml" href="http://dodecad.blogspot.com/feeds/1507163015182312324/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://dodecad.blogspot.com/2012/01/fastibd-analysis-of-afroasiatic-groups.html#comment-form" title="16 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/1507163015182312324?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/1507163015182312324?v=2" /><link rel="alternate" type="text/html" href="http://dodecad.blogspot.com/2012/01/fastibd-analysis-of-afroasiatic-groups.html" title="fastIBD analysis of Afroasiatic groups (Jews, Arabs, Assyrians, Berbers, Somalis, Amharas, etc.)" /><author><name>Dodecad Project</name><uri>http://www.blogger.com/profile/10447516703222698752</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://2.bp.blogspot.com/-lfHdp5xcSPg/TxsWaDQak1I/AAAAAAAAAtQ/8_QUwn5jEvE/s72-c/galore_afroasiatic.png" height="72" width="72" /><thr:total>16</thr:total></entry><entry gd:etag="W/&quot;CUEFRH4-eCp7ImA9WhRUEUQ.&quot;"><id>tag:blogger.com,1999:blog-6533996127304587865.post-1013930157208072251</id><published>2012-01-21T11:49:00.000+02:00</published><updated>2012-01-22T02:53:35.050+02:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-01-22T02:53:35.050+02:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="fastIBD" /><category scheme="http://www.blogger.com/atom/ns#" term="Baltic" /><category scheme="http://www.blogger.com/atom/ns#" term="Hungarians" /><category scheme="http://www.blogger.com/atom/ns#" term="Greeks" /><category scheme="http://www.blogger.com/atom/ns#" term="GALORE" /><category scheme="http://www.blogger.com/atom/ns#" term="Germanic" /><category scheme="http://www.blogger.com/atom/ns#" term="Slavic" /><category scheme="http://www.blogger.com/atom/ns#" term="Results" /><category scheme="http://www.blogger.com/atom/ns#" term="Europe" /><category scheme="http://www.blogger.com/atom/ns#" term="Romanians" /><title>fastIBD analysis of Central/Eastern Europe</title><content type="html">Please refer to the previous analysis on the&amp;nbsp;&lt;a href="http://dodecad.blogspot.com/2012/01/fastibd-analysis-of-balkanswest-asia.html"&gt;Balkans/West Asia&lt;/a&gt;&amp;nbsp;for more information about the interpretation of this type of analysis.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Clusters Galore&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
The Clusters Galore can be found in the &lt;a href="https://docs.google.com/spreadsheet/ccc?key=0ArAJcY18g2GadDVVQnlkS2xKTnhPZE42R0toN3JGQ3c"&gt;spreadsheet&lt;/a&gt;. After inspection of the 23 clusters inferred with 21 dimensions, they could be described as:&lt;br /&gt;
&lt;br /&gt;
&lt;ol&gt;
&lt;li&gt;Mordvin&lt;/li&gt;
&lt;li&gt;East Slavic&lt;/li&gt;
&lt;li&gt;Polish-Ukrainian&lt;/li&gt;
&lt;li&gt;East Balkan&lt;/li&gt;
&lt;li&gt;Vologda Russians&lt;/li&gt;
&lt;li&gt;Lithuanian&lt;/li&gt;
&lt;li&gt;Central European (combining many groups with small sample sizes)&lt;/li&gt;
&lt;li&gt;A couple of related (?) individuals&lt;/li&gt;
&lt;li&gt;Anatolian&lt;/li&gt;
&lt;li&gt;Greek&lt;/li&gt;
&lt;li&gt;Chuvash&lt;/li&gt;
&lt;li&gt;Ossetian&lt;/li&gt;
&lt;li&gt;A couple of related individuals&lt;/li&gt;
&lt;li&gt;A couple of related individuals
&lt;/li&gt;
&lt;li&gt;Balkar&lt;/li&gt;
&lt;li&gt;A couple of related individuals
&lt;/li&gt;
&lt;li&gt;Chechen&lt;/li&gt;
&lt;li&gt;Kumyk&lt;/li&gt;
&lt;li&gt;A couple of related individuals
&lt;/li&gt;
&lt;li&gt;Adygei&lt;/li&gt;
&lt;li&gt;Lezgin #1 (main)&lt;/li&gt;
&lt;li&gt;Lezgin #2&lt;/li&gt;
&lt;li&gt;Lezgin #3&lt;/li&gt;
&lt;/ol&gt;
If you belong to a population with few other participants, you might end up latching onto a cluster dominated by a bigger group. This does not mean that your population is not distinctive, only that there are not enough samples to reveal its distinctiveness if it exists.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Inter-Population IBD&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://2.bp.blogspot.com/-Gj9uvbFOC9E/TxqJGFkYGZI/AAAAAAAAAtI/4o1b_Aa9o2Q/s1600/heatmap.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="320" src="http://2.bp.blogspot.com/-Gj9uvbFOC9E/TxqJGFkYGZI/AAAAAAAAAtI/4o1b_Aa9o2Q/s320/heatmap.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;b&gt;Results for Dodecad Participants&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
Results can be found in the &lt;a href="https://docs.google.com/spreadsheet/ccc?key=0ArAJcY18g2GadDVVQnlkS2xKTnhPZE42R0toN3JGQ3c"&gt;spreadsheet&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
If you have joined the Project, please consider leaving a comment in the&lt;a href="http://dodecad.blogspot.com/2010/11/information-about-project-samples.html"&gt; Information about Project samples&lt;/a&gt; thread. That will help others make better sense of their results, e.g., if you find that you belong in the same cluster with some other individual, you might want to know something about their origins.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;UPDATE: &lt;/b&gt;I have added the &lt;a href="https://docs.google.com/open?id=0B7AJcY18g2GaMTc1YzQ5NzMtZDkxNi00MWJiLWJlODQtZjRjNDkzNzVlYzc5"&gt;IBD sharing matrix&lt;/a&gt;.See &lt;a href="http://dodecad.blogspot.com/2012/01/fastibd-analysis-of-afroasiatic-groups.html"&gt;here &lt;/a&gt;on how to use it.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6533996127304587865-1013930157208072251?l=dodecad.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel="replies" type="application/atom+xml" href="http://dodecad.blogspot.com/feeds/1013930157208072251/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://dodecad.blogspot.com/2012/01/fastibd-analysis-of-centraleastern.html#comment-form" title="11 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/1013930157208072251?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/1013930157208072251?v=2" /><link rel="alternate" type="text/html" href="http://dodecad.blogspot.com/2012/01/fastibd-analysis-of-centraleastern.html" title="fastIBD analysis of Central/Eastern Europe" /><author><name>Dodecad Project</name><uri>http://www.blogger.com/profile/10447516703222698752</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://2.bp.blogspot.com/-Gj9uvbFOC9E/TxqJGFkYGZI/AAAAAAAAAtI/4o1b_Aa9o2Q/s72-c/heatmap.png" height="72" width="72" /><thr:total>11</thr:total></entry><entry gd:etag="W/&quot;DUUHQXwycCp7ImA9WhRVGUs.&quot;"><id>tag:blogger.com,1999:blog-6533996127304587865.post-5008815011829643773</id><published>2012-01-19T12:00:00.001+02:00</published><updated>2012-01-19T12:00:30.298+02:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-01-19T12:00:30.298+02:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="fastIBD" /><category scheme="http://www.blogger.com/atom/ns#" term="South Asians" /><category scheme="http://www.blogger.com/atom/ns#" term="GALORE" /><category scheme="http://www.blogger.com/atom/ns#" term="Results" /><title>fastIBD analysis of South Asia</title><content type="html">Please refer to the previous analysis on the &lt;a href="http://dodecad.blogspot.com/2012/01/fastibd-analysis-of-balkanswest-asia.html"&gt;Balkans/West Asia&lt;/a&gt; for more information about the interpretation of this type of analysis.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Clusters Galore&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
The Clusters Galore analysis can be found in the &lt;a href="https://docs.google.com/spreadsheet/ccc?key=0ArAJcY18g2GadGZBWHNoNDdSMl8tQVBKbHRIWE1tZ3c"&gt;spreadsheet&lt;/a&gt;. 59 clusters were inferred with 47 MDS dimensions. The very fine-scale structure (I only considered the first 50 dimensions, but many more seemed significant than in any previous experiment) is probably the result of the size of the South Asian population, as well as the practice of endogamy associated with the caste system. High intra-population IBD sharing is also evident in the following (notice how well-defined the diagonal is):&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Inter-Population IBD&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://4.bp.blogspot.com/-_nWv2f5aqGo/TxfmRLO-SwI/AAAAAAAAAtA/VKDJcyrznYw/s1600/heatmap.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="320" src="http://4.bp.blogspot.com/-_nWv2f5aqGo/TxfmRLO-SwI/AAAAAAAAAtA/VKDJcyrznYw/s320/heatmap.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;Results for Dodecad participants&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
They can be found in the &lt;a href="https://docs.google.com/spreadsheet/ccc?key=0ArAJcY18g2GadGZBWHNoNDdSMl8tQVBKbHRIWE1tZ3c"&gt;spreadsheet&lt;/a&gt;. Many Project participants belong to a population with 1 or 2 individuals, so cluster #1 seems to be a generalized catch-all for many such individuals. Individuals from he two sub-populations that I've identified recently Iyer_D, and Jatt_D all belong to the same cluster. The Iyer_D cluster (#4) also seems to include the Iyengar project participants as might be expected.&lt;br /&gt;
&lt;br /&gt;
It is also interesting how all Dodecad participants fall in just 7 of the 59 clusters. This goes to show how truly diverse people from the Indian subcontinent are. I fully expect that with more participation further structure will be revealed, since it seems that due to endogamy it only takes a few participants from each ethnic group for a specific cluster pertaining to that group to be identified. So, I invite people from South Asia to join the Project during this &lt;a href="http://dodecad.blogspot.com/2012/01/submission-opportunity-january-2012.html"&gt;submission opportunity&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6533996127304587865-5008815011829643773?l=dodecad.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel="replies" type="application/atom+xml" href="http://dodecad.blogspot.com/feeds/5008815011829643773/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://dodecad.blogspot.com/2012/01/fastibd-analysis-of-south-asia.html#comment-form" title="1 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/5008815011829643773?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/5008815011829643773?v=2" /><link rel="alternate" type="text/html" href="http://dodecad.blogspot.com/2012/01/fastibd-analysis-of-south-asia.html" title="fastIBD analysis of South Asia" /><author><name>Dodecad Project</name><uri>http://www.blogger.com/profile/10447516703222698752</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://4.bp.blogspot.com/-_nWv2f5aqGo/TxfmRLO-SwI/AAAAAAAAAtA/VKDJcyrznYw/s72-c/heatmap.png" height="72" width="72" /><thr:total>1</thr:total></entry><entry gd:etag="W/&quot;D0EFSHc5fyp7ImA9WhRVF0g.&quot;"><id>tag:blogger.com,1999:blog-6533996127304587865.post-5230012061156602892</id><published>2012-01-17T01:13:00.003+02:00</published><updated>2012-01-17T01:13:39.927+02:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-01-17T01:13:39.927+02:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="fastIBD" /><category scheme="http://www.blogger.com/atom/ns#" term="Italians" /><category scheme="http://www.blogger.com/atom/ns#" term="Greeks" /><category scheme="http://www.blogger.com/atom/ns#" term="GALORE" /><category scheme="http://www.blogger.com/atom/ns#" term="Iberians" /><category scheme="http://www.blogger.com/atom/ns#" term="Results" /><category scheme="http://www.blogger.com/atom/ns#" term="Jews" /><title>fastIBD analysis of Iberia, France, Italy, Balkans, Anatolia and European Jews</title><content type="html">&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;
On the heels of the previous analysis of &lt;a href="http://dodecad.blogspot.com/2012/01/fastibd-analysis-of-balkanswest-asia.html"&gt;Balkans/West Asia&lt;/a&gt;, a new experiment on a different set of populations. &lt;b&gt;Please refer to the earlier post for some thoughts/explanations about this type of analysis&lt;/b&gt;, I'll stick to "just the data" for this post.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Clusters Galore&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://4.bp.blogspot.com/-hUvuuWcLzA0/TxSoiQ1vTKI/AAAAAAAAAsc/4CLsIXGSLz0/s1600/galore_iberia_france_italy_balkans_anatolia_jews.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="320" src="http://4.bp.blogspot.com/-hUvuuWcLzA0/TxSoiQ1vTKI/AAAAAAAAAsc/4CLsIXGSLz0/s320/galore_iberia_france_italy_balkans_anatolia_jews.png" width="222" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
24 clusters inferred with 17 MDS dimensions.&lt;br /&gt;
&lt;br /&gt;
The Galore analysis provides increased resolution within Iberia (#6-9, 11), Italy, and the Ashkenazi Jewish group (#14-16).&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://4.bp.blogspot.com/-NuEy53KCwEU/TxStEVsoN0I/AAAAAAAAAsw/zOyg7Ma3F4g/s1600/spain.png" imageanchor="1" style="clear: left; float: left; margin-bottom: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="306" src="http://4.bp.blogspot.com/-NuEy53KCwEU/TxStEVsoN0I/AAAAAAAAAsw/zOyg7Ma3F4g/s320/spain.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;
The &lt;b&gt;Iberian&lt;/b&gt; results are particularly interesting, showing the power of this approach compared to the one with &lt;a href="http://dienekes.blogspot.com/2011/12/lack-of-significant-population.html"&gt;unlinked data&lt;/a&gt;. There appear to be:&lt;br /&gt;
&lt;br /&gt;
&lt;ul&gt;
&lt;li&gt;a Spanish Basque (#6),&amp;nbsp;&lt;/li&gt;
&lt;li&gt;French Basque (#11) cluster, as well as&amp;nbsp;&lt;/li&gt;
&lt;li&gt;a Portuguese/Galician/Castilla Y Leon (#9) cluster, and&amp;nbsp;&lt;/li&gt;
&lt;li&gt;a complementary Castilla La Manch/Cantabria/Andalucia/Murcia (#7) cluster, and&amp;nbsp;&lt;/li&gt;
&lt;li&gt;a smaller Aragon/Cataluna cluster (#8).&amp;nbsp;&lt;/li&gt;
&lt;/ul&gt;
There is overlap between these clusters, but the geographical contrasts are quite evident. I did not go through the results of Spanish Project participants (all the Portuguese fall in the Galician cluster, and our Basque member in the Basque cluster as expeccted), so it would be interesting to hear whether they fall in the cluster(s) which exist in their regions of origin.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Inter-Population IBD&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://3.bp.blogspot.com/-L4sURdlfOko/TxSsebMbBoI/AAAAAAAAAsk/LXEmqi8DFz4/s1600/heatmap.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="320" src="http://3.bp.blogspot.com/-L4sURdlfOko/TxSsebMbBoI/AAAAAAAAAsk/LXEmqi8DFz4/s320/heatmap.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;Results for Project Participants&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
The results can be found in the &lt;a href="https://docs.google.com/spreadsheet/ccc?key=0ArAJcY18g2GadFprc1NrTGtSaDk0T0hFWGxjY05WSWc"&gt;spreadsheet&lt;/a&gt;.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6533996127304587865-5230012061156602892?l=dodecad.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel="replies" type="application/atom+xml" href="http://dodecad.blogspot.com/feeds/5230012061156602892/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://dodecad.blogspot.com/2012/01/fastibd-analysis-of-iberia-france-italy.html#comment-form" title="35 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/5230012061156602892?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/5230012061156602892?v=2" /><link rel="alternate" type="text/html" href="http://dodecad.blogspot.com/2012/01/fastibd-analysis-of-iberia-france-italy.html" title="fastIBD analysis of Iberia, France, Italy, Balkans, Anatolia and European Jews" /><author><name>Dodecad Project</name><uri>http://www.blogger.com/profile/10447516703222698752</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://4.bp.blogspot.com/-hUvuuWcLzA0/TxSoiQ1vTKI/AAAAAAAAAsc/4CLsIXGSLz0/s72-c/galore_iberia_france_italy_balkans_anatolia_jews.png" height="72" width="72" /><thr:total>35</thr:total></entry><entry gd:etag="W/&quot;CE8ARXwzeCp7ImA9WhRVFUk.&quot;"><id>tag:blogger.com,1999:blog-6533996127304587865.post-7549664593489628697</id><published>2012-01-14T13:22:00.001+02:00</published><updated>2012-01-14T14:07:24.280+02:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-01-14T14:07:24.280+02:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="Turkic" /><category scheme="http://www.blogger.com/atom/ns#" term="fastIBD" /><category scheme="http://www.blogger.com/atom/ns#" term="Greeks" /><category scheme="http://www.blogger.com/atom/ns#" term="Balkans" /><category scheme="http://www.blogger.com/atom/ns#" term="GALORE" /><category scheme="http://www.blogger.com/atom/ns#" term="Assyrians" /><category scheme="http://www.blogger.com/atom/ns#" term="Slavic" /><category scheme="http://www.blogger.com/atom/ns#" term="Results" /><category scheme="http://www.blogger.com/atom/ns#" term="Armenians" /><category scheme="http://www.blogger.com/atom/ns#" term="Iranian" /><category scheme="http://www.blogger.com/atom/ns#" term="Romanians" /><category scheme="http://www.blogger.com/atom/ns#" term="Caucasus" /><title>fastIBD analysis of Balkans/West Asia</title><content type="html">Now that I've discovered a way to boost Clusters Galore analysis even further by using &lt;a href="http://dienekes.blogspot.com/2012/01/clusters-galore-fastibd-edition.html"&gt;fastIBD&lt;/a&gt;, I will start experimenting with different regional populations.&amp;nbsp;This analysis took about 5 hours to complete, so it appears to be quite practical.&lt;br /&gt;
&lt;br /&gt;
For my first experiment, I carry out an analysis of various populations from the Balkans and West Asia.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Clusters Galore&lt;/b&gt;&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://1.bp.blogspot.com/-bhJGvh-Y7AU/TxFPpuAhShI/AAAAAAAAEbE/Puw_i0FW0DA/s1600/galore_balkans_west_asia.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="226" src="http://1.bp.blogspot.com/-bhJGvh-Y7AU/TxFPpuAhShI/AAAAAAAAEbE/Puw_i0FW0DA/s320/galore_balkans_west_asia.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
27 different clusters were inferred with 17 MDS dimensions. Some interesting findings:&lt;br /&gt;
&lt;ul&gt;
&lt;li&gt;For the first time there emerge a couple of clusters that appear to be quite specific to Armenians (#2 and #3).&amp;nbsp;&lt;/li&gt;
&lt;li&gt;Similarly, Assyrians are broken to a few clusters that appear fairly specific to them &amp;nbsp;(#9-11)&lt;/li&gt;
&lt;li&gt;Georgians are split into three clusters, one of which (#14) is linked with the neighboring Abkhasians, who in turn have their own exclusive cluster (#25)&lt;/li&gt;
&lt;li&gt;The cluster modal in Greeks (#6) includes 14 of 19 Greek participants, and a few Greeks are also in the Balkan cluster (#8) and an Iranian-Turkish cluster (#4)&lt;/li&gt;
&lt;li&gt;The Behar Cypriot sample also splits into two, and the few Turkish Cypriot participants link to one of them (#13)&lt;/li&gt;
&lt;li&gt;The Ossetian project participant links to one of the three North_Ossetian clusters&lt;/li&gt;
&lt;li&gt;The major Balkan cluster (#8) still defies resolution. I am certain, however, that structure in this cluster will be uncovered with more participation. MCLUST adapts the cluster size and shape, and a "big", inclusive cluster spanning the Balkans appears more parsimonious than smaller clusters centered on the different groups. With larger participation, I anticipate that regional structure will be uncovered in the Balkans as well.&lt;/li&gt;
&lt;/ul&gt;
&lt;b&gt;I cannot stress the importance of &lt;a href="http://dodecad.blogspot.com/2012/01/submission-opportunity-january-2012.html"&gt;participation&lt;/a&gt; strongly enough.&lt;/b&gt; When groups have more participants, it is possible to both:&lt;br /&gt;
&lt;br /&gt;
&lt;ol&gt;
&lt;li&gt;Discover group-specific clusters, by identifying what is &lt;i&gt;common &lt;/i&gt;between members of groups&lt;/li&gt;
&lt;li&gt;Discover within-group clusters, by identifying what is &lt;i&gt;different &lt;/i&gt;between members of groups&lt;/li&gt;
&lt;/ol&gt;
For example, the great participation of Armenians in the Project has now allowed me to discover structure within the Armenian population. It appears, that cluster #2 corresponds to a more "western" Armenian group, and #3 to a more "eastern" one, with some overlap between the two.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Inter-population IBD&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
You can also see a visual representation of inter-population IBD:&lt;br /&gt;
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://2.bp.blogspot.com/-5eo8_aXs88Y/TxFUVXAd9OI/AAAAAAAAEbQ/JRMypoUYz8M/s1600/heatmap.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="320" src="http://2.bp.blogspot.com/-5eo8_aXs88Y/TxFUVXAd9OI/AAAAAAAAEbQ/JRMypoUYz8M/s320/heatmap.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;
I have only included populations with 5+ participants in this representation. Reddish shades express high IBD sharing; bluish ones low one. The heatmap has been scaled by row.&lt;br /&gt;
&lt;br /&gt;
As you might expect, values across the diagonal are "reddish", since individuals within populations tend to have high IBD sharing with each other.&lt;br /&gt;
&lt;br /&gt;
A few features "pop out" of the screen. Going from top to bottom:&lt;br /&gt;
&lt;ul&gt;
&lt;li&gt;Intra-Iranic sharing&lt;/li&gt;
&lt;li&gt;Intra-Armenian sharing&lt;/li&gt;
&lt;li&gt;Intra-Balkan sharing&lt;/li&gt;
&lt;li&gt;Georgian-Abkhaz sharing&lt;/li&gt;
&lt;/ul&gt;
You can probably get more out of the figure, but these appear to be the most salient features.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Results for Project Participants&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
The results can be found in the &lt;a href="https://docs.google.com/spreadsheet/ccc?key=0ArAJcY18g2GadFJ4UFFFUlVWUG5zaXM0LVNaQUROalE"&gt;spreadsheet&lt;/a&gt;, and include:&lt;br /&gt;
&lt;ul&gt;
&lt;li&gt;Probabilities of assignment in each of the 27 clusters of the Clusters Galore analysis&lt;/li&gt;
&lt;li&gt;Z-scores of IBD between each individual and each of the 20 populations with 5+ participants. Higher values mean more IBD sharing. Note that Z-scores have been calculated for each row, hence each participant must scan his own row to find populations with an excess (+) or deficiency (-) of IBD sharing, and people &lt;i&gt;should not &lt;/i&gt;compare across different rows.&lt;/li&gt;
&lt;/ul&gt;
&lt;b&gt;Last but not least, I want to remind new project participants to leave a message in the &lt;a href="http://dodecad.blogspot.com/2010/11/information-about-project-samples.html"&gt;Information about Project samples&lt;/a&gt; thread. Your comment will not appear immediately, since comment moderation is on, and also note that there are multiple pages of comments.&amp;nbsp;&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;If you haven't joined the Project yet, I encourage you to do so if you are &lt;a href="http://dodecad.blogspot.com/2012/01/submission-opportunity-january-2012.html"&gt;eligible&lt;/a&gt;.&lt;/b&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6533996127304587865-7549664593489628697?l=dodecad.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel="replies" type="application/atom+xml" href="http://dodecad.blogspot.com/feeds/7549664593489628697/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://dodecad.blogspot.com/2012/01/fastibd-analysis-of-balkanswest-asia.html#comment-form" title="11 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/7549664593489628697?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/7549664593489628697?v=2" /><link rel="alternate" type="text/html" href="http://dodecad.blogspot.com/2012/01/fastibd-analysis-of-balkanswest-asia.html" title="fastIBD analysis of Balkans/West Asia" /><author><name>Dienekes</name><uri>http://www.blogger.com/profile/02082684850093948970</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="http://4.bp.blogspot.com/-KXXemZigoEc/TlK7wIUP_EI/AAAAAAAAEAk/uJ-FlueoC6o/s220/prosopon.png" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://1.bp.blogspot.com/-bhJGvh-Y7AU/TxFPpuAhShI/AAAAAAAAEbE/Puw_i0FW0DA/s72-c/galore_balkans_west_asia.png" height="72" width="72" /><thr:total>11</thr:total></entry><entry gd:etag="W/&quot;CU8NRno5eCp7ImA9WhRVEkU.&quot;"><id>tag:blogger.com,1999:blog-6533996127304587865.post-2099682999739661663</id><published>2012-01-11T14:11:00.002+02:00</published><updated>2012-01-11T14:11:37.420+02:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-01-11T14:11:37.420+02:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="GALORE" /><category scheme="http://www.blogger.com/atom/ns#" term="Results" /><category scheme="http://www.blogger.com/atom/ns#" term="Europe" /><title>Clusters Galore (fastIBD edition) for some northern European participants</title><content type="html">You can find some new Clusters Galore results &lt;a href="http://dienekes.blogspot.com/2012/01/clusters-galore-fastibd-edition.html"&gt;here&lt;/a&gt;&amp;nbsp;(scroll down for spreadsheet link). The new methodology described in that post has made it possible to infer even finer-level population structure than "classical" Clusters Galore.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6533996127304587865-2099682999739661663?l=dodecad.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel="replies" type="application/atom+xml" href="http://dodecad.blogspot.com/feeds/2099682999739661663/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://dodecad.blogspot.com/2012/01/clusters-galore-fastibd-edition-for.html#comment-form" title="2 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/2099682999739661663?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/2099682999739661663?v=2" /><link rel="alternate" type="text/html" href="http://dodecad.blogspot.com/2012/01/clusters-galore-fastibd-edition-for.html" title="Clusters Galore (fastIBD edition) for some northern European participants" /><author><name>Dienekes</name><uri>http://www.blogger.com/profile/02082684850093948970</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="http://4.bp.blogspot.com/-KXXemZigoEc/TlK7wIUP_EI/AAAAAAAAEAk/uJ-FlueoC6o/s220/prosopon.png" /></author><thr:total>2</thr:total></entry><entry gd:etag="W/&quot;CUQMSHg_fip7ImA9WhRWGEk.&quot;"><id>tag:blogger.com,1999:blog-6533996127304587865.post-7614200531195435292</id><published>2012-01-03T16:09:00.002+02:00</published><updated>2012-01-06T11:49:49.646+02:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2012-01-06T11:49:49.646+02:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="Dodecad" /><title>Submission opportunity (January 2012)</title><content type="html">&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;
&lt;b&gt;Who is eligible&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
Anyone who:&lt;br /&gt;
&lt;br /&gt;
&lt;ul&gt;
&lt;li&gt;has 23andMe or Family Finder autosomal data,&lt;/li&gt;
&lt;li&gt;is not related to any other Project participants,&lt;/li&gt;
&lt;li&gt;has 4 grandparents from the same African, European, or Asian ethnic group or country (e.g., 4 Albanian grandparents, 4 grandparents born in Ethiopia, 4 Kazakh grandparents, etc.)&lt;/li&gt;
&lt;/ul&gt;
Any ineligible submissions will be blacklisted. Do not send data if you do not meet the eligibility criteria.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;What to send&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
Send your compressed autosomal data (ending .zip or .gz) that you can download from your testing company.&lt;br /&gt;
Send to dodecad@gmail.com as an attachment, and include in your e-mail as much information about your ancestry as you can (e.g., birthplace of grandparents, spoken languages, practiced religions, ethnic affiliation, etc.). Samples without adequate ancestral information will be ignored.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Data Privacy Statement&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
Your raw data or genealogical information will not be shared or distributed in any manner, and it will not be analyzed for any other purpose than assessment of ancestry (i.e., not for any physical or health-related traits). It will be identified by a unique ID, known to you and me, and results will be posted in the blog using that ID. I will continue to analyze your data for ancestry, and new results will be posted using that same ID. Also, I will report aggregate results for populations with at least 5 participants.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;What you will receive&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
I will add you to the &lt;a href="https://docs.google.com/spreadsheet/ccc?key=0ArJDEoCgzRKedGdRbkxKMDdlZkJWc21tdkpldWxwVmc"&gt;K12a spreadsheet&lt;/a&gt;&amp;nbsp;of the &lt;a href="http://dienekes.blogspot.com/2011/12/first-analysis-of-metspalu-et-al-2011.html"&gt;K12a calculator&lt;/a&gt;. You will also be eligible to participate in future data analyses, and newer results will be posted in this blog with your ID.&lt;br /&gt;
&lt;br /&gt;
&lt;i&gt;Clarification (added on 6 Jan, 2012):&lt;/i&gt; The results which you will receive will be based on the K12a calculator whose components were inferred in &lt;a href="http://dienekes.blogspot.com/2011/12/first-analysis-of-metspalu-et-al-2011.html"&gt;December 2011&lt;/a&gt;, and hence included only those who had submitted their data up to that time.&amp;nbsp;As new members of the Project, your data will be used for the development of the next version of the admixture analysis, and this will -in all likelihood- lead to a subtle redrawing of the ancestral components and different ancestral proportions (see &lt;a href="http://dienekes.blogspot.com/2011/10/further-caution-on-admixture-estimates.html"&gt;technical note&lt;/a&gt;).&lt;br /&gt;
&lt;br /&gt;
By participating in the Project, you help better draw both the basic ancestral components underlying genomic variation in Africa/Europe/Asia, and create more robust samples of different populations. This is helpful both to the Project, and to yourself, because it helps you get increasingly better results with newer versions of the analysis. All newer analysis tools are announced on this blog.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;End of Submission Opportunity&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
The end of this submission opportunity will be announced on this blog.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6533996127304587865-7614200531195435292?l=dodecad.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel="replies" type="application/atom+xml" href="http://dodecad.blogspot.com/feeds/7614200531195435292/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://dodecad.blogspot.com/2012/01/submission-opportunity-january-2012.html#comment-form" title="15 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/7614200531195435292?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/7614200531195435292?v=2" /><link rel="alternate" type="text/html" href="http://dodecad.blogspot.com/2012/01/submission-opportunity-january-2012.html" title="Submission opportunity (January 2012)" /><author><name>Dodecad Project</name><uri>http://www.blogger.com/profile/10447516703222698752</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>15</thr:total></entry><entry gd:etag="W/&quot;A0EFQX0yeyp7ImA9WhRXEkQ.&quot;"><id>tag:blogger.com,1999:blog-6533996127304587865.post-3097113742390202214</id><published>2011-12-19T15:00:00.000+02:00</published><updated>2011-12-19T15:00:10.393+02:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-12-19T15:00:10.393+02:00</app:edited><title>'world9' calculator</title><content type="html">&lt;div dir="ltr" style="text-align: left;" trbidi="on"&gt;
I have consistently received requests for an assessment of Amerindian ancestry. While the focus of the Project is, and will remain, the region of Eurasia, I thought it was a good idea to release a tool that could be used by persons of partial Amerindian ancestry.&lt;br /&gt;
&lt;br /&gt;
I have also included the two Australasian populations currently available, namely Bougainville Melanesians (NAN_Melanesian) and Papuans from the HGDP.&lt;br /&gt;
&lt;br /&gt;
The inferred components at K=9 are quite similar to those of '&lt;a href="http://dodecad.blogspot.com/2011/10/eurasia7-calculator.html"&gt;eurasia7&lt;/a&gt;', with the addition of the Australasian and Amerindian components. I have also included the Kalash in this experiment, which caused the 'West_Asian' component to be modal in them, although the Kalash's difference in terms of this component to other populations is not so great as to render it strongly population-specific; I have called this component 'Caucasus_Gedrosia' and it -like the 'eurasia7' West Asian component- ought to be quite similar to the k5 component inferred by &lt;a href="http://dienekes.blogspot.com/2011/12/population-structure-in-south-asia.html"&gt;Metspalu et al. (2011)&lt;/a&gt;.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;It is unfortunate that there are only two Australasian populations currently available as public data.&lt;/b&gt; There are many more Amerindian and Mestizo ones, but it should be noted that the Amazonian populations on which the 'Amerindian' component is modal are some of the most lacking in genetic diversity in my entire database. As a result, Eurasians who lack any Amerindian or Australasian ancestry can expect to see a little of it in their results as noise.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;This is a very important caveat for Americans who suspect that they may have an Amerindian ancestor. &lt;/b&gt;Small levels of this component may be noise, and this component is also found in Siberia, and may represent either backflow from the Americas or the &lt;a href="http://dienekes.blogspot.com/2011/06/interpretation-of-admixture-results.html"&gt;common ancestry&lt;/a&gt; of Siberian and Amerindian populations. If you are interested in the detection of Amerindian ancestry, I recommend that you use DIYDodecad's 'byseg', 'bychr', and 'target' modes to drill down deeper in your genomes.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Download Files&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
&lt;ul style="text-align: left;"&gt;
&lt;li&gt;The &lt;a href="https://docs.google.com/spreadsheet/ccc?key=0ArAJcY18g2GadGlpc3JQaVdQbS1QTWF3SzNjTVdfZEE"&gt;spreadsheet&lt;/a&gt; contains admixture proportions, the table of Fst distances, and individual results in the Individual Results tab.&lt;/li&gt;
&lt;li&gt;The &lt;a href="https://docs.google.com/open?id=0B7AJcY18g2GaZDUzZGNlY2YtZTMyOS00OGNmLTkwOTAtYjIyZDZjMmExNDJk"&gt;RAR file&lt;/a&gt; contains files for use with &lt;a href="http://dodecad.blogspot.com/2011/09/do-it-yourself-dodecad-v-21.html"&gt;DIYDodecad&lt;/a&gt;. Extract its contents to the working directory of DIYDodecad. In order to run the calculator, you follow the instructions of the README file, but type 'world9' instead of 'dv3'.&lt;/li&gt;
&lt;/ul&gt;
&lt;br /&gt;
&lt;b&gt;Terms of use:&lt;/b&gt;&lt;br /&gt;
&lt;br /&gt;
'world9', including all files in the downloaded RAR file is free for non-commercial personal use. Commercial uses are forbidden. Contact me for non-personal uses of the calculator.&lt;br /&gt;
&lt;br /&gt;
&lt;b&gt;Information&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
Admixture proportions barplot:
&lt;br /&gt;
&lt;br /&gt;
&lt;a href="http://imageshack.us/photo/my-images/39/admixture9.jpg/" target="_blank"&gt;&lt;img border="0" src="http://img39.imageshack.us/img39/5844/admixture9.th.jpg" /&gt;&lt;/a&gt;
&lt;br /&gt;
&lt;br /&gt;
The nine ancestral components are:&lt;br /&gt;
&lt;br /&gt;
&lt;ul style="text-align: left;"&gt;
&lt;li&gt;Amerindian&lt;/li&gt;
&lt;li&gt;East_Asian&lt;/li&gt;
&lt;li&gt;African&lt;/li&gt;
&lt;li&gt;Atlantic_Baltic&lt;/li&gt;
&lt;li&gt;Australasian&lt;/li&gt;
&lt;li&gt;Siberian&lt;/li&gt;
&lt;li&gt;Caucasus_Gedrosia&lt;/li&gt;
&lt;li&gt;Southern&lt;/li&gt;
&lt;li&gt;South_Asian&lt;/li&gt;
&lt;/ul&gt;
&lt;div&gt;
Table of Fst divergences:&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://2.bp.blogspot.com/-lOghfpBOkR8/Tu5OibysxnI/AAAAAAAAArA/q16vk8i9hbU/s1600/fst.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="87" src="http://2.bp.blogspot.com/-lOghfpBOkR8/Tu5OibysxnI/AAAAAAAAArA/q16vk8i9hbU/s400/fst.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;br /&gt;
Neighbor-joining tree of Fst distances; the long branch lengths of the Australasian (and to a less degree the Amerindian) branch is due to the high level of inbreeding in the populations for which this component is modal.&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://4.bp.blogspot.com/-26cdLmzSHR8/Tu5N4p9FH1I/AAAAAAAAAqs/Zcs9Jx0-hqE/s1600/nj.png" imageanchor="1" style="margin-left: 1em; margin-right: 1em;"&gt;&lt;img border="0" height="400" src="http://4.bp.blogspot.com/-26cdLmzSHR8/Tu5N4p9FH1I/AAAAAAAAAqs/Zcs9Jx0-hqE/s400/nj.png" width="400" /&gt;&lt;/a&gt;&lt;/div&gt;
First 8 dimensions of multi-dimensional scaling (MDS):
&lt;br /&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://4.bp.blogspot.com/-t0MOLOTUfgI/Tu5aUZeKIzI/AAAAAAAAArw/fTwRazYfvBk/s1600/1_2.png" imageanchor="1"&gt;&lt;img border="0" height="320" src="http://4.bp.blogspot.com/-t0MOLOTUfgI/Tu5aUZeKIzI/AAAAAAAAArw/fTwRazYfvBk/s320/1_2.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://2.bp.blogspot.com/-KJGS44LuZbc/Tu5aX5LCQlI/AAAAAAAAAr8/z0WrH863wJs/s1600/3_4.png" imageanchor="1"&gt;&lt;img border="0" height="320" src="http://2.bp.blogspot.com/-KJGS44LuZbc/Tu5aX5LCQlI/AAAAAAAAAr8/z0WrH863wJs/s320/3_4.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://3.bp.blogspot.com/-Bl7Nmb9ytMw/Tu5abJ5GWTI/AAAAAAAAAsI/O4LiJ1azxOE/s1600/5_6.png" imageanchor="1"&gt;&lt;img border="0" height="320" src="http://3.bp.blogspot.com/-Bl7Nmb9ytMw/Tu5abJ5GWTI/AAAAAAAAAsI/O4LiJ1azxOE/s320/5_6.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;div class="separator" style="clear: both; text-align: center;"&gt;
&lt;a href="http://2.bp.blogspot.com/-phvQ8RKVhkI/Tu5ae7WOZyI/AAAAAAAAAsU/d5zpJ9AmKtk/s1600/7_8.png" imageanchor="1"&gt;&lt;img border="0" height="320" src="http://2.bp.blogspot.com/-phvQ8RKVhkI/Tu5ae7WOZyI/AAAAAAAAAsU/d5zpJ9AmKtk/s320/7_8.png" width="320" /&gt;&lt;/a&gt;&lt;/div&gt;
&lt;b&gt;Technical Details&lt;/b&gt;&lt;br /&gt;
&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;br /&gt;
A dataset of 3,548 individuals/265,519 SNPs/284 populations was assembled. Pruning for distantly related individuals was performed by iterative pruning of a single individual from each pair showing IBD RATIO greater than the mean plus 2 standard deviations, or greater than 2.5. 3,026 individuals remained. An additional 14 individuals were removed because they had less than 97% genotype rate. The marker set was thinned to remove SNPs with less than 97% genotype rate or 1% minor allele frequency. Linkage-disequilibrium based pruning with a window of 200 SNPs, advanced by 25 SNPs, and an R-squared of 0.4 was performed. A total of 3,012 individuals and 170,822 SNPs survived these filtering steps. &lt;a href="http://pngu.mgh.harvard.edu/~purcell/plink/"&gt;PLINK&lt;/a&gt; 1.07 and &lt;a href="http://www.genetics.ucla.edu/software/admixture/"&gt;ADMIXTURE&lt;/a&gt; 1.21 were used in the analyses.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6533996127304587865-3097113742390202214?l=dodecad.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel="replies" type="application/atom+xml" href="http://dodecad.blogspot.com/feeds/3097113742390202214/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://dodecad.blogspot.com/2011/12/world9-calculator.html#comment-form" title="7 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/3097113742390202214?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/3097113742390202214?v=2" /><link rel="alternate" type="text/html" href="http://dodecad.blogspot.com/2011/12/world9-calculator.html" title="'world9' calculator" /><author><name>Dodecad Project</name><uri>http://www.blogger.com/profile/10447516703222698752</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://2.bp.blogspot.com/-lOghfpBOkR8/Tu5OibysxnI/AAAAAAAAArA/q16vk8i9hbU/s72-c/fst.png" height="72" width="72" /><thr:total>7</thr:total></entry><entry gd:etag="W/&quot;DUYNRHkzcSp7ImA9WhRQFk4.&quot;"><id>tag:blogger.com,1999:blog-6533996127304587865.post-5989670149051676639</id><published>2011-12-11T22:33:00.003+02:00</published><updated>2011-12-11T22:59:55.789+02:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-12-11T22:59:55.789+02:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="Dodecad" /><category scheme="http://www.blogger.com/atom/ns#" term="Oracle" /><title>Dodecad Oracle (K12a edition)</title><content type="html">I have created a new version of the &lt;a href="https://docs.google.com/open?id=0B7AJcY18g2GaNzFmYTM3YmItYzY4OC00YzYzLThkMjktNWU3Y2UwZjc2MTk4"&gt;Dodecad Oracle&lt;/a&gt; for use with the &lt;a href="http://dodecad.blogspot.com/2011/12/participant-results-for-k12a-calculator.html"&gt;K12a calculator&lt;/a&gt;.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;You can refer to the original &lt;a href="http://dodecad.blogspot.com/2011/07/dodecad-oracle-v1.html"&gt;Dodecad Oracle&lt;/a&gt; for detailed usage instructions. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;(The only difference in the use of the program is that the number of populations is 204, so make sure to use this if you plan to remove any reference populations, as mentioned in the instructions)&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;In short: &lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;you first load the file &lt;a href="https://docs.google.com/open?id=0B7AJcY18g2GaNzFmYTM3YmItYzY4OC00YzYzLThkMjktNWU3Y2UwZjc2MTk4"&gt;DodecadOracleK12a.RData&lt;/a&gt; in R. You can do this by double-clicking on this file in Windows, or using the File-&amp;gt;Load Workspace menu. In Linux, you can use the "load" command, e.g., load('/home/ubuntu/Desktop/DodecadOracleK12a.RData')&lt;/li&gt;&lt;li&gt;You then enter commands at the command prompt&lt;/li&gt;&lt;/ul&gt;Some examples:&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Comparing a population against other populations&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;DodecadOracle("Somali_D")&lt;/div&gt;&lt;div&gt;      [,1]             [,2]     &lt;/div&gt;&lt;div&gt; [1,] "Somali_D"       "0"      &lt;/div&gt;&lt;div&gt; [2,] "Ethiopian_Jews" "12.3049"&lt;/div&gt;&lt;div&gt; [3,] "Ethiopians"     "12.3309"&lt;/div&gt;&lt;div&gt; [4,] "Sandawe_He"     "38.2093"&lt;/div&gt;&lt;div&gt; [5,] "MKK25"          "40.7983"&lt;/div&gt;&lt;div&gt; [6,] "Egyptans"       "63.2307"&lt;/div&gt;&lt;div&gt; [7,] "Yemenese"       "69.1628"&lt;/div&gt;&lt;div&gt; [8,] "Moroccans"      "72.6233"&lt;/div&gt;&lt;div&gt; [9,] "Jordanians"     "73.1838"&lt;/div&gt;&lt;div&gt;[10,] "Palestinian"    "74.2867"&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Comparing a population against 2-way population mixes:&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;DodecadOracle("Pathan",mixedmode=T)&lt;/div&gt;&lt;div&gt;      [,1]                                     [,2]    &lt;/div&gt;&lt;div&gt; [1,] "Pathan"                                 "0"     &lt;/div&gt;&lt;div&gt; [2,] "79.5% Sindhi + 20.5% Lezgins"           "3.948" &lt;/div&gt;&lt;div&gt; [3,] "82% Sindhi + 18% Chechens_Y"            "4.0251"&lt;/div&gt;&lt;div&gt; [4,] "16.7% Adygei + 83.3% Sindhi"            "4.5471"&lt;/div&gt;&lt;div&gt; [5,] "83.4% Sindhi + 16.6% Balkars_Y"         "4.6487"&lt;/div&gt;&lt;div&gt; [6,] "80.8% Sindhi + 19.2% Kumyks_Y"          "4.7067"&lt;/div&gt;&lt;div&gt; [7,] "83.7% Sindhi + 16.3% North_Ossetians_Y" "4.8352"&lt;/div&gt;&lt;div&gt; [8,] "80.9% Sindhi + 19.1% Nogais_Y"          "4.8821"&lt;/div&gt;&lt;div&gt; [9,] "66.6% Sindhi + 33.4% Tajiks_Y"          "5.6708"&lt;/div&gt;&lt;div&gt;[10,] "86.4% Sindhi + 13.6% Georgians"         "6.2927"&lt;/div&gt;&lt;div style="font-weight: bold; "&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="font-weight: bold; "&gt;Comparing an individual against populations&lt;/div&gt;&lt;div style="font-weight: bold; "&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;DodecadOracle(c(8.4, 0, 2.8, 6, 2.2, 0.1, 40.3, 25.9, 0.3, 11.9, 1.5, 0.5))&lt;/div&gt;&lt;div&gt;      [,1]              [,2]     &lt;/div&gt;&lt;div&gt; [1,] "Iranian_D"       "2.2405" &lt;/div&gt;&lt;div&gt; [2,] "Kurd_D"          "3.8092" &lt;/div&gt;&lt;div&gt; [3,] "Kurds_Y"         "5.4945" &lt;/div&gt;&lt;div&gt; [4,] "Iranians"        "6.634"  &lt;/div&gt;&lt;div&gt; [5,] "Uzbekistan_Jews" "12.8957"&lt;/div&gt;&lt;div&gt; [6,] "Turks"           "17.3173"&lt;/div&gt;&lt;div&gt; [7,] "Turkmens_Y"      "17.7316"&lt;/div&gt;&lt;div&gt; [8,] "Iranian_Jews"    "18.14"  &lt;/div&gt;&lt;div&gt; [9,] "Assyrian_D"      "18.8968"&lt;/div&gt;&lt;div&gt;[10,] "Azerbaijan_Jews" "18.9444"&lt;/div&gt;&lt;div style="font-weight: bold; "&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="font-weight: bold; "&gt;Comparing an individual against 2-way population mixes&lt;/div&gt;&lt;div style="font-weight: bold; "&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;DodecadOracle(c(28, 0.8, 1.6, 49.9, 1.9, 0, 10.6, 4.1, 0, 2.4, 0, 0.6),mixedmode=T)&lt;/div&gt;&lt;div&gt;      [,1]                                   [,2]    &lt;/div&gt;&lt;div&gt; [1,] "47.7% French_D + 52.3% Mordovians_Y"  "2.5849"&lt;/div&gt;&lt;div&gt; [2,] "48.3% French + 51.7% Mordovians_Y"    "2.6012"&lt;/div&gt;&lt;div&gt; [3,] "36.3% Spaniards + 63.7% Mordovians_Y" "2.9985"&lt;/div&gt;&lt;div&gt; [4,] "36% Spanish_D + 64% Mordovians_Y"     "3.0577"&lt;/div&gt;&lt;div&gt; [5,] "65.9% Russian_D + 34.1% Spaniards"    "3.0923"&lt;/div&gt;&lt;div&gt; [6,] "35.9% IBS + 64.1% Mordovians_Y"       "3.0943"&lt;/div&gt;&lt;div&gt; [7,] "40% French + 60% Ukranians_Y"         "3.1662"&lt;/div&gt;&lt;div&gt; [8,] "66.4% Russian_D + 33.6% IBS"          "3.2359"&lt;/div&gt;&lt;div&gt; [9,] "24.5% Swedish_D + 75.5% Hungarians"   "3.3021"&lt;/div&gt;&lt;div&gt;[10,] "39.3% French_D + 60.7% Ukranians_Y"   "3.4046"&lt;/div&gt;&lt;div style="font-weight: bold; "&gt;&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The numbers to the right of each result represent the "goodness" of the match; the lower, the better. If you wanted to list the top-30 results, in any of the above commands, you would enter, e.g.,&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;DodecadOracle(c(28, 0.8, 1.6, 49.9, 1.9, 0, 10.6, 4.1, 0, 2.4, 0, 0.6),mixedmode=T, k=30)&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;If you recently joined the Project, please consider leaving a brief comment in the &lt;a href="http://dodecad.blogspot.com/2010/11/information-about-project-samples.html"&gt;Information about Project Samples&lt;/a&gt; thread.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6533996127304587865-5989670149051676639?l=dodecad.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel="replies" type="application/atom+xml" href="http://dodecad.blogspot.com/feeds/5989670149051676639/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://dodecad.blogspot.com/2011/12/dodecad-oracle-k12a-edition.html#comment-form" title="8 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/5989670149051676639?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/5989670149051676639?v=2" /><link rel="alternate" type="text/html" href="http://dodecad.blogspot.com/2011/12/dodecad-oracle-k12a-edition.html" title="Dodecad Oracle (K12a edition)" /><author><name>Dodecad Project</name><uri>http://www.blogger.com/profile/10447516703222698752</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>8</thr:total></entry><entry gd:etag="W/&quot;DEMGQ38yeCp7ImA9WhRQFk0.&quot;"><id>tag:blogger.com,1999:blog-6533996127304587865.post-1647568206030070356</id><published>2011-12-11T14:24:00.003+02:00</published><updated>2011-12-11T14:27:02.190+02:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-12-11T14:27:02.190+02:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="Dodecad" /><category scheme="http://www.blogger.com/atom/ns#" term="Results" /><title>Participant results for 'K12a' calculator</title><content type="html">The participant results can be found in the "Individual Results" tab of the &lt;a href="https://docs.google.com/spreadsheet/ccc?key=0ArJDEoCgzRKedGdRbkxKMDdlZkJWc21tdkpldWxwVmc"&gt;K12a spreadsheet&lt;/a&gt;.&lt;div&gt;You can read more about the &lt;a href="http://dienekes.blogspot.com/2011/12/first-analysis-of-metspalu-et-al-2011.html"&gt;K12a calculator&lt;/a&gt; at my other blog; if you are not a Project participant, you can also find a DIY version of it there, which can be used in conjunction with &lt;a href="http://dodecad.blogspot.com/2011/09/do-it-yourself-dodecad-v-21.html"&gt;DIYDodecad 2.1&lt;/a&gt;.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6533996127304587865-1647568206030070356?l=dodecad.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel="replies" type="application/atom+xml" href="http://dodecad.blogspot.com/feeds/1647568206030070356/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://dodecad.blogspot.com/2011/12/participant-results-for-k12a-calculator.html#comment-form" title="21 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/1647568206030070356?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/1647568206030070356?v=2" /><link rel="alternate" type="text/html" href="http://dodecad.blogspot.com/2011/12/participant-results-for-k12a-calculator.html" title="Participant results for 'K12a' calculator" /><author><name>Dienekes</name><uri>http://www.blogger.com/profile/02082684850093948970</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="http://4.bp.blogspot.com/-KXXemZigoEc/TlK7wIUP_EI/AAAAAAAAEAk/uJ-FlueoC6o/s220/prosopon.png" /></author><thr:total>21</thr:total></entry><entry gd:etag="W/&quot;DUMAQH85fSp7ImA9WhRSGUg.&quot;"><id>tag:blogger.com,1999:blog-6533996127304587865.post-6951237764230394700</id><published>2011-10-31T12:35:00.023+02:00</published><updated>2011-11-22T12:24:01.125+02:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-11-22T12:24:01.125+02:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="South Asians" /><category scheme="http://www.blogger.com/atom/ns#" term="Experiments" /><title>Origin of Kalash inferred with Eurogenes K=10 "test" calculator</title><content type="html">&lt;div style="text-align: left;"&gt;Vasishta, is &lt;a href="http://www.forumbiodiversity.com/showthread.php?t=14387&amp;amp;page=254"&gt;asking&lt;/a&gt; Eurogenes for help in demonstrating that the Kalash have Northern-European-specific segments:&lt;/div&gt;&lt;blockquote&gt;Yes. &lt;b&gt;He keeps citing the Kalash as proof that the Indo-Iranians were an almost exclusively a West-Asian like population, even though I personally think the mainly West Asian-South Asian assortment of the Kalash in his analyses might be an artifact of their inbreeding and isolation, thus confusing ADMIXTURE.&lt;/b&gt; Zack's &lt;b&gt;K=11 at Harappa has shown that the Kalash display around 22% of the component modal in Lithuanians.&lt;/b&gt; Yet, he ignores the North/Eastern European admixture in Northwest Indians and North Indian Brahmins (in his own analyses at that!). Interestingly enough the aforementioned groups tend to score a sliver of Northeast European admixture in Dr.Doug McDonald's analyses, with the top matches for that sliver usually being Lithuanians, Russians and Finns; in that order. It (NEU) is even found in frequencies of around 4-6% in Dravidian-speaking southern Brahmins. &lt;b&gt;As much as I hate to say it, he is indeed rather stubborn and has somewhat of an underlying agenda.&lt;/b&gt;&lt;br /&gt;&lt;br /&gt;&lt;b&gt;David, I think you should look into proving that the Kalash do indeed have some NEU-specific segments.&lt;/b&gt; I would be super-surprised if they didn't, given that more mixed populations south of their geographical area display it themselves.&lt;/blockquote&gt;&lt;br /&gt;It appears that Vasishta disagrees with me because &lt;b&gt;he "personally thinks"&lt;/b&gt; that the admixture proportions of the Kalash are due to inbreeding and the limitations of ADMIXTURE. &lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Does he cite any studies or make any argument why ADMIXTURE would remove precisely the component that he is so eager to be present? &lt;/b&gt;No. While genetic drift in an isolated population could indeed lead to the loss of genetic diversity, there is no reason to think that this would lead &lt;i&gt;preferentially &lt;/i&gt;to the loss of Northern-European segments. It is strange that Vasishta accuses me of bias and yet, at the same time, invokes &lt;b&gt;the magic of some unspecified flaw&lt;/b&gt; of ADMIXTURE for the loss of his favorite component.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Vasishta invokes the &lt;b&gt;Harappa Ancestry Project &lt;a href="https://docs.google.com/spreadsheet/ccc?key=0AuW3R0Ys-P4HdGI3V2Z0SEs5WmRPcVoybDJXNzRIWXc&amp;amp;hl=en#gid=0"&gt;K=11 admixture&lt;/a&gt;&lt;/b&gt; analysis in support of his idea that the Kalash have 22% of the component modal in Lithuanians. However, he neglects to mention that at K=11 there is no West-Asian or Caucasus centered component in the HAP analysis, but rather only "European" (modal in Lithuanians) and "SW Asian" (modal in Yemen Jews). It is indeed strange that he accuses me of bias for providing &lt;i&gt;evidence &lt;/i&gt;about the relationship of the Kalash with West Asia, while at the same time, showing &lt;b&gt;preference for a level of analysis where such a component is lacking.&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;The West Eurasian cline between Arabia and Northeastern Europe is evident in the &lt;a href="http://dodecad.blogspot.com/2011/09/weac-calculator.html"&gt;'weac'&lt;/a&gt; admixture analysis, where the European-centered component (Atlantic-Baltic) is present in populations such as Assyrians and Armenians whereas it is lacking at the appropriate level of &lt;a href="http://dodecad.blogspot.com/2011/10/eurasia7-calculator.html"&gt;resolution&lt;/a&gt;. &lt;b&gt;Therefore, the fact that the Kalash show "European" admixture at the level of Europe vs. Near East does not mean that they ought to show such admixture at the level of Europe vs. West Asia/Caucasus vs. Arabia.&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;One of the benefits of DIYDodecad has been the availability of data from projects that have hitherto been black boxes.&lt;/b&gt; In the interest of transparency, I have taken the &lt;a href="http://bga101.blogspot.com/2011/09/genetic-substructures-across-west.html"&gt;Eurogenes K=10 "test" calculator&lt;/a&gt; and repeated my analysis of the Kalash, that had been previously shown &lt;a href="http://dodecad.blogspot.com/2011/05/how-to-create-zombies-from-admixture.html"&gt;by&lt;/a&gt; &lt;a href="http://dodecad.blogspot.com/2011/10/eurasia7-calculator.html"&gt;me&lt;/a&gt; to be a fairly simple West/South Asian mix. I could have waited for him to get around to it, but since he's quick on the talk and slow on the trigger, I decided to do it for him.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;a href="http://2.bp.blogspot.com/-TATm-XBBfBE/Tq6Lm4oyadI/AAAAAAAAEP0/xPVgSHOkyLg/s1600/Kalash_Eurogenes_10.png" onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"&gt;&lt;img src="http://2.bp.blogspot.com/-TATm-XBBfBE/Tq6Lm4oyadI/AAAAAAAAEP0/xPVgSHOkyLg/s400/Kalash_Eurogenes_10.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5669622481060784594" style="display: block; margin-top: 0px; margin-right: auto; margin-bottom: 10px; margin-left: auto; text-align: center; cursor: pointer; width: 400px; height: 256px; " /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;The admixture proportions of the Kalash, according to the Eurogenes K=10 are: 40.3% S_Asian, 58.7% W_Asian, 0.9% N_E_Euro, 0.1% N_Asian, and hence the analysis based on the Eurogenes K=10 components confirms the analysis based on my &lt;a href="http://dodecad.blogspot.com/2011/10/eurasia7-calculator.html"&gt;eurasia 7&lt;/a&gt;&lt;/b&gt;, "showing the Kalash to be a "West Asian" population (62.4%) with substantial "South Asian" admixture (37.1%), and near-complete absence of any other genetic components." &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Eurogenes alleges, not without his usual charm, that:&lt;/div&gt;&lt;div&gt;&lt;blockquote&gt;Dienekes has a keen eye for things he wants to see. But he hasn't yet noticed that in all accurate analyses, there's significant Eastern European admix in North India. &lt;b&gt;His monocle got fogged up in that instance.&lt;/b&gt;&lt;/blockquote&gt;&lt;/div&gt;Let us consider some pertinent facts: the Indian peninsula has been invaded multiple times from Central Asia, a process that continued long after the establishment of the Indo-Aryans during the 2nd millennium BC. Eurogenes may want to think that the "Eastern European" admixture in South Asia dates to his mythological Polish Indo-Europeans, galloping across the steppes on their horses, but there is, at present, no particular reason to think that this is the case&lt;br /&gt;&lt;br /&gt;Furthermore, the "West Asian" component as a fraction of the "West Asian" + "Atlantic-Baltic" component reaches a &lt;a href="https://docs.google.com/spreadsheet/ccc?key=0ArAJcY18g2GadExjVnpKbHFEeGVZOEZPOXBxWnA2Wnc#gid=0"&gt;minimum&lt;/a&gt; of 77% in the Pathans in populations from the northern parts of the Indian subcontinent. &lt;b&gt;His own monocle is surely in greater need of de-fogging if I miss the 23% and he misses the &amp;gt;77%.&lt;/b&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Indeed, the Europe vs. Caucasus ratio in Indian subcontinental populations is similar to that found in people from the Middle East and Caucasus region. &lt;b&gt;It is not surprising that Eurogenes has abandoned his &lt;a href="http://bga101.blogspot.com/2011/03/reconstructing-ancestral-north-indian.html"&gt;search&lt;/a&gt; for North European components in South Asia&lt;/b&gt;, going as far as reconstructing Ancestral North Indians as "Northern Europeans". Needless to say, he &lt;a href="http://dodecad.blogspot.com/2011/05/more-zombies-ancestral-north-indians.html"&gt;was&lt;/a&gt; &lt;a href="http://dienekes.blogspot.com/2011/05/beware-of-sample-sizes-why-ancestral.html"&gt;wrong&lt;/a&gt;. The West Eurasian ancestry of the population of the Indian subcontinent is similar to that found in modern West Asian populations, not Slavs.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Eurogenes promises:&lt;/div&gt;&lt;div&gt;&lt;blockquote&gt;This shouldn't be too difficult. I'll use Dienekes' calculator for the job, and then check the results with LAMP.How poetic.&lt;br /&gt;&lt;/blockquote&gt;Been there, done that. It will be fun to see what "Northern European" components he will be able to squeeze out of the 0.9% N_E_Euro component that my software, in conjunction with his "test" calculator produces.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;div&gt;&lt;b&gt;Why are the Kalash important?&lt;/b&gt;&lt;/div&gt;&lt;br /&gt;&lt;div&gt;There are three reasons why the Kalash are important in the study of Eurasian prehistory:&lt;/div&gt;&lt;div&gt;&lt;ol&gt;&lt;li&gt;Their mountainous habitat contributed to isolation and relative immunity from historical population movements&lt;/li&gt;&lt;li&gt;Their non-Islamic religion has definitely preserved them from recent gene inflow&lt;/li&gt;&lt;li&gt;Their &lt;a href="http://www.ethnologue.com/show_language.asp?code=kls"&gt;language&lt;/a&gt; is unique within the Indo-Aryan family, and it often considered &lt;a href="http://www.utexas.edu/cola/centers/lrc/general/ie-lg/Indo-Iranian.html"&gt;today&lt;/a&gt; as part of a separate Dardic family of Indo-Iranian in addition to the more populous Iranian and Indo-Aryan families. &lt;/li&gt;&lt;/ol&gt;The Kalash are crucial for those interested in the origins of Indo-Iranians, and the fact that they are, indeed, a simple West/South Asian mix is not without significance for that question.&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;UPDATE:&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;Here is the result of a PCA analysis of the Kalash together with 50 synthetic individuals from each of the S_Asian, W_Asian, and N_E_Euro components of Eurogenes K=10 "test". This was calculated with &lt;i&gt;smartpca &lt;/i&gt;with &lt;i&gt;numoutlieriter&lt;/i&gt; set to 0.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;a href="http://1.bp.blogspot.com/-JtOF0L6c9SQ/Tq7ZflaK_rI/AAAAAAAAEQA/cYvit-mVFoE/s1600/1_2.png" onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"&gt;&lt;img src="http://1.bp.blogspot.com/-JtOF0L6c9SQ/Tq7ZflaK_rI/AAAAAAAAEQA/cYvit-mVFoE/s400/1_2.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5669708117547089586" style="display: block; margin-top: 0px; margin-right: auto; margin-bottom: 10px; margin-left: auto; text-align: center; cursor: pointer; width: 400px; height: 400px; " /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;It is evident that the Kalash appear to fall on the S_Asian to W_Asian line, and toward the W_Asian pole, consistent with being a population of those two origins, with the W_Asian component predominating.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;UPDATE II:&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;As mentioned in the &lt;a href="http://dodecad.blogspot.com/2011/10/eurasia7-calculator.html"&gt;eurasia7&lt;/a&gt; post, the Kalash tend to form population-specific components in ADMIXTURE analyses, so they are generally not included in my runs. So, I run the K=7 analysis again, but this time I included the Kalash. Here are the top populations of the component that was modal in the Kalash:&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;div&gt;[186,] "Kurd_D"               "50.2"&lt;/div&gt;&lt;div&gt;[187,] "Kurds_Y"              "50.7"&lt;/div&gt;&lt;div&gt;[188,] "Armenian_D"           "50.9"&lt;/div&gt;&lt;div&gt;[189,] "Armenians_Y"          "51.2"&lt;/div&gt;&lt;div&gt;[190,] "Adygei"               "51.5"&lt;/div&gt;&lt;div&gt;[191,] "Chechens_Y"           "53"  &lt;/div&gt;&lt;div&gt;[192,] "North_Ossetians_Y"    "53.2"&lt;/div&gt;&lt;div&gt;[193,] "Lezgins"              "54.4"&lt;/div&gt;&lt;div&gt;[194,] "Georgians"            "59.8"&lt;/div&gt;&lt;div&gt;[195,] "Georgian_D"           "60.1"&lt;/div&gt;&lt;div&gt;[196,] "Abhkasians_Y"         "60.5"&lt;/div&gt;&lt;div&gt;[197,] "Kalash"               "63.2"&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Here are their exact admixture proportions in this &lt;b&gt;unsupervised &lt;/b&gt;ADMIXTURE run:&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Kalash N=23  &lt;/b&gt;&lt;/div&gt;&lt;div&gt;East_Asian: 0.5  &lt;/div&gt;&lt;div&gt;Atlantic_Baltic: 1.5 &lt;/div&gt;&lt;div&gt;South_Asian: 32.9   &lt;/div&gt;&lt;div&gt;Sub_Saharan: 0.0  &lt;/div&gt;&lt;div&gt;Southern: 0.0  &lt;/div&gt;&lt;div&gt;Siberian: 1.8 &lt;/div&gt;&lt;div&gt;West Asian: 63.2&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;UPDATE III (November 22)&lt;/b&gt;: Eurogenes &lt;a href="http://eurogenes.blogspot.com/2011/11/on-origins-and-expansions-of-r1a-and.html"&gt;estimates&lt;/a&gt; that there is 4% "Northeast European" admixture for Kalash individual HGDP00302. He managed to avoid the creation of a Kalash-specific component by including only a single Kalash individual in an ADMIXTURE experiment.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The Kalash do tend to create their own Kalash-specific component, and a good way to avoid such a component is to include each of them individually, and repeat the analysis 23 times. An alternative, and less time consuming way, is to create a single synthetic individual using the allele frequencies of the Kalash population as a whole. Even simpler, one could randomly pick a single individual (such as HGDP00302), but at the risk of picking an individual that has either much  more or much less than average a particular type of ancestry.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Below are the admixture proportions of all the 23 Kalash individuals from the unsupervised ADMIXTURE run of UPDATE II. Individual HDGP00302 is 4th of 23 in terms of their "Atlantic_Baltic" component that peaks in Lithuanians (3%). The Kalash have 1.5% "Atlantic_Baltic" on average (median=1%, standard deviation=2.1%).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;      &lt;p&gt;&lt;/p&gt;&lt;table cellspacing="0" cellpadding="3"&gt; &lt;tbody&gt;&lt;tr&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;ID&lt;/td&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;East_Asian&lt;/td&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;Atlantic_Baltic&lt;/td&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;South_Asian&lt;/td&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;Sub_Saharan&lt;/td&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;Southern&lt;/td&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;Siberian&lt;/td&gt; &lt;td colspan="2" valign="bottom" align="left" style=" font-size:10pt;"&gt;West_Asian&lt;/td&gt; &lt;/tr&gt; &lt;tr&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;HGDP00279&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.007&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.081&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.361&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.031&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.521&lt;/td&gt; &lt;td&gt;&lt;/td&gt; &lt;/tr&gt; &lt;tr&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;HGDP00307&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.004&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.059&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.336&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.018&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.583&lt;/td&gt; &lt;td&gt;&lt;/td&gt; &lt;/tr&gt; &lt;tr&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;HGDP00315&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.019&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.036&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.338&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.606&lt;/td&gt; &lt;td&gt;&lt;/td&gt; &lt;/tr&gt; &lt;tr&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;HGDP00302&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.006&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.03&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.337&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.02&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.608&lt;/td&gt; &lt;td&gt;&lt;/td&gt; &lt;/tr&gt; &lt;tr&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;HGDP00311&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.014&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.029&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.325&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.021&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.611&lt;/td&gt; &lt;td&gt;&lt;/td&gt; &lt;/tr&gt; &lt;tr&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;HGDP00285&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.027&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.319&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.019&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.635&lt;/td&gt; &lt;td&gt;&lt;/td&gt; &lt;/tr&gt; &lt;tr&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;HGDP00333&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.02&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.324&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.018&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.638&lt;/td&gt; &lt;td&gt;&lt;/td&gt; &lt;/tr&gt; &lt;tr&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;HGDP00277&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.016&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.334&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.021&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.63&lt;/td&gt; &lt;td&gt;&lt;/td&gt; &lt;/tr&gt; &lt;tr&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;HGDP00298&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.012&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.016&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.325&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.016&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.631&lt;/td&gt; &lt;td&gt;&lt;/td&gt; &lt;/tr&gt; &lt;tr&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;HGDP00281&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.011&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.015&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.332&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.01&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.633&lt;/td&gt; &lt;td&gt;&lt;/td&gt; &lt;/tr&gt; &lt;tr&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;HGDP00304&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.007&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.012&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.329&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.013&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.638&lt;/td&gt; &lt;td&gt;&lt;/td&gt; &lt;/tr&gt; &lt;tr&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;HGDP00290&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.007&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.01&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.325&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.021&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.637&lt;/td&gt; &lt;td&gt;&lt;/td&gt; &lt;/tr&gt; &lt;tr&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;HGDP00274&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.007&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.004&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.341&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.013&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.635&lt;/td&gt; &lt;td&gt;&lt;/td&gt; &lt;/tr&gt; &lt;tr&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;HGDP00309&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.007&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.317&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.019&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.656&lt;/td&gt; &lt;td&gt;&lt;/td&gt; &lt;/tr&gt; &lt;tr&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;HGDP00330&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.335&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.026&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.639&lt;/td&gt; &lt;td&gt;&lt;/td&gt; &lt;/tr&gt; &lt;tr&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;HGDP00319&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.011&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.328&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.01&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.651&lt;/td&gt; &lt;td&gt;&lt;/td&gt; &lt;/tr&gt; &lt;tr&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;HGDP00288&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.004&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.339&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.013&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.644&lt;/td&gt; &lt;td&gt;&lt;/td&gt; &lt;/tr&gt; &lt;tr&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;HGDP00286&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.329&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.018&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.653&lt;/td&gt; &lt;td&gt;&lt;/td&gt; &lt;/tr&gt; &lt;tr&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;HGDP00313&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.351&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.015&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.634&lt;/td&gt; &lt;td&gt;&lt;/td&gt; &lt;/tr&gt; &lt;tr&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;HGDP00328&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.31&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.023&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.667&lt;/td&gt; &lt;td&gt;&lt;/td&gt; &lt;/tr&gt; &lt;tr&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;HGDP00267&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.332&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.022&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.647&lt;/td&gt; &lt;td&gt;&lt;/td&gt; &lt;/tr&gt; &lt;tr&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;HGDP00326&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.307&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.03&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.663&lt;/td&gt; &lt;td&gt;&lt;/td&gt; &lt;/tr&gt; &lt;tr&gt; &lt;td valign="bottom" align="left" style=" font-size:10pt;"&gt;HGDP00323&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.002&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.304&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.013&lt;/td&gt; &lt;td valign="bottom" align="right" style=" font-size:10pt;"&gt;0.68&lt;/td&gt; &lt;td&gt;&lt;/td&gt; &lt;/tr&gt; &lt;/tbody&gt;&lt;/table&gt;&lt;p&gt;&lt;/p&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6533996127304587865-6951237764230394700?l=dodecad.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel="replies" type="application/atom+xml" href="http://dodecad.blogspot.com/feeds/6951237764230394700/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://dodecad.blogspot.com/2011/10/origin-of-kalash-inferred-with.html#comment-form" title="37 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/6951237764230394700?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/6951237764230394700?v=2" /><link rel="alternate" type="text/html" href="http://dodecad.blogspot.com/2011/10/origin-of-kalash-inferred-with.html" title="Origin of Kalash inferred with Eurogenes K=10 &quot;test&quot; calculator" /><author><name>Dienekes</name><uri>http://www.blogger.com/profile/02082684850093948970</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="http://4.bp.blogspot.com/-KXXemZigoEc/TlK7wIUP_EI/AAAAAAAAEAk/uJ-FlueoC6o/s220/prosopon.png" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://2.bp.blogspot.com/-TATm-XBBfBE/Tq6Lm4oyadI/AAAAAAAAEP0/xPVgSHOkyLg/s72-c/Kalash_Eurogenes_10.png" height="72" width="72" /><thr:total>37</thr:total></entry><entry gd:etag="W/&quot;Dk4BSHg9cSp7ImA9WhdaFkk.&quot;"><id>tag:blogger.com,1999:blog-6533996127304587865.post-7427955214628047298</id><published>2011-10-26T17:16:00.020+03:00</published><updated>2011-10-26T19:02:39.669+03:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-26T19:02:39.669+03:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="Dodecad" /><category scheme="http://www.blogger.com/atom/ns#" term="South Asians" /><category scheme="http://www.blogger.com/atom/ns#" term="Experiments" /><category scheme="http://www.blogger.com/atom/ns#" term="DIYDodecad" /><category scheme="http://www.blogger.com/atom/ns#" term="Africa" /><category scheme="http://www.blogger.com/atom/ns#" term="Results" /><category scheme="http://www.blogger.com/atom/ns#" term="Europe" /><category scheme="http://www.blogger.com/atom/ns#" term="Caucasus" /><category scheme="http://www.blogger.com/atom/ns#" term="Central Asians" /><title>'eurasia7' calculator</title><content type="html">&lt;div style="text-align: left;"&gt;This calculator was made with &lt;b&gt;196 different populations&lt;/b&gt; and &lt;b&gt;2,659 individuals&lt;/b&gt;, including &lt;b&gt;518 project participants&lt;/b&gt;. The following Dodecad populations do not have 5 individuals yet, so they are included in the OTHERS_D generic category:&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;/div&gt;&lt;/div&gt;&lt;blockquote&gt;&lt;div&gt;&lt;div&gt;Algerian_D, North_African_Jews_D, Slovenian_D, Mixed_Scandinavian_D, Danish_D, Moroccan_D, Tunisian_D, Serb_D, Austrian_D, Saudi_D, Pakistani_D, Tatar_Various_D, Palestinian_D, Greek_Italian_D, Romanian_D, Swiss_German_D, Szekler_D, Mandaean_D, Azeri_D, Czech_D, Georgian_D, Belgian_D, Latvian_D, Estonian_D, Bangladesh_D, Yemenese_D, Sri_Lanka_D, Hungarian_D, Basque_D, Udmurt_D, Egyptian_D&lt;/div&gt;&lt;/div&gt;&lt;/blockquote&gt;As always, I encourage people with 4 grandparents from the same country or ethnic group of Eurasia, North or East Africa to contact me (do not send data!) for possible inclusion in the Project. If I have overlooked any such individuals, drop me a line (my e-mail address is at the bottom of the blog). I usually start a new _D population whenever individuals with 4 grandparents from the same group are submitted, but I may have missed some.&lt;div&gt;&lt;div&gt;&lt;br /&gt;Note that all individuals from the reference populations have also been included, including outliers; you should be aware of this when reading the population averages, and consult the Outliers tab in the &lt;a href="https://spreadsheets.google.com/spreadsheet/ccc?key=0ArAJcY18g2GadDUyeEtjNnBmY09EbnowN3M3UWRyNnc&amp;amp;hl=en_US&amp;amp;authkey=COCa89AJ"&gt;v3 spreadsheet&lt;/a&gt; for some instances of outliers.&lt;div&gt;&lt;a href="http://3.bp.blogspot.com/-qt_LdB168BM/TqgYDr3-NwI/AAAAAAAAAnU/jcJ5XaJLrWs/s1600/eurasia7_7.png" onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"&gt;&lt;img src="http://3.bp.blogspot.com/-qt_LdB168BM/TqgYDr3-NwI/AAAAAAAAAnU/jcJ5XaJLrWs/s400/eurasia7_7.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5667806582641932034" style="display: block; margin-top: 0px; margin-right: auto; margin-bottom: 10px; margin-left: auto; text-align: center; cursor: pointer; width: 400px; height: 128px; " /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;Due to image size restrictions in Picasa, the labels are not visible well. A large version of the above plot can be found in the download bundle.&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The seven ancestral populations inferred at this level of resolution are:&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;Sub_Saharan&lt;/li&gt;&lt;li&gt;West_Asian&lt;/li&gt;&lt;li&gt;Atlantic_Baltic&lt;/li&gt;&lt;li&gt;East_Asian&lt;/li&gt;&lt;li&gt;Southern&lt;/li&gt;&lt;li&gt;South_Asian&lt;/li&gt;&lt;li&gt;Siberian&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;As usual, you should take these names as useful labels, and interpret them in conjunction with the components' distribution in different populations, and their Fst distances, both of which can be found in the spreadsheet.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The table of Fst distances:&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;a href="http://3.bp.blogspot.com/-aIGsQu-slxc/TqgjAXnT4kI/AAAAAAAAAoQ/3HWJQwMEdoQ/s1600/fst.jpg" onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"&gt;&lt;img src="http://3.bp.blogspot.com/-aIGsQu-slxc/TqgjAXnT4kI/AAAAAAAAAoQ/3HWJQwMEdoQ/s400/fst.jpg" border="0" alt="" id="BLOGGER_PHOTO_ID_5667818620291637826" style="display: block; margin-top: 0px; margin-right: auto; margin-bottom: 10px; margin-left: auto; text-align: center; cursor: pointer; width: 400px; height: 82px; " /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Below you can see a neighbor-joining tree based on inter-population Fst distances:&lt;/div&gt;&lt;div&gt;&lt;a href="http://1.bp.blogspot.com/-FZMAil1P-qo/TqgZJrbYCgI/AAAAAAAAAng/1bLd9cTHYqs/s1600/nj.png" onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"&gt;&lt;img src="http://1.bp.blogspot.com/-FZMAil1P-qo/TqgZJrbYCgI/AAAAAAAAAng/1bLd9cTHYqs/s400/nj.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5667807785112832514" style="display: block; margin-top: 0px; margin-right: auto; margin-bottom: 10px; margin-left: auto; text-align: center; cursor: pointer; width: 400px; height: 400px; " /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;The first six dimensions of a multi-dimensional scaling of the same:&lt;/div&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;a href="http://2.bp.blogspot.com/-c5kB3hdfZ-Q/TqgbiTNOmVI/AAAAAAAAAoE/XID5tQ2Vb_I/s1600/1_2.png" onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"&gt;&lt;img src="http://2.bp.blogspot.com/-c5kB3hdfZ-Q/TqgbiTNOmVI/AAAAAAAAAoE/XID5tQ2Vb_I/s400/1_2.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5667810407131027794" style="cursor: pointer; width: 400px; height: 400px; " /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a href="http://4.bp.blogspot.com/-5A6y2QJCxKw/Tqgbft0X9_I/AAAAAAAAAn4/ysUo90DIJ4M/s1600/3_4.png" onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"&gt;&lt;img src="http://4.bp.blogspot.com/-5A6y2QJCxKw/Tqgbft0X9_I/AAAAAAAAAn4/ysUo90DIJ4M/s400/3_4.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5667810362734934002" style="cursor: pointer; width: 400px; height: 400px; " /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a href="http://1.bp.blogspot.com/-Xpy2Imk23IU/TqgbcJL2CAI/AAAAAAAAAns/CUWGE8ToCD4/s1600/5_6.png" onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"&gt;&lt;img src="http://1.bp.blogspot.com/-Xpy2Imk23IU/TqgbcJL2CAI/AAAAAAAAAns/CUWGE8ToCD4/s400/5_6.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5667810301361653762" style="cursor: pointer; width: 400px; height: 400px; " /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Calculator Files:&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;The &lt;a href="https://docs.google.com/spreadsheet/ccc?key=0ArAJcY18g2GadExjVnpKbHFEeGVZOEZPOXBxWnA2Wnc"&gt;spreadsheet&lt;/a&gt; contains population averages, the table of Fst distances, and individual results for included Project participants.&lt;/li&gt;&lt;li&gt;The download RAR file (&lt;a href="https://docs.google.com/open?id=0B7AJcY18g2GaOTYyZTQ0ZDAtNGRkYi00ZjUyLTg3NmUtODNmMjI5MzQwM2Yy"&gt;Google Docs&lt;/a&gt; or &lt;a href="http://www.sendspace.com/file/i60p7k"&gt;Sendspace&lt;/a&gt;) contains all the files needed to run the calculator. You must download and install &lt;a href="http://dodecad.blogspot.com/2011/09/do-it-yourself-dodecad-v-21.html"&gt;DIYDodecad 2.1&lt;/a&gt; first. In order to run the calculator, you follow the instructions of the README file, but type 'eurasia7' instead of 'dv3'.&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Terms of use: &lt;/b&gt;'eurasia7', including all files in the downloaded RAR file is free for non-commercial personal use. Commercial uses are forbidden. Contact me for non-personal uses of the calculator.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Technical Details:&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;The calculator is built using allele frequencies of K=7 ancestral components inferred by &lt;a href="http://www.genetics.ucla.edu/software/admixture/"&gt;ADMIXTURE 1.21&lt;/a&gt; analysis of 2,659 individuals. Markers included in the source datasets, as well as the Family Finder and 23andMe (as of Oct 21) platforms were included. The marker set was thinned of markers with less than 99.5% genotype rate and less than 0.5% minor allele frequency. Linkage-disequilibrium based pruning was carried out with a window size of 250 SNPs, advanced by 25 SNPs and R-squared greater than 0.4. A total of 164,990 SNPs remained after these filtering steps.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;All relevant populations available to me, and genotyped at a sufficient number of markers were included. Inclusion of the &lt;a href="http://en.wikipedia.org/wiki/Kalash_people"&gt;Kalash&lt;/a&gt; population resulted in a population-specific component at K=7, and hence their admixture components were inferred a posteriori. Their proportions are &lt;a href="http://dodecad.blogspot.com/2011/05/how-to-create-zombies-from-admixture.html"&gt;consistent&lt;/a&gt; with previous results, showing them to be a "West Asian" population (62.4%) with substantial "South Asian" admixture (37.1%), and near-complete absence of any other genetic components.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6533996127304587865-7427955214628047298?l=dodecad.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel="replies" type="application/atom+xml" href="http://dodecad.blogspot.com/feeds/7427955214628047298/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://dodecad.blogspot.com/2011/10/eurasia7-calculator.html#comment-form" title="34 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/7427955214628047298?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/7427955214628047298?v=2" /><link rel="alternate" type="text/html" href="http://dodecad.blogspot.com/2011/10/eurasia7-calculator.html" title="'eurasia7' calculator" /><author><name>Dodecad Project</name><uri>http://www.blogger.com/profile/10447516703222698752</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://3.bp.blogspot.com/-qt_LdB168BM/TqgYDr3-NwI/AAAAAAAAAnU/jcJ5XaJLrWs/s72-c/eurasia7_7.png" height="72" width="72" /><thr:total>34</thr:total></entry><entry gd:etag="W/&quot;DEADQnwzeCp7ImA9WhdaEUQ.&quot;"><id>tag:blogger.com,1999:blog-6533996127304587865.post-3963958272077582643</id><published>2011-10-21T14:18:00.004+03:00</published><updated>2011-10-21T14:32:53.280+03:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-21T14:32:53.280+03:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="23andMe" /><title>23andMe data file changes</title><content type="html">A few recent submissions to the Project have alerted me to the fact that 23andMe has been making changes to its data download. It is unfortunate that such changes are made apparently "silently", as they may negatively impact third-party tools built around the 23andMe data.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Apparently, the orientation of some SNPs has been changed. This should not be a problem for DIYDodecad, as it handles orientation of different companies automatically. A different problem is that apparently some SNPs have been dropped from the data file altogether. This &lt;i&gt;is &lt;/i&gt;a problem if you are not using the latest 2.1 version of the DIYDodecad software, so you should upgrade to &lt;a href="http://dodecad.blogspot.com/2011/09/do-it-yourself-dodecad-v-21.html"&gt;2.1&lt;/a&gt;. It seems that about 80 of the SNPs expected by the 'dv3' calculator have been removed from the data file download, and these will appear as "absent". I do not expect 80 absent SNPs to have a huge impact on results, as they make up less than 0.1% of all SNPs in dv3.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;The change in format has other consequences as well; as many of you know, I have been working on Dodecad v4 for some time now. This would use common markers between 23andMe v2 and v3 platforms and Family Finder Illumina platform. However, I will now have to backtrack on it, to make sure that the marker set used is actually consistent with people's current 23andMe downloads. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;If you have a fresh 23andMe downloaded file and DIYDodecad 2.1 and you are unable to run 'dv3' or any other Project calculators, drop me a line.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6533996127304587865-3963958272077582643?l=dodecad.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel="replies" type="application/atom+xml" href="http://dodecad.blogspot.com/feeds/3963958272077582643/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://dodecad.blogspot.com/2011/10/23andme-data-file-changes.html#comment-form" title="3 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/3963958272077582643?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/3963958272077582643?v=2" /><link rel="alternate" type="text/html" href="http://dodecad.blogspot.com/2011/10/23andme-data-file-changes.html" title="23andMe data file changes" /><author><name>Dienekes</name><uri>http://www.blogger.com/profile/02082684850093948970</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="http://4.bp.blogspot.com/-KXXemZigoEc/TlK7wIUP_EI/AAAAAAAAEAk/uJ-FlueoC6o/s220/prosopon.png" /></author><thr:total>3</thr:total></entry><entry gd:etag="W/&quot;A0IBSHc8fCp7ImA9WhdaGEw.&quot;"><id>tag:blogger.com,1999:blog-6533996127304587865.post-757367533271642614</id><published>2011-10-21T09:06:00.007+03:00</published><updated>2011-10-28T19:32:39.974+03:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-28T19:32:39.974+03:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="Dodecad" /><title>Eurogenes is upset</title><content type="html">Eurogenes seems to be upset this week, first throwing a tantrum at Dr. McDonald and then at myself. You can probably find the cached text in Google for some time, although Eurogenes has deleted his anti-&lt;a href="http://webcache.googleusercontent.com/search?q=cache:7UKgyAmRr34J:bga101.blogspot.com/2011/10/tit-for-tat-with-dr-mcdonald-aka.html+tit+for+tat+mcdonald"&gt;McDonald tantrum&lt;/a&gt;, and changed the verbiage on the one directed &lt;a href="http://webcache.googleusercontent.com/search?q=cache:MPajC41qAVcJ:eurogenes.blogspot.com/2011/10/erroneous-results-from-dodecad-aka.html"&gt;against me&lt;/a&gt; on advice of some more cool-headed people. Here is the epilogue of his original anti-Dienekes rant:&lt;br /&gt;&lt;blockquote&gt;Dienekes, you've got a spreadsheet online showing all sorts of weird things. You need &lt;b&gt;stop being a prat&lt;/b&gt;, and do something about it ASAP.&lt;br /&gt;&lt;/blockquote&gt;Eurogenes' animus towards me is not surprising for those who have followed our interactions since the old days. Of course he is benefiting from my work (I have pointed him towards &lt;a href="http://dienekes.blogspot.com/2010/11/multidimensional-scaling-and-admixture.html?showComment=1289139148126#c8841741588815557417"&gt;data&lt;/a&gt; he didn't know existed, he is using DIYDodecad, as well as the &lt;a href="http://bga101.blogspot.com/2011/09/genetic-map-of-atlantic-fringe.html"&gt;1000Genomes&lt;/a&gt; data extracted with my code by the &lt;a href="http://magnusducatus.blogspot.com/2011/07/admixture-results-for-all-participants.html"&gt;MDLP&lt;/a&gt;), so &lt;b&gt;one would think that if he had any criticism against me, he would at least express it in a more dignified way.&lt;/b&gt;&lt;div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;Of course, being rude, ungrateful and mean-spirited does not mean one is wrong!&lt;/b&gt; So, what has Eurogenes actually discovered?&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;He noted the high Ukrainian West/East European ratio produced by Dodecad v3, and objected to my idea that Ukrainians were transitional to the Balkans and the Caucasus. Actually, according to the &lt;a href="http://4.bp.blogspot.com/-ztxlQ68e19Q/TnC0Z0rW6bI/AAAAAAAAEHc/MQny_v-ygqQ/s1600/pca-caucasus.png"&gt;PCA plot&lt;/a&gt; of the Yunusbayev et al. (2011) paper, they &lt;i&gt;are &lt;/i&gt;transitional, being situated toward both the Balkans and the Caucasus, relative to Belorussians/Lithuanians, i.e., the populations that generally show peaks of East European-related components. This is also supported by the &lt;a href="http://2.bp.blogspot.com/-AiqxfdH1kqo/TnC1DOX7yUI/AAAAAAAAEHk/LanedZAS19Q/s1600/admixture-caucasus.png"&gt;ADMIXTURE&lt;/a&gt; analysis that reveals Ukrainians to possess a Caucasus-centered component largely lacking in other Eastern Slavs, but shared with Balkan/Caucasus populations.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Should I have not tested the new Yunusbayev data with Dodecad v3 and reported their results? Of course not. &lt;b&gt;When one has a measuring instrument, one uses it on new data to test its performance and reports what he sees. &lt;/b&gt;This is exactly what I have done. At the same time, &lt;b&gt;one uses the new data to create new measuring instruments&lt;/b&gt; that have been trained using all available data, which is also what I have done with &lt;a href="http://dodecad.blogspot.com/2011/09/euro7-calculator.html"&gt;euro7&lt;/a&gt; and the upcoming Dodecad v4. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;To make matters worse, Eurogenes suggests that my euro7 analysis agrees with his &lt;a href="http://bga101.blogspot.com/2011/09/genetic-substructures-across-west.html"&gt;K=10&lt;/a&gt; which was presented two weeks later. &lt;b&gt;So, apparently, I am posting correct information about Ukrainians 2 weeks before he does, and this means that &lt;i&gt;I &lt;/i&gt;am turning around to &lt;i&gt;his &lt;/i&gt;way of thinking rather than vice versa.&lt;/b&gt; Go figure.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Eurogenes continues with his posting of supposed MDS/PCA plots supporting his thesis. &lt;b&gt;Actually, what he has posted are plots based on metric distances in the space of admixture proportions; these are &lt;i&gt;not &lt;/i&gt;genetic distances&lt;/b&gt; because e.g., a +/- 1% difference in a Sub-Saharan component results in the same Euclidean distance difference as a +/-1% in a European one, although the former affects genetic distance much more strongly than the latter. Metric distances are fine to quickly determine closeness of samples in the space of admixture proportions, but they are certainly no substitute for real genetic distances. I have already linked above with evidence that Ukrainians &lt;i&gt;are &lt;/i&gt;transitional to the Balkans and the Caucasus relative to the Yunusbaeyev et al. populations.&lt;/div&gt;&lt;meta equiv="content-type" content="text/html; charset=utf-8"&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I am also, apparently, accused of neglecting to point out the deficiencies of Dodecad v3, and I am invited by Eurogenes to retract it completely! This proposal is equivalent to the idea that we should burn old topographic maps that were based on measurements with sticks, ropes, and trigonometers, because we can now measure distances with laser beams. &lt;b&gt;And, it is funny indeed that I am supposedly neglecting the deficiencies of Dodecad v3 when, &lt;a href="http://dienekes.blogspot.com/2011/10/further-caution-on-admixture-estimates.html"&gt;3 weeks&lt;/a&gt; before the Eurogenes rant, I post exactly what its limitations are, and how it can be made better.&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;It is unfortunate that Eurogenes has chosen to go down that path. Envy is not a good guide to behavior, and perhaps, instead of &lt;a href="http://dodecad.blogspot.com/2011/10/comparing-different-admixture-runs.html?showComment=1319128382012#c5303451825267018085"&gt;relishing&lt;/a&gt; at the prospect of putting others down, he could spend a little more time inventing something of his own. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;As for myself, I will continue to work on my tools, and to &lt;a href="http://blogs.discovermagazine.com/gnxp/2011/09/dodecad-ancestry-project-is-at-10000/"&gt;encourage&lt;/a&gt; &lt;a href="http://dodecad.blogspot.com/2011/10/comparing-different-admixture-runs.html"&gt;cross-pollination&lt;/a&gt; &lt;a href="http://dodecad.blogspot.com/2011/09/third-party-tools-based-on-dodecad.html"&gt;between&lt;/a&gt; &lt;a href="http://dodecad.blogspot.com/2011/08/how-to-make-your-own-calculator-for.html"&gt;different&lt;/a&gt; projects for the benefit of all.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;UPDATE: &lt;/span&gt;In a newer post, Eurogenes attempts to justify his mishandling of MDS, by suggesting that he presented results based on raw SNP data. This is of course nonsense, since Eurogenes does not have the raw SNP data of the Dodecad populations. He is comparing apples and oranges by comparing plots made on raw data with those made in the space of admixture proportions. Furthermore, his supposed findings have no bearing on the Yunusbayev et al. ADMIXTURE and PCA results, posted above.&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6533996127304587865-757367533271642614?l=dodecad.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel="replies" type="application/atom+xml" href="http://dodecad.blogspot.com/feeds/757367533271642614/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://dodecad.blogspot.com/2011/10/eurogenes-is-upset.html#comment-form" title="18 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/757367533271642614?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/757367533271642614?v=2" /><link rel="alternate" type="text/html" href="http://dodecad.blogspot.com/2011/10/eurogenes-is-upset.html" title="Eurogenes is upset" /><author><name>Dienekes</name><uri>http://www.blogger.com/profile/02082684850093948970</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="32" height="32" src="http://4.bp.blogspot.com/-KXXemZigoEc/TlK7wIUP_EI/AAAAAAAAEAk/uJ-FlueoC6o/s220/prosopon.png" /></author><thr:total>18</thr:total></entry><entry gd:etag="W/&quot;Ck8FR3w9fip7ImA9WhdaEUw.&quot;"><id>tag:blogger.com,1999:blog-6533996127304587865.post-8420561648076804576</id><published>2011-10-20T13:05:00.013+03:00</published><updated>2011-10-20T14:40:16.266+03:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-20T14:40:16.266+03:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="Zombies" /><category scheme="http://www.blogger.com/atom/ns#" term="Experiments" /><title>Comparing different ADMIXTURE runs using Zombies</title><content type="html">&lt;div style="text-align: left;"&gt;My idea of using &lt;a href="http://dodecad.blogspot.com/2011/05/how-to-create-zombies-from-admixture.html"&gt;zombies&lt;/a&gt; with ADMIXTURE is the gift that keeps on giving. Remember that "zombies" are synthetic individuals created from ADMIXTURE output, representing the K inferred ancestral components. They can be viewed as hypothetical ancestral individuals representing each of these K components without any admixture from any of the others.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;An interesting problem that often comes up is to compare across different ADMIXTURE runs. I can think of at least three different applications of this:&lt;/div&gt;&lt;div&gt;&lt;ol&gt;&lt;li&gt;To compare components across &lt;b&gt;different K&lt;/b&gt;; for example, how does a "West Asian"-centered component at K=5 differ from a similarly-centered component at K=12?&lt;/li&gt;&lt;li&gt;To compare components across &lt;b&gt;different datasets&lt;/b&gt;; for example, how does a "West Asian"-centered component inferred from an existing dataset (e.g., the current &lt;a href="http://dodecad.blogspot.com/2011/06/design-of-dodecad-v3.html"&gt;Dodecad v3&lt;/a&gt;) differ from a "West Asian"-centered one from a new dataset (e.g., the upcoming Dodecad v4, which will also be trained on the valuable new populations of &lt;a href="http://dodecad.blogspot.com/2011/09/yunusbayev-et-al-2011-data-assessed.html"&gt;Yunusbayev et al. 2011&lt;/a&gt;)&lt;/li&gt;&lt;li&gt;To compare components across &lt;b&gt;different projects&lt;/b&gt;; there has been a proliferation of different ancestry projects since the launching of Dodecad nearly a year ago, and since all of them slightly different individuals/SNPs/terminology, it is quite useful to be able to gauge how one component from one project maps onto other components in other projects.&lt;/li&gt;&lt;/ol&gt;As proof of concept, I took the &lt;a href="http://magnusducatus.blogspot.com/2011/08/i-have-modified-diydodecad-calculator.html"&gt;MDLP&lt;/a&gt; calculator from the &lt;a href="http://magnusducatus.blogspot.com/"&gt;Magnus Ducatus Lituaniae Project&lt;/a&gt; and generated 50 zombies for each of its 7 ancestral components:&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;ol&gt;&lt;li&gt;Scandinavian&lt;/li&gt;&lt;li&gt;Volga_Region&lt;/li&gt;&lt;li&gt;Altaic&lt;/li&gt;&lt;li&gt;Celto_Germanic&lt;/li&gt;&lt;li&gt;Caucassian_Anatolian_Balkanic&lt;/li&gt;&lt;li&gt;Balto_Slavic&lt;/li&gt;&lt;li&gt;North_Atlantic&lt;/li&gt;&lt;/ol&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;I then inferred the ancestry of the MDLP zombies using Dodecad v3, and vice versa. Since Dodecad v3 also includes populations (e.g., Africans) not considered by MDLP, I did not try to map those onto MDLP.&lt;/div&gt;&lt;div&gt;&lt;meta equiv="content-type" content="text/html; charset=utf-8"&gt;&lt;meta equiv="content-type" content="text/html; charset=utf-8"&gt;&lt;a href="http://3.bp.blogspot.com/-nqiLJqxauNI/Tp_4sYaQpUI/AAAAAAAAAmk/tIVhzcQST5w/s1600/_7.png" onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"&gt;&lt;img src="http://3.bp.blogspot.com/-nqiLJqxauNI/Tp_4sYaQpUI/AAAAAAAAAmk/tIVhzcQST5w/s400/_7.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5665520297605899586" style="display: block; margin-top: 0px; margin-right: auto; margin-bottom: 10px; margin-left: auto; text-align: center; cursor: pointer; width: 400px; height: 256px; " /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;div style="text-align: center;"&gt;&lt;span class="Apple-style-span"&gt;&lt;u&gt;&lt;br /&gt;&lt;/u&gt;&lt;/span&gt;&lt;/div&gt;&lt;a href="http://4.bp.blogspot.com/-YUl_aoCCXnc/Tp_4pKsWHrI/AAAAAAAAAmY/X2Dnj4KJl5s/s1600/MDLPOndv3.png" onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"&gt;&lt;img src="http://4.bp.blogspot.com/-YUl_aoCCXnc/Tp_4pKsWHrI/AAAAAAAAAmY/X2Dnj4KJl5s/s400/MDLPOndv3.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5665520242384051890" style="display: block; margin-top: 0px; margin-right: auto; margin-bottom: 10px; margin-left: auto; text-align: center; cursor: pointer; width: 400px; height: 256px; " /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div style="text-align: center;"&gt;&lt;meta equiv="content-type" content="text/html; charset=utf-8"&gt;&lt;a href="http://3.bp.blogspot.com/-nqiLJqxauNI/Tp_4sYaQpUI/AAAAAAAAAmk/tIVhzcQST5w/s1600/_7.png" onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"&gt;&lt;br /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;I will comment on the MDLP-to-dv3 mapping:&lt;/div&gt;&lt;div&gt;&lt;ol&gt;&lt;li&gt;The MDLP "Scandinavian" component appears to be West/East European with a little Mediterranean and a little Northeast Asian&lt;/li&gt;&lt;li&gt;The MDLP "Volga_Region" component appears to be East European with some Northeast Asian&lt;/li&gt;&lt;li&gt;The MDLP "Altaic" component is West Asian+Northeast Asian+Southeast Asian. Note that in Dodecad v3, the Northeast Asian component peaks at Chukchi, Nganasan, and Koryak, and most other east Eurasian populations have much less of it&lt;/li&gt;&lt;li&gt;The MDLP "Celto-Germanic" component is (surprisingly) Mediterranean-dominated. One possible interpretation is that in the context of MDLP this captures one aspect of the difference between Southwestern and Northeastern Europe -higher Mediterranean in the former-, whereas the...&lt;/li&gt;&lt;li&gt;... MDLP "North-Atlantic" component seems to be entirely West European, and is capturing a different aspect of east-west variation in Europe. &lt;/li&gt;&lt;li&gt;The MDLP "Balto-Slavic" appears the reverse of the "Celto-Germanic" with lower Mediterranean and reversed East/West European&lt;/li&gt;&lt;li&gt;Finally, the MDLP "Caucassian_Anatolian_Balkanic" component is predictably mainly West Asian, but with a little Mediterranean and Southwest Asian as well&lt;/li&gt;&lt;/ol&gt;A different way of comparing the different components is to include them all in a joint MDS plot, or calculate various types of distances between them (e.g., Fst). &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;meta equiv="content-type" content="text/html; charset=utf-8"&gt;&lt;div&gt;For example, the first couple of dimensions are dominated by the African/Asian components of Dodecad v3 that are not present in MDLP. Notice, however, the position of "Altaic", right where one might expect to find it between West and East Eurasians.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;a href="http://3.bp.blogspot.com/-bmpdrBXRQV8/TqABqNqILEI/AAAAAAAAAmw/XrIXPcpgO2s/s1600/MDLP_dv3.png" onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"&gt;&lt;img src="http://3.bp.blogspot.com/-bmpdrBXRQV8/TqABqNqILEI/AAAAAAAAAmw/XrIXPcpgO2s/s400/MDLP_dv3.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5665530155964574786" style="display: block; margin-top: 0px; margin-right: auto; margin-bottom: 10px; margin-left: auto; text-align: center; cursor: pointer; width: 400px; height: 400px; " /&gt;&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;Limiting ourselves to only European populations, we obtain:&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;meta equiv="content-type" content="text/html; charset=utf-8"&gt;&lt;a href="http://2.bp.blogspot.com/-7x7WY7OHUSk/TqAHTw3UP0I/AAAAAAAAAnI/gDCnw1IVyEk/s1600/plink.png" onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"&gt;&lt;img src="http://2.bp.blogspot.com/-7x7WY7OHUSk/TqAHTw3UP0I/AAAAAAAAAnI/gDCnw1IVyEk/s400/plink.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5665536367347908418" style="display: block; margin-top: 0px; margin-right: auto; margin-bottom: 10px; margin-left: auto; text-align: center; cursor: pointer; width: 400px; height: 400px; " /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;It appears that the "North_Atlantic" component may be centered on a small number of related individuals.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;I encourage other genome bloggers to try their own hand at comparing their components with those of other projects, or even their own.&lt;/b&gt; This process will be made possible if people using ADMIXTURE follow the simple instructions to convert &lt;a href="http://dodecad.blogspot.com/2011/08/how-to-make-your-own-calculator-for.html"&gt;their output&lt;/a&gt; for use with DIYDodecad.&lt;/div&gt;&lt;div style="text-align: left;"&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Once Dodecad v4 is off the ground, and if I find time to fully automate the process, I will perhaps try to map all my past calculators (i.e., the initial K=10, Dodecad v3, 'bat', 'euro7', 'weac', 'africa9') onto the new golden standard of the Project.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;PS: This analysis was done on ~63k SNPs in common between MDLP and Dodecad v3&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6533996127304587865-8420561648076804576?l=dodecad.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel="replies" type="application/atom+xml" href="http://dodecad.blogspot.com/feeds/8420561648076804576/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://dodecad.blogspot.com/2011/10/comparing-different-admixture-runs.html#comment-form" title="10 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/8420561648076804576?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/8420561648076804576?v=2" /><link rel="alternate" type="text/html" href="http://dodecad.blogspot.com/2011/10/comparing-different-admixture-runs.html" title="Comparing different ADMIXTURE runs using Zombies" /><author><name>Dodecad Project</name><uri>http://www.blogger.com/profile/10447516703222698752</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://3.bp.blogspot.com/-nqiLJqxauNI/Tp_4sYaQpUI/AAAAAAAAAmk/tIVhzcQST5w/s72-c/_7.png" height="72" width="72" /><thr:total>10</thr:total></entry><entry gd:etag="W/&quot;CEIMR3ozeip7ImA9WhdUFE8.&quot;"><id>tag:blogger.com,1999:blog-6533996127304587865.post-2952912629604507029</id><published>2011-09-30T12:07:00.017+03:00</published><updated>2011-10-01T01:43:06.482+03:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-10-01T01:43:06.482+03:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="DIYDodecad" /><category scheme="http://www.blogger.com/atom/ns#" term="Anatolia" /><category scheme="http://www.blogger.com/atom/ns#" term="Europe" /><category scheme="http://www.blogger.com/atom/ns#" term="Caucasus" /><title>'euro7' calculator</title><content type="html">&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/-F9XoyN2wE-w/ToZBZZT7jdI/AAAAAAAAAmI/qwmGu64Xvwk/s1600/admixture.png"&gt;&lt;img style="float:left; margin:0 10px 10px 0;cursor:pointer; cursor:hand;width: 213px; height: 400px;" src="http://1.bp.blogspot.com/-F9XoyN2wE-w/ToZBZZT7jdI/AAAAAAAAAmI/qwmGu64Xvwk/s400/admixture.png" alt="" id="BLOGGER_PHOTO_ID_5658281886384623058" border="0" /&gt;&lt;/a&gt;I am releasing a new calculator for &lt;span style="font-weight: bold;"&gt;Europeans, including their immediate neighboring populations around the Black Sea (Caucasus and Anatolia)&lt;/span&gt;. The calculator can be used with &lt;a href="http://dodecad.blogspot.com/2011/09/do-it-yourself-dodecad-v-21.html"&gt;DIYDodecad&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;There are additional African and Far-Asian population controls, so, in principle, the calculator could be used by non-Europeans/Anatolians/Caucasians, although I would be less confident of their results. For example, people of South Asian ancestry may obtain a Far-Asian result if they use this calculator, due to the deep affinity of Ancestral South Indians with East Asians. Other West Eurasians and West Eurasian-admixed peoples, not from the studied regions (e.g., Arabians or East Africans) will have their West Eurasian components mapped onto the ones used in this calculator.&lt;br /&gt;&lt;br /&gt;'euro7' uses 7 ancestral components:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Caucasus&lt;/li&gt;&lt;li&gt;Northwestern&lt;/li&gt;&lt;li&gt;Northeastern&lt;/li&gt;&lt;li&gt;Southeastern&lt;/li&gt;&lt;li&gt;African&lt;/li&gt;&lt;li&gt;Far_Asian&lt;/li&gt;&lt;li&gt;Southwestern&lt;/li&gt;&lt;/ul&gt;These names represent 7 ancestral populations inferred by ADMIXTURE, and have been chosen based on the geographical regions where each of them achieves its maximum representation. You should always refer to &lt;a href="http://dienekes.blogspot.com/2011/03/note-of-caution-on-admixture-estimates.html"&gt;A note of caution on admixture estimates&lt;/a&gt;, &lt;a href="http://dienekes.blogspot.com/2011/06/interpretation-of-admixture-results.html"&gt;Interpretation of ADMIXTURE results: component sharing&lt;/a&gt;, as well as the average population values in the &lt;a href="https://docs.google.com/spreadsheet/ccc?key=0ArAJcY18g2GadGd1UEFIbzVlUEtpbTd0S0RLcnVYTEE&amp;amp;hl=en_US"&gt;spreadsheet&lt;/a&gt; when interpreting your individual results.&lt;br /&gt;&lt;br /&gt;The distribution of these 7 components can be seen in the barplot on the top left, and precise admixture proportions can be found in the &lt;a href="https://docs.google.com/spreadsheet/ccc?key=0ArAJcY18g2GadGd1UEFIbzVlUEtpbTd0S0RLcnVYTEE&amp;amp;hl=en_US"&gt;spreadsheet&lt;/a&gt;. Note that additional samples have been used to infer these components, but as these come from Dodecad populations with less than 5 participants, I am not reporting average values for them, as per the usual project policy.&lt;br /&gt;&lt;br /&gt;Here is the neighbor-joining tree based on the Fst divergences between the 7 ancestral components:&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/-vph_yaHMvAk/ToWLgozFHhI/AAAAAAAAAl8/IPV77uzAscM/s1600/nj.png"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 400px; height: 400px;" src="http://3.bp.blogspot.com/-vph_yaHMvAk/ToWLgozFHhI/AAAAAAAAAl8/IPV77uzAscM/s400/nj.png" alt="" id="BLOGGER_PHOTO_ID_5658081899684634130" border="0" /&gt;&lt;/a&gt;&lt;span style="font-weight: bold;"&gt;Instructions:&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;You can download the calculator RAR from &lt;a href="https://docs.google.com/viewer?a=v&amp;amp;pid=explorer&amp;amp;chrome=true&amp;amp;srcid=0B7AJcY18g2GaM2ZlOGQ0NjMtYzRlMS00YjA5LWIzZmUtM2RkMDIyNDEzZWZm&amp;amp;hl=en_US"&gt;here&lt;/a&gt; (Google docs; File-&amp;gt;Download original), or &lt;a href="http://www.sendspace.com/file/qfo0kv"&gt;here&lt;/a&gt; (sendspace).&lt;br /&gt;&lt;br /&gt;You need to extract the contents of the RAR file to the working directory of &lt;a href="http://dodecad.blogspot.com/2011/09/do-it-yourself-dodecad-v-21.html"&gt;DIYDodecad&lt;/a&gt;. You use it by following exactly the instructions of the DIYDodecad README, but always type 'euro7' instead of 'dv3' in these instructions.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Terms of use:&lt;/span&gt; 'euro7', including all files in the downloaded RAR file is free for non-commercial personal use. Commercial uses are forbidden. Contact me for non-personal uses of the calculator.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Calculators released by the Dodecad Project: &lt;/span&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://dodecad.blogspot.com/2011/09/bat-calculator-balkans-anatolia-turkic.html"&gt;bat&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://dodecad.blogspot.com/2011/09/africa9-calculator.html"&gt;africa9&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://dodecad.blogspot.com/2011/09/weac-calculator.html"&gt;weac&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://dodecad.blogspot.com/2011/09/euro7-calculator.html"&gt;euro7&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6533996127304587865-2952912629604507029?l=dodecad.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel="replies" type="application/atom+xml" href="http://dodecad.blogspot.com/feeds/2952912629604507029/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://dodecad.blogspot.com/2011/09/euro7-calculator.html#comment-form" title="44 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/2952912629604507029?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/2952912629604507029?v=2" /><link rel="alternate" type="text/html" href="http://dodecad.blogspot.com/2011/09/euro7-calculator.html" title="'euro7' calculator" /><author><name>Dodecad Project</name><uri>http://www.blogger.com/profile/10447516703222698752</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://1.bp.blogspot.com/-F9XoyN2wE-w/ToZBZZT7jdI/AAAAAAAAAmI/qwmGu64Xvwk/s72-c/admixture.png" height="72" width="72" /><thr:total>44</thr:total></entry><entry gd:etag="W/&quot;C0IDQXg7fSp7ImA9WhdUEEk.&quot;"><id>tag:blogger.com,1999:blog-6533996127304587865.post-6733427098571529244</id><published>2011-09-25T23:26:00.010+03:00</published><updated>2011-09-26T15:52:50.605+03:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-09-26T15:52:50.605+03:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="Dodecad" /><category scheme="http://www.blogger.com/atom/ns#" term="Turkic" /><category scheme="http://www.blogger.com/atom/ns#" term="Balkans" /><category scheme="http://www.blogger.com/atom/ns#" term="Slavic" /><category scheme="http://www.blogger.com/atom/ns#" term="Armenians" /><category scheme="http://www.blogger.com/atom/ns#" term="Iranian" /><category scheme="http://www.blogger.com/atom/ns#" term="Caucasus" /><title>Yunusbayev et al. (2011) data assessed with Dodecad v3</title><content type="html">&lt;div style="text-align: left;"&gt;I have acquired the &lt;a href="http://www.evolutsioon.ut.ee/MAIT/caucasus_data/"&gt;data&lt;/a&gt; from the recent &lt;a href="http://dienekes.blogspot.com/2011/09/caucasus-revisited-yunusbayev-et-al.html"&gt;Yunusbayev et al. (2011)&lt;/a&gt; paper on the Caucasus. This includes the following populations:&lt;/div&gt;&lt;div&gt;&lt;div&gt;&lt;div&gt;&lt;ul&gt;&lt;li&gt;Kurds_Y&lt;span class="Apple-tab-span" style="white-space:pre"&gt; &lt;/span&gt;6&lt;/li&gt;&lt;li&gt;Bulgarians_Y&lt;span class="Apple-tab-span" style="white-space:pre"&gt; &lt;/span&gt;13&lt;/li&gt;&lt;li&gt;Ukranians_Y&lt;span class="Apple-tab-span" style="white-space:pre"&gt; &lt;/span&gt;20&lt;/li&gt;&lt;li&gt;Mordovians_Y&lt;span class="Apple-tab-span" style="white-space:pre"&gt; &lt;/span&gt;15&lt;/li&gt;&lt;li&gt;Armenians_Y&lt;span class="Apple-tab-span" style="white-space:pre"&gt; &lt;/span&gt;16&lt;/li&gt;&lt;li&gt;Abhkasians_Y&lt;span class="Apple-tab-span" style="white-space:pre"&gt; &lt;/span&gt;20&lt;/li&gt;&lt;li&gt;Balkars_Y&lt;span class="Apple-tab-span" style="white-space:pre"&gt; &lt;/span&gt;19&lt;/li&gt;&lt;li&gt;North_Ossetians_Y&lt;span class="Apple-tab-span" style="white-space:pre"&gt; &lt;/span&gt;15&lt;/li&gt;&lt;li&gt;Chechens_Y&lt;span class="Apple-tab-span" style="white-space:pre"&gt; &lt;/span&gt;20&lt;/li&gt;&lt;li&gt;Nogais_Y&lt;span class="Apple-tab-span" style="white-space:pre"&gt; &lt;/span&gt;16&lt;/li&gt;&lt;li&gt;Kumyks_Y&lt;span class="Apple-tab-span" style="white-space:pre"&gt; &lt;/span&gt;14&lt;/li&gt;&lt;li&gt;Turkmens_Y&lt;span class="Apple-tab-span" style="white-space:pre"&gt; &lt;/span&gt;15&lt;/li&gt;&lt;li&gt;Tajiks_Y&lt;span class="Apple-tab-span" style="white-space:pre"&gt; &lt;/span&gt;15&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div&gt;It is a valuable new addition to the Project, and it is commendable that it has been made publicly and easily available so swiftly after the appearance of the &lt;a href="http://mbe.oxfordjournals.org/content/early/2011/09/13/molbev.msr221.abstract"&gt;Yunusbayev et al. (2011)&lt;/a&gt; paper.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;To get the ball rolling on the new Yunusbayev et al. data, I will map the new populations onto the &lt;a href="http://dodecad.blogspot.com/2011/06/design-of-dodecad-v3.html"&gt;Dodecad v3&lt;/a&gt; components; they will be added to the &lt;a href="https://spreadsheets.google.com/spreadsheet/ccc?key=0ArAJcY18g2GadDUyeEtjNnBmY09EbnowN3M3UWRyNnc&amp;amp;hl=en_US&amp;amp;authkey=COCa89AJ"&gt;Dodecad v3 spreadsheet&lt;/a&gt; as they are calculated.&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;I have been laboriously designing a new global (including Amerindians and Australasians) Dodecad X1&lt;b&gt; &lt;/b&gt;experimental calculator with 3,010 individuals for a few weeks now, but I guess I will now have to reboot it with 3,214.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Together with some other new data I recently discovered, I now have &lt;b&gt;9,799 individuals&lt;/b&gt; (some duplicates from different sources) in my global database. My Dodecad dataset of 511 individuals from a single country or ethnic group isn't too shabby either. Let's hope for a new data release that will push the data collection above the magic 10,000.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;UDDATE:&lt;/b&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;&lt;br /&gt;&lt;/b&gt;&lt;/div&gt;&lt;div&gt;I have added the first 7 populations to the spreadsheet; the others are being calculated as we speak. Most of them seem in line with expectations, but the Abkhasian sample has one outlier individual (abh27), and has thus been placed in the "Outliers" tab of the &lt;a href="https://spreadsheets.google.com/spreadsheet/ccc?key=0ArAJcY18g2GadDUyeEtjNnBmY09EbnowN3M3UWRyNnc&amp;amp;hl=en_US&amp;amp;authkey=COCa89AJ"&gt;spreadsheet&lt;/a&gt;; a new set of admixture proportions, minus that outlier individual, will be calculated anew:&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;a href="http://1.bp.blogspot.com/-NHwD_RC7NCU/ToAPkb9lZLI/AAAAAAAAAl0/psVvuC9Tn6g/s1600/ADMIXTURE%2BAbhkasians_Y_12.png" onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"&gt;&lt;img src="http://1.bp.blogspot.com/-NHwD_RC7NCU/ToAPkb9lZLI/AAAAAAAAAl0/psVvuC9Tn6g/s400/ADMIXTURE%2BAbhkasians_Y_12.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5656538250633110706" style="display: block; margin-top: 0px; margin-right: auto; margin-bottom: 10px; margin-left: auto; text-align: center; cursor: pointer; width: 400px; height: 250px; " /&gt;&lt;/a&gt;&lt;/div&gt;&lt;div&gt;&lt;b&gt;UPDATE II: &lt;/b&gt;The population portraits have been uploaded to Google Docs as a &lt;a href="https://docs.google.com/viewer?a=v&amp;amp;pid=explorer&amp;amp;chrome=true&amp;amp;srcid=0B7AJcY18g2GaMDA5NjE3NTMtNzkwNi00MTdhLTk5MTMtY2NlM2ZjY2IwNGFi&amp;amp;hl=en"&gt;rar file&lt;/a&gt; (Sendspace &lt;a href="http://www.sendspace.com/file/262nty"&gt;mirror&lt;/a&gt;). Average admixture results have all been entered to the &lt;a href="https://spreadsheets.google.com/spreadsheet/ccc?key=0ArAJcY18g2GadDUyeEtjNnBmY09EbnowN3M3UWRyNnc&amp;amp;hl=en_US&amp;amp;authkey=COCa89AJ"&gt;spreadsheet&lt;/a&gt;.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6533996127304587865-6733427098571529244?l=dodecad.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel="replies" type="application/atom+xml" href="http://dodecad.blogspot.com/feeds/6733427098571529244/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://dodecad.blogspot.com/2011/09/yunusbayev-et-al-2011-data-assessed.html#comment-form" title="31 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/6733427098571529244?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/6733427098571529244?v=2" /><link rel="alternate" type="text/html" href="http://dodecad.blogspot.com/2011/09/yunusbayev-et-al-2011-data-assessed.html" title="Yunusbayev et al. (2011) data assessed with Dodecad v3" /><author><name>Dodecad Project</name><uri>http://www.blogger.com/profile/10447516703222698752</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://1.bp.blogspot.com/-NHwD_RC7NCU/ToAPkb9lZLI/AAAAAAAAAl0/psVvuC9Tn6g/s72-c/ADMIXTURE%2BAbhkasians_Y_12.png" height="72" width="72" /><thr:total>31</thr:total></entry><entry gd:etag="W/&quot;CkEERngyfSp7ImA9WhdVFk4.&quot;"><id>tag:blogger.com,1999:blog-6533996127304587865.post-3699585454680721264</id><published>2011-09-21T21:18:00.004+03:00</published><updated>2011-09-21T21:43:27.695+03:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-09-21T21:43:27.695+03:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="DIYDodecad" /><title>'weac' calculator</title><content type="html">&lt;a href="http://2.bp.blogspot.com/-qAdi1qnYmi4/TnorECBBe3I/AAAAAAAAAls/JDPdP56KMtY/s1600/weac_4.png" onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"&gt;&lt;img style="float:left; margin:0 10px 10px 0;cursor:pointer; cursor:hand;width: 400px; height: 256px;" src="http://2.bp.blogspot.com/-qAdi1qnYmi4/TnorECBBe3I/AAAAAAAAAls/JDPdP56KMtY/s400/weac_4.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5654879630377712498" /&gt;&lt;/a&gt;&lt;br /&gt;This new calculator places individuals on the West Eurasian cline. This cline is the first-order description of variation in West Eurasians, with populations from northern and western Europe falling on one end, and those from the Near East on the other.&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;On the left, you can see the populations on which the calculator is based, sorted on their average "Atlantic-Baltic" component. The raw data can be found in the &lt;a href="https://docs.google.com/spreadsheet/ccc?key=0ArAJcY18g2GadDZBNmxiWG45WDBJNHlrN1YzMFBKRWc&amp;amp;hl=en_US"&gt;spreadsheet&lt;/a&gt;. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Note that the main purpose of the calculator is to place European and Near Eastern samples on the West Eurasian cline, and to do so, some African and East Eurasian populations are used as controls. Other types of ancestry (e.g., South Asian or Amerindian) may register as Far-Asian in the context of this test.&lt;br /&gt;&lt;div&gt;&lt;br /&gt;You can download the calculator RAR from &lt;a href="https://docs.google.com/viewer?a=v&amp;amp;pid=explorer&amp;amp;chrome=true&amp;amp;srcid=0B7AJcY18g2GaYzI1MDZlNmQtOGU1ZC00MmE3LThhOTAtMzcyNGE1NjU0ZDQ2&amp;amp;hl=en_US"&gt;here&lt;/a&gt; (Google docs), or &lt;a href="http://www.sendspace.com/file/q4s2vi"&gt;here&lt;/a&gt; (sendspace).&lt;br /&gt;&lt;br /&gt;You need to extract the contents of the RAR file to the working directory of &lt;a href="http://dodecad.blogspot.com/2011/09/do-it-yourself-dodecad-v-21.html"&gt;DIYDodecad&lt;/a&gt;. You use it by following exactly the instructions of the DIYDodecad README, but always type 'weac' instead of 'dv3' in these instructions.&lt;br /&gt;&lt;br /&gt;Terms of use: 'weac', including all files in the downloaded RAR file is free for non-commercial personal use. Commercial uses are forbidden. Contact me for non-personal uses of the calculator.&lt;br /&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6533996127304587865-3699585454680721264?l=dodecad.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel="replies" type="application/atom+xml" href="http://dodecad.blogspot.com/feeds/3699585454680721264/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://dodecad.blogspot.com/2011/09/weac-calculator.html#comment-form" title="4 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/3699585454680721264?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/3699585454680721264?v=2" /><link rel="alternate" type="text/html" href="http://dodecad.blogspot.com/2011/09/weac-calculator.html" title="'weac' calculator" /><author><name>Dodecad Project</name><uri>http://www.blogger.com/profile/10447516703222698752</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><media:thumbnail xmlns:media="http://search.yahoo.com/mrss/" url="http://2.bp.blogspot.com/-qAdi1qnYmi4/TnorECBBe3I/AAAAAAAAAls/JDPdP56KMtY/s72-c/weac_4.png" height="72" width="72" /><thr:total>4</thr:total></entry><entry gd:etag="W/&quot;CkcAQng7cCp7ImA9WhdVE0k.&quot;"><id>tag:blogger.com,1999:blog-6533996127304587865.post-3579669265043471437</id><published>2011-09-18T12:30:00.006+03:00</published><updated>2011-09-18T13:00:43.608+03:00</updated><app:edited xmlns:app="http://www.w3.org/2007/app">2011-09-18T13:00:43.608+03:00</app:edited><category scheme="http://www.blogger.com/atom/ns#" term="DIYDodecad" /><title>Do-It-Yourself Dodecad v 2.1</title><content type="html">&lt;div&gt;DIYDodecad v 2.1 allows incomplete genotype files to be used, i.e., genotype&lt;/div&gt;&lt;div&gt;files that do not include all expected SNP markers used in a calculator. This &lt;/div&gt;&lt;div&gt;is useful to individuals having older genotype files from their testing &lt;/div&gt;&lt;div&gt;companies, and allows the tool to be used with any type of genotype data, and&lt;/div&gt;&lt;div&gt;not only the Illumina platforms currently used by 23andMe and FamilyTreeDNA. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;There is a minimum requirement of at least 100 usable SNPs, i.e., SNPs that are&lt;/div&gt;&lt;div&gt;in the genotype file and do not have no-calls. &lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;If you had previously followed the instructions &lt;i&gt;carefully&lt;/i&gt;, and got an "end of file reached" error, this was most likely due to your genotype file lacking some of the expected markers used in the calculator. Version 2.1 should work for you.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;You can download it from &lt;a href="https://docs.google.com/viewer?a=v&amp;amp;pid=explorer&amp;amp;chrome=true&amp;amp;srcid=0B7AJcY18g2GaZGU4OWQ5OWItMzY2NC00NzI1LWIzNWMtMzUxYWI4NjRmMTlk&amp;amp;hl=en_US"&gt;here&lt;/a&gt; (Google Docs, File-&amp;gt;Download Original), or &lt;a href="http://www.sendspace.com/file/ti3ey7"&gt;here&lt;/a&gt; (Sendspace). Uncompress DIYDodecad2.1.rar to a local directory on your computer, and follow the instructions in the README.txt file.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;Past versions: &lt;a href="http://dodecad.blogspot.com/2011/08/do-it-yourself-dodecad-v-20.html"&gt;2.0&lt;/a&gt;, &lt;a href="http://dodecad.blogspot.com/2011/07/do-it-yourself-dodecad-v-10.html"&gt;1.0&lt;/a&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/6533996127304587865-3579669265043471437?l=dodecad.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel="replies" type="application/atom+xml" href="http://dodecad.blogspot.com/feeds/3579669265043471437/comments/default" title="Post Comments" /><link rel="replies" type="text/html" href="http://dodecad.blogspot.com/2011/09/do-it-yourself-dodecad-v-21.html#comment-form" title="4 Comments" /><link rel="edit" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/3579669265043471437?v=2" /><link rel="self" type="application/atom+xml" href="http://www.blogger.com/feeds/6533996127304587865/posts/default/3579669265043471437?v=2" /><link rel="alternate" type="text/html" href="http://dodecad.blogspot.com/2011/09/do-it-yourself-dodecad-v-21.html" title="Do-It-Yourself Dodecad v 2.1" /><author><name>Dodecad Project</name><uri>http://www.blogger.com/profile/10447516703222698752</uri><email>noreply@blogger.com</email><gd:image rel="http://schemas.google.com/g/2005#thumbnail" width="16" height="16" src="http://img2.blogblog.com/img/b16-rounded.gif" /></author><thr:total>4</thr:total></entry></feed>

