Best of EvX: How Turkic is Turkey?


map of the spread of farming in Turkey/Anatolia/Europe

Hello my Turkish and Turkic readers! In honor of having written a lot on this blog, we’re taking a look back at our most popular posts, and today’s is on the genetic history of Turkey and the Turkic peoples.

Since my original post, I have learned many things about Turkey–mostly that Turks and other Turkic peoples love their culture and heritage. Note: I will probably use “Turkey” and “Anatolia”, interchangeably in this post. Turkey is the name for the modern state located in the region; Anatolia is a more generic name for the geography. I know that “Turkey” as a state or even a people didn’t exist 8,000 years ago.

Turkey has a long and fascinating history. It is possibly the cradle of civilization, as sites like Gobekli Tepe attest, and one of the birthplaces of agriculture.

1280px-j228y-dna29Early farmers spread out from Anatolia into Europe and Asia, contributing much of the modern European gene pool. There are many Y-DNA haplogroups in modern Turkey, which most likely means the Turkish male population hasn’t been completely replaced in recent invasions. (It’s not uncommon for an invasion to wipe out 80+% of the male population in an area.) About 24% of Turkish men carry haplogroup J2, which might not have originated in Turkey all of those centuries ago, but by 12,000 years ago it was common throughout Turkey (and today remains the most common haplogroup). This lineage spread with the Anatolian farmers into Europe around 8,000 years ago. and presumably Asia, as well.

From Haak et al

The second most common Y-haplogroup, at 16%, is good old R1b, which was carried into Turkey around 5-6,000 years ago by the Indo-European invaders. (The Indo-European invasion in Spain apparently wiped out all of the local men, but was not nearly so bad in Turkey.) These invaders spoke the Anatolian branch of the Indo-European tree, including Hittite and Luwian.

The Anatolian languages went extinct following Anatolia’s conquest by Alexander the Great in the 4th century BC (though it took several centuries for the languages to fall completely out of use.)

Haplogroup G–11%–is most common in the Caucasus, spread thinly over much of Anatolia and Iran, and even more thinly through Europe, North Africa, and central Asia. It’s probably a pretty old group–Otzi the Iceman was a member of the G clade.

Haplogroup E-M215 is found in about 10% of Turks and is most common in North Africa and the Horn of Africa, but is also quite common in Bedouin populations. It seems likely to be a very old haplogroup.

J1–9%–is common throughout the Middle East and amusingly reaches 46% among Jewish men named “Cohen.”

The rest of Turkish Y-chromosomes hail either from related haplogroups, like R1a, or represent smaller fractions of the population, like Q, 2%, commonly found in Siberia and Native Americans.

(Information on all Turkish Y-haplogroups.)

TurkmenSo how much Turkish DNA hails from Turkic peoples?

Modern Turks don’t speak Anatolian or Greek. They speak a Turkic language, which hails originally from an area near Mongolia. The Turkic-speaking peoples migrated into Anatolia around a thousand years ago, after a long migration/expansion through central Eurasia that culminated with the conquering of Constantinople. Today, the most notable Turkic-speaking groups are the Turks of Turkey,  AzerbaijanisUzbeksKazakhsTurkmen and Kyrgyz people.

The difficulty with tracing Turkic DNA is that, unlike the Mongols, Turkic DNA isn’t terribly homogeneous. The Mongols left a definite genetic signature wherever they went, but imparted less of their language–that is, they killed, raped, and taxed, but didn’t mix much with the locals. By contrast, the Turkic peoples seem to have mixed with their neighbors as they spread, imparting their language and probably not massacring too many people.

Asian, Australian, and Melanesian ethic groups (including Indian, Middle Eastern, and Chinese) from Haak et al’s dataset

According to Wikipedia:

The largest autosomal study on Turkish genetics (on 16 individuals) concluded the weight of East Asian (presumably Central Asian) migration legacy of the Turkish people is estimated at 21.7%.[1]

Note that Turkey shares haplogroup J2 with its Turkic neighbors. This raises an interesting possibility: early Anatolian farmers spread into central Eurasia, mixed with local nomadic Turkic speakers, and then migrated back into Turkey. But 16 people isn’t much of a study.

“South Asian contribution to Turkey’s population was significantly higher than East/Central Asian contributions, suggesting that the genetic variation of medieval Central Asian populations may be more closely related to South Asian populations, or that there was continued low level migration from South Asia into Anatolia.”

“South Asian” here I assume means that Turkey looks more like Iran than Uzbekistan, which is true. The Turkic wanderers likely passed through Iran on their way to Turkey, picking up Iranian culture (such as Islam) and DNA–plus the pre-existing Anatolian population was probably closer to Iran than Uzbekistan anyway.

… the exact kinship between current East Asians and the medieval Oghuz Turks is uncertain. For instance, genetic pools of Central Asian Turkic peoples is particularly diverse and modern Oghuz Turkmens living in Central Asia are with higher West Eurasian genetic component than East Eurasian.[2][3][4]

I think “West Eurasian” is a euphemism for “Caucasian.” East Eurasian (aka Asian) DNA, you can see in the map above, tends to be red+yellow, tending toward all red in Siberia and all yellow in Taiwan. Indo-European groups, including Iranians, tend to have a teal/blue/orange pattern. Turkmen, Uzbeks, and Uygurs, as you can see in the graph, have a combination of both sets of DNA. The Turks also have a small amount of east Asian DNA–but much less–while their neighbors in Iran and central Eurasia share a little Indian DNA.

Several studies have concluded that the genetic haplogroups indigenous to Western Asia have the largest share in the gene pool of the present-day Turkish population.[5][6][7][8][5][9][10][11] An admixture analysis determined that the Anatolian Turks share most of their genetic ancestry with non-Turkic populations in the region and the 12th century is set as an admixture date.[12]

Western Asia=Middle East.

So Turkish DNA is about 22% Turkic, from nomads who entered the country via Iran, and about 78% ancient Anatolian, from the people who had already lived there on the Anatolian plateau for centuries.

But as the Turkic peoples (and many of the comments on my original post) show, culture doesn’t have to be genetic, and many Turkic people feel a strong cultural connection to each other. (And many people report that various Turkic languages are pretty easy to understand if you speak one Turkic language–EG:

hello everyone I’m an Uzbek,

… tatars played a great role in Genghis’s empire and they had an empire after dividing the empire called Golden Horde, it was mongol state but after it became to turki with a time. and their sons are kazakh and kirgiz. Thats why we uzbeks can understand turkish easly more than our neighboors kazakhs. and we uzbeks are not mongoloid like kazakhs.because uzbek language has oghuz and karluk dialect. uzbek-uygur are like turkish-azerbaijani or turkish-crimean tatar. thats why uzbek dialect is most understandable language for every turkic people. but we can understand %95 uygur, %85 turkish-turkmen, %70 azerbaijani %50 kazakh.

Our Uzbeki friend’s full comment is very interesting, and I recommend you read the whole thing.

For that matter, many thanks to everyone who has left interesting comments sharing your family’s histories or personal perspectives on Turkish/Turkic culture and history over the years–I hope you have enjoyed this update.


Which Groups Have the most Neanderthal and Denisovan DNA?  


Neanderthal and Denisovan contributions to different populations by chromosome (source)

Here are the numbers I’ve found so far for Neanderthal and Denisovan DNA in different populations:

 et al, in The Combined Landscape of Denisovan and Neanderthal Ancestry in Present-Day Humans, 2016, report:

Native Americans: 1.37%
Central Asia: 1.4%
East Asia: 1.39%
Oceana (Melanesians): 1.54%
South Asia: 1.19%
Europeans: 1.06%

(I have seen it claimed that the high Neanderthal percents for Oceanan populations (that is, Melanesians and their relatives,) could be a result of Denisovan DNA being incompletely distinguished from Neanderthal.)

Prufer et al, [pdf] 2017, report somewhat higher values:

East Asians: 2.3–2.6%
Europeans: 1.8–2.4%

While Lohse and Frantz estimate an even higher rate of between 3.4–7.3% for Europeans and East Asians. (They found 5.9% in their Chinese sample and 5.3% in their European.)

The Mixe and Karitiana people of Brazil have 0.2% Denisovan (source); other estimates for the amount of Denisovan DNA in Native populations are much lower–ie, 0.05%.

I found an older paper by Prufer et al with estimates for three Hispanic populations, but doesn’t clarify if they have Native American ancestry:

Population Individuals Neandertal ancestry (%)
Autosomes X
Europeans CEU–Euros from Utah 85 1.17±0.08 0.21±0.17
FIN–Finnish 93 1.20±0.07 0.19±0.14
GBR–British 89 1.15±0.08 0.20±0.15
IBS–Spain 14 1.07±0.06 0.23±0.18
TSI–Tuscan 98 1.11±0.07 0.25±0.20

East Asians CHB–Han Chinese Beijing 97 1.40±0.08 0.30±0.21
CHS–Han Chinese South 100 1.37±0.08 0.27±0.21
JPT–Japan, Tokyo 89 1.38±0.10 0.26±0.21

Americans CLM: Colombians from Medellin 60 1.14±0.12 0.22±0.16
MXL: Mexicans from LA 66 1.22±0.09 0.21±0.15
PUR: Puerto Ricans 55 1.05±0.12 0.20±0.15

Africans LWK: Luhya in Webuye, Kenya 97 0.08±0.02 0.04±0.07
ASW: African Americans South West US 61 0.34±0.22 0.07±0.11

Since the paper is older, all of its estimates are lower than current estimates, because we now have more Neanderthal DNA to compare against. However, you can still see the general trend.

The difference between “autosomes” and “X” highlighted here is that (IIRC) autosomes includes all chromosomes except the XY pair, and X is the X from that pair. They’re breaking them up this way because the X chromosome tends to have very little Neanderthal on it (and the Y even less), probably because Neanderthal DNA on these particular chromosomes was selected against.

Neanderthal DNA appears to have been selected for in areas that control hair and skin–people who had just left Africa were adapted to the African environment, and Neanderthal hair and skin traits helped them survive in colder, darker winters. We also see a lot of Neanderthal DNA influencing inflammation/immune response–these may have helped people fend off new diseases. But we see almost no Neanderthal (or Denisovan) DNA in areas of the genome that code for sperm, eggs, testes, ovaries, etc. These parts of people were probably already finely tuned to work together, didn’t need to change with the environment, and changing anything probably just made them less efficient–so Neanderthal (and Denisovan) DNA on the X and Y chromosomes has been purged from the Homo Sapiens gene pool.

North African Populations Carry Signature of Admixture with Neanderthals reports its data relative to the European average (which I believe is the CEU pop, 1.17%, so I’ll do the math for you to figure how much Neanderthal they have.)

Algeria 44.57% = 0.52% Neanderthal
Tunisia 100.16% = 1.172 N
Tunisia 138.13% = 1.6% N (This is an interesting population that has been highly endogamous and thus better reflects historical populations in the area.)
Egypt 58.45% = 0.68% N
Libya 56.36% = 0.66% N
Morroco North 69.17% = 0.81% N
Morocco South 17.90% = 0.21% N
Saharawi 50.90% = 0.6% N
Canary Island* 101.44% = 1.187% N
China Beijing 193.43% = 2.26 % N
China 195.41% = 2.29% N
Texas Indu Gupti 84.37% =0.987% N
Andalusia*118.66% = 1.39% N
Tuscan 94.90% = 1.11% N
Basque BASC 129.48% = 1.51% N
Galicia* GAL 115.86% = 1.36% N
Yoruba YRI  0.00% = 0% N
Luyha LWK  −14.89% = N

The authors note that they are not sure how the Luyha received a negative score–perhaps the presence of admixed DNA from yet another species is interfering with the results.

According to Wikipedia:

Denisovan DNA is most commonly found in Melanesians, Papulans, Aboriginal Australians and Aboriginal Filipinos, who all have similar amounts around 4-6%, indicating that they probably were all one group when their ancestors met the Denisovans. However, the similar-looking but historically quite isolated Onge people have no Denisovan–so they split off before the event.

In Papuans, Neanderthal DNA tends to be expressed in brain tissue, Denisovan in bones and other tissues.

Asians have a small amount of Denisovan DNA; Tibetans have a particular gene that lets them absorb oxygen effectively at high altitudes that they got from the Denisovans.

The Mende People of Sierra Leon may derive 13% of their DNA from an as-yet unknown hominin species (ancient DNA and bones do not preserve well in parts of Africa, so finding remains and identifying the species may be difficult.)

The Yoruba derive 8 or 9% of their DNA from the same hominin.

Masai have a small fraction of Neanderthal–since they are 30% non-African, probably about 0.35% of their genome–but you can read the paper yourself. 

Biaka Pygmies and Bushmen (San): 2% from an unknown archaic.

With more testing, better and more comprehensive numbers are sure to turn up.


So I still haven’t tracked down anything I consider a good source on the percent of Neanderthal DNA in people from particular regions of Europe, and I should note that at this point, I consider pretty much *everything* in the field of Neanderthal DNA in modern homo Sapiens quite speculative and not nearly has solid as people make it ought to be. But it’s still really interesting stuff, so here goes. I’ll start with a little background information in case you haven’t been following along:

1. Modern people tend to have a little bit of archaic DNA from non-human hominins. Pretty much everyone outside of Africa (including African Americans and even some Africans) has Neanderthal DNA; folks down in Papua New Guinea also have Denisovan DNA (I can’t remember if people outside of PNG have Denisovan.) People in Africa, I hear, have their own admixture from whoever else was living down in Africa. Oh, and people in Tibet have admixture from the hominins who used to live in Tibet before them.

2. Interestingly, the Neanderthal DNA does not appear to be concentrated where you’d think it would be. Sure, Neanderthals themselves hung out primarily in Europe and the Middle East, but American Indians seem to have the most Neanderthal DNA, followed by East Asians. (I consider these findings especially speculative.)

I have noticed in studying maps of different waves of human migrations that where one wave follows another, the first wave ends up way out in the fringes. Take, for example, the parts of Europe where people speak Celtic languages. Celtic languages were once widespread in Europe, covering France, Spain, Britain, Ireland, Switzerland, and a few other places, like possibly a town in Turkey. Today, Celtic languages are spoken in isolated pockets on the outer costs of Ireland, Scotland, and Wales. There are a few other isolated spots on the mainland coast where the languages persisted until fairly recently. Germanic peoples invaded all of these countries, taking them over and imposing their languages, until the Celtic languages are only left at the fringes, in places protected by their isolation.

So, likewise, perhaps Europe itself, being closer to Africa, had more invasions and so ended up with less Neanderthal DNA than the Americas, which were really bloody hard to invade.

There are a lot of other ways one group could get more Neanderthal DNA than another.

3. So how Neanderthal are you? On average, I believe non-African people have about 2.5% Neanderthal DNA.

For perspective, you have 32 great-great-great grandparents, who were probably born around 150 years ago, and 64 great-great-great-great grandparents, whom we’ll just neatly say were born around 170 years ago.

1/32 = 3.1%, so you’d expect to receive about 3.1% of your DNA from each of your 3xGreat Grandparents. 1/64 = 1.6%, so you get about 1.6% from each of your 4X Great Grandparents. So that’s where the average person’s Neanderthal contribution is–it’s like having one Neanderthal ancestor from the mid 1800s.

A lot of people claim to be ethnically Irish based on less.

4. I suspect Europe’s Neanderthal hotspot is Sardinia.

Speculations later.

Study: The genome-wide structure of the Jewish people

The genome-wide structure of the Jewish people

From a study published in Nature, 08 July 2010: “… we use high-density bead arrays to genotype individuals from 14 Jewish Diaspora communities and compare these patterns of genome-wide diversity with those from 69 Old World non-Jewish populations, of which 25 have not previously been reported. These samples were carefully chosen to provide comprehensive comparisons between Jewish and non-Jewish populations in the Diaspora, as well as with non-Jewish populations from the Middle East and north Africa. …”


My conclusion: Most Jewish communities are highly admixed with the local communities.