Missing Heritability Revisited
Following up on Sasha
Sasha Gusev had an excellent, informative post last week about all the new methods of estimating heritability and what they mean. Between families, within families, common variants, rare variants, whole genomes. I refer you to his post for all the technical parts. Here, I want to reflect for a second on what all this means for how we think about human behavioral differences.
When you get into a conversation about heritability, it is very easy to get sucked into a discussion of what the heritability of something is, as though the heritability of BMI or intelligence is some kind of evolved biological parameter that we are working to estimate as accurately as we can. But one of the most often-repeated and quickly-forgotten facts about heritability is that it is not a property of a trait, it is a property of a trait in a given population. That is repeated so often as to seem pedantic, but it is important. A heritability is a ratio of variances, and variances (especially human behavioral variances) depend on the time and place. We can ask, what is the variance of income in the US in 2025, but to ask in some general way what the variance of income is makes no sense. In a heritability coefficient we have to worry about this both in the numerator and the denominator, which just makes the indeterminacy worse.
To make his points about how different methods of estimating heritability change the values that result, Sasha makes the unusual decision to take the mean heritabilities across a wide range of phenotypes, broken down by method. I understand why he did this, and I learned a lot from all the sorting through the different ways to go about it, but it risks leaving the impression that there is some heritability floating out there that we will eventually nail down. There isn’t, not for individual phenotypes and certainly not for phenotypes in general.
The context for all of this is the missing heritability problem. Back in the twin study era, the heritabilities of most phenotypes were coming in shockingly high, at .6 if not higher. (Arthur Jensen corrected IQ scores for unreliability and arrived at .8) DNA-based molecular heritabilities are almost always much lower than that. Were the twin studies wrong? Are the molecular studies missing something important? Sasha’s post provides a clear path through all the modern work, showing that twin study estimates were almost certainly inflated, that rare variants probably don’t add much to the equation, and that the most up to date within family estimates of genetic transmission give the best guess as to how heritability (remember those caveats) are going to work out in the long run. Taking that funny mean across phenotypes it comes out around .3.
Like every non-zero estimate of heritability that has ever been suggested, this has led to celebration and gloating on hereditarian Twitter. Gusev admits that the heritability of IQ is .3! But first of all, that isn’t what he said— he said that the average phenotypic heritability was around .3. He states specifically that there still aren’t good estimates of within family heritability of IQ. Second, it is the hereditarians who are moving the goal posts here. Speaking for myself, I have never once suggested that the heritability of anything was going to turn out to be zero. Of course genes matter for how we differ in our intelligence. But it is a long way from Jensen’s .8 to .3, and I suspect that .3 is going to turn out to be on the high side anyway.
Another consideration is that what we are talking about here— differences in numerical coefficients— is only one leg, and probably the least important, of what Lucas Matthews and I have called the three legs of the missing heritability problem. The other two legs are the “prediction gap” and the “mechanism gap.” The prediction gap refers to the contrast between polygenic scores and what could be done by simply predicting phenotypes from the phenotypes of first degree relatives. Phenotypic parental IQ accounts for something like half the variance in offspring IQ. A properly controlled PGS accounts for 5% on a good day. (The embryo testing company Herasight has recently made big claims that they can do better, but I will wait until I see a peer-reviewed paper.) Sasha doesn’t get into polygenic scores in this post, but I think they are ultimately much more important than heritabilities, because they have real-world applications that heritability coefficients do not.
The third leg of the missing heritability problem is the mechanism gap. This refers to the fact that no matter what our estimate of heritability is, for human behavioral phenotypes we have essentially zero knowledge of how genetic differences produce behavioral differences. We can analyze variance, in Lewontin’s famous phrase, but we can’t analyze causes. All of the interesting within and between family work that Sasha catalogs is a step in the right direction, but the truth is there isn’t much progress toward actual mechanistic understanding. I personally doubt there ever will be (for complex human behavior), but that is another topic. The mechanism gap is crucial to my bet with Charles Murray, for example. Murray bet that by 2025 we would have IQ differences pretty much figured out, and that entails more than a reasonable estimate of a black-box heritability coefficient.
So where does all this leave us? For most human behavioral traits, genetic differences have a significant, non-ignorable effect on outcomes, but not the dominant, deterministic effects that have been anticipated by hereditarians since Galton. If you are conducting studies of how parental personalities are related to the personalities of their children, you can’t just correlate them and assume it is a matter of behavioral modeling. Try it in adoptive families and see what happens.
If that sounds disappointing coming from an anti-hereditarian like me, it shouldn’t be. I have always said it would be naive to expect the genetic effect on anything to be zero, and by now there is a deep pool of data showing that it never is. And the truth is, there aren’t many people writing these days who are willing to adopt a strictly zero-genetic-influence position. Jay Joseph is pretty much the only one I can think of.
On the other hand, our inability to define anything even resembling deterministic causal mechanisms underlying heritability coefficients places strict limitations on their application in the real world, or on our theoretical accounts of how human inequality (in the nonpejorative evolutionary sense) comes to be. If we ask, “if I had different genes, would I have a different IQ,” the answer is probably, yes. If we ask, “if I had these specific genes, how would my IQ be different?” (never mind why) the answer is almost entirely, we don’t know. We don’t know because it depends on a thousand other human contextual variables that we have no hope of controlling.


“Speaking for myself, I have never once suggested that the heritability of anything was going to turn out to be zero”
Look, I know you think that there is some smaller heritability for behavioral traits and that this is a sign of being reasonable and that anyone who suggests otherwise is an ideologue, but I have to say that your field has made some assumptions about heritability existing (whether large or small), but not zero for decades. This is a bad assumption, and frankly, bad science to start with and, in my view, has caused a lot of wasted time. All of your studies start with the assumption that traits are heritable (because of twin studies) and it’s a matter of to what extent. Studies should start with the null hypothesis and whether your results are better than the null (and statistically significant). However, the studies to date have never once tested against actual null traits. Instead, there is simply an implied null of zero. I’m sure you know that even a null trait (chop sticks, social security number, zip code, etc.) will not give a zero result. Or, if you think they will, show me the money. I’ve been making this point for 25 years. So if IQ and zip code and major depression all get similar results for “heritability,” what can you say about the results? Your field has an ideological bent and I would argue that, consciously or unconsciously, testing against the null is taboo. This has led to endless attempts to explain why results haven’t stood up to replication. This is why it took 15 years to see that candidate studies were giving spurious results and this is why GWAS won’t die, even though they have produced almost nothing of substance. There is no need to argue the point, really. If you want to show that Jay Joseph (and me) are wrong in suggesting the null, it is within your power to prove it. Otherwise, you are just spouting opinions.
I'm no expert, but let me repeat a point I've made before. Imagine a mom and dad who have their first child at 30 and their 5th child at 40. Won't their egg and sperm cells at age 40 be different in ways that are likely to affect the embryos? And won't they be different parents with their fifth child? And won't their parenting methods be different? And won't the fifth child be affected by the four siblings? (Perhaps an older sibling will become a surrogate parent.)So, isn't w/i family heritability hugely impacted by environment?