Discussion about this post

User's avatar
Sasha Gusev's avatar

It is unfortunate that while this post makes more of an effort to engage with modern data than the typical response, it also makes several egregious errors that ultimately undermine that effort.

1. "Gusev claims that “IQ is much less heritable” than height, based on molecular genetic studies with vastly different statistical power to detect genetic effects on the two traits"

This is incorrect. The estimation of molecular heritability does not depend on the power to detect genetic effects at all, it is a single genome-wide parameter aggregated over all variants in the study regardless of whether they are significant.

I explained this point in my first post on IQ with a reference (https://theinfinitesimal.substack.com/p/no-intelligence-is-not-like-height): "But prediction accuracy depends on sample size, could the findings drastically change with more samples in the future? In fact, through the magic of statistics, we actually know that this claim will always to be true. We know this because we have estimated a parameter called molecular heritability, which tells us the upper bound on what a genetic predictor could ever achieve".

And then I explained it again in my second post responding to comments (https://theinfinitesimal.substack.com/i/148251755/isnt-gwas-heritability-only-quantifying-the-mechanisms-we-currently-understand): "Isn’t GWAS heritability only quantifying the mechanisms we currently understand? This is a typical misconception: that GWAS only quantifies the heritability from individual significant associations or genes we understand. In fact, GWAS heritability is defined as the phenotypic variance explained by all genetic variation that has been measured, whether it is significant or not.".

I can explain it a third time, I guess. The population-scale molecular heritability of IQ is 10-20% and the population-scale molecular heritability of height is 40-50%. These estimates come with very low uncertainty, and the fact that 40% is larger than 20% is well established. The inability of this post to grapple with the basic operation of these methods nor their basic estimates should raise red flags about the other claims.

2. "He singles out one galaxy brain paper, Bingley et al 2023, as evidence that heritability can be made very low and shared environmental influence very high. This paper gets this result by assuming equal avuncular shared environments, in other words, by making the clearly wrong assumption that all differences between the families of cousins are environmental."

This is incorrect. Bingley et al. propose a quantitative method to directly test the equal environment assumption, something the author should be pleased to see. Their method assumes equal shared environments between twins and their nieces/nephews as between twin spouses and their nieces/nephews ("To avoid adding parameters, we assume that the degree of environmental sharing between twins and their niblings (Equation 15) is the same as that between twins’ spouses and niblings (Equation 16)"). This does not imply that "all differences between the families of cousins are environmental", it is a constraint *specifically* on the shared environments between these relationship classes. This constraint is clearly much less restrictive than the assumption from classic twin models that MZs and DZs have identical shared environments, which we know they do not. In addition to the Bingley paper, I describe a number of other studies that evaluated the EEA assumption with other extended family designs and found it to be violated (https://theinfinitesimal.substack.com/p/twin-heritability-models-can-tell). Again the author appears not to understand the model but it is confident that the model is wrong because it produces uncomfortable results.

3. "In the first, [Gusev] keeps making the argument that if a molecular method (RDR) finds lower heritability estimates then the quantitative estimate then the latter is wrong, which, as we saw, is a fallacy."

The author first presents RDR as "quantitative genetics with extra frills" but then later discards RDR because it is a molecular method. Well, which is it? What the fallacy is in reporting the RDR results is also never stated. It is odd to simply discount the strongest evidence against one's position as "a fallacy", typically this is the evidence one would addresses first and in most detail. I'll remind readers that the paper that introduced the RDR method is titled "Relatedness disequilibrium regression estimates heritability without environmental bias" (https://pubmed.ncbi.nlm.nih.gov/30104764/) and the entire motivation of the method was to estimate complete additive heritability (using Identical By Descent relatedness estimates) without the environmental assumptions of classic twin models. When applied to a large number of traits, the average RDR heritability was 32%, compared to an average twin heritability of 61% (and a SNP-based heritability of 26%); results that I also summarized in my post, which also goes into how RDR works (https://theinfinitesimal.substack.com/i/148251755/okay-but-why-do-you-think-twin-estimates-so-much-higher). Since RDR and twin estimates are biased to the same extent by assortative mating, this gap is explained by environmental bias in twin studies. So the quantitative genetics (with extra frills) models the author likes also tell us that twin estimates are 200% inflated on average, and that SNP/GWAS estimates are about 80% of the total heritability on average. These relative estimates are consistent with a variety of other studies and theory showing that common variants (i.e. those in GWAS) explain the majority of total heritability, so everything hangs together nicely.

---

TLDR: This post misunderstands how molecular genetic studies actually estimate heritability, then misunderstands how quantitative extended twin models have probed the equal environment assumption and found it violated, then finally does a sleight of hand by pretending that RDR is not a quantitative genetics method because the findings are inconvenient.

Finally, I do appreciate that this post finally acknowledges that behavioral GWAS were greatly inflated by population stratification and, for hereditarians, a disappointment. But I don't think we can simply let hereditarians off the hook, because they were the ones who bet hard on GWAS methods to begin with. Charles Murray predicted for over a decade that molecular studies were just about to prove him right (https://x.com/evopsychgoogle/status/1188441459860443137); Razib Khan argued that we just need to wait a little bit longer for GWAS to fully explain the genetics of intelligence (https://undark.org/2017/02/28/race-science-razib-khan-racism/); Steve Hsu predicted that GWAS would explain 60% of the total variance of intelligence by 2016 (https://x.com/SashaGusevPosts/status/1613724821480677378); Emil Kirkegaard has been running bogus polygenic score comparisons pretending stratification is a non-issue and getting the results completely backwards (https://theinfinitesimal.substack.com/p/how-population-stratification-led). Even setting aside motivated hereditarians, here's what internet celebrity Gwern said about the first GWAS of educational attainment: "Reading it was a revelation. The debate was over: behavioral genetics was right, and the critics were wrong. Kamin, Gould, Lewontin, Shalizi, the whole sorry pack—annihilated. IQ was indeed highly heritable, polygenic, and GWASes would only get better for it" (https://www.lesswrong.com/posts/sRchPdp6mCqY2ekJX/what-progress-gwern-s-10-year-retrospective). As we now know, this is not what actually happened.

And this ultimately gets at my issue with the modern hereditarian movement, my point is not to attack their *theories*, everyone is entitled to speculate about genetic processes and test out their speculation. What I am attacking is the pattern of overfitting and p-hacking in their *methods*. When GWAS provided a cudgel to beat up on Lewontin and Shalizi, then suddenly it was "a revelation" that will only get better and better. Now that GWAS falsifies these claims, it is deemed "something of an embarrassment". When the RDR method is first described, it is just "quantitative genetics with extra frills" (laudatory) but when it yields uncomfortable results, it is now "molecular genetics" (derogatory) and using it is "a fallacy"". Methods and findings are not used to understand how the world works, but to stitch together a narrative that supports the author's predetermined conclusion.

Expand full comment
Sebastian Jensen's avatar

Nice post. But I'm a bit bummed because I have been writing a similar piece for some time...

I guess I have to make it better now.

Expand full comment
61 more comments...

No posts