“A Saturated Map of Common Genetic Variants Associated With Human Height from 5.4 Million Individuals of Diverse Ancestries”, Loïc Yengo, Sailaja Vedantam, Eirini Marouli, Julia Sidorenko, Eric Bartell, Saori Sakaue, Marielisa Graff, Anders U. Eliasen, Yunxuan Jiang, Sridharan Raghavan, Jenkai Miao, Joshua D. Arias, Ronen E. Mukamel, Cassandra N. Spracklen, Xianyong Yin, Shyh-Huei Chen, Teresa Ferreira, Yingjie Ji, Tugce Karedera, Kreete Lull, Kuang Lin, Deborah E. Malden, Carolina Medina-Gomez, Moara Machado, Amy Moore, Sina Rueger, GIANT Consortium, 23andMe, V. A. Million Veteran Program, DiscovEHR (DiscovEHR, MyCode Community Health Initiative), eMERGE (Electronic Medical Records, Genomics Network), Lifelines Cohort Study, Regeneron Genetics Center, The PRACTICAL Consortium, Understanding Society Scientific Group, Daniel I. Chasman, Yoon Shin Cho, Iris M. Heid, Mark I. McCarthy, Maggie C. Y. Ng, Christopher J. O’Donnell, Fernando Rivadeneira, Unnur Thorsteinsdottir, Yan V. Sun, E. Shyong Thai, Michael Boehnke, Panos Deloukas, Anne E. Justice, Cecilia M. Lindgren, Ruth Loos, Karen L. Mohlke, Kari E. North, Kari Stefansson, Robin G. Walters, Thomas W. Winkler, Kristin L. Young, Po-Ru Loh, Jian Yang, Tõnu Esko, Themistocles L. Assimes, Adam Auton, Gonçalo Abecasis, Cristen Jennifer Willer, Adam E. Locke, Sonja I. Berndt, Guillaume Lettre, Timothy Frayling, Yukinori Okada, Andrew R. Wood, Peter M. Visscher, Joel N. Hirschhorn2022-01-10 (; backlinks; similar)⁠:

Common SNPs are predicted to collectively explain 40–50% of phenotypic variation in human height, but identifying the specific variants and associated regions requires huge sample sizes.

Here we show, using GWAS data from 5.4 million individuals of diverse ancestries, that:

12,111 independent SNPs that are statistically-significantly associated with height account for nearly all of the common SNP-based heritability. These SNPs are clustered within 7,209 non-overlapping genomic segments with a median size of ~90 kb, covering ~21% of the genome. The density of independent associations varies across the genome and the regions of elevated density are enriched for biologically relevant genes.

In out-of-sample estimation and prediction, the 12,111 SNPs account for 40% of phenotypic variance in European ancestry populations but only ~10%–20% in other ancestries. Effect sizes, associated regions, and gene prioritization are similar across ancestries, indicating that reduced prediction accuracy is likely explained by linkage disequilibrium and allele frequency differences within associated regions. Finally, we show that the relevant biological pathways are detectable with smaller sample sizes than needed to implicate causal genes and variants.

Overall, this study, the largest GWAS to date, provides an unprecedented saturated map of specific genomic regions containing the vast majority of common height-associated variants.