We present 4,157 whole-genome sequences (Korea4K) coupled with 107 health check-up parameters as the largest whole genomic resource of Koreans.
Korea4K provides 45,537,252 variants and encompasses most of the common and rare variants in Koreans. We identified 1,356 new geno-phenotype associations which were not found by the previous Korea1K dataset. Phenomics analyses revealed 24 genetic correlations, 1,131 pleiotropic variants, and 127 causal relationships from Mendelian Randomization. Moreover, the Korea4K imputation reference panel showed a superior imputation performance to Korea1K.
Collectively, Korea4K provides the most extensive genomic and phenomic data resources for discovering clinically relevant novel genome-phenome associations in Koreans.
Figure 4: Genetic correlation and Phenotypic correlation in Korea4K.
(a) Genetic heritability of 27 traits that showed at least a marginal statistical-significance. (b) Genetic correlation and phenotypic correlation between the 27 traits. The upper triangle indicates phenotypic correlation coefficient (Pearson’s r) and lower triangle indicates genetic correlation coefficient (rg).