TY - JOUR
T1 - Assessment of polygenic risk score performance in East Asian populations for ten common diseases
AU - Jung, Hae Un
AU - Jung, Hyein
AU - Baek, Eun Ju
AU - Kang, Ji One
AU - Kwon, Shin Young
AU - You, Jaeyoon
AU - Lim, Ji Eun
AU - Oh, Bermseok
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/12
Y1 - 2025/12
N2 - Polygenic risk score (PRS) uses genetic variants to assess disease susceptibility. While PRS performance is well-studied in Europeans, its accuracy in East Asians is less explored. This study evaluated PRSs for ten diseases in the Health Examinees (HEXA) cohort (n = 55,870) in Korea. Single-population PRSs were constructed using PRS-CS, LDpred2, and Lassosum based on East Asian GWAS summary statistics (sample sizes: 51,442–341,204), while cross-population PRSs were developed using PRS-CSx and CT-SLEB by integrating European and East Asian GWAS data. PRS-CS consistently outperformed other single-population methods across key metrics, including the likelihood ratio test (LRT), odds ratio per standard deviation (perSD OR), net reclassification improvement (NRI), and area under the curve (AUC). Cross-population PRSs further improved predictive performance, with average increases of 1.08-fold (LRT), 1.07-fold (perSD OR), and 1.15-fold (NRI) across seven diseases with statistical significance, and a 1.01-fold improvement in AUC. Differences in R² between single- and cross-population PRSs were statistically significant for five diseases, showing an average increase of 1.13%. Cross-population PRSs achieved 87.8% of the predictive performance observed in European PRSs. These findings highlight the benefits of integrating European GWAS data while underscoring the need for larger East Asian datasets to improve prediction accuracy.
AB - Polygenic risk score (PRS) uses genetic variants to assess disease susceptibility. While PRS performance is well-studied in Europeans, its accuracy in East Asians is less explored. This study evaluated PRSs for ten diseases in the Health Examinees (HEXA) cohort (n = 55,870) in Korea. Single-population PRSs were constructed using PRS-CS, LDpred2, and Lassosum based on East Asian GWAS summary statistics (sample sizes: 51,442–341,204), while cross-population PRSs were developed using PRS-CSx and CT-SLEB by integrating European and East Asian GWAS data. PRS-CS consistently outperformed other single-population methods across key metrics, including the likelihood ratio test (LRT), odds ratio per standard deviation (perSD OR), net reclassification improvement (NRI), and area under the curve (AUC). Cross-population PRSs further improved predictive performance, with average increases of 1.08-fold (LRT), 1.07-fold (perSD OR), and 1.15-fold (NRI) across seven diseases with statistical significance, and a 1.01-fold improvement in AUC. Differences in R² between single- and cross-population PRSs were statistically significant for five diseases, showing an average increase of 1.13%. Cross-population PRSs achieved 87.8% of the predictive performance observed in European PRSs. These findings highlight the benefits of integrating European GWAS data while underscoring the need for larger East Asian datasets to improve prediction accuracy.
UR - http://www.scopus.com/inward/record.url?scp=86000058188&partnerID=8YFLogxK
U2 - 10.1038/s42003-025-07767-9
DO - 10.1038/s42003-025-07767-9
M3 - Article
C2 - 40045046
AN - SCOPUS:86000058188
SN - 2399-3642
VL - 8
JO - Communications Biology
JF - Communications Biology
IS - 1
M1 - 374
ER -