Author
Listed:
- Taegun Kim
- Jaeseung Song
- Jong Wha J Joo
Abstract
Cardiovascular disease is a leading cause of mortality and rising healthcare costs worldwide. Fortunately, the disease is preventable, and addressing risk factors can significantly reduce its effects. Over the past decade, risk prediction models have advanced significantly, with polygenic risk scoring analysis, which is often used in combination with clinical health information for prediction. However, most previous cardiovascular disease prediction studies based on polygenic risk scores have focused on a single specific disease or event, such as cardiac events. Given the complex nature of the cardiovascular disease, which involves a combination of genetic and environmental factors, a comprehensive analysis of the disease prediction results is essential. In this study, we investigate the genetic and environmental factors contributing to cardiovascular disease by utilizing data from the Framingham Heart Study, a leading cardiovascular cohort. We compared the prediction performance of different methods across various scenarios and assessed performance using various evaluation metrics to identify the best-fitting model for six cardiovascular related diseases. We also analyzed the feature importance of genetic and clinical variables, noting that different variables had varying effects on each disease. Our findings demonstrated the performance of prediction algorithms in forecasting cardiovascular disease by utilizing genetic and clinical factors, as well as highlighting the importance of each feature in the disease prediction. While models relying solely on polygenic risk score showed relatively low prediction performance for some diseases, integrating genetic information with clinical data improved prediction performance in most cases. For certain diseases, particularly those known to be heritable, polygenic risk scores demonstrated predictive ability, suggesting that they may serve as standalone predictive tools. We believe our study reveals the value of combining polygenic risk scores with clinical variables and expect that our thorough analysis can inform study designs tailored to specific diseases and research objectives.
Suggested Citation
Taegun Kim & Jaeseung Song & Jong Wha J Joo, 2026.
"Risk prediction for cardiovascular related diseases using PRS and EHR in the Framingham Heart Study,"
PLOS ONE, Public Library of Science, vol. 21(4), pages 1-17, April.
Handle:
RePEc:plo:pone00:0345914
DOI: 10.1371/journal.pone.0345914
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0345914. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.