Author
Listed:
- Zhaotong Lin
- Wei Pan
- Haoran Xue
Abstract
Inferring a causal network among multiple traits is essential for unraveling complex biological relationships and informing interventions. Mendelian randomization (MR) has emerged as a powerful tool for causal inference, utilizing genetic variants as instrumental variables (IVs) to estimate causal effects. However, when the directions of causal relationships among traits are unknown, reconstructing the underlying causal network becomes challenging. In particular, the presence of cycles or feedback loops, which are common in biological systems, poses additional challenges for causal network inference, and remains largely under-studied with standard MR approaches and existing IV-based network inference methods. To address these issues, we introduce MR2G, a new statistical framework that enables robust inference of causal networks, including those with cycles, directly from GWAS summary statistics. MR2G is built on a formally defined recursive causal graph model that rigorously links direct causal effects to (univariable) MR estimands. It recovers a biologically interpretable causal network from pairwise MR effect estimates, while incorporating a network-informed IV screening strategy to reduce pleiotropic bias and improve robustness. Through realistic simulations, MR2G demonstrates superior accuracy and robustness in recovering complex causal structures, including those involving feedback loops. We apply MR2G to GWAS summary statistics for six complex diseases and nine cardiometabolic risk factors. MR2G not only recovers well-established causal pathways but also uncovers multiple feedback relationships, highlighting its utility in disentangling complex and biologically plausible causal networks from large-scale genetic data.Author summary: Unraveling how complex traits causally influence each others is essential for understanding disease mechanisms and guiding effective interventions. However, inferring such a causal network from observational data remains challenging due to hidden confounding, and most existing methods assume that the underlying network has no feedback loops, which is often unrealistic in biological systems. In this work, we developed MR2G, a statistical framework that infers a causal network of multiple traits directly from GWAS summary statistics, while allowing for cycles and feedback loops. Our approach is based on a recursive causal model that formally links direct causal effects to Mendelian randomization estimands, and incorporates a novel instrumental variable screening strategy to reduce bias from horizontal pleiotropy. We applied MR2G to 15 complex traits including cardiometabolic risk factors and diseases, and identified biologically supported causal pathways, such as the direct effects of blood pressure and LDL cholesterol on coronary artery disease, and a bidirectional relationship between atrial fibrillation and coronary artery disease.
Suggested Citation
Zhaotong Lin & Wei Pan & Haoran Xue, 2026.
"MR2G: A novel framework for causal network inference using GWAS summary data,"
PLOS Genetics, Public Library of Science, vol. 22(5), pages 1-21, May.
Handle:
RePEc:plo:pgen00:1012144
DOI: 10.1371/journal.pgen.1012144
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pgen00:1012144. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosgenetics (email available below). General contact details of provider: https://journals.plos.org/plosgenetics/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.