IDEAS home Printed from https://ideas.repec.org/a/plo/pgen00/1010823.html

Relaxing parametric assumptions for non-linear Mendelian randomization using a doubly-ranked stratification method

Author

Listed:
  • Haodong Tian
  • Amy M Mason
  • Cunhao Liu
  • Stephen Burgess

Abstract

Non-linear Mendelian randomization is an extension to standard Mendelian randomization to explore the shape of the causal relationship between an exposure and outcome using an instrumental variable. A stratification approach to non-linear Mendelian randomization divides the population into strata and calculates separate instrumental variable estimates in each stratum. However, the standard implementation of stratification, referred to as the residual method, relies on strong parametric assumptions of linearity and homogeneity between the instrument and the exposure to form the strata. If these stratification assumptions are violated, the instrumental variable assumptions may be violated in the strata even if they are satisfied in the population, resulting in misleading estimates. We propose a new stratification method, referred to as the doubly-ranked method, that does not require strict parametric assumptions to create strata with different average levels of the exposure such that the instrumental variable assumptions are satisfied within the strata. Our simulation study indicates that the doubly-ranked method can obtain unbiased stratum-specific estimates and appropriate coverage rates even when the effect of the instrument on the exposure is non-linear or heterogeneous. Moreover, it can also provide unbiased estimates when the exposure is coarsened (that is, rounded, binned into categories, or truncated), a scenario that is common in applied practice and leads to substantial bias in the residual method. We applied the proposed doubly-ranked method to investigate the effect of alcohol intake on systolic blood pressure, and found evidence of a positive effect of alcohol intake, particularly at higher levels of alcohol consumption.Author summary: Various exposures, such as alcohol consumption, may exhibit different effect sizes on health outcomes at different exposure levels. Studying the non-linear shape of these effects can provide valuable insights to predict the impact of interventions for different individuals. Mendelian randomization is an epidemiological approach to make causal inferences from observational data. It uses genetic variants to divide the population into subgroups which behave similarly to arms of a randomized trial. The current residual stratification method most often implemented for non-linear Mendelian randomization relies on strong parametric assumptions and can yield misleading results when these assumptions are violated. We propose a new stratification method that relaxes these assumptions. Our new doubly-ranked method demonstrates superior performance over the residual method across a wide range of simulation scenarios. Furthermore, in cases where the exposure is coarsened (for example, its value is rounded, say to the nearest whole number), the doubly-ranked method achieves good results while the residual method fails to handle such situations adequately. The doubly-ranked method can test the assumptions underlying the residual method, thereby assessing the validity of previously published results. We advocate for the doubly-ranked method to be used in preference to the residual stratification method for non-linear Mendelian randomization.

Suggested Citation

  • Haodong Tian & Amy M Mason & Cunhao Liu & Stephen Burgess, 2023. "Relaxing parametric assumptions for non-linear Mendelian randomization using a doubly-ranked stratification method," PLOS Genetics, Public Library of Science, vol. 19(6), pages 1-22, June.
  • Handle: RePEc:plo:pgen00:1010823
    DOI: 10.1371/journal.pgen.1010823
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1010823
    Download Restriction: no

    File URL: https://journals.plos.org/plosgenetics/article/file?id=10.1371/journal.pgen.1010823&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pgen.1010823?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Magne Mogstad & Matthew Wiswall, 2009. "How Linear Models Can Mask Non-Linear Causal Relationships. An Application to Family Size and Children's Education," Discussion Papers 586, Statistics Norway, Research Department.
    2. Amemiya, Takeshi, 1974. "The nonlinear two-stage least-squares estimator," Journal of Econometrics, Elsevier, vol. 2(2), pages 105-110, July.
    3. Joel L. Horowitz, 2011. "Applied Nonparametric Instrumental Variables Estimation," Econometrica, Econometric Society, vol. 79(2), pages 347-394, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Sara Pedron & Xiao Tan & Juliane Maushagen & Anna-Janina Stephan & Jacob Burns & Eleanor Sanderson & Kaitlin Wade & Michael Laxy, 2026. "Estimating the causal effect of cardiometabolic conditions on socioeconomic and healthcare outcomes: a scoping review of Mendelian randomization studies," Health Economics Review, Springer, vol. 16(1), pages 1-21, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shoya Ishimaru, 2024. "Empirical Decomposition of the IV-OLS Gap with Heterogeneous and Nonlinear Effects," The Review of Economics and Statistics, MIT Press, vol. 106(2), pages 505-520, March.
    2. Poterba, James M. & Summers, Lawrence H., 1983. "Dividend taxes, corporate investment, and `Q'," Journal of Public Economics, Elsevier, vol. 22(2), pages 135-167, November.
    3. Kadilli, Anjeza & Krishnakumar, Jaya, 2022. "Smooth Transition Simultaneous Equation Models," Journal of Economic Dynamics and Control, Elsevier, vol. 145(C).
    4. Hassan, Rashid M. & D'Silva, Brian & Hallam, A., 1989. "Normative Supply Response Analysis under Production Uncertainty: Irrigated Multicrop Farming Sector of Sudan," 1989 Occasional Paper Series No. 5 197677, International Association of Agricultural Economists.
    5. Ben Deaner, 2025. "The Trade-Off between Flexibility and Robustness in Instrumental Variables Analysis," American Economic Review, American Economic Association, vol. 115(11), pages 3975-3998, November.
    6. Ben Deaner, 2018. "Proxy Controls and Panel Data," Papers 1810.00283, arXiv.org, revised Nov 2023.
    7. Krief, Jerome M., 2017. "Direct instrumental nonparametric estimation of inverse regression functions," Journal of Econometrics, Elsevier, vol. 201(1), pages 95-107.
    8. Haoming Liu, 2014. "The quality–quantity trade-off: evidence from the relaxation of China’s one-child policy," Journal of Population Economics, Springer;European Society for Population Economics, vol. 27(2), pages 565-602, April.
    9. Ravallion, Martin & Datt, Gaurav, 1995. "Growth and poverty in rural India," Policy Research Working Paper Series 1405, The World Bank.
    10. Christiane Hinerasky, 2014. "Advances in Training Evaluation - Psychological, Educational, Economic, and Econometric Perspectives on the Kirkpatrick Model," Working Papers Dissertations 14, Paderborn University, Faculty of Business Administration and Economics.
    11. Atella, Vincenzo & Deb, Partha, 2008. "Are primary care physicians, public and private sector specialists substitutes or complements? Evidence from a simultaneous equations model for count data," Journal of Health Economics, Elsevier, vol. 27(3), pages 770-785, May.
    12. Donald, Stephen G. & Imbens, Guido W. & Newey, Whitney K., 2003. "Empirical likelihood estimation and consistent tests with conditional moment restrictions," Journal of Econometrics, Elsevier, vol. 117(1), pages 55-93, November.
    13. Irene Botosaru & Chris Muris & Senay Sokullu, 2022. "Time-Varying Linear Transformation Models with Fixed Effects and Endogeneity for Short Panels," Department of Economics Working Papers 2022-01, McMaster University.
    14. Sara Cools & Rannveig Kaldager Hart, 2015. "The effect of childhood family size on fertility in adulthood. New evidence form IV estimation," Discussion Papers 802, Statistics Norway, Research Department.
    15. Degl’Innocenti, Marta & Matousek, Roman & Tzeremes, Nickolaos G., 2019. "The interconnections of academic research and universities’ “third mission”: Evidence from the UK," Research Policy, Elsevier, vol. 48(9), pages 1-1.
    16. Guido W. Imbens, 2020. "Potential Outcome and Directed Acyclic Graph Approaches to Causality: Relevance for Empirical Practice in Economics," Journal of Economic Literature, American Economic Association, vol. 58(4), pages 1129-1179, December.
    17. Xiaohong Chen & Timothy Christensen, 2013. "Optimal Sup-norm Rates, Adaptivity and Inference in Nonparametric Instrumental Variables Estimation," Cowles Foundation Discussion Papers 1923R, Cowles Foundation for Research in Economics, Yale University, revised Apr 2015.
    18. Bakhitov, Edvard, 2025. "On machine learning instrumental variable estimators," Economics Letters, Elsevier, vol. 256(C).
    19. Denis Chetverikov & Dongwoo Kim & Daniel Wilhelm, 2018. "Nonparametric instrumental-variable estimation," Stata Journal, StataCorp LLC, vol. 18(4), pages 937-950, December.
    20. Lundborg, Petter & Nilsson, Anton & Rooth, Dan-Olof, 2012. "Parental Education and Offspring Outcomes: Evidence from the Swedish Compulsory Schooling Reform," IZA Discussion Papers 6570, IZA Network @ LISER.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pgen00:1010823. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosgenetics (email available below). General contact details of provider: https://journals.plos.org/plosgenetics/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.