IDEAS home Printed from https://ideas.repec.org/a/bpj/causin/v6y2018i1p27n2.html
   My bibliography  Save this article

Detecting Confounding in Multivariate Linear Models via Spectral Analysis

Author

Listed:
  • Janzing Dominik

    (Deaprtment ‘Empirical Inference’,Max Planck Institute for Intelligent Systems,Spemannstr. 36, 70569Tübingen,Germany)

  • Schölkopf Bernhard

    (Deaprtment ‘Empirical Inference’,Max Planck Institute for Intelligent Systems,Tübingen,Germany)

Abstract

We study a model where one target variable Y$Y$ is correlated with a vector X:=(X1,…,Xd)$\textbf{X}:=(X_1,\dots,X_d)$ of predictor variables being potential causes of Y$Y$. We describe a method that infers to what extent the statistical dependences between X$\textbf{X}$ and Y$Y$ are due to the influence of X$\textbf{X}$ on Y$Y$ and to what extent due to a hidden common cause (confounder) of X$\textbf{X}$ and Y$Y$. The method relies on concentration of measure results for large dimensions d$d$ and an independence assumption stating that, in the absence of confounding, the vector of regression coefficients describing the influence of each X$\textbf{X}$ on Y$Y$ typically has ‘generic orientation’ relative to the eigenspaces of the covariance matrix of X$\textbf{X}$. For the special case of a scalar confounder we show that confounding typically spoils this generic orientation in a characteristic way that can be used to quantitatively estimate the amount of confounding (subject to our idealized model assumptions).

Suggested Citation

  • Janzing Dominik & Schölkopf Bernhard, 2018. "Detecting Confounding in Multivariate Linear Models via Spectral Analysis," Journal of Causal Inference, De Gruyter, vol. 6(1), pages 1-27, March.
  • Handle: RePEc:bpj:causin:v:6:y:2018:i:1:p:27:n:2
    DOI: 10.1515/jci-2017-0013
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/jci-2017-0013
    Download Restriction: no

    File URL: https://libkey.io/10.1515/jci-2017-0013?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Karlin, Samuel & Rinott, Yosef, 1980. "Classes of orderings of measures and related correlation inequalities. I. Multivariate totally positive distributions," Journal of Multivariate Analysis, Elsevier, vol. 10(4), pages 467-498, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Khaledi, Baha-Eldin & Shaked, Moshe, 2010. "Stochastic comparisons of multivariate mixtures," Journal of Multivariate Analysis, Elsevier, vol. 101(10), pages 2486-2498, November.
    2. Colangelo Antonio, 2005. "Multivariate hazard orderings of discrete random vectors," Economics and Quantitative Methods qf05010, Department of Economics, University of Insubria.
    3. Chi, Chang Koo & Murto, Pauli & Valimaki, Juuso, 2017. "All-Pay Auctions with Affiliated Values," MPRA Paper 80799, University Library of Munich, Germany.
    4. Arnaud Costinot & Jonathan Vogel, 2010. "Matching and Inequality in the World Economy," Journal of Political Economy, University of Chicago Press, vol. 118(4), pages 747-786, August.
    5. Rinott, Yosef & Scarsini, Marco, 2006. "Total positivity order and the normal distribution," Journal of Multivariate Analysis, Elsevier, vol. 97(5), pages 1251-1261, May.
    6. repec:dau:papers:123456789/698 is not listed on IDEAS
    7. Vikram Krishnamurthy & Udit Pareek, 2015. "Myopic Bounds for Optimal Policy of POMDPs: An Extension of Lovejoy’s Structural Results," Operations Research, INFORMS, vol. 63(2), pages 428-434, April.
    8. Ilse Lindenlaub & Fabien Postel-Vinay, 2023. "Multidimensional Sorting under Random Search," Journal of Political Economy, University of Chicago Press, vol. 131(12), pages 3497-3539.
    9. Baha-Eldin Khaledi & Subhash Kochar, 2001. "Dependence Properties of Multivariate Mixture Distributions and Their Applications," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 53(3), pages 620-630, September.
    10. Müller, Alfred & Scarsini, Marco, 2005. "Archimedean copulæ and positive dependence," Journal of Multivariate Analysis, Elsevier, vol. 93(2), pages 434-445, April.
    11. Arnaud Costinot, 2009. "An Elementary Theory of Comparative Advantage," Econometrica, Econometric Society, vol. 77(4), pages 1165-1192, July.
    12. Barmalzan, Ghobad & Akrami, Abbas & Balakrishnan, Narayanaswamy, 2020. "Stochastic comparisons of the smallest and largest claim amounts with location-scale claim severities," Insurance: Mathematics and Economics, Elsevier, vol. 93(C), pages 341-352.
    13. Junbo Son & Yeongin Kim & Shiyu Zhou, 2022. "Alerting patients via health information system considering trust-dependent patient adherence," Information Technology and Management, Springer, vol. 23(4), pages 245-269, December.
    14. Jian Yang, 2023. "A Partial Order for Strictly Positive Coalitional Games and a Link from Risk Aversion to Cooperation," Papers 2304.10652, arXiv.org.
    15. Belzunce, Félix & Mercader, José A. & Ruiz, José M., 2003. "Multivariate aging properties of epoch times of nonhomogeneous processes," Journal of Multivariate Analysis, Elsevier, vol. 84(2), pages 335-350, February.
    16. Battey, H.S. & Cox, D.R., 2022. "Some aspects of non-standard multivariate analysis," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    17. Bhattacharya, Bhaskar, 2006. "Maximum entropy characterizations of the multivariate Liouville distributions," Journal of Multivariate Analysis, Elsevier, vol. 97(6), pages 1272-1283, July.
    18. Ligtvoet, R., 2015. "A test for using the sum score to obtain a stochastic ordering of subjects," Journal of Multivariate Analysis, Elsevier, vol. 133(C), pages 136-139.
    19. Huang, Wen-Tao & Xu, Bing, 2002. "Some maximal inequalities and complete convergences of negatively associated random sequences," Statistics & Probability Letters, Elsevier, vol. 57(2), pages 183-191, April.
    20. Eden, Maya, 2012. "Financial distortions and the distribution of global volatility," Policy Research Working Paper Series 5929, The World Bank.
    21. Laniado, Henry & Lillo, Rosa E. & Pellerey, Franco & Romo, Juan, 2012. "Portfolio selection through an extremality stochastic order," Insurance: Mathematics and Economics, Elsevier, vol. 51(1), pages 1-9.

    More about this item

    Keywords

    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:causin:v:6:y:2018:i:1:p:27:n:2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyterbrill.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.