IDEAS home Printed from https://ideas.repec.org/p/cwl/cwldpp/2497.html

On Local Overidentification and Efficiency Gains in Modern Causal Inference and Data Combination

Author

Listed:
  • Xiaohong Chen

    (Yale University)

  • Haitian Xie

    (Yale University)

Abstract

This paper studies nonparametric local (over-)identification and the semiparametric efficiency in modern causal frameworks. We develop a unified approach that begins by translating structural models with latent variables into their induced statistical models of observables and then analyzes local overidentification through conditional moment restrictions. We apply this approach to three popular classes of causal models: (1) the general treatment model under unconfoundedness; (2) the negative control model, and (3) the long-term causal inference model under unobserved confounding. The first model yields a locally just-identified statistical model, implying that all regular asymptotically linear estimators of the treatment effect have the same asymptotic variance, which equals the (trivial) semiparametric efficient variance bound. In contrast, the latter two models involve nonparametric endogeneity and are naturally locally overidentified; consequently, some doubly robust orthogonal moment estimators of the average treatment effect are inefficient. Whereas existing work typically imposes strong conditions to restore local just-identification to justify the efficiency of their doubly robust orthogonal moment estimators, we characterize the semiparametric efficient variance bounds, along with efficient estimators, for the (locally) overidentified models (2) and (3). A small real data application, along with a simulation study, illustrates the semiparametric efficiency gains in model (3).

Suggested Citation

  • Xiaohong Chen & Haitian Xie, 2026. "On Local Overidentification and Efficiency Gains in Modern Causal Inference and Data Combination," Cowles Foundation Discussion Papers 2497, Cowles Foundation for Research in Economics, Yale University.
  • Handle: RePEc:cwl:cwldpp:2497
    as

    Download full text from publisher

    File URL: https://cowles.yale.edu/sites/default/files/2026-03/d2497.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Daniel Ackerberg & Xiaohong Chen & Jinyong Hahn & Zhipeng Liao, 2014. "Asymptotic Efficiency of Semiparametric Two-step GMM," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 81(3), pages 919-943.
    2. Xiaohong Chen & Pedro H. C. Sant'Anna & Haitian Xie, 2025. "Efficient Difference-in-Differences and Event Study Estimators," Papers 2506.17729, arXiv.org.
    3. Sokbae Lee & Ryo Okui & Yoon†Jae Whang, 2017. "Doubly robust uniform confidence band for the conditional average treatment effect function," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 32(7), pages 1207-1225, November.
    4. Minsu Chang & Sokbae Lee & Yoon‐Jae Whang, 2015. "Nonparametric tests of conditional treatment effects with an application to single‐sex schooling on academic achievements," Econometrics Journal, Royal Economic Society, vol. 18(3), pages 307-346, October.
    5. Sergio Firpo, 2007. "Efficient Semiparametric Estimation of Quantile Treatment Effects," Econometrica, Econometric Society, vol. 75(1), pages 259-276, January.
    6. Xiaohong Chen & Demian Pouzo, 2015. "Sieve Wald and QLR Inferences on Semi/Nonparametric Conditional Moment Models," Econometrica, Econometric Society, vol. 83(3), pages 1013-1079, May.
    7. Xiaohong Chen & Andres Santos, 2018. "Overidentification in Regular Models," Econometrica, Econometric Society, vol. 86(5), pages 1771-1817, September.
    8. Cattaneo, Matias D., 2010. "Efficient semiparametric estimation of multi-valued treatment effects under ignorability," Journal of Econometrics, Elsevier, vol. 155(2), pages 138-154, April.
    9. Guido Imbens & Nathan Kallus & Xiaojie Mao & Yuhao Wang, 2022. "Long-term Causal Inference Under Persistent Confounding via Data Combination," Papers 2202.07234, arXiv.org, revised Aug 2024.
    10. Jinyong Hahn & Guido Kuersteiner & Andres Santos & Wavid Willigrod, 2024. "Overidentification in Shift-Share Designs," Papers 2404.17049, arXiv.org.
    11. Xiaohong Chen & Pedro H. C. SantÕAnna & Haitian Xie, 2025. "Efficient Difference-in-Differences and Event Study Estimators," Cowles Foundation Discussion Papers 2470, Cowles Foundation for Research in Economics, Yale University.
    12. Manu Navjeevan & Rodrigo Pinto & Andres Santos, 2023. "Identification and Estimation in a Class of Potential Outcomes Models," Papers 2310.05311, arXiv.org.
    13. Susan Athey & Raj Chetty & Guido Imbens, 2025. "The Experimental Selection Correction Estimator: Using Experiments to Remove Biases in Observational Estimates," NBER Working Papers 33817, National Bureau of Economic Research, Inc.
    14. Jiafeng Chen & David M. Ritzwoller, 2021. "Semiparametric Estimation of Long-Term Treatment Effects," Papers 2107.14405, arXiv.org, revised Aug 2023.
    15. Chen, Jiafeng & Ritzwoller, David M., 2023. "Semiparametric estimation of long-term treatment effects," Journal of Econometrics, Elsevier, vol. 237(2).
    16. Han, Heejoon & Linton, Oliver & Oka, Tatsushi & Whang, Yoon-Jae, 2016. "The cross-quantilogram: Measuring quantile dependence and testing directional predictability between time series," Journal of Econometrics, Elsevier, vol. 193(1), pages 251-270.
    17. Linton, O. & Whang, Yoon-Jae, 2007. "The quantilogram: With an application to evaluating directional predictability," Journal of Econometrics, Elsevier, vol. 141(1), pages 250-282, November.
    18. Peter Z. Schochet & John Burghardt & Sheena McConnell, 2008. "Does Job Corps Work? Impact Findings from the National Job Corps Study," American Economic Review, American Economic Association, vol. 98(5), pages 1864-1886, December.
    19. repec:mpr:mprres:6097 is not listed on IDEAS
    20. Chen, Jiafeng & Chen, Xiaohong & Tamer, Elie, 2023. "Efficient estimation of average derivatives in NPIV models: Simulation comparisons of neural network estimators," Journal of Econometrics, Elsevier, vol. 235(2), pages 1848-1875.
    21. Chen, Xiaohong & Liu, Ying & Ma, Shujie & Zhang, Zheng, 2024. "Causal inference of general treatment effects using neural networks with a diverging number of confounders," Journal of Econometrics, Elsevier, vol. 238(1).
    22. Ai, Chunrong & Chen, Xiaohong, 2007. "Estimation of possibly misspecified semiparametric conditional moment restriction models with different conditioning variables," Journal of Econometrics, Elsevier, vol. 141(1), pages 5-43, November.
    23. Chunrong Ai & Xiaohong Chen, 2003. "Efficient Estimation of Models with Conditional Moment Restrictions Containing Unknown Functions," Econometrica, Econometric Society, vol. 71(6), pages 1795-1843, November.
    24. Jinyong Hahn, 1998. "On the Role of the Propensity Score in Efficient Semiparametric Estimation of Average Treatment Effects," Econometrica, Econometric Society, vol. 66(2), pages 315-332, March.
    25. Keisuke Hirano & Guido W. Imbens & Geert Ridder, 2003. "Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score," Econometrica, Econometric Society, vol. 71(4), pages 1161-1189, July.
    26. Ai, Chunrong & Chen, Xiaohong, 2012. "The semiparametric efficiency bound for models of sequential moment restrictions containing unknown functions," Journal of Econometrics, Elsevier, vol. 170(2), pages 442-457.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xiaohong Chen & Haitian Xie, 2025. "Local Overidentification and Efficiency Gains in Modern Causal Inference and Data Combination," Cowles Foundation Discussion Papers 2467, Cowles Foundation for Research in Economics, Yale University.
    2. Xiaohong Chen & Haitian Xie, 2025. "On Local Overidentification and Efficiency Gains in Modern Causal Inference and Data Combination," Papers 2510.16683, arXiv.org, revised Feb 2026.
    3. Ganesh Karapakula, 2023. "Stable Probability Weighting: Large-Sample and Finite-Sample Estimation and Inference Methods for Heterogeneous Causal Effects of Multivalued Treatments Under Limited Overlap," Papers 2301.05703, arXiv.org, revised Jan 2023.
    4. Hidehiko Ichimura & Whitney K. Newey, 2022. "The influence function of semiparametric estimators," Quantitative Economics, Econometric Society, vol. 13(1), pages 29-61, January.
    5. Sant’Anna, Pedro H.C. & Zhao, Jun, 2020. "Doubly robust difference-in-differences estimators," Journal of Econometrics, Elsevier, vol. 219(1), pages 101-122.
    6. Chen, Xiaohong & Pouzo, Demian & Powell, James L., 2019. "Penalized sieve GEL for weighted average derivatives of nonparametric quantile IV regressions," Journal of Econometrics, Elsevier, vol. 213(1), pages 30-53.
    7. Ai, Chunrong & Linton, Oliver & Zhang, Zheng, 2022. "Estimation and inference for the counterfactual distribution and quantile functions in continuous treatment models," Journal of Econometrics, Elsevier, vol. 228(1), pages 39-61.
    8. Sasaki, Yuya & Ura, Takuya, 2023. "Estimation and inference for policy relevant treatment effects," Journal of Econometrics, Elsevier, vol. 234(2), pages 394-450.
    9. Victor Chernozhukov & Juan Carlos Escanciano & Hidehiko Ichimura & Whitney K. Newey & James M. Robins, 2022. "Locally Robust Semiparametric Estimation," Econometrica, Econometric Society, vol. 90(4), pages 1501-1535, July.
    10. Zeqi Wu & Meilin Wang & Wei Huang & Zheng Zhang, 2025. "A New and Efficient Debiased Estimation of General Treatment Models by Balanced Neural Networks Weighting," Papers 2507.04044, arXiv.org.
    11. Qingliang Fan & Yu-Chin Hsu & Robert P. Lieli & Yichong Zhang, 2022. "Estimation of Conditional Average Treatment Effects With High-Dimensional Data," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 40(1), pages 313-327, January.
    12. Haitian Xie, 2020. "Efficient and Robust Estimation of the Generalized LATE Model," Papers 2001.06746, arXiv.org, revised Feb 2022.
    13. Edvard Bakhitov, 2026. "Penalized GMM Framework for Inference on Functionals of Nonparametric Instrumental Variable Estimators," Papers 2603.29889, arXiv.org, revised Apr 2026.
    14. Sung Jae Jun & Sokbae Lee, 2024. "Causal Inference Under Outcome-Based Sampling with Monotonicity Assumptions," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 42(3), pages 998-1009, July.
    15. Halbert White & Karim Chalak, 2013. "Identification and Identification Failure for Treatment Effects Using Structural Systems," Econometric Reviews, Taylor & Francis Journals, vol. 32(3), pages 273-317, November.
    16. Michael Jansson & Demian Pouzo, 2017. "Towards a General Large Sample Theory for Regularized Estimators," Papers 1712.07248, arXiv.org, revised Jul 2020.
    17. Firpo, Sergio Pinheiro & Pinto, Rafael de Carvalho Cayres, 2012. "Combining Strategies for the Estimation of Treatment Effects," Brazilian Review of Econometrics, Sociedade Brasileira de Econometria - SBE, vol. 32(1), March.
    18. Hu, Yingyao, 2017. "The Econometrics of Unobservables -- Latent Variable and Measurement Error Models and Their Applications in Empirical Industrial Organization and Labor Economics [The Econometrics of Unobservables]," Economics Working Paper Archive 64578, The Johns Hopkins University,Department of Economics, revised 2021.
    19. Wei Huang & Oliver Linton & Zheng Zhang, 2022. "A Unified Framework for Specification Tests of Continuous Treatment Effect Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 40(4), pages 1817-1830, October.
    20. Ying-Ying Lee, 2014. "Partial Mean Processes with Generated Regressors: Continuous Treatment Effects and Nonseparable Models," Economics Series Working Papers 706, University of Oxford, Department of Economics.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cwl:cwldpp:2497. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Brittany Ladd (email available below). General contact details of provider: https://edirc.repec.org/data/cowleus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.