IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2510.16683.html
   My bibliography  Save this paper

Local Overidentification and Efficiency Gains in Modern Causal Inference and Data Combination

Author

Listed:
  • Xiaohong Chen
  • Haitian Xie

Abstract

This paper studies nonparametric local (over-)identification, in the sense of Chen and Santos (2018), and the associated semiparametric efficiency in modern causal frameworks. We develop a unified approach that begins by translating structural models with latent variables into their induced statistical models of observables and then analyzes local overidentification through conditional moment restrictions. We apply this approach to three leading models: (i) the general treatment model under unconfoundedness, (ii) the negative control model, and (iii) the long-term causal inference model under unobserved confounding. The first design yields a locally just-identified statistical model, implying that all regular asymptotically linear estimators of the treatment effect share the same asymptotic variance, equal to the (trivial) semiparametric efficiency bound. In contrast, the latter two models involve nonparametric endogeneity and are naturally locally overidentified; consequently, some doubly robust orthogonal moment estimators of the average treatment effect are inefficient. Whereas existing work typically imposes strong conditions to restore just-identification before deriving the efficiency bound, we relax such assumptions and characterize the general efficiency bound, along with efficient estimators, in the overidentified models (ii) and (iii).

Suggested Citation

  • Xiaohong Chen & Haitian Xie, 2025. "Local Overidentification and Efficiency Gains in Modern Causal Inference and Data Combination," Papers 2510.16683, arXiv.org.
  • Handle: RePEc:arx:papers:2510.16683
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2510.16683
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Daniel Ackerberg & Xiaohong Chen & Jinyong Hahn & Zhipeng Liao, 2014. "Asymptotic Efficiency of Semiparametric Two-step GMM," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 81(3), pages 919-943.
    2. Xiaohong Chen & Pedro H. C. Sant'Anna & Haitian Xie, 2025. "Efficient Difference-in-Differences and Event Study Estimators," Papers 2506.17729, arXiv.org.
    3. Xiaohong Chen & Demian Pouzo, 2015. "Sieve Wald and QLR Inferences on Semi/Nonparametric Conditional Moment Models," Econometrica, Econometric Society, vol. 83(3), pages 1013-1079, May.
    4. Gordon B. Dahl & Lance Lochner, 2012. "The Impact of Family Income on Child Achievement: Evidence from the Earned Income Tax Credit," American Economic Review, American Economic Association, vol. 102(5), pages 1927-1956, August.
    5. Ai, Chunrong & Chen, Xiaohong, 2012. "The semiparametric efficiency bound for models of sequential moment restrictions containing unknown functions," Journal of Econometrics, Elsevier, vol. 170(2), pages 442-457.
    6. Sergio Firpo, 2007. "Efficient Semiparametric Estimation of Quantile Treatment Effects," Econometrica, Econometric Society, vol. 75(1), pages 259-276, January.
    7. Xiaohong Chen & Andres Santos, 2018. "Overidentification in Regular Models," Econometrica, Econometric Society, vol. 86(5), pages 1771-1817, September.
    8. Gordon B. Dahl & Lance Lochner, 2017. "The Impact of Family Income on Child Achievement: Evidence from the Earned Income Tax Credit: Reply," American Economic Review, American Economic Association, vol. 107(2), pages 629-631, February.
    9. Cattaneo, Matias D., 2010. "Efficient semiparametric estimation of multi-valued treatment effects under ignorability," Journal of Econometrics, Elsevier, vol. 155(2), pages 138-154, April.
    10. Guido Imbens & Nathan Kallus & Xiaojie Mao & Yuhao Wang, 2022. "Long-term Causal Inference Under Persistent Confounding via Data Combination," Papers 2202.07234, arXiv.org, revised Aug 2024.
    11. Jinyong Hahn & Guido Kuersteiner & Andres Santos & Wavid Willigrod, 2024. "Overidentification in Shift-Share Designs," Papers 2404.17049, arXiv.org.
    12. Manu Navjeevan & Rodrigo Pinto & Andres Santos, 2023. "Identification and Estimation in a Class of Potential Outcomes Models," Papers 2310.05311, arXiv.org.
    13. Susan Athey & Raj Chetty & Guido Imbens, 2025. "The Experimental Selection Correction Estimator: Using Experiments to Remove Biases in Observational Estimates," NBER Working Papers 33817, National Bureau of Economic Research, Inc.
    14. Keisuke Hirano & Guido W. Imbens & Geert Ridder, 2003. "Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score," Econometrica, Econometric Society, vol. 71(4), pages 1161-1189, July.
    15. Chunrong Ai & Oliver Linton & Kaiji Motegi & Zheng Zhang, 2021. "A unified framework for efficient estimation of general treatment models," Quantitative Economics, Econometric Society, vol. 12(3), pages 779-816, July.
    16. Jiafeng Chen & David M. Ritzwoller, 2021. "Semiparametric Estimation of Long-Term Treatment Effects," Papers 2107.14405, arXiv.org, revised Aug 2023.
    17. Jacob Carlson & Melissa Dell, 2025. "A Unifying Framework for Robust and Efficient Inference with Unstructured Data," Papers 2505.00282, arXiv.org, revised Jul 2025.
    18. Chen, Jiafeng & Ritzwoller, David M., 2023. "Semiparametric estimation of long-term treatment effects," Journal of Econometrics, Elsevier, vol. 237(2).
    19. Isaiah Andrews & Jiafeng Chen & Otavio Tecchio, 2025. "The purpose of an estimator is what it does: Misspecification, estimands, and over-identification," Papers 2508.13076, arXiv.org, revised Aug 2025.
    20. Chen, Jiafeng & Chen, Xiaohong & Tamer, Elie, 2023. "Efficient estimation of average derivatives in NPIV models: Simulation comparisons of neural network estimators," Journal of Econometrics, Elsevier, vol. 235(2), pages 1848-1875.
    21. Chen, Xiaohong & Liu, Ying & Ma, Shujie & Zhang, Zheng, 2024. "Causal inference of general treatment effects using neural networks with a diverging number of confounders," Journal of Econometrics, Elsevier, vol. 238(1).
    22. Ai, Chunrong & Chen, Xiaohong, 2007. "Estimation of possibly misspecified semiparametric conditional moment restriction models with different conditioning variables," Journal of Econometrics, Elsevier, vol. 141(1), pages 5-43, November.
    23. Jinyong Hahn, 1998. "On the Role of the Propensity Score in Efficient Semiparametric Estimation of Average Treatment Effects," Econometrica, Econometric Society, vol. 66(2), pages 315-332, March.
    24. Chunrong Ai & Xiaohong Chen, 2003. "Efficient Estimation of Models with Conditional Moment Restrictions Containing Unknown Functions," Econometrica, Econometric Society, vol. 71(6), pages 1795-1843, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hidehiko Ichimura & Whitney K. Newey, 2022. "The influence function of semiparametric estimators," Quantitative Economics, Econometric Society, vol. 13(1), pages 29-61, January.
    2. Ganesh Karapakula, 2023. "Stable Probability Weighting: Large-Sample and Finite-Sample Estimation and Inference Methods for Heterogeneous Causal Effects of Multivalued Treatments Under Limited Overlap," Papers 2301.05703, arXiv.org, revised Jan 2023.
    3. Victor Chernozhukov & Juan Carlos Escanciano & Hidehiko Ichimura & Whitney K. Newey & James M. Robins, 2022. "Locally Robust Semiparametric Estimation," Econometrica, Econometric Society, vol. 90(4), pages 1501-1535, July.
    4. Sant’Anna, Pedro H.C. & Zhao, Jun, 2020. "Doubly robust difference-in-differences estimators," Journal of Econometrics, Elsevier, vol. 219(1), pages 101-122.
    5. Chen, Xiaohong & Pouzo, Demian & Powell, James L., 2019. "Penalized sieve GEL for weighted average derivatives of nonparametric quantile IV regressions," Journal of Econometrics, Elsevier, vol. 213(1), pages 30-53.
    6. Zeqi Wu & Meilin Wang & Wei Huang & Zheng Zhang, 2025. "A New and Efficient Debiased Estimation of General Treatment Models by Balanced Neural Networks Weighting," Papers 2507.04044, arXiv.org.
    7. Wei Huang & Oliver Linton & Zheng Zhang, 2022. "A Unified Framework for Specification Tests of Continuous Treatment Effect Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 40(4), pages 1817-1830, October.
    8. Chunrong Ai & Oliver Linton & Kaiji Motegi & Zheng Zhang, 2021. "A unified framework for efficient estimation of general treatment models," Quantitative Economics, Econometric Society, vol. 12(3), pages 779-816, July.
    9. Ai, Chunrong & Linton, Oliver & Zhang, Zheng, 2022. "Estimation and inference for the counterfactual distribution and quantile functions in continuous treatment models," Journal of Econometrics, Elsevier, vol. 228(1), pages 39-61.
    10. Chen, Xiaohong & Liu, Ying & Ma, Shujie & Zhang, Zheng, 2024. "Causal inference of general treatment effects using neural networks with a diverging number of confounders," Journal of Econometrics, Elsevier, vol. 238(1).
    11. Haitian Xie, 2020. "Efficient and Robust Estimation of the Generalized LATE Model," Papers 2001.06746, arXiv.org, revised Feb 2022.
    12. Xiaohong Chen & Andres Santos, 2018. "Overidentification in Regular Models," Econometrica, Econometric Society, vol. 86(5), pages 1771-1817, September.
    13. Sung Jae Jun & Sokbae Lee, 2024. "Causal Inference Under Outcome-Based Sampling with Monotonicity Assumptions," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 42(3), pages 998-1009, July.
    14. Halbert White & Karim Chalak, 2013. "Identification and Identification Failure for Treatment Effects Using Structural Systems," Econometric Reviews, Taylor & Francis Journals, vol. 32(3), pages 273-317, November.
    15. Mammen, Enno & Rothe, Christoph & Schienle, Melanie, 2016. "Semiparametric Estimation With Generated Covariates," Econometric Theory, Cambridge University Press, vol. 32(5), pages 1140-1177, October.
    16. Michael Jansson & Demian Pouzo, 2017. "Towards a General Large Sample Theory for Regularized Estimators," Papers 1712.07248, arXiv.org, revised Jul 2020.
    17. Firpo, Sergio Pinheiro & Pinto, Rafael de Carvalho Cayres, 2012. "Combining Strategies for the Estimation of Treatment Effects," Brazilian Review of Econometrics, Sociedade Brasileira de Econometria - SBE, vol. 32(1), March.
    18. Hu, Yingyao, 2017. "The Econometrics of Unobservables -- Latent Variable and Measurement Error Models and Their Applications in Empirical Industrial Organization and Labor Economics [The Econometrics of Unobservables]," Economics Working Paper Archive 64578, The Johns Hopkins University,Department of Economics, revised 2021.
    19. Ying-Ying Lee, 2014. "Partial Mean Processes with Generated Regressors: Continuous Treatment Effects and Nonseparable Models," Economics Series Working Papers 706, University of Oxford, Department of Economics.
    20. Ying-Ying Lee, 2015. "Efficient propensity score regression estimators of multi-valued treatment effects for the treated," Economics Series Working Papers 738, University of Oxford, Department of Economics.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2510.16683. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.