IDEAS home Printed from https://ideas.repec.org/p/ifs/cemmap/62-18.html
   My bibliography  Save this paper

Semiparametrically efficient estimation of the average linear regression function

Author

Listed:
  • Bryan S. Graham

    (Institute for Fiscal Studies and University of California, Berkeley)

  • Cristine Campos de Xavier Pinto

    (Institute for Fiscal Studies)

Abstract

Let Y be an outcome of interest, X a vector of treatment measures, and W a vector of pre-treatment control variables. Here X may include (combinations of) continuous, discrete, and/or non-mutually exclusive “treatments”. Consider the linear regression of Y onto X in a subpopulation homogenous in W = w (formally a conditional linear predictor). Let b0 (w) be the coefficient vector on X in this regression. We introduce a semiparametrically efficient estimate of the average ß0 = E [b0 (W)]. When X is binary-valued (multi-valued) our procedure recovers the (a vector of) average treatment effect(s). When X is continuously-valued, or consists of multiple non-exclusive treatments, our estimand coincides with the average partial effect (APE) of X on Y when the underlying potential response function is linear in X, but otherwise heterogenous across agents. When the potential response function takes a general nonlinear/heterogenous form, and X is continuously-valued, our procedure recovers a weighted average of the gradient of this response across individuals and values of X. We provide a simple, and semiparametrically efficient, method of covariate adjustment for settings with complicated treatment regimes. Our method generalizes familiar methods of covariate adjustment used for program evaluation as well as methods of semiparametric regression (e.g., the partially linear regression model).

Suggested Citation

  • Bryan S. Graham & Cristine Campos de Xavier Pinto, 2018. "Semiparametrically efficient estimation of the average linear regression function," CeMMAP working papers CWP62/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
  • Handle: RePEc:ifs:cemmap:62/18
    as

    Download full text from publisher

    File URL: https://www.ifs.org.uk/uploads/cemmap/wps/CWP621818.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Wooldridge, Jeffrey M., 1999. "Distribution-free estimation of some nonlinear panel data models," Journal of Econometrics, Elsevier, vol. 90(1), pages 77-97, May.
    2. Ruud, Paul A., 1986. "Consistent estimation of limited dependent variable models despite misspecification of distribution," Journal of Econometrics, Elsevier, vol. 32(1), pages 157-187, June.
    3. Robinson, Peter M, 1988. "Root- N-Consistent Semiparametric Regression," Econometrica, Econometric Society, vol. 56(4), pages 931-954, July.
    4. Cattaneo, Matias D., 2010. "Efficient semiparametric estimation of multi-valued treatment effects under ignorability," Journal of Econometrics, Elsevier, vol. 155(2), pages 138-154, April.
    5. Bryan S. Graham, 2011. "Efficiency Bounds for Missing Data Models With Semiparametric Restrictions," Econometrica, Econometric Society, vol. 79(2), pages 437-452, March.
    6. Keisuke Hirano & Guido W. Imbens & Geert Ridder, 2003. "Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score," Econometrica, Econometric Society, vol. 71(4), pages 1161-1189, July.
    7. Jeffrey M Wooldridge, 2010. "Econometric Analysis of Cross Section and Panel Data," MIT Press Books, The MIT Press, edition 2, volume 1, number 0262232588, April.
    8. Bryan S. Graham & Cristine Campos de Xavier Pinto & Daniel Egel, 2016. "Efficient Estimation of Data Combination Models by the Method of Auxiliary-to-Study Tilting (AST)," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(2), pages 288-301, April.
    9. Bryan S. Graham & Cristine Campos De Xavier Pinto & Daniel Egel, 2012. "Inverse Probability Tilting for Moment Condition Models with Missing Data," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 79(3), pages 1053-1079.
    10. Newey, Whitney K., 1994. "Kernel Estimation of Partial Means and a General Variance Estimator," Econometric Theory, Cambridge University Press, vol. 10(2), pages 1-21, June.
    11. Joshua D. Angrist & Jörn-Steffen Pischke, 2009. "Mostly Harmless Econometrics: An Empiricist's Companion," Economics Books, Princeton University Press, edition 1, number 8769.
    12. Newey, Whitney K, 1990. "Semiparametric Efficiency Bounds," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 5(2), pages 99-135, April-Jun.
    13. repec:eee:labchp:v:1:y:1986:i:c:p:3-102 is not listed on IDEAS
    14. Alexandre Belloni & Victor Chernozhukov & Christian Hansen, 2014. "Inference on Treatment Effects after Selection among High-Dimensional Controlsâ€," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 81(2), pages 608-650.
    15. Sloczynski, Tymon, 2018. "A General Weighted Average Representation of the Ordinary and Two-Stage Least Squares Estimands," IZA Discussion Papers 11866, Institute of Labor Economics (IZA).
    16. Jinyong Hahn, 1998. "On the Role of the Propensity Score in Efficient Semiparametric Estimation of Average Treatment Effects," Econometrica, Econometric Society, vol. 66(2), pages 315-332, March.
    17. Heejung Bang & James M. Robins, 2005. "Doubly Robust Estimation in Missing Data and Causal Inference Models," Biometrics, The International Biometric Society, vol. 61(4), pages 962-973, December.
    18. Kline, Patrick, 2014. "A note on variance estimation for the Oaxaca estimator of average treatment effects," Economics Letters, Elsevier, vol. 122(3), pages 428-431.
    19. Jeffrey M. Wooldridge, 2004. "Estimating average partial effects under conditional moment independence assumptions," CeMMAP working papers CWP03/04, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    20. Matias D. Cattaneo, 2010. "multi-valued treatment effects," The New Palgrave Dictionary of Economics,, Palgrave Macmillan.
    21. Wooldridge, Jeffrey M., 2007. "Inverse probability weighted estimation for general missing data problems," Journal of Econometrics, Elsevier, vol. 141(2), pages 1281-1301, December.
    22. Hitomi, Kohtaro & Nishiyama, Yoshihiko & Okui, Ryo, 2008. "A Puzzling Phenomenon In Semiparametric Estimation Problems With Infinite-Dimensional Nuisance Parameters," Econometric Theory, Cambridge University Press, vol. 24(6), pages 1717-1728, December.
    23. Chamberlain, Gary, 1987. "Asymptotic efficiency in estimation with conditional moment restrictions," Journal of Econometrics, Elsevier, vol. 34(3), pages 305-334, March.
    24. Tymon Słoczyński, 2015. "The Oaxaca–Blinder Unexplained Component as a Treatment Effects Estimator," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 77(4), pages 588-604, August.
    25. Joshua D. Angrist, 1998. "Estimating the Labor Market Impact of Voluntary Military Service Using Social Security Data on Military Applicants," Econometrica, Econometric Society, vol. 66(2), pages 249-288, March.
    26. Joshua D. Angrist & Kathryn Graddy & Guido W. Imbens, 2000. "The Interpretation of Instrumental Variables Estimators in Simultaneous Equations Models with an Application to the Demand for Fish," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 67(3), pages 499-527.
    27. Markus Frolich, 2004. "A Note on the Role of the Propensity Score for Estimating Average Treatment Effects," Econometric Reviews, Taylor & Francis Journals, vol. 23(2), pages 167-174.
    28. Bryan S. Graham & Guido W. Imbens & Geert Ridder, 2010. "Measuring the Effects of Segregation in the Presence of Social Spillovers: A Nonparametric Approach," NBER Working Papers 16499, National Bureau of Economic Research, Inc.
    29. Imbens,Guido W. & Rubin,Donald B., 2015. "Causal Inference for Statistics, Social, and Biomedical Sciences," Cambridge Books, Cambridge University Press, number 9780521885881, January.
    30. Yitzhaki, Shlomo, 1996. "On Using Linear Regressions in Welfare Economics," Journal of Business & Economic Statistics, American Statistical Association, vol. 14(4), pages 478-486, October.
    31. Chamberlain, Gary, 1992. "Efficiency Bounds for Semiparametric Regression," Econometrica, Econometric Society, vol. 60(3), pages 567-596, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Whitney K. Newey & Sami Stouli, 2018. "Heterogenous Coefficients, Discrete Instruments, and Identification of Treatment Effects," Papers 1811.09837, arXiv.org.
    2. Mert Demirer & Vasilis Syrgkanis & Greg Lewis & Victor Chernozhukov, 2019. "Semi-Parametric Efficient Policy Learning with Continuous Actions," CeMMAP working papers CWP34/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    3. Ohanisian Alina & Levchenko Nataliia & Shyshkanova Ganna & Abuselidze George & Prykhodko Volodymyr & Banchuk-Petrosova Olena, 2022. "Organic farms are the fundamental basis for the sustainable foreign economic activities of agrarians in Ukraine," Environmental & Socio-economic Studies, Sciendo, vol. 10(2), pages 49-61, June.
    4. Tymon Słoczyński, 2022. "Interpreting OLS Estimands When Treatment Effects Are Heterogeneous: Smaller Groups Get Larger Weights," The Review of Economics and Statistics, MIT Press, vol. 104(3), pages 501-509, May.
    5. W K Newey & S Stouli, 2022. "Heterogeneous coefficients, control variables and identification of multiple treatment effects [Multivalued treatments and decomposition analysis: An application to the WIA program]," Biometrika, Biometrika Trust, vol. 109(3), pages 865-872.
    6. Winkelmann Rainer, 2024. "Neglected Heterogeneity, Simpson’s Paradox, and the Anatomy of Least Squares," Journal of Econometric Methods, De Gruyter, vol. 13(1), pages 131-144, January.
    7. Stijn Vansteelandt & Oliver Dukes, 2022. "Assumption‐lean inference for generalised linear model parameters," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(3), pages 657-685, July.
    8. Alejandro Sanchez-Becerra, 2022. "The Network Propensity Score: Spillovers, Homophily, and Selection into Treatment," Papers 2209.14391, arXiv.org.
    9. DiTraglia, Francis J. & García-Jimeno, Camilo & O’Keeffe-O’Donovan, Rossa & Sánchez-Becerra, Alejandro, 2023. "Identifying causal effects in experiments with spillovers and non-compliance," Journal of Econometrics, Elsevier, vol. 235(2), pages 1589-1624.
    10. Rainer Winkelmann, 2023. "Neglected heterogeneity, Simpson’s paradox, and the anatomy of least squares," ECON - Working Papers 426, Department of Economics - University of Zurich, revised Jul 2023.
    11. Max H. Farrell & Tengyuan Liang & Sanjog Misra, 2020. "Deep Learning for Individual Heterogeneity: An Automatic Inference Framework," Papers 2010.14694, arXiv.org, revised Jul 2021.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sant’Anna, Pedro H.C. & Zhao, Jun, 2020. "Doubly robust difference-in-differences estimators," Journal of Econometrics, Elsevier, vol. 219(1), pages 101-122.
    2. Chris Muris, 2020. "Efficient GMM Estimation with Incomplete Data," The Review of Economics and Statistics, MIT Press, vol. 102(3), pages 518-530, July.
    3. Victor Chernozhukov & Juan Carlos Escanciano & Hidehiko Ichimura & Whitney K. Newey & James M. Robins, 2022. "Locally Robust Semiparametric Estimation," Econometrica, Econometric Society, vol. 90(4), pages 1501-1535, July.
    4. Guido W. Imbens & Jeffrey M. Wooldridge, 2009. "Recent Developments in the Econometrics of Program Evaluation," Journal of Economic Literature, American Economic Association, vol. 47(1), pages 5-86, March.
    5. Bryan S. Graham & Guido W. Imbens & Geert Ridder, 2020. "Identification and Efficiency Bounds for the Average Match Function Under Conditionally Exogenous Matching," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 38(2), pages 303-316, April.
    6. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
    7. Farrell, Max H., 2015. "Robust inference on average treatment effects with possibly more covariates than observations," Journal of Econometrics, Elsevier, vol. 189(1), pages 1-23.
    8. Michael C. Knaus, 2021. "A double machine learning approach to estimate the effects of musical practice on student’s skills," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(1), pages 282-300, January.
    9. Firpo, Sergio Pinheiro & Pinto, Rafael de Carvalho Cayres, 2012. "Combining Strategies for the Estimation of Treatment Effects," Brazilian Review of Econometrics, Sociedade Brasileira de Econometria - SBE, vol. 32(1), March.
    10. Frölich, Markus & Huber, Martin & Wiesenfarth, Manuel, 2017. "The finite sample performance of semi- and non-parametric estimators for treatment effects and policy evaluation," Computational Statistics & Data Analysis, Elsevier, vol. 115(C), pages 91-102.
    11. Difang Huang & Jiti Gao & Tatsushi Oka, 2022. "Semiparametric Single-Index Estimation for Average Treatment Effects," Papers 2206.08503, arXiv.org, revised Apr 2024.
    12. A. Belloni & V. Chernozhukov & I. Fernández‐Val & C. Hansen, 2017. "Program Evaluation and Causal Inference With High‐Dimensional Data," Econometrica, Econometric Society, vol. 85, pages 233-298, January.
    13. Susan Athey & Guido W. Imbens, 2017. "The State of Applied Econometrics: Causality and Policy Evaluation," Journal of Economic Perspectives, American Economic Association, vol. 31(2), pages 3-32, Spring.
    14. Ganesh Karapakula, 2023. "Stable Probability Weighting: Large-Sample and Finite-Sample Estimation and Inference Methods for Heterogeneous Causal Effects of Multivalued Treatments Under Limited Overlap," Papers 2301.05703, arXiv.org, revised Jan 2023.
    15. Sasaki, Yuya & Ura, Takuya, 2023. "Estimation and inference for policy relevant treatment effects," Journal of Econometrics, Elsevier, vol. 234(2), pages 394-450.
    16. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2016. "Double/Debiased Machine Learning for Treatment and Causal Parameters," Papers 1608.00060, arXiv.org, revised Nov 2024.
    17. Huber, Martin & Lechner, Michael & Wunsch, Conny, 2013. "The performance of estimators based on the propensity score," Journal of Econometrics, Elsevier, vol. 175(1), pages 1-21.
    18. Bryan S. Graham & Guido Imbens & Geert Ridder, 2016. "Identification and efficiency bounds for the average match function under conditionally exogenous matching," CeMMAP working papers 10/16, Institute for Fiscal Studies.
    19. Słoczyński, Tymon, 2012. "New Evidence on Linear Regression and Treatment Effect Heterogeneity," MPRA Paper 39524, University Library of Munich, Germany.
    20. Alexandre Belloni & Victor Chernozhukov & Ivan Fernandez-Val & Christian Hansen, 2013. "Program evaluation with high-dimensional data," CeMMAP working papers CWP77/13, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.

    More about this item

    Keywords

    Conditional Linear Predictor; Causal Inference; Average Treatment Effect; Propensity Score; Semiparametric Efficiency; Semiparametric Regression;
    All these keywords.

    JEL classification:

    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • C21 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Cross-Sectional Models; Spatial Models; Treatment Effect Models
    • C31 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Cross-Sectional Models; Spatial Models; Treatment Effect Models; Quantile Regressions; Social Interaction Models

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ifs:cemmap:62/18. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Emma Hyman (email available below). General contact details of provider: https://edirc.repec.org/data/cmifsuk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.