IDEAS home Printed from https://ideas.repec.org/p/ifs/cemmap/44-19.html
   My bibliography  Save this paper

Inference on a distribution from noisy draws

Author

Listed:
  • Koen Jochmans

    (Institute for Fiscal Studies and University of Cambridge)

  • Martin Weidner

    (Institute for Fiscal Studies and University College London)

Abstract

We consider a situation where the distribution of a random variable is being estimated by the empirical distribution of noisy measurements of that variable. This is common practice in, for example, teacher value-added models and other fixed-effect models for panel data. We use an asymptotic embedding where the noise shrinks with the sample size to calculate the leading bias in the empirical distribution arising from the presence of noise. The leading bias in the empirical quantile function is equally obtained. These calculations are new in the literature, where only results on smooth functionals such as the mean and variance have been derived. Given a closed-form expression for the bias, bias-corrected estimator of the distribution function and quantile function can be constructed. We provide both analytical and jackknife corrections that recenter the limit distribution and yield confidence intervals with correct coverage in large samples. These corrections are non-parametric and easy to implement. Our approach can be connected to corrections for selection bias and shrinkage estimation and is to be contrasted with deconvolution. Simulation results confirm the much-improved sampling behavior of the corrected estimators.

Suggested Citation

  • Koen Jochmans & Martin Weidner, 2019. "Inference on a distribution from noisy draws," CeMMAP working papers CWP44/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
  • Handle: RePEc:ifs:cemmap:44/19
    as

    Download full text from publisher

    File URL: https://www.ifs.org.uk/uploads/CWP4419-Inference-on-a-distribution-from-noisy-draws.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Steven G. Rivkin & Eric A. Hanushek & John F. Kain, 2005. "Teachers, Schools, and Academic Achievement," Econometrica, Econometric Society, vol. 73(2), pages 417-458, March.
    2. David Card & Jörg Heining & Patrick Kline, 2013. "Workplace Heterogeneity and the Rise of West German Wage Inequality," The Quarterly Journal of Economics, Oxford University Press, vol. 128(3), pages 967-1015.
    3. Crucini, Mario J. & Shintani, Mototsugu & Tsuruga, Takayuki, 2015. "Noisy information, distance and law of one price dynamics across US cities," Journal of Monetary Economics, Elsevier, vol. 74(C), pages 52-66.
    4. Fatih Guvenen, 2009. "An Empirical Investigation of Labor Income Processes," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 12(1), pages 58-79, January.
    5. Okui, Ryo & Yanagi, Takahide, 2019. "Panel data analysis with heterogeneous dynamics," Journal of Econometrics, Elsevier, vol. 212(2), pages 451-475.
    6. Chesher, Andrew, 2017. "Understanding the effect of measurement error on quantile regressions," Journal of Econometrics, Elsevier, vol. 200(2), pages 223-237.
    7. Bradley Efron, 2016. "Empirical Bayes deconvolution estimates," Biometrika, Biometrika Trust, vol. 103(1), pages 1-20.
    8. Geert Dhaene & Koen Jochmans, 2015. "Split-panel Jackknife Estimation of Fixed-effect Models," Review of Economic Studies, Oxford University Press, vol. 82(3), pages 991-1030.
    9. Barras, Laurent & Gagliardini, Patrick & Scaillet, Olivier, 2018. "The Cross-Sectional Distribution of Fund Skill Measures," Working Papers unige:110006, University of Geneva, Geneva School of Economics and Management.
    10. Stéphane Bonhomme & Koen Jochmans & Jean-Marc Robin, 2014. "Estimating Multivariate Latent-Structure Models," SciencePo Working papers Main hal-01097135, HAL.
    11. Haihong Li & Bruce G. Lindsay & Richard P. Waterman, 2003. "Efficiency of projected score methods in rectangular array asymptotics," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 65(1), pages 191-208, February.
    12. Martin Browning & Mette Ejrnæs & Javier Alvarez, 2010. "Modelling Income Processes with Lots of Heterogeneity," Review of Economic Studies, Oxford University Press, vol. 77(4), pages 1353-1381.
    13. Eva Vivalt, 2015. "Heterogeneous Treatment Effects in Impact Evaluation," American Economic Review, American Economic Association, vol. 105(5), pages 467-470, May.
    14. Javier Alvarez & Manuel Arellano, 2003. "The Time Series and Cross-Section Asymptotics of Dynamic Panel Data Estimators," Econometrica, Econometric Society, vol. 71(4), pages 1121-1159, July.
    15. John M. Abowd & Francis Kramarz & David N. Margolis, 1999. "High Wage Workers and High Wage Firms," Econometrica, Econometric Society, vol. 67(2), pages 251-334, March.
    16. Federico Belotti & Silvio Daidone & Giuseppe Ilardi & Vincenzo Atella, 2013. "Stochastic frontier analysis using Stata," Stata Journal, StataCorp LP, vol. 13(4), pages 718-758, December.
    17. Iván Fernández‐Val & Joonhwah Lee, 2013. "Panel data models with nonadditive unobserved heterogeneity: Estimation and inference," Quantitative Economics, Econometric Society, vol. 4(3), pages 453-481, November.
    18. Raj Chetty & John N. Friedman & Jonah E. Rockoff, 2014. "Measuring the Impacts of Teachers I: Evaluating Bias in Teacher Value-Added Estimates," American Economic Review, American Economic Association, vol. 104(9), pages 2593-2632, September.
    19. Yingyao Hu & Susanne M. Schennach, 2008. "Instrumental Variable Treatment of Nonclassical Measurement Error Models," Econometrica, Econometric Society, vol. 76(1), pages 195-216, January.
    20. N. Sartori, 2003. "Modified profile likelihoods in models with stratum nuisance parameters," Biometrika, Biometrika Trust, vol. 90(3), pages 533-549, September.
    21. Wang, Xiao-Feng & Fan, Zhaozhi & Wang, Bin, 2010. "Estimating smooth distribution function in the presence of heteroscedastic measurement errors," Computational Statistics & Data Analysis, Elsevier, vol. 54(1), pages 25-36, January.
    22. Jinyong Hahn & Guido Kuersteiner, 2002. "Asymptotically Unbiased Inference for a Dynamic Panel Model with Fixed Effects when Both "n" and "T" Are Large," Econometrica, Econometric Society, vol. 70(4), pages 1639-1657, July.
    23. Stéphane Bonhomme & Koen Jochmans & Jean-Marc Robin, 2016. "Non-parametric estimation of finite mixtures from repeated measurements," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(1), pages 211-229, January.
    24. Geert Dhaene & Koen Jochmans, 2015. "Split-panel Jackknife Estimation of Fixed-effect Models," Review of Economic Studies, Oxford University Press, vol. 82(3), pages 991-1030.
    25. Magnac, Thierry & Roux, Sébastien, 2021. "Heterogeneity and wage inequalities over the life cycle," European Economic Review, Elsevier, vol. 134(C).
    26. Hu, Yingyao, 2008. "Identification and estimation of nonlinear models with misclassification error using instrumental variables: A general solution," Journal of Econometrics, Elsevier, vol. 144(1), pages 27-61, May.
    27. Xianchao Xie & S. C. Kou & Lawrence D. Brown, 2012. "SURE Estimates for a Heteroscedastic Hierarchical Model," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(500), pages 1465-1479, December.
    28. Jonah E. Rockoff, 2004. "The Impact of Individual Teachers on Student Achievement: Evidence from Panel Data," American Economic Review, American Economic Association, vol. 94(2), pages 247-252, May.
    29. David Ahn & Syngjoo Choi & Douglas Gale & Shachar Kariv, 2014. "Estimating ambiguity aversion in a portfolio choice experiment," Quantitative Economics, Econometric Society, vol. 5, pages 195-223, July.
    30. Joel L. Horowitz & Marianthi Markatou, 1996. "Semiparametric Estimation of Regression Models for Panel Data," Review of Economic Studies, Oxford University Press, vol. 63(1), pages 145-168.
    31. Pitt, Mark M. & Lee, Lung-Fei, 1981. "The measurement and sources of technical inefficiency in the Indonesian weaving industry," Journal of Development Economics, Elsevier, vol. 9(1), pages 43-64, August.
    32. Schmidt, Peter & Sickles, Robin C, 1984. "Production Frontiers and Panel Data," Journal of Business & Economic Statistics, American Statistical Association, vol. 2(4), pages 367-374, October.
    33. C. Kirabo Jackson & Jonah E. Rockoff & Douglas O. Staiger, 2014. "Teacher Effects and Teacher-Related Policies," Annual Review of Economics, Annual Reviews, vol. 6(1), pages 801-825, August.
    34. Jinyong Hahn & Whitney Newey, 2004. "Jackknife and Analytical Bias Reduction for Nonlinear Panel Models," Econometrica, Econometric Society, vol. 72(4), pages 1295-1319, July.
    35. Li, Tong & Vuong, Quang, 1998. "Nonparametric Estimation of the Measurement Error Model Using Multiple Indicators," Journal of Multivariate Analysis, Elsevier, vol. 65(2), pages 139-165, May.
    36. repec:hal:spmain:info:hdl:2441/etefo8s8r89oamhnhiclqr530 is not listed on IDEAS
    37. Ryo Okui & Takahide Yanagi, 2020. "Kernel estimation for panel data with heterogeneous dynamics [Econometric tools for analyzing market outcomes]," The Econometrics Journal, Royal Economic Society, vol. 23(1), pages 156-175.
    38. Asaf Weinstein & Zhuang Ma & Lawrence D. Brown & Cun-Hui Zhang, 2018. "Group-Linear Empirical Bayes Estimates for a Heteroscedastic Normal Mean," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(522), pages 698-710, April.
    39. Aigner, Dennis & Lovell, C. A. Knox & Schmidt, Peter, 1977. "Formulation and estimation of stochastic frontier production function models," Journal of Econometrics, Elsevier, vol. 6(1), pages 21-37, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Okui, Ryo & Yanagi, Takahide, 2019. "Panel data analysis with heterogeneous dynamics," Journal of Econometrics, Elsevier, vol. 212(2), pages 451-475.
    2. Ryo Okui & Takahide Yanagi, 2020. "Kernel estimation for panel data with heterogeneous dynamics [Econometric tools for analyzing market outcomes]," The Econometrics Journal, Royal Economic Society, vol. 23(1), pages 156-175.
    3. Magnac, Thierry & Roux, Sébastien, 2021. "Heterogeneity and wage inequalities over the life cycle," European Economic Review, Elsevier, vol. 134(C).
    4. Laurent Barras & Patrick Gagliardini & Olivier Scaillet, 2022. "Skill, Scale, and Value Creation in the Mutual Fund Industry," Journal of Finance, American Finance Association, vol. 77(1), pages 601-638, February.
    5. Gobillon, Laurent & Magnac, Thierry & Roux, Sébastien, 2022. "Lifecycle Wages and Human Capital Investments: Selection and Missing Data," TSE Working Papers 22-1299, Toulouse School of Economics (TSE).
    6. St'ephane Bonhomme & Martin Weidner, 2019. "Posterior Average Effects," Papers 1906.06360, arXiv.org, revised Sep 2021.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Geert Dhaene & Koen Jochmans, 2015. "Split-panel Jackknife Estimation of Fixed-effect Models," Review of Economic Studies, Oxford University Press, vol. 82(3), pages 991-1030.
    2. Okui, Ryo & Yanagi, Takahide, 2019. "Panel data analysis with heterogeneous dynamics," Journal of Econometrics, Elsevier, vol. 212(2), pages 451-475.
    3. repec:hal:spmain:info:hdl:2441/f6h8764enu2lskk9p2m9mgp8l is not listed on IDEAS
    4. repec:hal:spmain:info:hdl:2441/eu4vqp9ompqllr09ij4j0h0h1 is not listed on IDEAS
    5. repec:hal:wpspec:info:hdl:2441/f6h8764enu2lskk9p2m9mgp8l is not listed on IDEAS
    6. repec:hal:spmain:info:hdl:2441/dambferfb7dfprc9m052g20qh is not listed on IDEAS
    7. Dhaene, Geert & Jochmans, Koen, 2016. "Likelihood Inference In An Autoregression With Fixed Effects," Econometric Theory, Cambridge University Press, vol. 32(5), pages 1178-1215, October.
    8. Dhaene, Geert & Jochmans, Koen, 2016. "Likelihood Inference In An Autoregression With Fixed Effects," Econometric Theory, Cambridge University Press, vol. 32(5), pages 1178-1215, October.
    9. repec:hal:wpspec:info:hdl:2441/dambferfb7dfprc9m052g20qh is not listed on IDEAS
    10. Galvao, Antonio F. & Gu, Jiaying & Volgushev, Stanislav, 2020. "On the unbiased asymptotic normality of quantile regression with fixed effects," Journal of Econometrics, Elsevier, vol. 218(1), pages 178-215.
    11. repec:hal:spmain:info:hdl:2441/1mc4dip81d9t8r0t57fe1h8lap is not listed on IDEAS
    12. Rasmus Lentz & Jean Marc Robin & Suphanit Piyapromdee, 2018. "On Worker and Firm Heterogeneity in Wages and Employment Mobility: Evidence from Danish Register Data," 2018 Meeting Papers 469, Society for Economic Dynamics.
    13. L. Hospido, 2012. "Modelling heterogeneity and dynamics in the volatility of individual wages," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 27(3), pages 386-414, April.
    14. Manuel Arellano & Stéphane Bonhomme, 2012. "Identifying Distributional Characteristics in Random Coefficients Panel Data Models," Review of Economic Studies, Oxford University Press, vol. 79(3), pages 987-1020.
    15. Xuan Leng & Jiaming Mao & Yutao Sun, 2023. "Debiased inference for dynamic nonlinear models with two-way fixed effects," Papers 2305.03134, arXiv.org, revised Oct 2023.
    16. Kunz, J.S.; & Staub, K.E.; & Winkelmann, R.;, 2018. "Predicting fixed effects in panel probit models," Health, Econometrics and Data Group (HEDG) Working Papers 18/23, HEDG, c/o Department of Economics, University of York.
    17. Ivan Fernandez-Val & Martin Weidner, 2015. "Individual and time effects in nonlinear panel models with large N , T," CeMMAP working papers 17/15, Institute for Fiscal Studies.
    18. Koen Jochmans & Martin Weidner, 2019. "Fixed‐Effect Regressions on Network Data," Econometrica, Econometric Society, vol. 87(5), pages 1543-1560, September.
    19. Magnac, Thierry & Pistolesi, Nicolas & Roux, Sébastien, 2013. "Post schooling human capital investments and the life cycle variance of earnings," TSE Working Papers 13-380, Toulouse School of Economics (TSE).
    20. Carneiro, Anabela & Portugal, Pedro & Raposo, Pedro & Rodrigues, Paulo M.M., 2023. "The persistence of wages," Journal of Econometrics, Elsevier, vol. 233(2), pages 596-611.
    21. Hu, Yingyao, 2017. "The econometrics of unobservables: Applications of measurement error models in empirical industrial organization and labor economics," Journal of Econometrics, Elsevier, vol. 200(2), pages 154-168.
    22. Fernández-Val, Iván & Weidner, Martin, 2016. "Individual and time effects in nonlinear panel models with large N, T," Journal of Econometrics, Elsevier, vol. 192(1), pages 291-312.
    23. Stéphane Bonhomme & Martin Weidner, 2019. "Posterior average effects," CeMMAP working papers CWP43/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    24. Jochmans, Koen & Higgins, Ayden, 2022. "Bootstrap inference for fixed-effect models," TSE Working Papers 22-1328, Toulouse School of Economics (TSE), revised Dec 2023.
    25. Hospido, Laura, 2015. "Wage dynamics in the presence of unobserved individual and job heterogeneity," Labour Economics, Elsevier, vol. 33(C), pages 81-93.
    26. Karol Jan Borowiecki, 2022. "Good Reverberations? Teacher Influence in Music Composition since 1450," Journal of Political Economy, University of Chicago Press, vol. 130(4), pages 991-1090.

    More about this item

    JEL classification:

    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • C23 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Models with Panel Data; Spatio-temporal Models

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ifs:cemmap:44/19. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Emma Hyman (email available below). General contact details of provider: https://edirc.repec.org/data/cmifsuk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.