IDEAS home Printed from https://ideas.repec.org/p/ifs/ifsewp/cwp04-25.html
   My bibliography  Save this paper

Prediction sets and conformal inference with censored outcomes

Author

Listed:
  • Áureo de Paula

    (Institute for Fiscal Studies)

  • Elie Tamer

    (Institute for Fiscal Studies)

  • Weiguang Liu

    (UCL)

Abstract

Given data on a scalar random variable π‘Œ, a prediction set for π‘Œ with miscoverage level 𝛼 is a set of values for π‘Œ that contains a randomly drawn π‘Œ with probability 1 βˆ’ 𝛼, where 𝛼 ∈ (0, 1). Among all prediction sets that satisfy this coverage property, the oracle prediction set is the one with the smallest volume. This paper provides estimation methods of such prediction sets given observed conditioning covariates when π‘Œ is censored or measured in intervals. We first characterise the oracle prediction set under interval censoring and develop a consistent estimator for the shortest prediction interval that satisfies this coverage property. We then extend these consistency results to accommodate cases where the prediction set consists of multiple disjoint intervals. Second, we use conformal inference to construct a prediction set that achieves a particular notion of finite-sample validity under censoring and maintains consistency as sample size increases. This notion exploits exchangeability to obtain finite sample guarantees on coverage using a specially constructed conformity score function. The procedure accomodates the prediction uncertainty that is irreducible (due to the stochastic nature of outcomes), the modelling uncertainty due to partial identification and also sampling uncertainty that gets reduced as samples get larger. We conduct a set of Monte Carlo simulations and an application to data from the Current Population Survey. The results highlight the robustness and efficiency of the proposed methods.
(This abstract was borrowed from another version of this item.)

Suggested Citation

  • Áureo de Paula & Elie Tamer & Weiguang Liu, 2025. "Prediction sets and conformal inference with censored outcomes," IFS Working Papers WCWP04/25, Institute for Fiscal Studies.
  • Handle: RePEc:ifs:ifsewp:cwp04/25
    as

    Download full text from publisher

    File URL: https://ifs.org.uk/sites/default/files/2025-01/CWP0425-Prediction-sets-and-conformal-inference-with-censored-outcomes.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Lillard, Lee & Smith, James P & Welch, Finis, 1986. "What Do We Really Know about Wages? The Importance of Nonreporting and Census Imputation," Journal of Political Economy, University of Chicago Press, vol. 94(3), pages 489-506, June.
    2. Degui Li & Qi Li & Zheng Li, 2021. "Nonparametric Quantile Regression Estimation With Mixed Discrete and Continuous Data," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 39(3), pages 741-756, July.
    3. Christopher R. Bollinger & Barry T. Hirsch & Charles M. Hokayem & James P. Ziliak, 2019. "Trouble in the Tails? What We Know about Earnings Nonresponse 30 Years after Lillard, Smith, and Welch," Journal of Political Economy, University of Chicago Press, vol. 127(5), pages 2143-2185.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Vladimir Hlasny & Paolo Verme, 2022. "The Impact of Top Incomes Biases on the Measurement of Inequality in the United States," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 84(4), pages 749-788, August.
    2. Paolo Verme, 2023. "Predicting Poverty with Missing Incomes," Working Papers 642, ECINEQ, Society for the Study of Economic Inequality.
    3. Engelhardt, Gary V. & Purcell, Patrick J., 2021. "The minimum wage and annual earnings inequality," Economics Letters, Elsevier, vol. 207(C).
    4. James P. Ziliak & Charles Hokayem & Christopher R. Bollinger, 2022. "Trends in Earnings Volatility Using Linked Administrative and Survey Data," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 41(1), pages 12-19, December.
    5. Eggleston Jonathan, 2019. "Item Response Rates for Composite Variables," Journal of Official Statistics, Sciendo, vol. 35(2), pages 387-408, June.
    6. Klee, Mark A. & Chenevert, Rebecca L. & Wilkin, Kelly R., 2019. "Revisiting the shape of earnings nonresponse," Economics Letters, Elsevier, vol. 184(C).
    7. Ivan Fernandez-Val & Franco Peracchi & Francis Vella & Aico van Vuuren, 2019. "Decomposing Changes in the Distribution of Real Hourly Wages in the U.S," CeMMAP working papers CWP61/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    8. Gopi Shah Goda & Emilie Jackson & Lauren Hersch Nicholas & Sarah See Stith, 2023. "The impact of Covid-19 on older workers’ employment and Social Security spillovers," Journal of Population Economics, Springer;European Society for Population Economics, vol. 36(2), pages 813-846, April.
    9. James P. Ziliak, 2021. "Food Hardship during the COVID‐19 Pandemic and Great Recession," Applied Economic Perspectives and Policy, John Wiley & Sons, vol. 43(1), pages 132-152, March.
    10. Christian Awuku-Budu & Dirk van Duym, 2022. "Developing Statistics on the Distribution of State Personal Income: Methodology and Preliminary Results," BEA Working Papers 0197, Bureau of Economic Analysis.
    11. Dennis Fixler & Marina Gindelsky & David S. Johnson, 2020. "Distributing Personal Income: Trends over Time," NBER Chapters, in: Measuring Distribution and Mobility of Income and Wealth, pages 589-603, National Bureau of Economic Research, Inc.
    12. Riphahn, Regina, 1999. "Immigrant Participation in Social Assistance Programs: Evidence from German Guestworkers," CEPR Discussion Papers 2318, C.E.P.R. Discussion Papers.
    13. Mathias Silva, 2023. "Parametric models of income distributions integrating misreporting and non-response mechanisms," AMSE Working Papers 2311, Aix-Marseille School of Economics, France.
    14. Anton Korinek & Johan Mistiaen & Martin Ravallion, 2006. "Survey nonresponse and the distribution of income," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 4(1), pages 33-55, April.
    15. Meyer, Bruce D. & Mittag, Nikolas, 2019. "Combining Administrative and Survey Data to Improve Income Measurement," IZA Discussion Papers 12266, Institute of Labor Economics (IZA).
    16. Bruce D. Meyer & Derek Wu & Victoria R. Mooers & Carla Medalia, 2019. "The use and misuse of income data and extreme poverty in the United States," AEI Economics Working Papers 1018925, American Enterprise Institute.
    17. Jonathan Fisher & Bradley L. Hardy, 2023. "Money matters: consumption variability across the income distribution," Fiscal Studies, John Wiley & Sons, vol. 44(3), pages 275-298, September.
    18. Korinek, Anton & Mistiaen, Johan A. & Ravallion, Martin, 2007. "An econometric method of correcting for unit nonresponse bias in surveys," Journal of Econometrics, Elsevier, vol. 136(1), pages 213-235, January.
    19. McGovern, Mark E. & Canning, David & BΓ€rnighausen, Till, 2018. "Accounting for non-response bias using participation incentives and survey design: An application using gift vouchers," Economics Letters, Elsevier, vol. 171(C), pages 239-244.
    20. Christian Dustmann & Francesca Fabbri, 2005. "Gender and Ethnicity--Married Immigrants in Britain," Oxford Review of Economic Policy, Oxford University Press and Oxford Review of Economic Policy Limited, vol. 21(3), pages 462-484, Autumn.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ifs:ifsewp:cwp04/25. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Emma Hyman (email available below). General contact details of provider: https://edirc.repec.org/data/ifsssuk.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.