IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2501.10117.html

Prediction Sets and Conformal Inference with Interval Outcomes

Author

Listed:
  • Weiguang Liu
  • 'Aureo de Paula
  • Elie Tamer

Abstract

Given data on a random variable \(Y\), a prediction set with miscoverage level \(\alpha \in (0,1)\) is a set that contains a new draw of \(Y\) with probability \(1-\alpha\). Among all prediction sets satisfying this coverage property, the oracle prediction set is the one with minimal volume. The oracle prediction set offers a complementary view of the distribution of \(Y\), beyond point estimators such as the mean and quantiles, and has attracted considerable interest recently. This paper develops methods for estimating such prediction sets conditional on observed covariates when \(Y\) is \textit{censored} or \textit{interval-valued}. We characterise the oracle prediction set under partial identification induced by interval censoring and propose consistent estimators for both oracle prediction intervals and more general oracle prediction sets consisting of multiple disjoint intervals. In addition, we apply conformal inference to construct finite-sample valid prediction sets for interval outcomes that remain consistent as the sample size grows, using a conformity score tailored to interval data. The proposed procedure accounts for irreducible prediction uncertainty due to the stochastic nature of outcomes, modelling uncertainty arising from partial identification, and sampling uncertainty that vanishes as sample size increases. We conduct Monte Carlo simulations and two empirical applications using UK job postings data and the US Current Population Survey. The results demonstrate the robustness and efficiency of the proposed methods.

Suggested Citation

  • Weiguang Liu & 'Aureo de Paula & Elie Tamer, 2025. "Prediction Sets and Conformal Inference with Interval Outcomes," Papers 2501.10117, arXiv.org, revised Feb 2026.
  • Handle: RePEc:arx:papers:2501.10117
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2501.10117
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Lillard, Lee & Smith, James P & Welch, Finis, 1986. "What Do We Really Know about Wages? The Importance of Nonreporting and Census Imputation," Journal of Political Economy, University of Chicago Press, vol. 94(3), pages 489-506, June.
    2. Christopher R. Bollinger & Barry T. Hirsch & Charles M. Hokayem & James P. Ziliak, 2019. "Trouble in the Tails? What We Know about Earnings Nonresponse 30 Years after Lillard, Smith, and Welch," Journal of Political Economy, University of Chicago Press, vol. 127(5), pages 2143-2185.
    3. Victor Chernozhukov & Kaspar Wuthrich & Yinchu Zhu, 2019. "Distributional conformal prediction," Papers 1909.07889, arXiv.org, revised Aug 2021.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Vladimir Hlasny & Paolo Verme, 2022. "The Impact of Top Incomes Biases on the Measurement of Inequality in the United States," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 84(4), pages 749-788, August.
    2. Paolo Verme, 2025. "Predicting Poverty," Papers 2505.05958, arXiv.org.
    3. Lustig, Nora & Vigorito, Andrea, 2025. "The “Missing Rich” in Household Surveys: Causes and Correction Approaches," SocArXiv 97ng6_v1, Center for Open Science.
    4. Verme, Paolo, 2023. "Predicting Poverty with Missing Incomes," GLO Discussion Paper Series 1260, Global Labor Organization (GLO).
    5. Engelhardt, Gary V. & Purcell, Patrick J., 2021. "The minimum wage and annual earnings inequality," Economics Letters, Elsevier, vol. 207(C).
    6. James P. Ziliak & Charles Hokayem & Christopher R. Bollinger, 2022. "Trends in Earnings Volatility Using Linked Administrative and Survey Data," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 41(1), pages 12-19, December.
    7. Nora Lustig & Andrea Vigorito, 2025. "The "Missing Rich" in Household Surveys: Causes and Correction Approaches Extended Version with Technical Appendixes," Working Papers 2512, Tulane University, Department of Economics.
    8. Áureo de Paula & Elie Tamer & Weiguang Liu, 2025. "Prediction sets and conformal inference with censored outcomes," CeMMAP working papers 04/25, Institute for Fiscal Studies.
    9. Eggleston Jonathan, 2019. "Item Response Rates for Composite Variables," Journal of Official Statistics, Sciendo, vol. 35(2), pages 387-408, June.
    10. Katy Bergstrom & William Dodds & Nicholas Lacoste & Juan Rios, 2025. "Estimating the Welfare Cost of Labor Supply Frictions," Working Papers 2503, Tulane University, Department of Economics.
    11. Ferreira, Francisco H. G. & Brunori, Paolo, 2024. "Inherited inequality, meritocracy, and the purpose of economic growth," LSE Research Online Documents on Economics 126263, London School of Economics and Political Science, LSE Library.
    12. Klee, Mark A. & Chenevert, Rebecca L. & Wilkin, Kelly R., 2019. "Revisiting the shape of earnings nonresponse," Economics Letters, Elsevier, vol. 184(C).
    13. Ivan Fernandez-Val & Franco Peracchi & Francis Vella & Aico van Vuuren, 2019. "Decomposing Changes in the Distribution of Real Hourly Wages in the U.S," CeMMAP working papers CWP61/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    14. Gopi Shah Goda & Emilie Jackson & Lauren Hersch Nicholas & Sarah See Stith, 2023. "The impact of Covid-19 on older workers’ employment and Social Security spillovers," Journal of Population Economics, Springer;European Society for Population Economics, vol. 36(2), pages 813-846, April.
    15. James P. Ziliak, 2021. "Food Hardship during the COVID‐19 Pandemic and Great Recession," Applied Economic Perspectives and Policy, John Wiley & Sons, vol. 43(1), pages 132-152, March.
    16. Christian Awuku-Budu & Dirk van Duym, 2022. "Developing Statistics on the Distribution of State Personal Income: Methodology and Preliminary Results," BEA Working Papers 0197, Bureau of Economic Analysis.
    17. Ghosal, Rahul & Matabuena, Marcos & Ghosh, Sujit K., 2025. "Functional time transformation model with applications to digital health," Computational Statistics & Data Analysis, Elsevier, vol. 207(C).
    18. Dennis Fixler & Marina Gindelsky & David S. Johnson, 2020. "Distributing Personal Income: Trends over Time," NBER Chapters, in: Measuring Distribution and Mobility of Income and Wealth, pages 589-603, National Bureau of Economic Research, Inc.
    19. Riphahn, Regina, 1999. "Immigrant Participation in Social Assistance Programs: Evidence from German Guestworkers," CEPR Discussion Papers 2318, C.E.P.R. Discussion Papers.
    20. Mathias Silva, 2023. "Parametric models of income distributions integrating misreporting and non-response mechanisms," AMSE Working Papers 2311, Aix-Marseille School of Economics, France.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2501.10117. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.