IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2604.08798.html

Identification of Latent Group Effects under Conditional Calibration

Author

Listed:
  • Marcell T. Kurbucz

Abstract

We study identification of a structural group effect when the group indicator $G\in\{0,1\}$ is unobserved but the analyst observes a calibrated probability score $p\in[0,1]$ satisfying $\mathbb{E}[G|p,X]=p$. Under a constant-coefficient structural mean model, the latent-group coefficient $\tau$ is point-identified from the joint law of observables $(Y,X,p)$ by a simple ratio of weighted moments: the covariance of the signed score $2p-1$ with the covariate-partialled outcome, divided by twice the residual variance of the score after conditioning on covariates. Identification fails if and only if the score is a deterministic function of $X$; we establish this by constructing an explicit continuum of observationally equivalent models indexed by arbitrary values of $\tau$. The identified coefficient differs from the marginal latent mean gap by a compositional term that is unidentified without further assumptions; we give a necessary and sufficient condition for the two to coincide. The oracle estimator is $\sqrt{n}$-consistent and asymptotically normal with a closed-form sandwich variance. Under calibration error bounded uniformly by $\delta$, the bias is bounded by $|\tau|\,\mathbb{E}[|2p-1|]\,\delta\,(2V^*)^{-1}$, a bound that is sharp over all calibration error functions of that magnitude. Hard-threshold classification at $p=1/2$ attenuates the estimated gap by a factor strictly less than one. Monte Carlo experiments confirm the asymptotic theory, trace the divergence of RMSE as $V^*\to 0$, illustrate the attenuation bias of hard-threshold classification, and verify identification of the variance-weighted estimand under heterogeneous effects.

Suggested Citation

  • Marcell T. Kurbucz, 2026. "Identification of Latent Group Effects under Conditional Calibration," Papers 2604.08798, arXiv.org.
  • Handle: RePEc:arx:papers:2604.08798
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2604.08798
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Robinson, Peter M, 1988. "Root- N-Consistent Semiparametric Regression," Econometrica, Econometric Society, vol. 56(4), pages 931-954, July.
    2. Victor Chernozhukov & Denis Chetverikov & Mert Demirer & Esther Duflo & Christian Hansen & Whitney Newey & James Robins, 2018. "Double/debiased machine learning for treatment and structural parameters," Econometrics Journal, Royal Economic Society, vol. 21(1), pages 1-68, February.
    3. Kasahara, Hiroyuki & Shimotsu, Katsumi, 2022. "Identification Of Regression Models With A Misclassified And Endogenous Binary Regressor," Econometric Theory, Cambridge University Press, vol. 38(6), pages 1117-1139, December.
    4. Arthur Lewbel, 2007. "Estimation of Average Treatment Effects with Misclassification," Econometrica, Econometric Society, vol. 75(2), pages 537-551, March.
    5. Yingyao Hu & Susanne M. Schennach, 2008. "Instrumental Variable Treatment of Nonclassical Measurement Error Models," Econometrica, Econometric Society, vol. 76(1), pages 195-216, January.
    6. Aprajit Mahajan, 2006. "Identification and Estimation of Regression Models with Misclassification," Econometrica, Econometric Society, vol. 74(3), pages 631-665, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. DiTraglia, Francis J. & García-Jimeno, Camilo, 2019. "Identifying the effect of a mis-classified, binary, endogenous regressor," Journal of Econometrics, Elsevier, vol. 209(2), pages 376-390.
    2. Francis J. DiTraglia & Camilo García-Jimeno, 2017. "Mis-classified, Binary, Endogenous Regressors: Identification and Inference," NBER Working Papers 23814, National Bureau of Economic Research, Inc.
    3. Jochmans, Koen & Henry, Marc & Salanié, Bernard, 2017. "Inference On Two-Component Mixtures Under Tail Restrictions," Econometric Theory, Cambridge University Press, vol. 33(3), pages 610-635, June.
    4. Lorenzo Almada & Ian McCarthy & Rusty Tchernis, 2016. "What Can We Learn about the Effects of Food Stamps on Obesity in the Presence of Misreporting?," American Journal of Agricultural Economics, Agricultural and Applied Economics Association, vol. 98(4), pages 997-1017.
    5. Song, Suyong, 2015. "Semiparametric estimation of models with conditional moment restrictions in the presence of nonclassical measurement errors," Journal of Econometrics, Elsevier, vol. 185(1), pages 95-109.
    6. Hu, Yingyao, 2017. "The econometrics of unobservables: Applications of measurement error models in empirical industrial organization and labor economics," Journal of Econometrics, Elsevier, vol. 200(2), pages 154-168.
    7. Takahide Yanagi, 2019. "Inference on local average treatment effects for misclassified treatment," Econometric Reviews, Taylor & Francis Journals, vol. 38(8), pages 938-960, September.
    8. Acerenza, Santiago & Ban, Kyunghoon & Kedagni, Desire, 2021. "Marginal Treatment Effects with Misclassified Treatment," ISU General Staff Papers 202106180700001132, Iowa State University, Department of Economics.
    9. Francis DiTraglia & Camilo Garcia-Jimeno, 2015. "On Mis-measured Binary Regressors: New Results And Some Comments on the Literature, Second Version," PIER Working Paper Archive 15-039, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania, revised 11 Nov 2015.
    10. Francis DiTraglia & Camilo Garcia-Jimeno, 2015. "On Mis-measured Binary Regressors: New Results And Some Comments on the Literature, Third Version," PIER Working Paper Archive 15-040, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania, revised 24 Nov 2015.
    11. Anish Agarwal & Rahul Singh, 2021. "Causal Inference with Corrupted Data: Measurement Error, Missing Values, Discretization, and Differential Privacy," Papers 2107.02780, arXiv.org, revised Feb 2024.
    12. Wossen, Tesfamicheal & Alene, Arega & Abdoulaye, Tahirou & Feleke, Shiferaw & Manyong, Victor, 2019. "Agricultural technology adoption and household welfare: Measurement and evidence," Food Policy, Elsevier, vol. 87(C), pages 1-1.
    13. Shiu, Ji-Liang, 2016. "Identification and estimation of endogenous selection models in the presence of misclassification errors," Economic Modelling, Elsevier, vol. 52(PB), pages 507-518.
    14. Hu, Yingyao & Sasaki, Yuya, 2015. "Closed-form estimation of nonparametric models with non-classical measurement errors," Journal of Econometrics, Elsevier, vol. 185(2), pages 392-408.
    15. Lina Zhang, 2020. "Spillovers of Program Benefits with Missing Network Links," Papers 2009.09614, arXiv.org, revised Aug 2024.
    16. Denni Tommasi & Lina Zhang, 2024. "Identifying program benefits when participation is misreported," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 39(6), pages 1123-1148, September.
    17. Nguimkeu, Pierre & Denteh, Augustine & Tchernis, Rusty, 2019. "On the estimation of treatment effects with endogenous misreporting," Journal of Econometrics, Elsevier, vol. 208(2), pages 487-506.
    18. Kyle Colangelo & Ying-Ying Lee, 2019. "Double debiased machine learning nonparametric inference with continuous treatments," CeMMAP working papers CWP72/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    19. Santiago Acerenza & Kyunghoon Ban & D'esir'e K'edagni, 2021. "Local Average and Marginal Treatment Effects with a Misclassified Treatment," Papers 2105.00358, arXiv.org, revised Sep 2024.
    20. Xinkun Nie & Stefan Wager, 2017. "Quasi-Oracle Estimation of Heterogeneous Treatment Effects," Papers 1712.04912, arXiv.org, revised Aug 2020.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2604.08798. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.