IDEAS home Printed from https://ideas.repec.org/p/iza/izadps/dp14404.html

Finite Mixture Models for Linked Survey and Administrative Data: Estimation and Post-estimation

Author

Listed:
  • Jenkins, Stephen P.

    (London School of Economics)

  • Rios-Avila, Fernando

    (Levy Economics Institute)

Abstract

Researchers use finite mixture models to analyze linked survey and administrative data on labour earnings (or similar variables), taking account of various types of measurement error in each data source. Different combinations of error-ridden and/or error-free observations characterize latent classes. Latent class probabilities depend on the probabilities of the different types of error. We introduce a set of Stata commands to fit a general class of finite mixture models to fit to linked survey-administrative data We also provide post-estimation commands for assessment of reliability, marginal effects, data simulation, and prediction of hybrid earnings variables that combine information from both data sources.

Suggested Citation

  • Jenkins, Stephen P. & Rios-Avila, Fernando, 2021. "Finite Mixture Models for Linked Survey and Administrative Data: Estimation and Post-estimation," IZA Discussion Papers 14404, IZA Network @ LISER.
  • Handle: RePEc:iza:izadps:dp14404
    as

    Download full text from publisher

    File URL: https://docs.iza.org/dp14404.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Jenkins, Stephen P. & Rios-Avila, Fernando, 2021. "Reconciling Reports: Modelling Employment Earnings and Measurement Errors Using Linked Survey and Administrative Data," IZA Discussion Papers 14405, IZA Network @ LISER.
    2. Jenkins, Stephen P. & Rios-Avila, Fernando, 2020. "Modelling errors in survey and administrative data on employment earnings: Sensitivity to the fraction assumed to have error-free earnings," Economics Letters, Elsevier, vol. 192(C).
    3. John M. Abowd & Martha H. Stinson, 2013. "Estimating Measurement Error in Annual Job Earnings: A Comparison of Survey and Administrative Data," The Review of Economics and Statistics, MIT Press, vol. 95(5), pages 1451-1467, December.
    4. Dean R. Hyslop & Wilbur Townsend, 2020. "Earnings Dynamics and Measurement Error in Matched Survey and Administrative Data," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 38(2), pages 457-469, April.
    5. repec:taf:jnlbes:v:30:y:2012:i:2:p:191-201 is not listed on IDEAS
    6. Stephen P. Jenkins & Fernando Rios‐Avila, 2021. "Measurement error in earnings data: Replication of Meijer, Rohwedder, and Wansbeek's mixture model approach to combining survey and register data," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 36(4), pages 474-483, June.
    7. Arie Kapteyn & Jelmer Y. Ypma, 2007. "Measurement Error and Misclassification: A Comparison of Survey and Administrative Data," Journal of Labor Economics, University of Chicago Press, vol. 25(3), pages 513-551.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Stella Martin, 2025. "We Might Both Be Wrong - Reconciliation of Survey and Administrative Earnings Measurements," CQE Working Papers 11025, Center for Quantitative Economics (CQE), University of Muenster.
    2. Apostolos Davillas & Victor Hugo Oliveira & Andrew M. Jones, 2024. "A model of errors in BMI based on self-reported and measured anthropometrics with evidence from Brazilian data," Empirical Economics, Springer, vol. 67(5), pages 2371-2410, November.
    3. Jenkins, Stephen P. & Rios-Avila, Fernando, 2021. "Reconciling Reports: Modelling Employment Earnings and Measurement Errors Using Linked Survey and Administrative Data," IZA Discussion Papers 14405, IZA Network @ LISER.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jenkins, Stephen P. & Rios-Avila, Fernando, 2021. "Reconciling Reports: Modelling Employment Earnings and Measurement Errors Using Linked Survey and Administrative Data," IZA Discussion Papers 14405, IZA Network @ LISER.
    2. Nico Thurow, 2025. "Characterizing Measurement Error in the German Socio-Economic Panel Using Linked Survey and Administrative Data," Papers 2501.03015, arXiv.org, revised Aug 2025.
    3. Apostolos Davillas & Victor Hugo Oliveira & Andrew M. Jones, 2024. "A model of errors in BMI based on self-reported and measured anthropometrics with evidence from Brazilian data," Empirical Economics, Springer, vol. 67(5), pages 2371-2410, November.
    4. Stella Martin, 2025. "We Might Both Be Wrong - Reconciliation of Survey and Administrative Earnings Measurements," CQE Working Papers 11025, Center for Quantitative Economics (CQE), University of Muenster.
    5. Emmanuel Flachaire & Nora Lustig & Andrea Vigorito, 2023. "Underreporting of Top Incomes and Inequality: A Comparison of Correction Methods using Simulations and Linked Survey and Tax Data," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 69(4), pages 1033-1059, December.
    6. Schmillen, Achim & Umkehrer, Matthias & Wachter, Till von, 2024. "Measurement error in longitudinal earnings data: evidence from Germany," Journal for Labour Market Research, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany], vol. 58, pages 1-008.
    7. Katy Bergstrom & William Dodds & Nicholas Lacoste & Juan Rios, 2025. "Estimating the Welfare Cost of Labor Supply Frictions," Working Papers 2503, Tulane University, Department of Economics.
    8. Ferreira, Francisco H. G. & Brunori, Paolo, 2024. "Inherited inequality, meritocracy, and the purpose of economic growth," LSE Research Online Documents on Economics 126263, London School of Economics and Political Science, LSE Library.
    9. Nora Lustig & Andrea Vigorito, 2025. "The "Missing Rich" in Household Surveys: Causes and Correction Approaches Extended Version with Technical Appendixes," Working Papers 2512, Tulane University, Department of Economics.
    10. Jenkins, Stephen P. & Rios-Avila, Fernando, 2020. "Modelling errors in survey and administrative data on employment earnings: Sensitivity to the fraction assumed to have error-free earnings," Economics Letters, Elsevier, vol. 192(C).
    11. Hyslop, Dean R. & Townsend, Wilbur, 2017. "Employment misclassification in survey and administrative reports," Economics Letters, Elsevier, vol. 155(C), pages 19-23.
    12. Madeira, Carlos & Margaretic, Paula, 2022. "The impact of financial literacy on the quality of self-reported financial information," Journal of Behavioral and Experimental Finance, Elsevier, vol. 34(C).
    13. Lustig, Nora & Vigorito, Andrea, 2025. "The “Missing Rich” in Household Surveys: Causes and Correction Approaches," SocArXiv 97ng6_v1, Center for Open Science.
    14. R. Bollinger, Christopher & Valentinova Tasseva, Iva, 2022. "Income source confusion using the SILC," ISER Working Paper Series 2022-04, Institute for Social and Economic Research.
    15. Tullio, Federico, 2025. "Dynamics and measurement error in household income data collected with single questions," MPRA Paper 124151, University Library of Munich, Germany.
    16. Daniel L. Millimet & Christopher F. Parmeter, 2025. "The impact of measurement error on trends in earnings inequality in the USA," Empirical Economics, Springer, vol. 69(5), pages 2727-2753, November.
    17. Crossley, Thomas F. & Fisher, Paul & Hussein, Omar, 2023. "Assessing data from summary questions about earnings and income," Labour Economics, Elsevier, vol. 81(C).
    18. Zachary H. Seeskin, 2016. "Evaluating the Use of Commercial Data to Improve Survey Estimates of Property Taxes," CARRA Working Papers 2016-06, Center for Economic Studies, U.S. Census Bureau.
    19. Michele Lalla & Patrizio Frederic & Daniela Mantovani, 2022. "The inextricable association of measurement errors and tax evasion as examined through a microanalysis of survey data matched with fiscal data: a case study," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 31(5), pages 1375-1401, December.
    20. Quinn Moore & Irma Perez-Johnson & Robert Santillano, 2018. "Decomposing Differences in Impacts on Survey- and Administrative-Measured Earnings From a Job Training Voucher Experiment," Evaluation Review, , vol. 42(5-6), pages 515-549, October.

    More about this item

    Keywords

    ;
    ;
    ;

    JEL classification:

    • C81 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Methodology for Collecting, Estimating, and Organizing Microeconomic Data; Data Access
    • C83 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Survey Methods; Sampling Methods
    • D31 - Microeconomics - - Distribution - - - Personal Income and Wealth Distribution

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:iza:izadps:dp14404. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Mark Fallak (email available below). General contact details of provider: https://edirc.repec.org/data/izaaalu.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.