IDEAS home Printed from https://ideas.repec.org/a/bla/jorssc/v71y2022i5p1503-1520.html
   My bibliography  Save this article

A model‐based approach to predict employee compensation components

Author

Listed:
  • Andreea L. Erciulescu
  • Jean D. Opsomer

Abstract

The demand for official statistics at fine levels is motivating researchers to explore estimation methods that extend beyond the traditional survey‐based estimation. For this work, the challenge originated with the US Bureau of Labor Statistics, who conducts the National Compensation Survey to collect compensation data from a nationwide sample of establishments. The objective is to obtain predictions of the wage and non‐wage components of compensation for a large number of employment domains defined by detailed job characteristics. Survey estimates are only available for a small subset of these domains. To address the objective, we developed a bivariate hierarchical Bayes model that jointly predicts the wage and non‐wage compensation components for a large number of employment domains defined by detailed job characteristics. We also discuss solutions to some practical challenges encountered in implementing small area estimation methods in large‐scale settings, including methods for defining the prediction space, for constructing and selecting the information that serves as model input, and for obtaining stable survey variance and covariance estimates.

Suggested Citation

  • Andreea L. Erciulescu & Jean D. Opsomer, 2022. "A model‐based approach to predict employee compensation components," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(5), pages 1503-1520, November.
  • Handle: RePEc:bla:jorssc:v:71:y:2022:i:5:p:1503-1520
    DOI: 10.1111/rssc.12587
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/rssc.12587
    Download Restriction: no

    File URL: https://libkey.io/10.1111/rssc.12587?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Enrico Fabrizi & Maria Rosaria Ferrante & Carlo Trivisano, 2018. "Bayesian small area estimation for skewed business survey variables," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 67(4), pages 861-879, August.
    2. Gauri Sankar Datta & J. N. K. Rao & David Daniel Smith, 2005. "On measuring the variability of small area estimators under a basic area level model," Biometrika, Biometrika Trust, vol. 92(1), pages 183-196, March.
    3. Lewandowski, Daniel & Kurowicka, Dorota & Joe, Harry, 2009. "Generating random correlation matrices based on vines and extended onion method," Journal of Multivariate Analysis, Elsevier, vol. 100(9), pages 1989-2001, October.
    4. Jiming Jiang & P. Lahiri, 2006. "Mixed model prediction and small area estimation," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 15(1), pages 1-96, June.
    5. Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2010. "Regularization Paths for Generalized Linear Models via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 33(i01).
    6. Nikos Tzavidis & Li‐Chun Zhang & Angela Luna & Timo Schmid & Natalia Rojas‐Perilla, 2018. "From start to finish: a framework for the production of small area official statistics," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 181(4), pages 927-979, October.
    7. Simon, Noah & Friedman, Jerome H. & Hastie, Trevor & Tibshirani, Rob, 2011. "Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 39(i05).
    8. González-Manteiga, W. & Lombardi­a, M.J. & Molina, I. & Morales, D. & Santamari­a, L., 2008. "Analytic and bootstrap approximations of prediction errors under a multivariate Fay-Herriot model," Computational Statistics & Data Analysis, Elsevier, vol. 52(12), pages 5242-5252, August.
    9. Pfeffermann, Danny & Tiller, Richard, 2006. "Small-Area Estimation With StateSpace Models Subject to Benchmark Constraints," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1387-1397, December.
    10. David J. Spiegelhalter & Nicola G. Best & Bradley P. Carlin & Angelika Van Der Linde, 2002. "Bayesian measures of model complexity and fit," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(4), pages 583-639, October.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Harm Jan Boonstra & Jan A. Van Den Brakel & Bart Buelens & Sabine Krieg & Marc Smeets, 2008. "Towards small area estimation at Statistics Netherlands," Metron - International Journal of Statistics, Dipartimento di Statistica, Probabilità e Statistiche Applicate - University of Rome, vol. 0(1), pages 21-49.
    2. Benedicte Sjo Tislevoll & Monica Hellesøy & Oda Helen Eck Fagerholt & Stein-Erik Gullaksen & Aashish Srivastava & Even Birkeland & Dimitrios Kleftogiannis & Pilar Ayuda-Durán & Laure Piechaczyk & Dagi, 2023. "Early response evaluation by single cell signaling profiling in acute myeloid leukemia," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    3. Matthew F Dixon, 2017. "A High Frequency Trade Execution Model for Supervised Learning," Papers 1710.03870, arXiv.org, revised Dec 2017.
    4. Lixia Diao & David D. Smith & Gauri Sankar Datta & Tapabrata Maiti & Jean D. Opsomer, 2014. "Accurate Confidence Interval Estimation of Small Area Parameters Under the Fay–Herriot Model," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 41(2), pages 497-515, June.
    5. Zhixuan Fu & Shuangge Ma & Haiqun Lin & Chirag R. Parikh & Bingqing Zhou, 2017. "Penalized Variable Selection for Multi-center Competing Risks Data," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 9(2), pages 379-405, December.
    6. Andreas Groll & Gerhard Tutz, 2017. "Variable selection in discrete survival models including heterogeneity," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 23(2), pages 305-338, April.
    7. Yoshimori, Masayo & Lahiri, Partha, 2014. "A new adjusted maximum likelihood method for the Fay–Herriot small area model," Journal of Multivariate Analysis, Elsevier, vol. 124(C), pages 281-294.
    8. M. D. Ugarte & A. F. Militino & T. Goicoa, 2008. "Adjusting economic estimates in business surveys," Journal of Applied Statistics, Taylor & Francis Journals, vol. 35(11), pages 1253-1265.
    9. G. Datta & M. Ghosh & R. Steorts & J. Maples, 2011. "Bayesian benchmarking with applications to small area estimation," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 20(3), pages 574-588, November.
    10. Matthew F Dixon, 2017. "Sequence Classification of the Limit Order Book using Recurrent Neural Networks," Papers 1707.05642, arXiv.org.
    11. Angelo Moretti, 2023. "Estimation of small area proportions under a bivariate logistic mixed model," Quality & Quantity: International Journal of Methodology, Springer, vol. 57(4), pages 3663-3684, August.
    12. Katarzyna Reluga & María‐José Lombardía & Stefan Sperlich, 2023. "Simultaneous inference for linear mixed model parameters with an application to small area estimation," International Statistical Review, International Statistical Institute, vol. 91(2), pages 193-217, August.
    13. Chakraborty Adrijo & Datta Gauri Sankar & Mandal Abhyuday, 2016. "A Two-Component Normal Mixture Alternative to the Fay-Herriot Model," Statistics in Transition New Series, Polish Statistical Association, vol. 17(1), pages 67-90, March.
    14. Esteban, M.D. & Morales, D. & Pérez, A. & Santamaría, L., 2012. "Small area estimation of poverty proportions under area-level time models," Computational Statistics & Data Analysis, Elsevier, vol. 56(10), pages 2840-2855.
    15. Jie Xiong & Zhitong Bing & Yanlin Su & Defeng Deng & Xiaoning Peng, 2014. "An Integrated mRNA and microRNA Expression Signature for Glioblastoma Multiforme Prognosis," PLOS ONE, Public Library of Science, vol. 9(5), pages 1-8, May.
    16. Maria Rosaria Ferrante & Silvia Pacei, 2017. "Small domain estimation of business statistics by using multivariate skew normal models," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 180(4), pages 1057-1088, October.
    17. Dehnel Grażyna & Wawrowski Łukasz, 2020. "Robust estimation of wages in small enterprises: the application to Poland’s districts," Statistics in Transition New Series, Polish Statistical Association, vol. 21(1), pages 137-157, March.
    18. Paul A. Smith & Chiara Bocci & Nikos Tzavidis & Sabine Krieg & Marc J. E. Smeets, 2021. "Robust estimation for small domains in business surveys," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(2), pages 312-334, March.
    19. Torabi, Mahmoud & Rao, J.N.K., 2014. "On small area estimation under a sub-area level model," Journal of Multivariate Analysis, Elsevier, vol. 127(C), pages 36-55.
    20. Liao Zhu & Robert A. Jarrow & Martin T. Wells, 2021. "Time-Invariance Coefficients Tests with the Adaptive Multi-Factor Model," Quarterly Journal of Finance (QJF), World Scientific Publishing Co. Pte. Ltd., vol. 11(04), pages 1-30, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:jorssc:v:71:y:2022:i:5:p:1503-1520. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: https://edirc.repec.org/data/rssssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.