IDEAS home Printed from https://ideas.repec.org/a/taf/jnlasa/v112y2017i517p24-36.html
   My bibliography  Save this article

A Directional Mixed Effects Model for Compositional Expenditure Data

Author

Listed:
  • J. L. Scealy
  • A. H. Welsh

Abstract

Compositional data are vectors of proportions defined on the unit simplex and this type of constrained data occur frequently in Government surveys. It is also possible for the compositional data to be correlated due to the clustering or grouping of the observations within small domains or areas. We propose a new class of the mixed model for compositional data based on the Kent distribution for directional data, where the random effects also have Kent distributions. One useful property of the new directional mixed model is that the marginal mean direction has a closed form and is interpretable. The random effects enter the model in a multiplicative way via the product of a set of rotation matrices and the conditional mean direction is a random rotation of the marginal mean direction. In small area estimation settings, the mean proportions are usually of primary interest and these are shown to be simple functions of the marginal mean direction. For estimation, we apply a quasi-likelihood method which results in solving a new set of generalized estimating equations and these are shown to have low bias in typical situations. For inference, we use a nonparametric bootstrap method for clustered data which does not rely on estimates of the shape parameters (shape parameters are difficult to estimate in Kent models). We analyze data from the 2009–2010 Australian Household Expenditure Survey CURF (confidentialized unit record file). We predict the proportions of total weekly expenditure on food and housing costs for households in a chosen set of domains. The new approach is shown to be more tractable than the traditional approach based on the logratio transformation.

Suggested Citation

  • J. L. Scealy & A. H. Welsh, 2017. "A Directional Mixed Effects Model for Compositional Expenditure Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(517), pages 24-36, January.
  • Handle: RePEc:taf:jnlasa:v:112:y:2017:i:517:p:24-36
    DOI: 10.1080/01621459.2016.1189336
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/01621459.2016.1189336
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/01621459.2016.1189336?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Samanta, Mayukh & Welsh, A.H., 2013. "Bootstrapping for highly unbalanced clustered data," Computational Statistics & Data Analysis, Elsevier, vol. 59(C), pages 70-81.
    2. Isabel Molina & Ayoub Saei & M. José Lombardía, 2007. "Small area estimates of labour force participation under a multinomial logit mixed model," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 170(4), pages 975-1000, October.
    3. J. L. Scealy & A. H. Welsh, 2011. "Regression for compositional data by using distributions defined on the hypersphere," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 73(3), pages 351-375, June.
    4. Field, C. A. & Pang, Zhen & Welsh, A. H., 2010. "Bootstrapping Robust Estimates for Clustered Data," Journal of the American Statistical Association, American Statistical Association, vol. 105(492), pages 1606-1616.
    5. Militino, A.F. & Goicoa, T. & Ugarte, M.D., 2012. "Estimating the percentage of food expenditure in small areas using bias-corrected P-spline based estimators," Computational Statistics & Data Analysis, Elsevier, vol. 56(10), pages 2934-2948.
    6. J. L. Scealy & Patrice de Caritat & Eric C. Grunsky & Michail T. Tsagris & A. H. Welsh, 2015. "Robust Principal Component Analysis for Power Transformed Compositional Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(509), pages 136-148, March.
    7. Jiming Jiang & P. Lahiri, 2006. "Mixed model prediction and small area estimation," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 15(1), pages 1-96, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Janice L. Scealy & David Heslop & Jia Liu & Andrew T. A. Wood, 2022. "Directions Old and New: Palaeomagnetism and Fisher (1953) Meet Modern Statistics," International Statistical Review, International Statistical Institute, vol. 90(2), pages 237-258, August.
    2. María Dolores Esteban & María José Lombardía & Esther López-Vizcaíno & Domingo Morales & Agustín Pérez, 2020. "Small area estimation of proportions under area-level compositional mixed models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(3), pages 793-818, September.
    3. Arthur Pewsey & Eduardo García-Portugués, 2021. "Recent advances in directional statistics," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 30(1), pages 1-58, March.
    4. Shang, Han Lin & Haberman, Steven & Xu, Ruofan, 2022. "Multi-population modelling and forecasting life-table death counts," Insurance: Mathematics and Economics, Elsevier, vol. 106(C), pages 239-253.
    5. Joscha Krause & Jan Pablo Burgard & Domingo Morales, 2022. "Robust prediction of domain compositions from uncertain data using isometric logratio transformations in a penalized multivariate Fay–Herriot model," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 76(1), pages 65-96, February.
    6. Dawber James & Smith Paul A. & Tzavidis Nikos & Würz Nora & Flower Tanya & Thomas Heledd & Schmid Timo, 2022. "Experimental UK Regional Consumer Price Inflation with Model-Based Expenditure Weights," Journal of Official Statistics, Sciendo, vol. 38(1), pages 213-237, March.
    7. María Dolores Esteban & María José Lombardía & Esther López-Vizcaíno & Domingo Morales & Agustín Pérez, 2023. "Small area estimation of average compositions under multivariate nested error regression models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 32(2), pages 651-676, June.
    8. María Dolores Esteban & María José Lombardía & Esther López‐Vizcaíno & Domingo Morales & Agustín Pérez, 2022. "Empirical best prediction of small area bivariate parameters," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 49(4), pages 1699-1727, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Angelo Moretti, 2023. "Estimation of small area proportions under a bivariate logistic mixed model," Quality & Quantity: International Journal of Methodology, Springer, vol. 57(4), pages 3663-3684, August.
    2. O’Shaughnessy, P.Y. & Welsh, A.H., 2018. "Bootstrapping longitudinal data with multiple levels of variation," Computational Statistics & Data Analysis, Elsevier, vol. 124(C), pages 117-131.
    3. Shonosuke Sugasawa & Tatsuya Kubokawa & J. N. K. Rao, 2018. "Small area estimation via unmatched sampling and linking models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 27(2), pages 407-427, June.
    4. M. Giovanna Ranalli & Giorgio E. Montanari & Cecilia Vicarelli, 2018. "Estimation of small area counts with the benchmarking property," METRON, Springer;Sapienza Università di Roma, vol. 76(3), pages 349-378, December.
    5. K. Shuvo Bakar & Nicholas Biddle & Philip Kokic & Huidong Jin, 2020. "A Bayesian spatial categorical model for prediction to overlapping geographical areas in sample surveys," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(2), pages 535-563, February.
    6. Alfredo A. Romero, 2014. "Where do Moderation Terms Come from in Binary Choice Models?," Central European Journal of Economic Modelling and Econometrics, Central European Journal of Economic Modelling and Econometrics, vol. 6(1), pages 57-68, March.
    7. J. N. K. Rao, 2015. "Inferential issues in model-based small area estimation: some new developments," Statistics in Transition new series, Główny Urząd Statystyczny (Polska), vol. 16(4), pages 491-510, December.
    8. Jan Pablo Burgard & Ralf Münnich, 2015. "Sae Teaching Using Simulations," Statistics in Transition New Series, Polish Statistical Association, vol. 16(4), pages 603-610, December.
    9. María Dolores Esteban & María José Lombardía & Esther López-Vizcaíno & Domingo Morales & Agustín Pérez, 2020. "Small area estimation of proportions under area-level compositional mixed models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(3), pages 793-818, September.
    10. Xiongtao Dai & Zhenhua Lin & Hans‐Georg Müller, 2021. "Modeling sparse longitudinal data on Riemannian manifolds," Biometrics, The International Biometric Society, vol. 77(4), pages 1328-1341, December.
    11. Sedeño-Noda, A. & González-Dávila, E. & González-Martín, C. & González-Yanes, A., 2009. "Preemptive benchmarking problem: An approach for official statistics in small areas," European Journal of Operational Research, Elsevier, vol. 196(1), pages 360-369, July.
    12. Johannes Gräb & Michael Grimm, 2008. "Spatial inequalities explained - Evidence from Burkina Faso," Ibero America Institute for Econ. Research (IAI) Discussion Papers 173, Ibero-America Institute for Economic Research.
    13. N. Salvati & N. Tzavidis & M. Pratesi & R. Chambers, 2012. "Small area estimation via M-quantile geographically weighted regression," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 21(1), pages 1-28, March.
    14. Molina, Isabel, 2006. "Uncertainty under a multivariate nested-error regression model with logarithmic transformation," DES - Working Papers. Statistics and Econometrics. WS ws066117, Universidad Carlos III de Madrid. Departamento de Estadística.
    15. Valéry Dongmo Jiongo & Pierre Nguimkeu, 2018. "Bootstrapping Mean Squared Errors of Robust Small-Area Estimators: Application to the Method-of-Payments Data," Staff Working Papers 18-28, Bank of Canada.
    16. Miguel Boubeta & María José Lombardía & Domingo Morales, 2016. "Empirical best prediction under area-level Poisson mixed models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(3), pages 548-569, September.
    17. Lixia Diao & David D. Smith & Gauri Sankar Datta & Tapabrata Maiti & Jean D. Opsomer, 2014. "Accurate Confidence Interval Estimation of Small Area Parameters Under the Fay–Herriot Model," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 41(2), pages 497-515, June.
    18. Datta, Gauri S. & Torabi, Mahmoud & Rao, J.N.K. & Liu, Benmei, 2018. "Small area estimation with multiple covariates measured with errors: A nested error linear regression approach of combining multiple surveys," Journal of Multivariate Analysis, Elsevier, vol. 167(C), pages 49-59.
    19. Napoleón Vargas Jurado & Kent M. Eskridge & Stephen D. Kachman & Ronald M. Lewis, 2018. "Using a Bayesian Hierarchical Linear Mixing Model to Estimate Botanical Mixtures," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 23(2), pages 190-207, June.
    20. Francis K. C. Hui & Samuel Müller & Alan H. Welsh, 2021. "Random Effects Misspecification Can Have Severe Consequences for Random Effects Inference in Linear Mixed Models," International Statistical Review, International Statistical Institute, vol. 89(1), pages 186-206, April.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:jnlasa:v:112:y:2017:i:517:p:24-36. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/UASA20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.