IDEAS home Printed from https://ideas.repec.org/a/zbw/espost/218721.html
   My bibliography  Save this article

Generating reliable tourist accommodation statistics: Bootstrapping regression model for overdispersed long-tailed data

Author

Listed:
  • Van Truong, Nguyen
  • Shimizu, Tetsuo
  • Choi, Sunkyung

Abstract

Purpose: Few studies have applied count data analysis to tourist accommodation data. This study was undertaken to investigate the characteristics and to seek for the most fitting models for population total estimation in relation to tourist accommodation data. Methods: Based on the data of 10,503 hotels, obtained from by a nationwide Japanese survey, the bootstrap resampling method was applied for re-randomisation of the data. Training and test sets were derived by randomly splitting each of the bootstrap samples. Six count models were fitted to the training set and validated with the test set. Bootstrap distributions for parameters of significance were used for model evaluation. Results: The outcome variable (number of guests), was found to be heterogenous, over dispersed and long-tailed, with excessive zero counts. The hurdle negative binomial and zero-inflated negative binomial models outperformed the other models. The accuracy (se) of the estimation of total guests with training sets that ranged from 5% to 85%, was from 3.7 to 0.4 respectively. Results appear rather overestimated. Implications: Findings indicated that the integration of the bootstrap resampling method and count regression provide a statistical tool for generating reliable tourist accommodation statistics. The use of bootstrap would help to detect and correct the bias of the estimation.

Suggested Citation

  • Van Truong, Nguyen & Shimizu, Tetsuo & Choi, Sunkyung, 2020. "Generating reliable tourist accommodation statistics: Bootstrapping regression model for overdispersed long-tailed data," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 6(2), pages 30-37.
  • Handle: RePEc:zbw:espost:218721
    DOI: 10.5281/zenodo.3835847
    as

    Download full text from publisher

    File URL: https://www.econstor.eu/bitstream/10419/218721/1/6-2-4.pdf
    Download Restriction: no

    File URL: https://libkey.io/10.5281/zenodo.3835847?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Deb, Partha & Trivedi, Pravin K., 2002. "The structure of demand for health care: latent class versus two-part models," Journal of Health Economics, Elsevier, vol. 21(4), pages 601-625, July.
    2. A. Georges Assaf & Frank W. Agbola, 2011. "Modelling the Performance of Australian Hotels: A DEA Double Bootstrap Approach," Tourism Economics, , vol. 17(1), pages 73-89, February.
    3. Chou, Ming Che, 2013. "Does tourism development promote economic growth in transition countries? A panel data analysis," Economic Modelling, Elsevier, vol. 33(C), pages 226-232.
    4. Simar, Leopold & Wilson, Paul W., 2007. "Estimation and inference in two-stage, semi-parametric models of production processes," Journal of Econometrics, Elsevier, vol. 136(1), pages 31-64, January.
    5. Hausman, Jerry & Hall, Bronwyn H & Griliches, Zvi, 1984. "Econometric Models for Count Data with an Application to the Patents-R&D Relationship," Econometrica, Econometric Society, vol. 52(4), pages 909-938, July.
    6. Deb, Partha & Trivedi, Pravin K, 1997. "Demand for Medical Care by the Elderly: A Finite Mixture Approach," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 12(3), pages 313-336, May-June.
    7. Gergaud, Olivier & Livat, Florine & Song, Haiyan, 2018. "Terrorism and Wine Tourism: The Case of Museum Attendance," Journal of Wine Economics, Cambridge University Press, vol. 13(4), pages 375-383, November.
    8. Duarte, R. & Escario, J.J., 2006. "Alcohol abuse and truancy among Spanish adolescents: A count-data approach," Economics of Education Review, Elsevier, vol. 25(2), pages 179-187, April.
    9. Shrestha, Ram K. & Seidl, Andrew F. & Moraes, Andre S., 2002. "Value of recreational fishing in the Brazilian Pantanal: a travel cost analysis using count data models," Ecological Economics, Elsevier, vol. 42(1-2), pages 289-299, August.
    10. Chen, Rong & Fomby, Thomas B, 1999. "Forecasting with Stable Seasonal Pattern Models with an Application to Hawaiian Tourism Data," Journal of Business & Economic Statistics, American Statistical Association, vol. 17(4), pages 497-504, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Del Chiappa, Giacomo & Bregoli, Ilenia & Fotiadis, Anestis K., 2021. "The impact of COVID-19 on Italian accommodation: A supply-perspective," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 7(1), pages 13-22.
    2. Diunugala, Hemantha Premakumara & Mombeuil, Claudel, 2020. "Modeling and predicting foreign tourist arrivals to Sri Lanka: A comparison of three different methods," MPRA Paper 103779, University Library of Munich, Germany.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Majo, M.C. & van Soest, A.H.O., 2011. "The Fixed-Effects Zero-Inflated Poisson Model with an Application to Health Care Utilization," Other publications TiSEM 68cf0f9b-fc68-4017-97a9-a, Tilburg University, School of Economics and Management.
    2. Majo, M.C., 2010. "A microeconometric analysis of health care utilization in Europe," Other publications TiSEM 1cf5fd2f-8146-4ef8-8eb5-e, Tilburg University, School of Economics and Management.
    3. João Cotter Salvado, 2008. "The Determinants of Health Care Utilization in Portugal: An Approach with Count Data Models," Swiss Journal of Economics and Statistics (SJES), Swiss Society of Economics and Statistics (SSES), vol. 144(III), pages 437-458, September.
    4. McLeod, Logan, 2011. "A nonparametric vs. latent class model of general practitioner utilization: Evidence from Canada," Journal of Health Economics, Elsevier, vol. 30(6), pages 1261-1279.
    5. Sergi Jiménez‐Martín & José M. Labeaga & Maite Martínez‐Granado, 2002. "Latent class versus two‐part models in the demand for physician services across the European Union," Health Economics, John Wiley & Sons, Ltd., vol. 11(4), pages 301-321, June.
    6. Óscar Lourenço & Carlota Quintal & Pedro Lopes Ferreira & Pedro Pita Barros, 2007. "A equidade na utilização de cuidados de saúde em Portugal: Uma avaliação baseada em modelos de contagem," Notas Económicas, Faculty of Economics, University of Coimbra, issue 25, pages 6-26, June.
    7. Prayaga, Prabha, 2017. "Estimating the value of beach recreation for locals in the Great Barrier Reef Marine Park, Australia," Economic Analysis and Policy, Elsevier, vol. 53(C), pages 9-18.
    8. Brown, Sarah & Greene, William H. & Harris, Mark N. & Taylor, Karl, 2015. "An inverse hyperbolic sine heteroskedastic latent class panel tobit model: An application to modelling charitable donations," Economic Modelling, Elsevier, vol. 50(C), pages 228-236.
    9. Erik Schokkaert & Tom Van Ourti & Diana De Graeve & Ann Lecluyse & Carine Van de Voorde, 2010. "Supplemental health insurance and equality of access in Belgium," Health Economics, John Wiley & Sons, Ltd., vol. 19(4), pages 377-395, April.
    10. Hsin‐Fan Chen & Sheng‐Hung Chen & Jie‐Min Lee & Huei‐Yann Jeng, 2010. "Who Are the Potential Smokers of Smuggled Cigarettes?," Asian Economic Journal, East Asian Economic Association, vol. 24(3), pages 221-234, September.
    11. Daniele Fabbri & Chiara Monfardini, 2016. "Opt Out or Top Up? Voluntary Health Care Insurance and the Public vs. Private Substitution," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 78(1), pages 75-93, February.
    12. Jones, A.M, 2010. "Models For Health Care," Health, Econometrics and Data Group (HEDG) Working Papers 10/01, HEDG, c/o Department of Economics, University of York.
    13. Nabanita Datta Gupta & Jane Greve, 2011. "Overweight and obesity and the utilization of primary care physicians," Health Economics, John Wiley & Sons, Ltd., vol. 20(S1), pages 53-67, September.
    14. Morakinyo O Adetutu & Kayode A Odusanya & Eleni Stathopoulou & Thomas G Weyman-Jones, 2023. "Environmental regulation, taxes, and activism," Oxford Economic Papers, Oxford University Press, vol. 75(2), pages 460-489.
    15. Jie Q. Guo & Pravin K. Trivedi, 2002. "Flexible Parametric Models for Long‐tailed Patent Count Distributions," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 64(1), pages 63-82, February.
    16. Fabbri, Daniele & Monfardini, Chiara, 2009. "Rationing the public provision of healthcare in the presence of private supplements: Evidence from the Italian NHS," Journal of Health Economics, Elsevier, vol. 28(2), pages 290-304, March.
    17. Partha Deb & Furio Rosati, 2002. "Determinants of Child Labor and School Attendance: The Role of Household Unobservables," Economics Working Paper Archive at Hunter College 02/9, Hunter College Department of Economics.
    18. Alegre, Joaquín & Mateo, Sara & Pou, Llorenç, 2011. "A latent class approach to tourists’ length of stay," Tourism Management, Elsevier, vol. 32(3), pages 555-563.
    19. Gómez Vega, Mafalda & Herrero Prieto, Luis César, 2017. "Determinantes de la eficiencia en la captación de turismo cultural nacional y extranjero en España: Un análisis regional/Drivers of Efficiency in Attracting Domestic and Foreign Cultural Tourism to Sp," Estudios de Economia Aplicada, Estudios de Economia Aplicada, vol. 35, pages 849-872, Agosto.
    20. Stefanie Schurer, 2008. "Discrete Heterogeneity in the Impact of Health Shocks on Labour Market Outcomes," Ruhr Economic Papers 0071, Rheinisch-Westfälisches Institut für Wirtschaftsforschung, Ruhr-Universität Bochum, Universität Dortmund, Universität Duisburg-Essen.

    More about this item

    Keywords

    tourism statistics; bootstrap; econometrics; over dispersed data; zero-inflated data;
    All these keywords.

    JEL classification:

    • C4 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics
    • L8 - Industrial Organization - - Industry Studies: Services
    • C24 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Truncated and Censored Models; Switching Regression Models; Threshold Regression Models
    • Z3 - Other Special Topics - - Tourism Economics

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:zbw:espost:218721. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ZBW - Leibniz Information Centre for Economics (email available below). General contact details of provider: https://edirc.repec.org/data/zbwkide.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.