IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0216511.html
   My bibliography  Save this article

Modelling tick bite risk by combining random forests and count data regression models

Author

Listed:
  • Irene Garcia-Marti
  • Raul Zurita-Milla
  • Arno Swart

Abstract

The socio-economic and demographic changes that occurred over the past 50 years have dramatically expanded urban areas around the globe, thus bringing urban settlers in closer contact with nature. Ticks have trespassed the limits of forests and grasslands to start inhabiting green spaces within metropolitan areas. Hence, the transmission of pathogens causing tick-borne diseases is an important threat to public health. Using volunteered tick bite reports collected by two Dutch initiatives, here we present a method to model tick bite risk using human exposure and tick hazard predictors. Our method represents a step forward in risk modelling, since we combine a well-known ensemble learning method, Random Forest, with four count data models of the (zero-inflated) Poisson family. This combination allows us to better model the disproportions inherent in the volunteered tick bite reports. Unlike canonical machine learning models, our method can capture the overdispersion or zero-inflation inherent in data, thus yielding tick bite risk predictions that resemble the original signal captured by volunteers. Mapping model predictions enables a visual inspection of the spatial patterns of tick bite risk in the Netherlands. The Veluwe national park and the Utrechtse Heuvelrug forest, which are large forest-urban interfaces with several cities, are areas with high tick bite risk. This is expected, since these are popular places for recreation and tick activity is high in forests. However, our model can also predict high risk in less-intensively visited recreational areas, such as the patchy forests in the northeast of the country, the natural areas along the coastline, or some of the Frisian Islands. Our model could help public health specialists to design mitigation strategies for tick-borne diseases, and to target risky areas with awareness and prevention campaigns.

Suggested Citation

  • Irene Garcia-Marti & Raul Zurita-Milla & Arno Swart, 2019. "Modelling tick bite risk by combining random forests and count data regression models," PLOS ONE, Public Library of Science, vol. 14(12), pages 1-22, December.
  • Handle: RePEc:plo:pone00:0216511
    DOI: 10.1371/journal.pone.0216511
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0216511
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0216511&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0216511?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0216511. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.