IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0071218.html
   My bibliography  Save this article

Novel Three-Step Pseudo-Absence Selection Technique for Improved Species Distribution Modelling

Author

Listed:
  • Senait D Senay
  • Susan P Worner
  • Takayoshi Ikeda

Abstract

Pseudo-absence selection for spatial distribution models (SDMs) is the subject of ongoing investigation. Numerous techniques continue to be developed, and reports of their effectiveness vary. Because the quality of presence and absence data is key for acceptable accuracy of correlative SDM predictions, determining an appropriate method to characterise pseudo-absences for SDM’s is vital. The main methods that are currently used to generate pseudo-absence points are: 1) randomly generated pseudo-absence locations from background data; 2) pseudo-absence locations generated within a delimited geographical distance from recorded presence points; and 3) pseudo-absence locations selected in areas that are environmentally dissimilar from presence points. There is a need for a method that considers both geographical extent and environmental requirements to produce pseudo-absence points that are spatially and ecologically balanced. We use a novel three-step approach that satisfies both spatial and ecological reasons why the target species is likely to find a particular geo-location unsuitable. Step 1 comprises establishing a geographical extent around species presence points from which pseudo-absence points are selected based on analyses of environmental variable importance at different distances. This step gives an ecologically meaningful explanation to the spatial range of background data, as opposed to using an arbitrary radius. Step 2 determines locations that are environmentally dissimilar to the presence points within the distance specified in step one. Step 3 performs K-means clustering to reduce the number of potential pseudo-absences to the desired set by taking the centroids of clusters in the most environmentally dissimilar class identified in step 2. By considering spatial, ecological and environmental aspects, the three-step method identifies appropriate pseudo-absence points for correlative SDMs. We illustrate this method by predicting the New Zealand potential distribution of the Asian tiger mosquito (Aedes albopictus) and the Western corn rootworm (Diabrotica virgifera virgifera).

Suggested Citation

  • Senait D Senay & Susan P Worner & Takayoshi Ikeda, 2013. "Novel Three-Step Pseudo-Absence Selection Technique for Improved Species Distribution Modelling," PLOS ONE, Public Library of Science, vol. 8(8), pages 1-16, August.
  • Handle: RePEc:plo:pone00:0071218
    DOI: 10.1371/journal.pone.0071218
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0071218
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0071218&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0071218?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. VanDerWal, Jeremy & Shoo, Luke P. & Graham, Catherine & Williams, Stephen E., 2009. "Selecting pseudo-absence data for presence-only distribution modeling: How far should you stray from what you know?," Ecological Modelling, Elsevier, vol. 220(4), pages 589-594.
    2. Chefaoui, Rosa M. & Lobo, Jorge M., 2008. "Assessing the effects of pseudo-absences on predictive distribution model performance," Ecological Modelling, Elsevier, vol. 210(4), pages 478-486.
    3. Karatzoglou, Alexandros & Smola, Alexandros & Hornik, Kurt & Zeileis, Achim, 2004. "kernlab - An S4 Package for Kernel Methods in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 11(i09).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. An T. N. Dang & Lalit Kumar & Michael Reid, 2020. "Modelling the Potential Impacts of Climate Change on Rice Cultivation in Mekong Delta, Vietnam," Sustainability, MDPI, vol. 12(22), pages 1-21, November.
    2. Chunrong Mi & Liang Ma & Mengyuan Yang & Xinhai Li & Shai Meiri & Uri Roll & Oleksandra Oskyrko & Daniel Pincheira-Donoso & Lilly P. Harvey & Daniel Jablonski & Barbod Safaei-Mahroo & Hanyeh Ghaffari , 2023. "Global Protected Areas as refuges for amphibians and reptiles under climate change," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    3. Ana Cristina Mosebo Fernandes & Rebeca Quintero Gonzalez & Marie Ann Lenihan-Clarke & Ezra Francis Leslie Trotter & Jamal Jokar Arsanjani, 2020. "Machine Learning for Conservation Planning in a Changing Climate," Sustainability, MDPI, vol. 12(18), pages 1-28, September.
    4. Iturbide, Maialen & Bedia, Joaquín & Herrera, Sixto & del Hierro, Oscar & Pinto, Miriam & Gutiérrez, Jose Manuel, 2015. "A framework for species distribution modelling with improved pseudo-absence generation," Ecological Modelling, Elsevier, vol. 312(C), pages 166-174.
    5. Steen, Bart & Broennimann, Olivier & Maiorano, Luigi & Guisan, Antoine, 2024. "How sensitive are species distribution models to different background point selection strategies? A test with species at various equilibrium levels," Ecological Modelling, Elsevier, vol. 493(C).
    6. Sillero, Neftalí & Arenas-Castro, Salvador & Enriquez‐Urzelai, Urtzi & Vale, Cândida Gomes & Sousa-Guedes, Diana & Martínez-Freiría, Fernando & Real, Raimundo & Barbosa, A.Márcia, 2021. "Want to model a species niche? A step-by-step guideline on correlative ecological niche modelling," Ecological Modelling, Elsevier, vol. 456(C).
    7. Wentao Yang & Huaxi He & Dongsheng Wei & Hao Chen, 2022. "Generating pseudo-absence samples of invasive species based on outlier detection in the geographical characteristic space," Journal of Geographical Systems, Springer, vol. 24(2), pages 261-279, April.
    8. Whitford, Anna M. & Shipley, Benjamin R. & McGuire, Jenny L., 2024. "The influence of the number and distribution of background points in presence-background species distribution models," Ecological Modelling, Elsevier, vol. 488(C).
    9. Rana, Divyashree & Sartor, Caroline Charão & Chiaverini, Luca & Cushman, Samuel Alan & Kaszta, Żaneta & Ramakrishnan, Uma & Macdonald, David W., 2024. "Differentially biased sampling strategies reveal the non-stationarity of species distribution models for Indian small felids," Ecological Modelling, Elsevier, vol. 493(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Coro, Gianpaolo & Pagano, Pasquale & Ellenbroek, Anton, 2013. "Combining simulated expert knowledge with Neural Networks to produce Ecological Niche Models for Latimeria chalumnae," Ecological Modelling, Elsevier, vol. 268(C), pages 55-63.
    2. Brice B Hanberry & Hong S He & Brian J Palik, 2012. "Pseudoabsence Generation Strategies for Species Distribution Models," PLOS ONE, Public Library of Science, vol. 7(8), pages 1-12, August.
    3. Whitford, Anna M. & Shipley, Benjamin R. & McGuire, Jenny L., 2024. "The influence of the number and distribution of background points in presence-background species distribution models," Ecological Modelling, Elsevier, vol. 488(C).
    4. Hengl, Tomislav & Sierdsema, Henk & Radović, Andreja & Dilo, Arta, 2009. "Spatial prediction of species’ distributions from occurrence-only records: combining point pattern analysis, ENFA and regression-kriging," Ecological Modelling, Elsevier, vol. 220(24), pages 3499-3511.
    5. Broussin, Joséphine & Mouchet, Maud & Goberville, Eric, 2024. "Generating pseudo-absences in the ecological space improves the biological relevance of response curves in species distribution models," Ecological Modelling, Elsevier, vol. 498(C).
    6. Sillero, Neftalí & Arenas-Castro, Salvador & Enriquez‐Urzelai, Urtzi & Vale, Cândida Gomes & Sousa-Guedes, Diana & Martínez-Freiría, Fernando & Real, Raimundo & Barbosa, A.Márcia, 2021. "Want to model a species niche? A step-by-step guideline on correlative ecological niche modelling," Ecological Modelling, Elsevier, vol. 456(C).
    7. Stokland, Jogeir N. & Halvorsen, Rune & Støa, Bente, 2011. "Species distribution modelling—Effect of design and sample size of pseudo-absence observations," Ecological Modelling, Elsevier, vol. 222(11), pages 1800-1809.
    8. Tsukioka, Yasutomo & Yanagi, Junya & Takada, Teruko, 2018. "Investor sentiment extracted from internet stock message boards and IPO puzzles," International Review of Economics & Finance, Elsevier, vol. 56(C), pages 205-217.
    9. Václavík, Tomáš & Meentemeyer, Ross K., 2009. "Invasive species distribution modeling (iSDM): Are absence data and dispersal constraints needed to predict actual distributions?," Ecological Modelling, Elsevier, vol. 220(23), pages 3248-3258.
    10. Iturbide, Maialen & Bedia, Joaquín & Herrera, Sixto & del Hierro, Oscar & Pinto, Miriam & Gutiérrez, Jose Manuel, 2015. "A framework for species distribution modelling with improved pseudo-absence generation," Ecological Modelling, Elsevier, vol. 312(C), pages 166-174.
    11. Jiang, Dong & Wang, Qian & Ding, Fangyu & Fu, Jingying & Hao, Mengmeng, 2019. "Potential marginal land resources of cassava worldwide: A data-driven analysis," Renewable and Sustainable Energy Reviews, Elsevier, vol. 104(C), pages 167-173.
    12. Marchetto, Elisa & Da Re, Daniele & Tordoni, Enrico & Bazzichetto, Manuele & Zannini, Piero & Celebrin, Simone & Chieffallo, Ludovico & Malavasi, Marco & Rocchini, Duccio, 2023. "Testing the effect of sample prevalence and sampling methods on probability- and favourability-based SDMs," Ecological Modelling, Elsevier, vol. 477(C).
    13. Rana, Divyashree & Sartor, Caroline Charão & Chiaverini, Luca & Cushman, Samuel Alan & Kaszta, Żaneta & Ramakrishnan, Uma & Macdonald, David W., 2024. "Differentially biased sampling strategies reveal the non-stationarity of species distribution models for Indian small felids," Ecological Modelling, Elsevier, vol. 493(C).
    14. Andrea S Martinez-Vernon & James A Covington & Ramesh P Arasaradnam & Siavash Esfahani & Nicola O’Connell & Ioannis Kyrou & Richard S Savage, 2018. "An improved machine learning pipeline for urinary volatiles disease detection: Diagnosing diabetes," PLOS ONE, Public Library of Science, vol. 13(9), pages 1-20, September.
    15. Madhumita Sahoo & Aman Kasot & Anirban Dhar & Amlanjyoti Kar, 2018. "On Predictability of Groundwater Level in Shallow Wells Using Satellite Observations," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 32(4), pages 1225-1244, March.
    16. Polina Bombina & Dwayne Tally & Zachary B Abrams & Kevin R Coombes, 2024. "SillyPutty: Improved clustering by optimizing the silhouette width," PLOS ONE, Public Library of Science, vol. 19(6), pages 1-17, June.
    17. P. J. Zarco-Tejada & T. Poblete & C. Camino & V. Gonzalez-Dugo & R. Calderon & A. Hornero & R. Hernandez-Clemente & M. Román-Écija & M. P. Velasco-Amo & B. B. Landa & P. S. A. Beck & M. Saponari & D. , 2021. "Divergent abiotic spectral pathways unravel pathogen stress signals across species," Nature Communications, Nature, vol. 12(1), pages 1-11, December.
    18. Grubinger, Thomas & Zeileis, Achim & Pfeiffer, Karl-Peter, 2014. "evtree: Evolutionary Learning of Globally Optimal Classification and Regression Trees in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 61(i01).
    19. Uwe Ligges & Sebastian Krey, 2011. "Feature clustering for instrument classification," Computational Statistics, Springer, vol. 26(2), pages 279-291, June.
    20. Arnout Van Messem & Andreas Christmann, 2010. "A review on consistency and robustness properties of support vector machines for heavy-tailed distributions," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 4(2), pages 199-220, September.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0071218. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.