IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0326122.html
   My bibliography  Save this article

A novel hybrid model for species distribution prediction using probabilistic random forest, principal component analysis and genetic algorithm

Author

Listed:
  • Taiwo A Adekunle
  • Ibrahim K Ogundoyin
  • Caleb O Akanbi

Abstract

Probabilistic Random Forest is an extension of the traditional Random Forest machine learning algorithm that is one of the frequently used machine learning algorithms employed for species distribution modeling. However, with the use of complex dataset for predicting the presence or absence of the species, It is essential that feature extraction is important to generate optimal prediction that can affect the model accuracy and AUC score of the model simulation. In this paper, we integrated the Genetic Algorithm Optimization technique, which is popular for its excellent feature extraction technique, to enhance the predictive performance of the PRF Model. a novel hybrid algorithm the genetically optimized probabilistic random forest algorithm, designed for predicting the distribution of mastomys natalensis in Nigeria. The model was also compared with existing models for dimensionality reduction with other optimization techniques, such as Principal Component Analysis, Grey Wolf, Optimizer optimized backpropagation neural network algorithm (GNNA), Butterfly Optimization Algorithm. These models were evaluated using four performance metrics, accuracy, the areas under curve, sensitivity, specificity, F1_score and precision. We also examined the spatial predictive distribution of the models. The results generated that the predictive performance of PRFGA, significantly improved compared to PRFPCA, GNNA and PRFBOA in predicting the presence or absence of mastomys natalensis with a presence only and pseudo-absence sample set. the PRFGA demonstrated a high predictive power in predicting the spatial distribution of the presence or absence of mastomys natalensis in Nigeria. The integration of the Genetic Algorithm optimization technique, stems from its renowned ability to address the specific challenges of data uncertainty and high-dimensionality reduction in feature extraction sets of SDMs, to enhance the performance of the PRF model.

Suggested Citation

  • Taiwo A Adekunle & Ibrahim K Ogundoyin & Caleb O Akanbi, 2025. "A novel hybrid model for species distribution prediction using probabilistic random forest, principal component analysis and genetic algorithm," PLOS ONE, Public Library of Science, vol. 20(9), pages 1-21, September.
  • Handle: RePEc:plo:pone00:0326122
    DOI: 10.1371/journal.pone.0326122
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0326122
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0326122&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0326122?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0326122. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.