IDEAS home Printed from https://ideas.repec.org/a/gam/jijerp/v18y2021i19p10303-d646956.html
   My bibliography  Save this article

Prediction of Geosmin at Different Depths of Lake Using Machine Learning Techniques

Author

Listed:
  • Yong-Su Kwon

    (EcoBank Team, Division of Ecological Information, National Institute of Ecology, Seocheon, Busan 33657, Chungcheongnam-do, Korea
    Yong-Su Kwon and In-Hwan Cho contributed equally to this work.)

  • In-Hwan Cho

    (Department of Environmental Science, Hanyang University, Seoul 04763, Korea
    Yong-Su Kwon and In-Hwan Cho contributed equally to this work.)

  • Ha-Kyung Kim

    (Department of Environmental Science, Hanyang University, Seoul 04763, Korea)

  • Jeong-Hwan Byun

    (Han-River Environment Research Center, National Institute of Environmental Research, Yangpyeong-gun, Incheon 12585, Gyeonggi-do, Korea
    Department of Life Science, Hanyang University, Seoul 04763, Korea)

  • Mi-Jung Bae

    (Biodiversity Research Team, Freshwater Biodiversity Research Bureau, Nakdonggang National Institute of Biological Resources, Sangju 37242, Gyeongsangbuk-do, Korea)

  • Baik-Ho Kim

    (Department of Life Science, Hanyang University, Seoul 04763, Korea)

Abstract

Geosmin is a major concern in the management of water sources worldwide. Thus, we predicted concentration categories of geosmin at three different depths of lakes (i.e., surface, middle, and bottom), and analyzed relationships between geosmin concentration and factors such as phytoplankton abundance and environmental variables. Data were collected monthly from three major lakes (Uiam, Cheongpyeong, and Paldang lakes) in Korea from May 2014 to December 2015. Before predicting geosmin concentration, we categorized it into four groups based on the boxplot method, and multivariate adaptive regression splines, classification and regression trees, and random forest (RF) were applied to identify the most appropriate modelling to predict geosmin concentration. Overall, using environmental variables was more accurate than using phytoplankton abundance to predict the four categories of geosmin concentration based on AUC and accuracy in all three models as well as each layer. The RF model had the highest predictive power among the three SDMs. When predicting geosmin in the three water layers, the relative importance of environmental variables and phytoplankton abundance in the sensitivity analysis was different for each layer. Water temperature and abundance of Cyanophyceae were the most important factors for predicting geosmin concentration categories in the surface layer, whereas total abundance of phytoplankton exhibited relatively higher importance in the bottom layer.

Suggested Citation

  • Yong-Su Kwon & In-Hwan Cho & Ha-Kyung Kim & Jeong-Hwan Byun & Mi-Jung Bae & Baik-Ho Kim, 2021. "Prediction of Geosmin at Different Depths of Lake Using Machine Learning Techniques," IJERPH, MDPI, vol. 18(19), pages 1-13, September.
  • Handle: RePEc:gam:jijerp:v:18:y:2021:i:19:p:10303-:d:646956
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1660-4601/18/19/10303/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1660-4601/18/19/10303/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jijerp:v:18:y:2021:i:19:p:10303-:d:646956. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.