IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0253385.html

Hyperspectral band selection and modeling of soil organic matter content in a forest using the Ranger algorithm

Author

Listed:
  • Yuanyuan Shi
  • Junyu Zhao
  • Xianchong Song
  • Zuoyu Qin
  • Lichao Wu
  • Huili Wang
  • Jian Tang

Abstract

Effective soil spectral band selection and modeling methods can improve modeling accuracy. To establish a hyperspectral prediction model of soil organic matter (SOM) content, this study investigated a forested Eucalyptus plantation in Huangmian Forest Farm, Guangxi, China. The Ranger and Lasso algorithms were used to screen spectral bands. Subsequently, models were established using four algorithms: partial least squares regression, random forest (RF), a support vector machine, and an artificial neural network (ANN). The optimal model was then selected. The results showed that the modeling accuracy was higher when band selection was based on the Ranger algorithm than when it was based on the Lasso algorithm. ANN modeling had the best goodness of fit, and the model established by RF had the most stable modeling results. Based on the above results, a new method is proposed in this study for band selection in the early phase of soil hyperspectral modeling. The Ranger algorithm can be applied to screen the spectral bands, and ANN or RF can then be selected to construct the prediction model based on different datasets, which is applicable to establish the prediction model of SOM content in red soil plantations. This study provides a reference for the remote sensing of soil fertility in forests of different soil types and a theoretical basis for developing portable equipment for the hyperspectral measurement of SOM content in forest habitats.

Suggested Citation

  • Yuanyuan Shi & Junyu Zhao & Xianchong Song & Zuoyu Qin & Lichao Wu & Huili Wang & Jian Tang, 2021. "Hyperspectral band selection and modeling of soil organic matter content in a forest using the Ranger algorithm," PLOS ONE, Public Library of Science, vol. 16(6), pages 1-15, June.
  • Handle: RePEc:plo:pone00:0253385
    DOI: 10.1371/journal.pone.0253385
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0253385
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0253385&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0253385?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Wright, Marvin N. & Ziegler, Andreas, 2017. "ranger: A Fast Implementation of Random Forests for High Dimensional Data in C++ and R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 77(i01).
    2. Danial Jahed Armaghani & Panagiotis G. Asteris & Behnam Askarian & Mahdi Hasanipanah & Reza Tarinejad & Van Van Huynh, 2020. "Examining Hybrid and Single SVM Models with Different Kernels to Predict Rock Brittleness," Sustainability, MDPI, vol. 12(6), pages 1-17, March.
    3. Shan Luo & Zehua Chen, 2020. "Feature Selection by Canonical Correlation Search in High-Dimensional Multiresponse Models With Complex Group Structures," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(531), pages 1227-1235, July.
    4. Prashant K. Srivastava & Manika Gupta & Ujjwal Singh & Rajendra Prasad & Prem Chandra Pandey & A. S. Raghubanshi & George P. Petropoulos, 2021. "Sensitivity analysis of artificial neural network for chlorophyll prediction using hyperspectral data," Environment, Development and Sustainability: A Multidisciplinary Approach to the Theory and Practice of Sustainable Development, Springer, vol. 23(4), pages 5504-5519, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Mingsong Zhao & Yingfeng Gao & Yuanyuan Lu & Shihang Wang, 2022. "Hyperspectral Modeling of Soil Organic Matter Based on Characteristic Wavelength in East China," Sustainability, MDPI, vol. 14(14), pages 1-18, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Backer, David & Billing, Trey, 2024. "Forecasting the prevalence of child acute malnutrition using environmental and conflict conditions as leading indicators," World Development, Elsevier, vol. 176(C).
    2. Luis A Barboza & Shu-Wei Chou-Chen & Paola Vásquez & Yury E García & Juan G Calvo & Hugo G Hidalgo & Fabio Sanchez, 2023. "Assessing dengue fever risk in Costa Rica by using climate variables and machine learning techniques," PLOS Neglected Tropical Diseases, Public Library of Science, vol. 17(1), pages 1-13, January.
    3. Mariana Oliveira & Luís Torgo & Vítor Santos Costa, 2021. "Evaluation Procedures for Forecasting with Spatiotemporal Data," Mathematics, MDPI, vol. 9(6), pages 1-27, March.
    4. Hausner, Ryan, 2026. "Genre and Temporal Dynamics in Spotify Popularity Prediction," SocArXiv vba8f_v1, Center for Open Science.
    5. Bokelmann, Björn & Lessmann, Stefan, 2024. "Improving uplift model evaluation on randomized controlled trial data," European Journal of Operational Research, Elsevier, vol. 313(2), pages 691-707.
    6. Joel Podgorski & Oliver Kracht & Luis Araguas-Araguas & Stefan Terzer-Wassmuth & Jodie Miller & Ralf Straub & Rolf Kipfer & Michael Berg, 2024. "Groundwater vulnerability to pollution in Africa’s Sahel region," Nature Sustainability, Nature, vol. 7(5), pages 558-567, May.
    7. Heinisch, Katja & Scaramella, Fabio & Schult, Christoph, 2025. "Assumption errors and forecast accuracy: A partial linear instrumental variable and double machine learning approach," IWH Discussion Papers 6/2025, Halle Institute for Economic Research (IWH).
    8. Nayiri Galestian Pour & Soudabeh Shemehsavar, 2024. "Learning from high dimensional data based on weighted feature importance in decision tree ensembles," Computational Statistics, Springer, vol. 39(1), pages 313-342, February.
    9. Bazyli Czyżewski & Jakub Staniszewski & Joanna Staniszewska & Marta Guth, 2025. "Does Increasing Agricultural Efficiency Contribute to Food Security—Trade‐Offs of Value Addition in Crop Production?," Sustainable Development, John Wiley & Sons, Ltd., vol. 33(S1), pages 939-970, November.
    10. Arjan S. Gosal & Janine A. McMahon & Katharine M. Bowgen & Catherine H. Hoppe & Guy Ziv, 2021. "Identifying and Mapping Groups of Protected Area Visitors by Environmental Awareness," Land, MDPI, vol. 10(6), pages 1-14, May.
    11. David Dorn & Florian Schoner & Moritz Seebacher & Lisa Simon & Ludger Woessmann, 2024. "Multidimensional Skills on LinkedIn Profiles: Measuring Human Capital and the Gender Skill Gap," Papers 2409.18638, arXiv.org, revised May 2025.
    12. Albert Stuart Reece & Gary Kenneth Hulse, 2022. "European Epidemiological Patterns of Cannabis- and Substance-Related Congenital Neurological Anomalies: Geospatiotemporal and Causal Inferential Study," IJERPH, MDPI, vol. 20(1), pages 1-35, December.
    13. Michael Parzinger & Lucia Hanfstaengl & Ferdinand Sigg & Uli Spindler & Ulrich Wellisch & Markus Wirnsberger, 2020. "Residual Analysis of Predictive Modelling Data for Automated Fault Detection in Building’s Heating, Ventilation and Air Conditioning Systems," Sustainability, MDPI, vol. 12(17), pages 1-18, August.
    14. Nance Nerissa & Mertens Andrew & Gerds Thomas Alexander & Wang Zeyi & Torp-Pedersen Christian & van der Laan Mark & Kvist Kajsa & Lange Theis & Zareini Bochra & Petersen Maya L., 2025. "Applying the Causal Roadmap to longitudinal national registry data in Denmark: A case study of second-line diabetes medication and dementia," Journal of Causal Inference, De Gruyter, vol. 13(1), pages 1-18.
    15. Chen, Jianbao & Shen, Jiamin & Ke, Nan, 2025. "Assessing the impact of new energy demonstration city policy on industrial carbon intensity using machine learning," Economic Analysis and Policy, Elsevier, vol. 87(C), pages 1690-1707.
    16. Van Belle, Jente & Guns, Tias & Verbeke, Wouter, 2021. "Using shared sell-through data to forecast wholesaler demand in multi-echelon supply chains," European Journal of Operational Research, Elsevier, vol. 288(2), pages 466-479.
    17. Tania L. Maxwell & Mark D. Spalding & Daniel A. Friess & Nicholas J. Murray & Kerrylee Rogers & Andre S. Rovai & Lindsey S. Smart & Lukas Weilguny & Maria Fernanda Adame & Janine B. Adams & William E., 2024. "Soil carbon in the world’s tidal marshes," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    18. Albert Stuart Reece & Gary Kenneth Hulse, 2022. "European Epidemiological Patterns of Cannabis- and Substance-Related Body Wall Congenital Anomalies: Geospatiotemporal and Causal Inferential Study," IJERPH, MDPI, vol. 19(15), pages 1-38, July.
    19. Andrew P. Wheeler & Wouter Steenbeek, 2021. "Mapping the Risk Terrain for Crime Using Machine Learning," Journal of Quantitative Criminology, Springer, vol. 37(2), pages 445-480, June.
    20. Philipp Bach & Victor Chernozhukov & Malte S. Kurz & Martin Spindler & Sven Klaassen, 2021. "DoubleML -- An Object-Oriented Implementation of Double Machine Learning in R," Papers 2103.09603, arXiv.org, revised Jun 2024.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0253385. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.