IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v9y2021i4p415-d502614.html
   My bibliography  Save this article

Machine Learning Data Imputation and Prediction of Foraging Group Size in a Kleptoparasitic Spider

Author

Listed:
  • Yong-Chao Su

    (Department of Biomedical Science and Environmental Biology, Kaohsiung Medical University, Kaohsiung 80708, Taiwan)

  • Cheng-Yu Wu

    (Department of Biomedical Science and Environmental Biology, Kaohsiung Medical University, Kaohsiung 80708, Taiwan)

  • Cheng-Hong Yang

    (Department of Electronic Engineering, National Kaohsiung University of Science and Technology, Kaohsiung 80778, Taiwan
    Ph. D. Program in Biomedical Engineering, Kaohsiung Medical University, Kaohsiung 80708, Taiwan
    Drug Development and Value Creation Research Center, Kaohsiung Medical University, Kaohsiung 80708, Taiwan)

  • Bo-Sheng Li

    (Department of Electronic Engineering, National Kaohsiung University of Science and Technology, Kaohsiung 80778, Taiwan)

  • Sin-Hua Moi

    (Center of Cancer Program Development, E-Da Cancer Hospital, I-Shou University, Kaohsiung 82445, Taiwan)

  • Yu-Da Lin

    (Department of Computer Science and Information Engineering, National Penghu University of Science and Technology, Penghu 880011, Taiwan)

Abstract

Cost–benefit analysis is widely used to elucidate the association between foraging group size and resource size. Despite advances in the development of theoretical frameworks, however, the empirical systems used for testing are hindered by the vagaries of field surveys and incomplete data. This study developed the three approaches to data imputation based on machine learning (ML) algorithms with the aim of rescuing valuable field data. Using 163 host spider webs (132 complete data and 31 incomplete data), our results indicated that the data imputation based on random forest algorithm outperformed classification and regression trees, the k -nearest neighbor, and other conventional approaches (Wilcoxon signed-rank test and correlation difference have p -value from < 0.001–0.030). We then used rescued data based on a natural system involving kleptoparasitic spiders from Taiwan and Vietnam ( Argyrodes miniaceus , Theridiidae) to test the occurrence and group size of kleptoparasites in natural populations. Our partial least-squares path modelling (PLS-PM) results demonstrated that the size of the host web ( T = 6.890, p = 0.000) is a significant feature affecting group size. The resource size ( T = 2.590, p = 0.010) and the microclimate ( T = 3.230, p = 0.001) are significant features affecting the presence of kleptoparasites. The test of conformation of group size distribution to the ideal free distribution (IFD) model revealed that predictions pertaining to per-capita resource size were underestimated (bootstrap resampling mean slopes

Suggested Citation

  • Yong-Chao Su & Cheng-Yu Wu & Cheng-Hong Yang & Bo-Sheng Li & Sin-Hua Moi & Yu-Da Lin, 2021. "Machine Learning Data Imputation and Prediction of Foraging Group Size in a Kleptoparasitic Spider," Mathematics, MDPI, vol. 9(4), pages 1-16, February.
  • Handle: RePEc:gam:jmathe:v:9:y:2021:i:4:p:415-:d:502614
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/9/4/415/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/9/4/415/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Křivan, Vlastimil & Cressman, Ross & Schneider, Candace, 2008. "The ideal free distribution: A review and synthesis of the game-theoretic perspective," Theoretical Population Biology, Elsevier, vol. 73(3), pages 403-425.
    2. Saeed Nosratabadi & Amir Mosavi & Puhong Duan & Pedram Ghamisi, 2020. "Data Science in Economics," Papers 2003.13422, arXiv.org.
    3. Nosratabadi, Saeed & Mosavi, Amir & Duan, Puhong & Ghamisi, Pedram & Filip, Ferdinand & Band, Shahab S. & Reuter, Uwe & Gama, Joao & Gandomi, Amir H., 2020. "Data science in economics: comprehensive review of advanced machine learning and deep learning methods," MetaArXiv haf2v, Center for Open Science.
    4. Gérard Biau & Erwan Scornet, 2016. "A random forest guided tour," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(2), pages 197-227, June.
    5. Nosratabadi, Saeed & Mosavi, Amir & Duan, Puhong & Ghamisi, Pedram & Filip, Ferdinand & Band, Shahab S. & Reuter, Uwe & Gama, Joao & Gandomi, Amir H., 2020. "Data science in economics: comprehensive review of advanced machine learning and deep learning methods," LawArXiv kczj5, Center for Open Science.
    6. Nosratabadi, Saeed & Mosavi, Amir & Duan, Puhong & Ghamisi, Pedram & Filip, Ferdinand & Band, Shahab S. & Reuter, Uwe & Gama, Joao & Gandomi, Amir H., 2020. "Data science in economics: comprehensive review of advanced machine learning and deep learning methods," SocArXiv 9vdwf, Center for Open Science.
    7. Nosratabadi, Saeed & Mosavi, Amir & Duan, Puhong & Ghamisi, Pedram & Filip, Ferdinand & Band, Shahab S. & Reuter, Uwe & Gama, Joao & Gandomi, Amir H., 2020. "Data science in economics: comprehensive review of advanced machine learning and deep learning methods," OSF Preprints yc6e2, Center for Open Science.
    8. Gérard Biau & Erwan Scornet, 2016. "Rejoinder on: A random forest guided tour," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(2), pages 264-268, June.
    9. Nosratabadi, Saeed & Mosavi, Amir & Duan, Puhong & Ghamisi, Pedram & Filip, Ferdinand & Band, Shahab S. & Reuter, Uwe & Gama, Joao & Gandomi, Amir H., 2020. "Data science in economics: comprehensive review of advanced machine learning and deep learning methods," Thesis Commons auyvc, Center for Open Science.
    10. Nosratabadi, Saeed & Mosavi, Amir & Duan, Puhong & Ghamisi, Pedram & Filip, Ferdinand & Band, Shahab S. & Reuter, Uwe & Gama, Joao & Gandomi, Amir H., 2020. "Data science in economics: comprehensive review of advanced machine learning and deep learning methods," EdArXiv 5dwrt, Center for Open Science.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Liangkun Fang & Zhangjie Wu & Yuan Tao & Jinfeng Gao, 2023. "Light Pollution Index System Model Based on Markov Random Field," Mathematics, MDPI, vol. 11(13), pages 1-18, July.
    2. Cheng-Hong Yang & Tshimologo Molefyane & Yu-Da Lin, 2023. "The Forecasting of a Leading Country’s Government Expenditure Using a Recurrent Neural Network with a Gated Recurrent Unit," Mathematics, MDPI, vol. 11(14), pages 1-17, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Urko Aguirre-Larracoechea & Cruz E. Borges, 2021. "Imputation for Repeated Bounded Outcome Data: Statistical and Machine-Learning Approaches," Mathematics, MDPI, vol. 9(17), pages 1-27, August.
    2. Lin, Yong & Wang, Renyu & Gong, Xingyue & Jia, Guozhu, 2022. "Cross-correlation and forecast impact of public attention on USD/CNY exchange rate: Evidence from Baidu Index," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 604(C).
    3. Oliver Hümbelin & Lukas Hobi & Robert Fluder, 2021. "Rich Cities, Poor Countryside? Social Structure of the Poor and Poverty Risks in Urban and Rural Places in an Affluent Country. An Administrative Data based Analysis using Random Forest," University of Bern Social Sciences Working Papers 40, University of Bern, Department of Social Sciences, revised 10 Nov 2021.
    4. Petr Suler & Zuzana Rowland & Tomas Krulicky, 2021. "Evaluation of the Accuracy of Machine Learning Predictions of the Czech Republic’s Exports to the China," JRFM, MDPI, vol. 14(2), pages 1-30, February.
    5. Saeed Nosratabadi & Nesrine Khazami & Marwa Ben Abdallah & Zoltan Lackner & Shahab S. Band & Amir Mosavi & Csaba Mako, 2020. "Social Capital Contributions to Food Security: A Comprehensive Literature Review," Papers 2012.03606, arXiv.org.
    6. Cheng Zhang & Nilam Nur Amir Sjarif & Roslina Ibrahim, 2023. "Deep learning models for price forecasting of financial time series: A review of recent advancements: 2020-2022," Papers 2305.04811, arXiv.org, revised Sep 2023.
    7. Xiaodong Zhang & Suhui Liu & Xin Zheng, 2021. "Stock Price Movement Prediction Based on a Deep Factorization Machine and the Attention Mechanism," Mathematics, MDPI, vol. 9(8), pages 1-21, April.
    8. Teddy Lazebnik & Tzach Fleischer & Amit Yaniv-Rosenfeld, 2023. "Benchmarking Biologically-Inspired Automatic Machine Learning for Economic Tasks," Sustainability, MDPI, vol. 15(14), pages 1-9, July.
    9. Mei-Li Shen & Cheng-Feng Lee & Hsiou-Hsiang Liu & Po-Yin Chang & Cheng-Hong Yang, 2021. "An Effective Hybrid Approach for Forecasting Currency Exchange Rates," Sustainability, MDPI, vol. 13(5), pages 1-29, March.
    10. Marcus Vinicius Santos & Fernando Morgado-Dias & Thiago C. Silva, 2023. "Oil Sector and Sentiment Analysis—A Review," Energies, MDPI, vol. 16(12), pages 1-29, June.
    11. ErLe Du & Meng Ji, 2021. "Analyzing the regional economic changes in a high-tech industrial development zone using machine learning algorithms," PLOS ONE, Public Library of Science, vol. 16(6), pages 1-18, June.
    12. David G. Green, 2023. "Emergence in complex networks of simple agents," Journal of Economic Interaction and Coordination, Springer;Society for Economic Science with Heterogeneous Interacting Agents, vol. 18(3), pages 419-462, July.
    13. Steve J. Bickley & Benno Torgler, 2021. "Behavioural Economics, What Have we Missed? Exploring “Classical” Behavioural Economics Roots in AI, Cognitive Psychology, and Complexity Theory," CREMA Working Paper Series 2021-21, Center for Research in Economics, Management and the Arts (CREMA).
    14. Meir Russ, 2021. "Knowledge Management for Sustainable Development in the Era of Continuously Accelerating Technological Revolutions: A Framework and Models," Sustainability, MDPI, vol. 13(6), pages 1-32, March.
    15. Amir Masoud Rahmani & Efat Yousefpoor & Mohammad Sadegh Yousefpoor & Zahid Mehmood & Amir Haider & Mehdi Hosseinzadeh & Rizwan Ali Naqvi, 2021. "Machine Learning (ML) in Medicine: Review, Applications, and Challenges," Mathematics, MDPI, vol. 9(22), pages 1-52, November.
    16. Di Wu & Zhenning Xu & Seung Bach, 2023. "Using Google Trends to predict and forecast avocado sales," Journal of Marketing Analytics, Palgrave Macmillan, vol. 11(4), pages 629-641, December.
    17. Ren, Yi-Shuai & Ma, Chao-Qun & Kong, Xiao-Lin & Baltas, Konstantinos & Zureigat, Qasim, 2022. "Past, present, and future of the application of machine learning in cryptocurrency research," Research in International Business and Finance, Elsevier, vol. 63(C).
    18. Liang She & Jianyuan Wang & Yifan Bo & Yangyan Zeng, 2022. "MACA: Multi-Agent with Credit Assignment for Computation Offloading in Smart Parks Monitoring," Mathematics, MDPI, vol. 10(23), pages 1-18, December.
    19. Hou, Lei & Elsworth, Derek & Zhang, Fengshou & Wang, Zhiyuan & Zhang, Jianbo, 2023. "Evaluation of proppant injection based on a data-driven approach integrating numerical and ensemble learning models," Energy, Elsevier, vol. 264(C).
    20. Ma, Zhikai & Huo, Qian & Wang, Wei & Zhang, Tao, 2023. "Voltage-temperature aware thermal runaway alarming framework for electric vehicles via deep learning with attention mechanism in time-frequency domain," Energy, Elsevier, vol. 278(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:9:y:2021:i:4:p:415-:d:502614. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.