IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v13y2024i1p99-d1556015.html
   My bibliography  Save this article

Variables Selection from the Patterns of the Features Applied to Spectroscopic Data—An Application Case

Author

Listed:
  • José L. Romero-Béjar

    (Department of Statistics and Operations Research, University of Granada, 18011 Granada, Spain
    Instituto de Investigación Biosanitaria (ibs.GRANADA), 18014 Granada, Spain
    Institute of Mathematics, University of Granada (IMAG), Ventanilla 11, 18001 Granada, Spain)

  • Francisco Javier Esquivel

    (Department of Statistics and Operations Research, University of Granada, 18011 Granada, Spain
    Laboratory of 3D Archaeological Modelling, University of Granada, 18011 Granada, Spain)

  • José Antonio Esquivel

    (Laboratory of 3D Archaeological Modelling, University of Granada, 18011 Granada, Spain
    Department of Prehistory and Archaeology, University of Granada, 18011 Granada, Spain)

Abstract

Spectroscopic data allows for the obtaining of relevant information about the composition of samples and has been used for research in scientific disciplines such as chemistry, geology, archaeology, Mars research, pharmacy, and medicine, as well as important industrial use. In archaeology, it allows the characterization and classification of artifacts and ecofacts, the analysis of patterns, the characterization and study of the exchange of materials, etc. Spectrometers provide a large amount of data, the so-called “big data” type, which requires the use of multivariate statistical techniques, mainly principal component analysis, cluster analysis, and discriminant analysis. This work is focused on reducing the dimensionality of the data by selecting a small subset of variables to characterize the samples and presents a mathematical methodology for the selection of the most efficient variables. The objective is to identify a subset of variables based on spectral features that allow characterization of the samples under study with the least possible errors when performing quantitative analyses or discriminations between different samples. The subset is not predetermined and, in each case, is obtained for each set of samples based on the most important features of the samples under study, which allows for a good fit to the data. The reduction of the number of variables to an important performance based on the previously chosen difference between features, with a great fit to the raw data. Thus, instead of 2151 variables, a minimum optimal subset of 32 valleys and 31 peaks is obtained for a minimum difference between peaks or between valleys of 20 nm. This methodology has been applied to a sample of minerals and rocks extracted from the ECOSTRESS 1.0 spectral library.

Suggested Citation

  • José L. Romero-Béjar & Francisco Javier Esquivel & José Antonio Esquivel, 2024. "Variables Selection from the Patterns of the Features Applied to Spectroscopic Data—An Application Case," Mathematics, MDPI, vol. 13(1), pages 1-14, December.
  • Handle: RePEc:gam:jmathe:v:13:y:2024:i:1:p:99-:d:1556015
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/13/1/99/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/13/1/99/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Asa Gholizadeh & Luboš Borůvka & Mohammad Mehdi Saberioon & Josef Kozák & Radim Vašát & Karel Němeček, 2015. "Comparing different data preprocessing methods for monitoring soil heavy metals based on soil spectral features," Soil and Water Research, Czech Academy of Agricultural Sciences, vol. 10(4), pages 218-227.
    2. Theodora Angelopoulou & Athanasios Balafoutis & George Zalidis & Dionysis Bochtis, 2020. "From Laboratory to Proximal Sensing Spectroscopy for Soil Organic Carbon Estimation—A Review," Sustainability, MDPI, vol. 12(2), pages 1-24, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Snapp, Sieglinde, 2022. "Embracing variability in soils on smallholder farms: New tools and better science," Agricultural Systems, Elsevier, vol. 195(C).
    2. Lenka Demková & Tomáš Jezný & Lenka Bobuľská, 2017. "Assessment of soil heavy metal pollution in a former mining area - before and after the end of mining activities," Soil and Water Research, Czech Academy of Agricultural Sciences, vol. 12(4), pages 229-236.
    3. Nerea Ferrando Jorge & Joanna Clark & Macarena L. Cárdenas & Hilary Geoghegan & Vicky Shannon, 2021. "Measuring Soil Colour to Estimate Soil Organic Carbon Using a Large-Scale Citizen Science-Based Approach," Sustainability, MDPI, vol. 13(19), pages 1-17, October.
    4. Yi Liu & Tiezhu Shi & Zeying Lan & Kai Guo & Dachang Zhuang & Xiangyang Zhang & Xiaojin Liang & Tianqi Qiu & Shengfei Zhang & Yiyun Chen, 2024. "Estimating the Soil Copper Content of Urban Land in a Megacity Using Piecewise Spectral Pretreatment," Land, MDPI, vol. 13(4), pages 1-21, April.
    5. Adaugo O. Okoli & Athena Birkenberg, 2024. "Monitoring soil carbon in smallholder carbon projects: insights from Kenya," Climatic Change, Springer, vol. 177(9), pages 1-28, September.
    6. George Kyriakarakos & Theodoros Petropoulos & Vasso Marinoudi & Remigio Berruto & Dionysis Bochtis, 2024. "Carbon Farming: Bridging Technology Development with Policy Goals," Sustainability, MDPI, vol. 16(5), pages 1-18, February.
    7. Efthymios Rodias & Eirini Aivazidou & Charisios Achillas & Dimitrios Aidonis & Dionysis Bochtis, 2020. "Water-Energy-Nutrients Synergies in the Agrifood Sector: A Circular Economy Framework," Energies, MDPI, vol. 14(1), pages 1-17, December.
    8. Francisco Javier Esquivel & José Antonio Esquivel & Antonio Morgado & José L. Romero-Béjar & Luis F. García del Moral, 2022. "Preprocessing of Spectroscopic Data Using Affine Transformations to Improve Pattern-Recognition Analysis: An Application to Prehistoric Lithic Tools," Mathematics, MDPI, vol. 10(22), pages 1-14, November.
    9. Lwandile Nduku & Cilence Munghemezulu & Zinhle Mashaba-Munghemezulu & Wonga Masiza & Phathutshedzo Eugene Ratshiedana & Ahmed Mukalazi Kalumba & Johannes George Chirima, 2024. "Field-Scale Winter Wheat Growth Prediction Applying Machine Learning Methods with Unmanned Aerial Vehicle Imagery and Soil Properties," Land, MDPI, vol. 13(3), pages 1-26, February.
    10. Charisios Achillas & Dionysis Bochtis, 2020. "Toward a Green, Closed-Loop, Circular Bioeconomy: Boosting the Performance Efficiency of Circular Business Models," Sustainability, MDPI, vol. 12(23), pages 1-6, December.
    11. Stanisław Gruszczyński & Wojciech Gruszczyński, 2022. "Assessing the Information Potential of MIR Spectral Signatures for Prediction of Multiple Soil Properties Based on Data from the AfSIS Phase I Project," IJERPH, MDPI, vol. 19(22), pages 1-22, November.
    12. Yi Liu & Tiezhu Shi & Yiyun Chen & Zeying Lan & Kai Guo & Dachang Zhuang & Chao Yang & Wenyi Zhang, 2024. "Monitoring the Soil Copper of Urban Land with Visible and Near-Infrared Spectroscopy: Comparing Spectral, Compositional, and Spatial Similarities," Land, MDPI, vol. 13(8), pages 1-19, August.
    13. Chaoqun Chen & Qigang Jiang & Zhenchao Zhang & Pengfei Shi & Yan Xu & Bin Liu & Jing Xi & ShouZhi Chang, 2020. "Hyperspectral Inversion of Petroleum Hydrocarbon Contents in Soil Based on Continuum Removal and Wavelet Packet Decomposition," Sustainability, MDPI, vol. 12(10), pages 1-13, May.
    14. Javier Reyes & Mareike Ließ, 2023. "On-the-Go Vis-NIR Spectroscopy for Field-Scale Spatial-Temporal Monitoring of Soil Organic Carbon," Agriculture, MDPI, vol. 13(8), pages 1-15, August.
    15. Massimo Conforti & Gabriele Buttafuoco, 2022. "Insights into the Effects of Study Area Size and Soil Sampling Density in the Prediction of Soil Organic Carbon by Vis-NIR Diffuse Reflectance Spectroscopy in Two Forest Areas," Land, MDPI, vol. 12(1), pages 1-16, December.
    16. Konstantinos Karyotis & Theodora Angelopoulou & Nikolaos Tziolas & Evgenia Palaiologou & Nikiforos Samarinas & George Zalidis, 2021. "Evaluation of a Micro-Electro Mechanical Systems Spectral Sensor for Soil Properties Estimation," Land, MDPI, vol. 10(1), pages 1-16, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:13:y:2024:i:1:p:99-:d:1556015. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.