Author
Listed:
- Valls, Ricardo A. P. Geo.
(Valls Geoconsultant)
Abstract
Compositional data analysis (CoDA) has emerged as a critical methodology for enhancing the predictive capabilities of geochemical models in mineral exploration and environmental assessment applications. This systematic review examines the extent to which compositional data analysis techniques improve geochemical modeling performance in diverse geological settings and application domains. Through the analysis of 50 peer-reviewed studies, we identified 10 studies that met rigorous inclusion criteria, focusing on the quantitative evaluation of predictive modeling performance using compositional data analysis methods. The review reveals that log-ratio transformations, particularly centered log-ratio (CLR) and isometric log-ratio (ILR) transformations, consistently address the closure problem inherent in compositional geochemical data [1]. The integration of compositional data analysis with machine learning approaches, including random forests and principal component analysis, has demonstrated significant improvements in anomaly detection, geological class prediction, and mineralization identification. Quantitative results show classification accuracy improvements from 68.4% to 74.7% when maximum autocorrelation factor analysis is applied to compositionally transformed data compared to traditional principal component analysis (McKinley et al., 2018). The key findings indicate that compositional data analysis enhances predictive capabilities through improved data uniformity, more accurate anomaly identification, and better alignment with known geological processes. The methodology is particularly effective in diverse geological contexts, including stream sediment analysis, soil geochemistry, and regional mapping applications across multiple continents. However, this review identified limitations in reporting standards, with many studies lacking explicit accuracy metrics and baseline comparisons. This absence of standardized reporting significantly hinders the ability to compare findings across studies, assess the true impact of CoDA techniques, and draw robust and generalizable conclusions regarding their effectiveness. The inconsistent metrics make it challenging to synthesize quantitative results, potentially leading to an overestimation or underestimation of CoDA’s benefits in certain contexts. This systematic review also delves into a deeper analysis of these limitations, particularly the inconsistent reporting of quantitative performance metrics, and presents counterarguments to provide a more balanced perspective on the effectiveness of the compositional data analysis. This systematic review provides evidence that compositional data analysis significantly strengthens geochemical modeling capabilities, offering a robust framework for addressing the unique statistical challenges of geochemical datasets while improving practical outcomes in mineral exploration and environmental monitoring. However, this review also examines the limitations and potential counterarguments associated with the application of compositional data analysis in geochemical modeling, providing a more nuanced and balanced perspective. The author declares no conflicts of interest. The data supporting the conclusions of this article are available in the cited references. This study received no external funding.
Suggested Citation
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:osf:osfxxx:3zhwv_v1. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: OSF (email available below). General contact details of provider: https://osf.io/preprints/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.