IDEAS home Printed from https://ideas.repec.org/p/osf/osfxxx/3h9yp_v1.html

A multi-model relationship detection method to assist with naïve exploration of high-dimensional data

Author

Listed:
  • Wiley, James Christopher
  • English, Simon
  • Church, Kinsey
  • Ward, Richard A
  • Flowers, James
  • Chamoun, Céline

Abstract

This paper presents a method for classifying predictors Xp based on their relationship to an outcome variable Y; as having main effects, interactions, collinearity, or no effects. The presented method operates by combining complimentary information from a multivariate model and a series of bivariate models. We demonstrate how the method works using simulated data. In addition, we experimentally vary the effect sizes in our data generation process to see if the proposed method can detect different relationships between predictors Xp and outcome Y at varied strengths. We also vary the sample size (n) and observe the impact on relationship classification. We find that the proposed method functions as desired within the constraints of this study. We propose future simulation designs for continued testing of said method. We conclude by providing broad instructions for applying this method. Our goal is to use this method to develop initial analytical profiles of high-dimensional data in naïve data exploration contexts. This work stems from trying to find an efficient alternative to scatterplot matrices when exploring data that contain thousands of variables.

Suggested Citation

  • Wiley, James Christopher & English, Simon & Church, Kinsey & Ward, Richard A & Flowers, James & Chamoun, Céline, 2025. "A multi-model relationship detection method to assist with naïve exploration of high-dimensional data," OSF Preprints 3h9yp_v1, Center for Open Science.
  • Handle: RePEc:osf:osfxxx:3h9yp_v1
    DOI: 10.31219/osf.io/3h9yp_v1
    as

    Download full text from publisher

    File URL: https://osf.io/download/67b353e9780d7136620c7eb5/
    Download Restriction: no

    File URL: https://libkey.io/10.31219/osf.io/3h9yp_v1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Bergmeir, Christoph & Benítez, José M., 2012. "Neural Networks in R Using the Stuttgart Neural Network Simulator: RSNNS," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 46(i07).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Riza, Lala Septem & Bergmeir, Christoph & Herrera, Francisco & Benítez, José M., 2015. "frbs: Fuzzy Rule-Based Systems for Classification and Regression in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 65(i06).
    2. Severinsen, A. & Myrland, Ø., 2022. "ShinyRBase: Near real-time energy saving models using reactive programming," Applied Energy, Elsevier, vol. 325(C).
    3. Kaliba, Aloyce R. & Mushi, Richard J. & Gongwe, Anne G. & Mazvimavi, Kizito, 2020. "A typology of adopters and nonadopters of improved sorghum seeds in Tanzania: A deep learning neural network approach," World Development, Elsevier, vol. 127(C).
    4. Sánchez Lasheras, Fernando & de Cos Juez, Francisco Javier & Suárez Sánchez, Ana & Krzemień, Alicja & Riesgo Fernández, Pedro, 2015. "Forecasting the COMEX copper spot price by means of neural networks and ARIMA models," Resources Policy, Elsevier, vol. 45(C), pages 37-43.
    5. Guallar, Carles & Delgado, Maximino & Diogène, Jorge & Fernández-Tejedor, Margarita, 2016. "Artificial neural network approach to population dynamics of harmful algal blooms in Alfacs Bay (NW Mediterranean): Case studies of Karlodinium and Pseudo-nitzschia," Ecological Modelling, Elsevier, vol. 338(C), pages 37-50.
    6. Suellen Teixeira Zavadzki de Pauli & Mariana Kleina & Wagner Hugo Bonat, 2020. "Comparing Artificial Neural Network Architectures for Brazilian Stock Market Prediction," Annals of Data Science, Springer, vol. 7(4), pages 613-628, December.
    7. Guopeng Jiang & Miles Grafton & Diane Pearson & Mike Bretherton & Allister Holmes, 2019. "Integration of Precision Farming Data and Spatial Statistical Modelling to Interpret Field-Scale Maize Productivity," Agriculture, MDPI, vol. 9(11), pages 1-22, November.
    8. Kmytiuk, Tetiana & Majore, Ginta & Bilyk, Tetiana, . "Time series forecasting of price of the agricultural products using data science," Agricultural and Resource Economics: International Scientific E-Journal, Agricultural and Resource Economics: International Scientific E-Journal, vol. 10(3).
    9. Andree,Bo Pieter Johannes & Chamorro Elizondo,Andres Fernando & Kraay,Aart C. & Spencer,Phoebe Girouard & Wang,Dieter, 2020. "Predicting Food Crises," Policy Research Working Paper Series 9412, The World Bank.
    10. D’Amato, Valeria & Levantesi, Susanna & Piscopo, Gabriella, 2022. "Deep learning in predicting cryptocurrency volatility," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 596(C).
    11. Khudhayr A. Rashedi & Mohd Tahir Ismail & Sadam Al Wadi & Abdeslam Serroukh & Tariq S. Alshammari & Jamil J. Jaber, 2024. "Multi-Layer Perceptron-Based Classification with Application to Outlier Detection in Saudi Arabia Stock Returns," JRFM, MDPI, vol. 17(2), pages 1-13, February.
    12. Björn Büdenbender & Tim T A Höfling & Antje B M Gerdes & Georg W Alpers, 2023. "Training machine learning algorithms for automatic facial coding: The role of emotional facial expressions’ prototypicality," PLOS ONE, Public Library of Science, vol. 18(2), pages 1-16, February.
    13. Misiunas, Nicholas & Oztekin, Asil & Chen, Yao & Chandra, Kavitha, 2016. "DEANN: A healthcare analytic methodology of data envelopment analysis and artificial neural networks for the prediction of organ recipient functional status," Omega, Elsevier, vol. 58(C), pages 46-54.
    14. Youngmin Seo & Sungwon Kim & Vijay Singh, 2015. "Estimating Spatial Precipitation Using Regression Kriging and Artificial Neural Network Residual Kriging (RKNNRK) Hybrid Approach," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 29(7), pages 2189-2204, May.
    15. Evangelos Spiliotis & Spyros Makridakis & Artemios-Anargyros Semenoglou & Vassilios Assimakopoulos, 2022. "Comparison of statistical and machine learning methods for daily SKU demand forecasting," Operational Research, Springer, vol. 22(3), pages 3037-3061, July.
    16. repec:jss:jstsof:46:i07 is not listed on IDEAS
    17. Lu Yuan & Yingshi Huang & Ping Chen, 2026. "Online Calibration for Multidimensional CAT With Polytomously Scored Items: A Neural Network–Based Approach," Journal of Educational and Behavioral Statistics, , vol. 51(1), pages 141-174, February.
    18. Feng, Cong & Zhang, Jie & Zhang, Wenqi & Hodge, Bri-Mathias, 2022. "Convolutional neural networks for intra-hour solar forecasting based on sky image sequences," Applied Energy, Elsevier, vol. 310(C).

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:osf:osfxxx:3h9yp_v1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: OSF (email available below). General contact details of provider: https://osf.io/preprints/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.