IDEAS home Printed from https://ideas.repec.org/a/spr/compst/v35y2020i3d10.1007_s00180-020-00954-8.html
   My bibliography  Save this article

Using tours to visually investigate properties of new projection pursuit indexes with application to problems in physics

Author

Listed:
  • Ursula Laa

    (Monash University
    Monash University)

  • Dianne Cook

    (Monash University)

Abstract

Projection pursuit is used to find interesting low-dimensional projections of high-dimensional data by optimizing an index over all possible projections. Most indexes have been developed to detect departure from known distributions, such as normality, or to find separations between known groups. Here, we are interested in finding projections revealing potentially complex bivariate patterns, using new indexes constructed from scagnostics and a maximum information coefficient, with a purpose to detect unusual relationships between model parameters describing physics phenomena. The performance of these indexes is examined with respect to ideal behaviour, using simulated data, and then applied to problems from gravitational wave astronomy. The implementation builds upon the projection pursuit tools available in the R package, tourr, with indexes constructed from code in the R packages, binostics, minerva and mbgraphic.

Suggested Citation

  • Ursula Laa & Dianne Cook, 2020. "Using tours to visually investigate properties of new projection pursuit indexes with application to problems in physics," Computational Statistics, Springer, vol. 35(3), pages 1171-1205, September.
  • Handle: RePEc:spr:compst:v:35:y:2020:i:3:d:10.1007_s00180-020-00954-8
    DOI: 10.1007/s00180-020-00954-8
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00180-020-00954-8
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00180-020-00954-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Ju Ahn & Heike Hofmann & Dianne Cook, 2003. "A Projection Pursuit Method on the multidimensional squared Contingency Table," Computational Statistics, Springer, vol. 18(3), pages 605-626, September.
    2. Huang, Bei & Cook, Dianne & Wickham, Hadley, 2012. "tourrGui: A gWidgets GUI for the Tour to Explore High-Dimensional Data Using Low-Dimensional Projections," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 49(i06).
    3. Loperfido, Nicola, 2018. "Skewness-based projection pursuit: A computational approach," Computational Statistics & Data Analysis, Elsevier, vol. 120(C), pages 42-57.
    4. F. Ferraty & A. Goia & E. Salinelli & P. Vieu, 2013. "Functional projection pursuit regression," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 22(2), pages 293-320, June.
    5. Simon N. Wood & Natalya Pya & Benjamin Säfken, 2016. "Smoothing Parameter and Model Selection for General Smooth Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(516), pages 1548-1563, October.
    6. Posse, Christian, 1995. "Projection pursuit exploratory data analysis," Computational Statistics & Data Analysis, Elsevier, vol. 20(6), pages 669-687, December.
    7. Eun-Kyung Lee & Dianne Cook & Sigbert Klinke & Thomas Lumley, 2005. "Projection Pursuit for Exploratory Supervised Classification," SFB 649 Discussion Papers SFB649DP2005-026, Sonderforschungsbereich 649, Humboldt University, Berlin, Germany.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ursula Laa & Dianne Cook & Andreas Buja & German Valencia, 2020. "Hole or grain? A Section Pursuit Index for Finding Hidden Structure in Multiple Dimensions," Monash Econometrics and Business Statistics Working Papers 17/20, Monash University, Department of Econometrics and Business Statistics.
    2. Nicola Loperfido, 2023. "Kurtosis removal for data pre-processing," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 17(1), pages 239-267, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chunming Zhang & Jimin Ye & Xiaomei Wang, 2023. "A Computational Perspective on Projection Pursuit in High Dimensions: Feasible or Infeasible Feature Extraction," International Statistical Review, International Statistical Institute, vol. 91(1), pages 140-161, April.
    2. Georgios Gioldasis & Antonio Musolesi & Michel Simioni, 2020. "Model uncertainty, nonlinearities and out-of-sample comparison: evidence from international technology diffusion," Working Papers hal-02790523, HAL.
    3. Zhang, Tao & Zhang, Qingzhao & Wang, Qihua, 2014. "Model detection for functional polynomial regression," Computational Statistics & Data Analysis, Elsevier, vol. 70(C), pages 183-197.
    4. Loperfido, Nicola, 2021. "Some theoretical properties of two kurtosis matrices, with application to invariant coordinate selection," Journal of Multivariate Analysis, Elsevier, vol. 186(C).
    5. E. Zanini & E. Eastoe & M. J. Jones & D. Randell & P. Jonathan, 2020. "Flexible covariate representations for extremes," Environmetrics, John Wiley & Sons, Ltd., vol. 31(5), August.
    6. Lee, Sharon X. & McLachlan, Geoffrey J., 2022. "An overview of skew distributions in model-based clustering," Journal of Multivariate Analysis, Elsevier, vol. 188(C).
    7. Fischer, Daniel & Berro, Alain & Nordhausen, Klaus & Ruiz-Gazen, Anne, 2019. "REPPlab: An R package for detecting clusters and outliers using exploratory projection pursuit," TSE Working Papers 19-1001, Toulouse School of Economics (TSE).
    8. Benjamin Säfken & Thomas Kneib, 2020. "Conditional covariance penalties for mixed models," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 47(3), pages 990-1010, September.
    9. Loperfido, Nicola, 2018. "Skewness-based projection pursuit: A computational approach," Computational Statistics & Data Analysis, Elsevier, vol. 120(C), pages 42-57.
    10. Khamma, Thulasi Ram & Zhang, Yuming & Guerrier, Stéphane & Boubekri, Mohamed, 2020. "Generalized additive models: An efficient method for short-term energy prediction in office buildings," Energy, Elsevier, vol. 213(C).
    11. Guochang Wang & Jianjun Zhou & Wuqing Wu & Min Chen, 2017. "Robust functional sliced inverse regression," Statistical Papers, Springer, vol. 58(1), pages 227-245, March.
    12. Massimiliano Mazzanti & Antonio Musolesi, 2020. "Modeling Green Knowledge Production and Environmental Policies with Semiparametric Panel Data Regression models," SEEDS Working Papers 1420, SEEDS, Sustainability Environmental Economics and Dynamics Studies, revised Sep 2020.
    13. Andrew Leroux & Junrui Di & Ekaterina Smirnova & Elizabeth J Mcguffey & Quy Cao & Elham Bayatmokhtari & Lucia Tabacu & Vadim Zipunnikov & Jacek K Urbanek & Ciprian Crainiceanu, 2019. "Organizing and Analyzing the Activity Data in NHANES," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 11(2), pages 262-287, July.
    14. Yong Liu & Alan P. Ker, 2021. "Simultaneous borrowing of information across space and time for pricing insurance contracts: An application to rating crop insurance policies," Journal of Risk & Insurance, The American Risk and Insurance Association, vol. 88(1), pages 231-257, March.
    15. Stefano Cabras & J. D. Tena, 2023. "Implicit institutional incentives and individual decisions: Causal inference with deep learning models," Managerial and Decision Economics, John Wiley & Sons, Ltd., vol. 44(6), pages 3739-3754, September.
    16. Hervé Cardot & Antonio Musolesi, 2018. "Modeling temporal treatment effects with zero inflated semi-parametric regression models: the case of local development policies in France," SEEDS Working Papers 0718, SEEDS, Sustainability Environmental Economics and Dynamics Studies, revised Mar 2018.
    17. Gioldasis, Georgios & Musolesi, Antonio & Simioni, Michel, 2023. "Interactive R&D spillovers: An estimation strategy based on forecasting-driven model selection," International Journal of Forecasting, Elsevier, vol. 39(1), pages 144-169.
    18. Shushi, Tomer, 2018. "A proof for the existence of multivariate singular generalized skew-elliptical density functions," Statistics & Probability Letters, Elsevier, vol. 141(C), pages 50-55.
    19. Sun, Tianyu & Chand, Satish & Sharpe, Keiran, 2018. "Effect of aging on housing prices: evidence from a panel data," MPRA Paper 94418, University Library of Munich, Germany, revised 01 Mar 2019.
    20. Daniel Fischer & Alain Berro & Klaus Nordhausen & Anne Ruiz-Gazen, 2021. "REPPlab: An R package for detecting clusters and outliers using exploratory projection pursuit," Post-Print hal-03548865, HAL.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:compst:v:35:y:2020:i:3:d:10.1007_s00180-020-00954-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.