IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v171y2022ics0167947322000378.html
   My bibliography  Save this article

Parameter estimation and model-based clustering with spherical normal distribution on the unit hypersphere

Author

Listed:
  • You, Kisung
  • Suh, Changhee

Abstract

In directional statistics, the von Mises-Fisher (vMF) distribution is one of the most basic and popular probability distributions for data on the unit hypersphere. Recently, the spherical normal (SN) distribution was proposed as an intrinsic counterpart to the vMF distribution by replacing the standard Euclidean norm with the great-circle distance, which is length of the shortest path joining two points on the unit sphere. Focusing on an isotropic version of SN distribution, it is shown that maximum likelihood estimators uniquely exist under mild support conditions. Since no analytic formula are available for the estimation, efficient numerical routines are proposed for parameter estimation. The estimation is considered in a general setting where non-negative weights are assigned to observations. This leads to a more interesting contribution for model-based clustering on the unit hypersphere by finite mixture model with SN distributions. Efficiency of optimization-based estimation procedures and effectiveness of SN mixture model are validated using simulated and real data examples.

Suggested Citation

  • You, Kisung & Suh, Changhee, 2022. "Parameter estimation and model-based clustering with spherical normal distribution on the unit hypersphere," Computational Statistics & Data Analysis, Elsevier, vol. 171(C).
  • Handle: RePEc:eee:csdana:v:171:y:2022:i:c:s0167947322000378
    DOI: 10.1016/j.csda.2022.107457
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947322000378
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2022.107457?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Hornik, Kurt & Grün, Bettina, 2014. "movMF: An R Package for Fitting Mixtures of von Mises-Fisher Distributions," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 58(i10).
    2. Mullen, Katharine M. & Ardia, David & Gil, David L. & Windover, Donald & Cline, James, 2011. "DEoptim: An R Package for Global Optimization by Differential Evolution," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 40(i06).
    3. Suvrit Sra, 2012. "A short note on parameter approximation for von Mises-Fisher distributions: and a fast implementation of I s (x)," Computational Statistics, Springer, vol. 27(1), pages 177-190, March.
    4. Cavanaugh, Joseph E., 1997. "Unifying the derivations for the Akaike and corrected Akaike information criteria," Statistics & Probability Letters, Elsevier, vol. 33(2), pages 201-208, April.
    5. Bhattacharya, Abhishek & Dunson, David, 2012. "Nonparametric Bayes classification and hypothesis testing on manifolds," Journal of Multivariate Analysis, Elsevier, vol. 111(C), pages 1-19.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Arthur Pewsey & Eduardo García-Portugués, 2021. "Recent advances in directional statistics," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 30(1), pages 1-58, March.
    2. Xu, Hang & Alvo, Mayer & Yu, Philip L.H., 2018. "Angle-based models for ranking data," Computational Statistics & Data Analysis, Elsevier, vol. 121(C), pages 113-136.
    3. Fernando Rojas & Peter Wanke & Víctor Leiva & Mauricio Huerta & Carlos Martin-Barreiro, 2022. "Modeling Inventory Cost Savings and Supply Chain Success Factors: A Hybrid Robust Compromise Multi-Criteria Approach," Mathematics, MDPI, vol. 10(16), pages 1-18, August.
    4. Felix Mbuga & Cristina Tortora, 2021. "Spectral Clustering of Mixed-Type Data," Stats, MDPI, vol. 5(1), pages 1-11, December.
    5. Emil Cornea & Hongtu Zhu & Peter Kim & Joseph G. Ibrahim, 2017. "Regression models on Riemannian symmetric spaces," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(2), pages 463-482, March.
    6. Carlos A. Medel, 2015. "Probabilidad Clásica de Sobreajuste con Criterios de Información: Estimaciones con Series Macroeconómicas Chilenas," Revista de Analisis Economico – Economic Analysis Review, Universidad Alberto Hurtado/School of Economics and Business, vol. 30(1), pages 57-72, Abril.
    7. Villacorta, Pablo J. & Verdegay, José L., 2016. "FuzzyStatProb: An R Package for the Estimation of Fuzzy Stationary Probabilities from a Sequence of Observations of an Unknown Markov Chain," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 71(i08).
    8. Olgun Aydin & Bartłomiej Igliński & Krzysztof Krukowski & Marek Siemiński, 2022. "Analyzing Wind Energy Potential Using Efficient Global Optimization: A Case Study for the City Gdańsk in Poland," Energies, MDPI, vol. 15(9), pages 1-22, April.
    9. Scrucca, Luca, 2013. "GA: A Package for Genetic Algorithms in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 53(i04).
    10. William Lim & Gaurav Khemka & David Pitt & Bridget Browne, 2019. "A method for calculating the implied no-recovery three-state transition matrix using observable population mortality incidence and disability prevalence rates among the elderly," Journal of Population Research, Springer, vol. 36(3), pages 245-282, September.
    11. Jed Armstrong & Özer Karagedikli, 2017. "The role of non-participants in labour market dynamics," Reserve Bank of New Zealand Analytical Notes series AN2017/01, Reserve Bank of New Zealand.
    12. Giuseppe Brandi & Ruggero Gramatica & Tiziana Di Matteo, 2019. "Unveil stock correlation via a new tensor-based decomposition method," Papers 1911.06126, arXiv.org, revised Apr 2020.
    13. Kelter, Riko, 2022. "Power analysis and type I and type II error rates of Bayesian nonparametric two-sample tests for location-shifts based on the Bayes factor under Cauchy priors," Computational Statistics & Data Analysis, Elsevier, vol. 165(C).
    14. Shi, Yanlin, 2022. "A closed-form estimator for the Markov switching in mean model," Finance Research Letters, Elsevier, vol. 44(C).
    15. Olschewski, Sebastian & Diao, Linan & Rieskamp, Jörg, 2021. "Reinforcement learning about asset variability and correlation in repeated portfolio decisions," Journal of Behavioral and Experimental Finance, Elsevier, vol. 32(C).
    16. Gao, Guangyuan & Ho, Kin-Yip & Shi, Yanlin, 2020. "Long memory or regime switching in volatility? Evidence from high-frequency returns on the U.S. stock indices," Pacific-Basin Finance Journal, Elsevier, vol. 61(C).
    17. Moraes, Ricardo Kalil & Wanke, Peter Fernandes & Faria, João Ricardo, 2021. "Unveiling the endogeneity between social-welfare and labor efficiency: Two-stage NDEA neural network approach," Socio-Economic Planning Sciences, Elsevier, vol. 77(C).
    18. Hornik, Kurt & Grün, Bettina, 2014. "movMF: An R Package for Fitting Mixtures of von Mises-Fisher Distributions," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 58(i10).
    19. Detering, Nils & Packham, Natalie, 2018. "Model risk of contingent claims," IRTG 1792 Discussion Papers 2018-036, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    20. Hafidi, B. & Mkhadri, A., 2006. "A corrected Akaike criterion based on Kullback's symmetric divergence: applications in time series, multiple and multivariate regression," Computational Statistics & Data Analysis, Elsevier, vol. 50(6), pages 1524-1550, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:171:y:2022:i:c:s0167947322000378. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.