摘要(英) |
Principal component analysis (PCA) is a widely used statistical tool for reducing the dimensionality of large data sets while retaining most of the information. A key area of interest is applying PCA to cluster analysis, which is crucial in fields such as anomaly detection, biology, and medicine. Traditional model-based approaches are sensitive to model mis-specification and require a predefined number of clusters, potentially leading to biased results or unstable inferences. In this article, we propose a novel PCA method, γ-SUP PCA, which combines the γ-SUP approach with PCA. This method circumvents the need to specify the number of clusters and model selection, while effectively extracting important features. Numerical studies will demonstrate the robust performance of the proposed method. |
參考文獻 |
1.Amari, S.-i. and Ohara, A. (2011). Geometry of q-exponential family of probability distributions. Entropy, 13(6):1170–1185.
2.Chen, T.-L., Hsieh, D.-N., Hung, H., Tu, I.-P., Wu, P.-S., Wu, Y.-M., Chang, W.-H., and Huang, S.-Y. (2014). gamma-sup: A clustering algorithm for cryo-electron microscopy images of asymmetric particles. Annals of Applied Statistics,8(1):259–285.
3.Chen, T.-L. and Shiu, S.-Y. (2007). A new clustering algorithm based on self-updating process. JSM proceedings, statistical computing section, Salt Lake City, Utah, pages
2034–2038.
4.Cichocki, A. and Amari, S.-i. (2010). Families of alpha- beta- and gamma-divergences:Flexible and robust measures of similarities. Entropy, 12(6):1532–1568.
5.Eguchi, S., Komori, O., and Kato, S. (2011). Projective power entropy and maximum tsallis entropy distributions. Entropy, 13(10):1746–1764.
6.Fujisawa, H. and Eguchi, S. (2008). Robust parameter estimation with a small bias against heavy contamination. Journal of Multivariate Analysis, 99(9):2053–2081.
7.Maćkiewicz, A. and Ratajczak, W. (1993). Principal components analysis (pca). Computers & Geosciences, 19(3):303–342.
8.Morissette, L. and Chartier, S. (2013). The k-means clustering technique: General considerations and implementation in mathematica. Tutorials in Quantitative Methods for Psychology, 9(1):15–24.
9.Nakagawa, T. and Hashimoto, S. (2020). Robust bayesian inference via γ-divergence.Communications in Statistics-Theory and Methods, 49(2):343–360. |