博碩士論文 108225016 完整後設資料紀錄

DC 欄位 語言
DC.contributor統計研究所zh_TW
DC.creator陳奕儒zh_TW
DC.creatorYi-Ju Chenen_US
dc.date.accessioned2022-8-1T07:39:07Z
dc.date.available2022-8-1T07:39:07Z
dc.date.issued2022
dc.identifier.urihttp://ir.lib.ncu.edu.tw:88/thesis/view_etd.asp?URN=108225016
dc.contributor.department統計研究所zh_TW
DC.description國立中央大學zh_TW
DC.descriptionNational Central Universityen_US
dc.description.abstract主成分分析(PCA)是一種常用的線性降維方法,在降維過程中保留了數據之間變數的變異性。PCA 通常用於可視化單個數據集;對比成分分析 (CPCA) 是傳統 PCA的推廣。CPCA 可用於存在多個數據集(如實驗組和對照組)的情況,CPCA 可以在參考其他數據集的前提下探索特定數據集獨特的低維結構。然而,雖然 CPCA 已在許多領域被證明可以找到 PCA 忽略的重要數據模式(Abubakar Abid,2017),但CPCA 缺乏一個統計模型來告訴我們為什麼 CPCA 可以識別我們感興趣的那些變化。在本文中,我們提出 CPCA 的模型假設。我們將目標數據劃分為我們感興趣的信號 矩陣和我們不感興趣的滋擾矩陣,並試圖說明我們不感興趣的滋擾矩陣對目標數據的影響可以通過 CPCA 移除。另一方面,我們通過模擬分析說明 CPCA 還原信號矩陣的優勢。除此之外,我們根據我們對 CPCA 的模型假設提出了一種新方法,用以幫助我們選取對執行 CPCA 很重要的對比參數。最後,我們通過調整對比參數在合成圖像示例中找到了感興趣的數據模式,並驗證了我們選擇對比參數的新方法可以達到相同的效果。zh_TW
dc.description.abstractPrincipal Component Analysis (PCA) is a commonly used linear dimensionality reduction method and is often used to visualize a single dataset; Contrastive Component Analysis (CPCA) can be used in situations where there are multiple datasets, and CPCA can explore the unique low-dimensional structure of a specific dataset on the premise of referring to other datasets. However, while CPCA has been shown in many fields to find important data patterns that PCA ignores (Abubakar Abid, 2017), CPCA lacks a statistical model to tell us why CPCA can identify those changes that we are interested in. In this paper, we propose a statistical model for CPCA. We divide the target data into the signal matrix that we are interested in and the nuisance matrix that we are not interested in, and try to explain that the influence of the nuisance matrix on the target data can be removed by CPCA. On the other hand, we illustrate the advantages of CPCA in restoring the signal matrix using simulation analysis. Furthermore, we propose a new method based on our model to help us decide on the contrast parameter that is important to perform CPCA. Finally, we found data patterns of interest in the synthetic image example by adjusting the contrast parameter, and verified that our new method of choosing the contrast parameter can achieve the same effect.en_US
DC.subject子組發現zh_TW
DC.subject可視化zh_TW
DC.subject特徵選取zh_TW
DC.subject去噪zh_TW
DC.subjectsubgroup discoveryen_US
DC.subjectvisualizingen_US
DC.subjectfeature selectionen_US
DC.subjectdenoisingen_US
DC.titleContrastive Principal Component Analysis for High Dimension, Low Sample Size Dataen_US
dc.language.isoen_USen_US
DC.type博碩士論文zh_TW
DC.typethesisen_US
DC.publisherNational Central Universityen_US

若有論文相關問題,請聯絡國立中央大學圖書館推廣服務組 TEL:(03)422-7151轉57407,或E-mail聯絡  - 隱私權政策聲明