群集分析(Clustering Analysis)為資料探勘(Data Mining)技術中所廣泛被人們所應用的技術之一,目前已被大量應用在商業顧客分析、醫學、病理學、自然科學等領域,近年來更大量用在基因資料分析或影像分析等領域。目前人們使用群集分析所探討的資料種類絕大部分為類別性資料(Qualitative data)、數量型資料(Quantitative data)或兩種資料混和型資料。較少有人討論到是否可以用群集分析的理論來探討流程型資料(Process Data or Flow Data)。本研究旨在建立能夠分析具有流程型資料之物件其相似度模型,發現將模型建立在群集分析之匯聚型階層式群集分析法(Agglomerative Hierarchical Method)之上,並使用Average Linkage計算兩物件之距離,可得到較佳之分群(Grouping)效果。但流程順序之相似度(similarity)判定需視不同研究標的給予不同的相似度得分,使用者須端看研究環境選擇較適合之判定準則。 本研究所挑選之個案資料為某發動機修護工廠執行GE CF6-80C2 Engine 翻修(Overhaul)工作時,維修過程中所產生之自修工件,希望能得到自修工件之間彼此在維修流程上之相似程度,以利管理階層於做工作站安排工作時或是其他管理決策之依據,例如修理站人員專長規劃、新維修能量開發標的甚至是未來維修站新購機台之考量。 最後得到的結果發現此分群模型可將發動機翻修時所產生的205項自修工件在經過流程相似度分群之後可得到8個較明顯之工件群組。在經過與現場工作人員討論後,可給予各個工件群組依維修流程上之相似意義,可見集群分析概念亦可替流程性類型資料做相似度分群,達到物件分群效果。 Cluster analysis is one of the data mining methods that are commonly applied in the fields including business analysis, medicine, pathology, and natural science. In recent years, it has also been largely applied to the genetic data analysis or image analysis fields. Nowadays, the majority of people who apply the cluster analysis technique are on qualitative data, quantitative data or mixed data types and it’s noted that fewer people discuss whether the cluster analysis theory can also be applied on process data or flow data. The aim of this thesis is to establish a similarity model that contains the objects of process data when applying cluster analysis. It’s found that a better grouping effect can be achieved when the model is based on the agglomerative hierarchical method of the cluster analysis and that the average linkage is used when calculating distance between objects. Nevertheless, the assessment of similarity of process order requires different scores depending on individual study subject therefore users need to set appropriate criteria that cater for their specific research environments. The case study used in this thesis is based on the data collected from an engine maintenance plant during the overhaul process on the GE CF6-80C2 Engine. Several in-house repair parts are being identified in this process and by studying the of similarity degree of these in-house repair parts, it’s hoped that the findings can be useful to the management in planning shifts or making other managerial decisions including competency planning for maintenance technicians, determining new maintenance capacity or acquisition of maintenance machinery. In conclusion, the study shows that after applying this similarity model, there are 205 in-house repairing parts identified in the engine overhaul process and those can be grouped into 8 major clusters. Having discussed further with production staff, each group can be given a similar meaning that corresponds to its overhaul process. As such, it shows that cluster analysis theory can also be applied in the process data type when carrying out grouping analysis based on their similarities.