博碩士論文 964203027 詳細資訊




以作者查詢圖書館館藏 以作者查詢臺灣博碩士 以作者查詢全國書目 勘誤回報 、線上人數:25 、訪客IP:3.139.103.74
姓名 范凱翔(Kai-shiang Fan)  查詢紙本館藏   畢業系所 資訊管理學系
論文名稱 應用序列樣式探勘於軟體版本歷史之研究
(A study of applying sequential-pattern miningto software version histories)
相關論文
★ 網路合作式協同教學設計平台-以國中九年一貫課程為例★ 內容管理機制於常用問答集(FAQ)之應用
★ 行動多重代理人技術於排課系統之應用★ 存取控制機制與國內資安規範之研究
★ 信用卡系統導入NFC手機交易機制探討★ App應用在電子商務的推薦服務-以P公司為例
★ 建置服務導向系統改善生產之流程-以W公司PMS系統為例★ NFC行動支付之TSM平台規劃與導入
★ 關鍵字行銷在半導體通路商運用-以G公司為例★ 探討國內田徑競賽資訊系統-以103年全國大專田徑公開賽資訊系統為例
★ 航空地勤機坪作業盤櫃追蹤管理系統導入成效評估—以F公司為例★ 導入資訊安全管理制度之資安管理成熟度研究-以B個案公司為例
★ 資料探勘技術在電影推薦上的應用研究-以F線上影音平台為例★ BI視覺化工具運用於資安日誌分析—以S公司為例
★ 特權帳號登入行為即時分析系統之實證研究★ 郵件系統異常使用行為偵測與處理-以T公司為例
檔案 [Endnote RIS 格式]    [Bibtex 格式]    [相關文章]   [文章引用]   [完整記錄]   [館藏目錄]   [檢視]  [下載]
  1. 本電子論文使用權限為同意立即開放。
  2. 已達開放權限電子全文僅授權使用者為學術研究之目的,進行個人非營利性質之檢索、閱讀、列印。
  3. 請遵守中華民國著作權法之相關規定,切勿任意重製、散佈、改作、轉貼、播送,以免觸法。

摘要(中) 現代軟體不斷的演進,雖然在版本控制系統中有詳細的軟體版本異動資訊,
但是對於日益複雜的軟體結構的理解仍是有限的,同時,一個專案動輒有數千甚
至上萬行程式碼,所以使得軟體維護成為一項難題。目前對於版本控制系統的資
料挖掘研究多使用關聯規則,常會忽略可能存在的程式異動順序關係,因此本研
究加入時間維度,從網路上開放的CVS程式庫取得資料並利用循序探勘技術加以
分析,找出潛在的序列樣式,與先前相似的研究相比能更精確的指出軟體專案需
變動的「個體」,並以一個分類規則檢視序列樣式對於使用者的價值,以提供未
來軟體維護的參考。
摘要(英) The evolution of the modern software is continual. Although detailed
information of the evolution of the software version is stored in the version control
system (VCS), the understanding of more and more complex software structure is still
finite. On the other hand, lines of code in the software project are usually hundreds of
thousands, which makes the software maintenance a difficult problem. The researches
of applying data mining techniques to VCS are usually based on association rules,
which usually pass over the ordering information. As a result, by taking the time
dimension of the software data from the Concurrent Version System (CVS) into
consideration, this study uses the sequential-pattern mining technique to analyze and
find out the potential sequence pattern. We expect the “entities” to be changed more
precisely than those in previous researches and re-evaluate the sequence pattern for
the users by means of a classification rule in order to provide the reference of the
software maintenance in the future.
關鍵字(中) ★ 版本控制系統
★ 循序探勘
★ 軟體演進
★ 軟體工程
關鍵字(英) ★ Software Engineering
★ Version Control System (VCS)
★ Sequential-pattern mining
★ Software evolution
論文目次 目錄
摘要................................................................................................................................. i
Abstract ......................................................................................................................... ii
誌謝.............................................................................................................................. iii
目錄............................................................................................................................... iv
圖目錄........................................................................................................................... vi
表目錄.......................................................................................................................... vii
第一章 緒論 ................................................................................................................. 1
1.1 研究背景......................................................................................................... 1
1.2 研究動機......................................................................................................... 3
1.3 研究目的......................................................................................................... 4
1.4 研究方法......................................................................................................... 5
1.5 論文架構......................................................................................................... 6
第二章 文獻探討 ......................................................................................................... 7
2.1 軟體工程資料種類......................................................................................... 7
2.2 資料探勘於軟體工程之應用......................................................................... 8
2.2.1 關聯規則.............................................................................................. 8
2.2.2 序列規則.............................................................................................. 9
2.2.3 關聯規則與序列規則的比較............................................................ 10
2.2.4 其他方法............................................................................................ 11
2.3 序列樣式探勘............................................................................................... 11
2.3.1 Apriori-like .......................................................................................... 12
2.3.2 Pattern-growth..................................................................................... 13
2.3.3 時序群集............................................................................................ 15
2.4 小結............................................................................................................... 15
v
第三章 系統平台設計 ............................................................................................... 16
3.1 系統架構....................................................................................................... 16
3.2 資料蒐集....................................................................................................... 17
3.3 資料前處理................................................................................................... 18
3.4 序列樣式探勘............................................................................................... 19
3.5 序列樣式探討............................................................................................... 24
第四章 實驗結果與討論 ........................................................................................... 26
4.1 實驗環境....................................................................................................... 26
4.2 實驗對象....................................................................................................... 26
4.3 實驗設計....................................................................................................... 27
4.3.1 資料蒐集............................................................................................ 28
4.3.2 資料前處理........................................................................................ 30
4.3.3 序列樣式探勘及分類探討................................................................ 32
4.4 實驗結果與討論........................................................................................... 32
4.4.1 XAMJ project...................................................................................... 33
4.4.2 Sqlexplorer project .............................................................................. 34
4.4.3 Findbugs project.................................................................................. 36
4.4.4 OpenXava project ............................................................................... 38
第五章 結論與未來研究方向 ................................................................................... 40
5.1 結論............................................................................................................... 40
5.2 未來研究方向............................................................................................... 42
參考文獻...................................................................................................................... 43
中文參考文獻...................................................................................................... 43
英文參考文獻...................................................................................................... 43
網頁資料.............................................................................................................. 47
參考文獻 參考文獻
中文參考文獻
1. 林雅鈞、民90,物件導向設計之版本變更差異分析研究,中華大學資訊工程
研究所碩士論文。
2. 陳仕昇、民92,序列樣式探勘之研究,中央大學資訊管理研究所博士論文。
3. 洪菁憶、民97,循序探勘在軟體版本控制上的應用,中央大學資訊管理研究
所碩士論文。
英文參考文獻
4. Agrawal, R., Lin, K., Sawhney, H. S., and Shim, K. 1995. “Fast Similarity
Search in the Presence of Noise, Scaling, and Translation in Time-Series
Databases.” In Proceedings of the 21th international Conference on Very Large
Data Bases (September 11 - 15, 1995).
5. Ball, T., Kim, J.M., Porter, A.A. and Siy, H.P., “If your version control system
could talk....” In ICSE Workshop on Process Modeling and Empirical Studies of
Software Engineering, 1997.
6. Bieman, J.M., Andrews, A.A. and Yang, H.J., “Understanding change-proneness
in OO software through visualization.” In Proc. 11th International Workshop on
Program Comprehension. Portland, Oregon, 2003; 44–53.
7. Breu, S., Zimmermann, T., and Lindig, C. 2006. “Mining eclipse for
cross-cutting concerns.” In Proceedings of the 2006 international Workshop on
Mining Software Repositories (Shanghai, China, May 22 - 23, 2006). MSR '06.
ACM, New York, NY, 94-97.
8. Burch, M., Diehl, S. and Weisgerber, P., “Visual data mining in software
archives.” Proceedings ACM Symposium on Software Visualization (SoftVis’05).
ACM Press: New York NY, 2005; 37–46.
9. Canfora, G., Cerulo, L. and Di Penta, M., “Identifying Changed Source Code
Lines from Version Repositories.” Proc. of the 4th International Workshop on
Mining Software Repositories, Minneapolis, Minnesota, IEEE Computer Society
Press, 2007.
10. Chao Liu, Xifeng Yan, Long Fei, Jiawei Han and Samuel Midkiff, "SOBER:
Statistical Model-based Bug Localization", the 5th joint meeting of the European
Software Engineering Conference and ACM SIGSOFT Symposium on the
Foundations of Software Engineering, pp. 286-295, Lisbon, Portugal, Sept. 2005.
11. Chen, A., Chou, E., Wong, J., Yao, A.Y., Zhang, Q., Zhang S. and Michail, A.,
“CVSSearch: Searching through source code using CVS comments.”
Proceedings 17th IEEE International Conference on Software Maintenance
(ICSM’01). IEEE Computer Society Press: Los Alamitos CA, 2001; 364–373.
12. Dallmeier, V., Lindig, C., and Zeller, A. 2005. “Lightweight bug localization with
AMPLE.” In Proceedings of the Sixth international Symposium on Automated
Analysis-Driven Debugging (Monterey, California, USA, September 19 - 21,
2005). AADEBUG'05. ACM, New York, NY, 99-104.
13. Gall, H., Hajek, K. and Jazayeri, M., “Detection of logical coupling based on
product release history.” Proceedings 14
th
IEEE International Conference on
Software Maintenance. IEEE Computer Society Press: Los Alamitos CA, 1998;
190–199.
14. Hassan, A.E. and Holt, R.C., “Predicting change propagation in software
systems.” Proceedings 20
th
IEEE International Conference on Software
Maintenance. IEEE Computer Society Press: Los Alamitos CA. 2004; 284–293.
15. Hassan, A.E., “The road ahead for Mining Software Repositories,” Frontiers of
Software Maintenance. 2008; 48 - 57.
16. Kagdi, H., Yusuf, S. and Maletic, J.I., “Mining sequences of changed-files from
version histories.” Proceedings 3
rd
International Workshop on Mining Software
Repositories. ACM Press: New York NY. 2006; 47–53.
17. Kagdi, H., Collard, M.L. and Maletic, J.I., “A survey and taxonomy of
approaches for mining software repositories in the context of software
evolution.” Journal of Software Maintenance and Evolution: Research and
Practice. 2007; 19(2): 77-131.
18. Kagdi, H., Collard, M.L., and Maletic, J.I., “Comparing Approaches to Mining
Source Code for Call-Usage Patterns.” Proceedings of 4th International
Workshop on Mining Software Repositories, Minneapolis, MN, 2007; 123 - 130.
19. Kawaguchi, S., Garg, P. K., Matsushita, M., and Inoue, K. 2004. “MUDABlue:
An Automatic Categorization System for Open Source Repositories.”
In Proceedings of the 11th Asia-Pacific Software Engineering
Conference (November 30 - December 03, 2004). APSEC. IEEE Computer
Society, Washington, DC, 184-193.
20. Mandelin, D., Xu, L., Bodik, R., and Kimelman, D. 2005. Jungloid mining:
helping to navigate the API jungle. SIGPLAN Not. 40, 6 (Jun. 2005), 48-61.
21. Michail, A. and Xie, T., “Helping users avoid bugs in GUI
applications,” Software Engineering, 2005. ICSE 2005. Proceedings. 27th
International Conference on , vol., no., pp. 107-116, 15-21 May 2005.
22. Pei, J., Han, J.W., Mortazavi-Asl, B., Pinto, H., Chen, Q.M., Dayal, U., Hsu,
M.C., “PrefixSpan: Mining Sequential Patterns Efficiently by Prefix-Projected
Pattern Growth,” Data Engineering, International Conference on, pp. 0215, 17th
International Conference on Data Engineering (ICDE'01), 2001.
23. Xie, T. and Notkin, D., “Automatically Identifying Special and Common Unit
Tests for Object-Oriented Programs.” In Proceedings of the 16th IEEE
International Symposium on Software Reliability Engineering(ISSRE 2005),
Chicago, Illinois, USA, pp. 277-287, November 2005.
24. Xie, T. and Pei, J. 2006. MAPO: mining API usages from open source
repositories. In Proceedings of the 2006 international Workshop on Mining
Software Repositories (Shanghai, China, May 22 - 23, 2006). MSR '06. ACM,
New York, NY, 54-57.
25. Yang, J., Wang, W., “CLUSEQ: Efficient and Effective Sequence
Clustering,” Data Engineering, International Conference on, pp. 101, 19th
International Conference on Data Engineering (ICDE'03), 2003.
26. Ying, A.T.T., Murphy, G.C., Ng, R. and Chu-Carroll, M.C., “Predicting source
code changes by mining change history.” IEEE Transactions on Software
Engineering, 2004; 30(9):574–586.
27. Zimmermann, T. and Weisgerber, P., “Preprocessing CVS Data For Fine-Grained
Analysis.” Proc. Mining Software Repositories, 2004; 2-6.
28. Zimmermann, T., Weisgerber, P., Diehl, S. and Zeller, A., “Mining version
histories to guide software changes.” Proceedings 26
th
International Conference
on Software Engineering (ICSE’04). IEEE Computer Society Press: Los
Alamitos CA, 2004; 563–572.
29. Zimmermann, T., Zeller, A., Weisgerber, P. and Diehl, S., “Mining version
histories to guide software changes.” IEEE Transactions on Software
Engineering, 2005; 31(6):429–445.
30. Zimmermann, T., Kim, S., Whitehead, E.J. Jr. and Zeller, A., “Mining Version
Archives for Co-changed Lines.” In Proceedings of the Third International
Workshop on Mining Software Repositories, Shanghai, China, 2006, 72 - 75.
網頁資料
31. DMSE, http://ase.csc.ncsu.edu/dmse/
32. Microsoft 時序群集演算法技術參考,
http://msdn.microsoft.com/zh-tw/library/cc645866.aspx
33. Sourceforge.net, http://sourceforge.net/index.php
指導教授 林熙禎(Shi-jen Lin) 審核日期 2009-7-9
推文 facebook   plurk   twitter   funp   google   live   udn   HD   myshare   reddit   netvibes   friend   youpush   delicious   baidu   
網路書籤 Google bookmarks   del.icio.us   hemidemi   myshare   

若有論文相關問題,請聯絡國立中央大學圖書館推廣服務組 TEL:(03)422-7151轉57407,或E-mail聯絡  - 隱私權政策聲明