![]() |
以作者查詢圖書館館藏 、以作者查詢臺灣博碩士 、以作者查詢全國書目 、勘誤回報 、線上人數:25 、訪客IP:18.118.162.243
姓名 劉柏廷(Po-Ting Liu) 查詢紙本館藏 畢業系所 資訊管理學系 論文名稱 關聯規則應用於證券交易相對關係規則之挖掘
(Association Rule Based Relational Mining for Stock Trading)相關論文 檔案 [Endnote RIS 格式]
[Bibtex 格式]
[相關文章]
[文章引用]
[完整記錄]
[館藏目錄]
[檢視]
[下載]
- 本電子論文使用權限為同意立即開放。
- 已達開放權限電子全文僅授權使用者為學術研究之目的,進行個人非營利性質之檢索、閱讀、列印。
- 請遵守中華民國著作權法之相關規定,切勿任意重製、散佈、改作、轉貼、播送,以免觸法。
摘要(中) 傳統上當關聯規則在分析數值型資料(Numerical Data)時,常見的處理方式就是先將資料進行離散化之後再進行運算,也就是將不同的數值分類到各個長度相同的區間中,故離散化的分類方式是屬於所謂的「絕對數值」比較。但實際上某些數值型資料在本質上並不適於這種離散化處理。以證券市場的資料為例,有許多常見且知名的技術指標的使用方式是屬於「相對數值」的比較,而非「絕對數值」的比較方式。因此,如果我們單純地對數值型資料進行等距或等量離散化而不考慮資料在本質上存在有「相對比較」的關係時,將造成資訊流失的問題。
因此,本研究將提出「相對數值」的比較關係,讓數值型資料的處理方式不只侷限於等距劃分或等量劃分這種「絕對比較」的資料處理方式,而藉由「相對數值」的比較關係能讓資料的應用方式使得更符合該資料原本的意義,並使關聯規則能夠更加合適地應用在數值型資料上。
在運用了「相對數值比較」的概念於關聯規則中,並進行資料挖掘之後,本研究將以「分類型關聯規則」的方式,對目標欄位進行分類預測,而分類型關聯規則包括了「規則精簡」與「整體預測」二個步驟,「規則精簡」將利用子集合的概念從所有的規則之中,篩選出較為一般化的規則,以對規則進行簡化與整合的動作,並改善關聯規則產生過多的規則的缺點,最後利用「整體預測」對目標資料進行預測,並利用總合信心水準門檻的機制來提升預測的準確率。
本研究將針對2003年1月1日到2006年12月31日的美國股市交易資料,進行上述方法的實驗驗證,相較於只運用「絕對比較」的資料處理方式,在加入了「相對數值」的比較關係之後,無論是在訓練期以及測試期,其預測的準確率都有顯著的提升,而在本研究進一步加入「規則精簡」與「整體預測」的方法之後,也能有效提高預測的準確率,足作為投資決策參考之用。摘要(英) When it comes to analyzing numerical data by Association Rule, we have to disperse those numerical data before we start to use them as a data mining source data. The common data dispersed methods are “equal width interval” and “equal frequency interval”. We categorize these two methods into “absolute”, because both of them classify different values into each interval with the same length. In practice, equal width interval and equal frequency interval are not necessary the suitable way to deal with all kinds of data. For example, the usage of many popular and famous technical analysis indicators is considered “relative-comparison”, rather than “absolute- comparison”. Therefore, if we simply treat all kinds of data as “absolute-comparison” data without thinking about whether those data have “relative-comparison” characteristics in nature, we may lead to information loss because we ignore some important features in those data.
For this reason, we propose a concept of “relative-type comparative relation” which is an alternative to “equal width interval” and “equal frequency interval” for data preprocessing. Through “relative-comparison” we can transfer numerical data to data mining source data in a more appropriate way that make the source data more similar into the numerical data in meaning, so that we can reduce information loss and enhance the result of data mining.
After applying “relative-comparison” to association rule data mining, we use CBA(Classification Based on Associations) to classify and predict the target data. CBA can be divided in two steps which are “rule simplification” and “collective evaluation.” “Rule simplification” eliminates those redundant rules and integrates those general rules for classification. “Collective evaluation” uses the total confidence of screened rules to classify and predict the target data and enhance the accuracy of classification and prediction.
The experimental data is extracted from American stock trading data form 2003 to 2006. The results of the experiments show that the application of “relative-comparison” does improve the precision of stock price estimation. After we implement “rule simplification” and “collective evaluation” in the experiments, we improve the precision rate to a higher level.關鍵字(中) ★ 資料挖掘
★ 關聯規則
★ 相對關係
★ 技術分析關鍵字(英) ★ Technical Analysis
★ Relative Relation
★ Association Rules
★ Data Mining論文目次 中文摘要..........i
Abstract..........ii
致謝..............iv
目錄..............v
圖目錄............vii
表目錄............viii
第1章 緒論..........1
1.1 研究背景與動機..........1
1.2 研究目的..........2
1.3 論文架構..........3
第2章 文獻探討..........4
2.1 證券投資理論..........4
2.1.1 效率市場假說..........4
2.1.2 基本分析..........5
2.1.3 技術分析..........6
2.2 資料挖掘方法論..........14
2.2.1 資料庫知識發掘流程..........15
2.2.2 關聯規則定義..........17
2.2.3 關聯規則演算法..........19
2.2.4 數值型關聯規則..........22
2.2.5 以關聯規則為基礎之分類..........23
第3章 研究架構..........25
3.1 選擇股市資料與資料期間..........26
3.2 資料處理..........26
3.3 資料轉換..........28
3.4 分類型關聯規則資料挖掘..........29
3.4.1 Apriori關聯規則分析..........29
3.4.2 精簡關聯規則..........32
3.4.3 整合關聯規則分析..........34
3.5 形成投資決策分析依據..........36
第4章 實驗結果與分析..........38
4.1 實驗資料說明..........38
4.1.1 技術指標與其應用方式之選擇..........38
4.1.2 金融產品之選擇..........39
4.2 實驗參數設定..........40
4.3 實驗設計..........42
4.4 實驗結果..........45
4.4.1 實驗一:預測目標值為上漲..........45
4.4.1.1 「絕對數值」型與加入「相對數值」型對提高精確率之比較..........47
4.4.1.2 提高信心水準門檻值對提高準確率之比較..........52
4.4.2 實驗二:預測目標值為下跌..........54
4.4.2.1 「絕對數值」型與加入「相對數值」型對提高精確率之比較..........54
4.4.2.2 提高信心水準門檻值對提高準確率之比較..........58
第5章 結論與未來研究方向..........60
5.1 結論與研究貢獻..........60
5.2 未來研究方向..........62
參考文獻..........63參考文獻 [1] 陳共、周升業、吳曉求,證券投資分析,台北:五南圖書出版公司,2001。
[2] 陳伯仁,證券交易策略發掘,中央大學資訊管理研究所碩士論文,2002。
[3] Achelis and Steven B., Technical Analysis from A to Z, McGraw-Hill, New York, 2000.
[4] Agrawal, Rakesh, and R. Srikant, “Fast algorithm for mining association rules,” In Proc. 1994 Int. Conf. Very Large Data Bases (VLDB’94), 1994, pp. 487-499.
[5] Agrawal, Rakesh, T. Imielinski and A. Swami, “Mining Association Rules between Sets of Items in Large Databases,” In Proc. 1993 ACM-SIGMOD Int. Conf. Management of Data (SIGMOD’93), 1993, pp. 207-216.
[6] Berry, Michael J.A. and G.S. Linoff, Data Mining Technique: For Marketing, Sales, and Customer Relationship Management, Wiley Computer Publishing, New York, 1997.
[7] Brealey, Richard, Stewart Myers, and Alan Marcus, Fundamentals of Corporate Finance, 4th ed., McGraw-Hill, 2003.
[8] de Graaf, Jeannette M., Walter A. Kosters and Jeroen J.W. Witteman, “Interesting Fuzzy Association Rules in Quantitative Databases,” In Principles of Data Mining and Knowledge Discovery-LNAI 2168, Berlin, Springer-Verlag, pp. 140-151.
[9] Fama, E. F “The behavior of stock market prices,” Journal of Business, Vol. 38, 1965, pp. 34-106.
[10] Fama, E. F. “Efficient Capital Markets: A Review of Theory and Empirical Work,” Journal of Finance, Vol. 25, 1970, pp. 383-417.
[11] Fama, E. F. “Efficient capital markets II,” Journal of Finance, Vol. 46, 1991, pp. 1575-1643.
[12] Fayyad, Usama M, G. Piatetsky-Shapiro and P. Symth, “From data mining to knowledge discovery in databases”, AI Magazine, 1996, pp.37-54.
[13] Fayyad, Usama M, G. Piatetsky-Shapiro and P. Symth, “Overview of Data Mining and Knowledge Discovery,” Knowledge Discovery and Data Mining, 1996, AAAI press, pp. 1-36.
[14] Granville, Joseph E., Granville's New Strategy of Daily Stock Market Timing for Maximum Profit, Prentice Hall, 1976.
[15] Han, Jiawei and Yongjian Fu, “Discovery of Multiple-Level Association Rules from Large Databases,” In Proc. 1995 Int. Conf. Very Large Data Base (VLDB’95), 1995, pp. 420-431.
[16] Hong, Tzung-Pei, Chan-Sheng Kuo and Sheng-Chai Chi, “A fuzzy data mining algorithm for quantitative values,” Knowledge-Based Intelligent Information Engineering Systems, 1999, pp480-482.
[17] Jobman, Darrell R., The Handbook of Technical Analysis, Irwin, New York, 1995.
[18] Kuok, Chan Man, Ada Fu and Man Hon Wong, “Mining fuzzy association rules in databases,” ACM SIGMOD Record, 1998, p.41-46.
[19] Liu, Bing, Yiming Ma, and Ching-Kian Wong, “Classification using association rules: weaknesses and enhancements,” In Vipin Kumar, et al. (Ed.), Data mining for scientific and engineering applications, 2001, pp. 591.
[20] Liu, Bing, Yiming Ma, and Ching-Kian Wong, “Integrating Classification and Association Rule Mining,” In Proceedings of KDD-98, 1998.
[21] Liu, Bing and Yiming Ma, “Mining Association Rules with Multiple Minimum Supports,” In Proceedings of KDD-99, 1999.
[22] Miller, R. J. and Y. Yang, “Association rules over interval data,” Proc. 1997, ACM-SIGMOD Int. Conf. Management of Data (SIGMOD’97), 1997, pp 452-461.
[23] Pan, Heping, “A Joint Review of Technical and Quantitative Analysis of Financial Markets Towards A Unified Science of Intelligent Finance,” Hawaii International Conference on Statistics and Related Fields, 2003.
[24] Schwager, Jack D., The New Market Wizards, Harper Business, 1994.
[25] Schwager, Jack D., Getting Started in Technical Analysis, John Wiley & Sons, 1999.
[26] Shaw, M. J., C. Subramaniam, G.W. Tan and M.E. Welge, “Knowledge Management and Data Mining for Marketing,” Decision Support Systems, Vol. 31, 2001, pp.127-137.
[27] Srikant, Ramakrishnan and Agrawal Rakesh, “Mining quantitative association rules in large relational tables,” Proc. 1996, ACM-SIGMOD Int. Conf. Management of Data (SIGMOD’96), 1996, pp 1-12.
[28] Suraphan, Thawornwong, David Enke and Cihan Dagli “Neural Networks as a Decision Maker for Stock Trading: A Technical Analysis Approach,” Journal of Smart Engineering Systems Design, Vol. 5, 2003, pp. 1-13.指導教授 陳稼興(Jiah-Shing Chen) 審核日期 2007-6-27 推文 plurk
funp
live
udn
HD
myshare
netvibes
friend
youpush
delicious
baidu
網路書籤 Google bookmarks
del.icio.us
hemidemi
myshare