摘要(英) |
Multiple function devices are a type of office machines which combines E-mail, fax,copy, printing, and scanning functions. It was designed to provide users with easy and promptoperation and usage. In the literature of data mining applications, very few focus on B2B selling forecast in Taiwan. Moreover, there is no a comparative study for the applicability of data mining techniques to different types of forecasting results, which are continuous and discrete prediction outputs. Therefore, in this thesis the research objective is to compare different supervised learning techniques for the sale forecast of multiple function devices. The contributions of this thesis are able to provide some guidelines for the case company to conduct sales forecast and can give academics a reference on B2B industry.
In the experiments, the attributes relate to sales from historical data are collected, and the data completeness in each attribute is also taken into account. Next, the historical selling quantity (i.e. continuous values) is used as the prediction output. In addition, the selling quantity is further divided into 3 classes by normal distribution for comparison. On the other hand, in order to find out the effect of performing feature selection on the forecasting result,PCA (principle components analysis) is used to select more representative attributes from the original data set. For model construction, different single and multiple classification techniques are compared.
The experimental results show that performing feature selection does not significantly affect the final prediction results no matter for continuous or discrete prediction output. For continuous prediction without PCA, the support vector machine (SVM) performs the best in terms of MAE (Mean Absolute Error). For discrete prediction without PCA, the SVM outperforms the other models in terms of prediction accuracy. |
參考文獻 |
60
參考文獻
【英文文獻】
1. Berry, M. J. A. and Linoff, G.(1997), Data Mining Technique for Marketing, Sale, and Customer Support, Wiley Computer.
2. Cabena, P.P.O. Hadjinian, R. Stadler, DR. J. Verhees, and A. Zanasi(1997), Discovering Data Mining from Concept to Implementation, Prentice Hall.
3. Dorian Pyle(1999), Data Preparation for Data Ming, Morgan Kaufmann.
4. Dunham, M. H.(2003), Data Mining Introductory and Advanced Topics, N. J. , Pearson Education Inc.
5. Fayyad, M.U(1996), ”Data Mining and Knowledge Discovery: Making Sense Out of Data, ”IEEE Expect, 11(10), 20-25
6. Frawley, W.J., Sharpiro, G. P. and Matheus C. J.(1992), “knowledge Discovery in Database: An Overview,” AI Magazine, 13(3), 57-10.
7. Grupe, G. H. and Owrang(1995), M. M “Database Mining Discovering New Knowledge and Cooperative Advantage,” Information System Management, l(12), 26-31.
8. Han, J. and Kamber M.(2001), Data Mining: Concepts and Techniques, Academic Press, San Diego.
9. Kleissner, C., & Technol, A. (1998). Data mining for the enterprise. System Sciences,1998., Proceedings of the Thirty-First Hawaii International Conference on, 7.
10. Rakesh Agrawal, Tomasz Imielinskim and Arun Swami(1993), Database Mining: A Performance Perspective, IEEE Trans on Knowledge and Data Engineering, 5(6),914-925.
11. Shaw, M. J., Subramaniam C., Tan, G. W. and Welge, M. E.(2001), Knowledge Management and Data Mining for Marketing,Decision Support Systems, 31, 127-137.
12. S. K. Shevade, S. S. Keerthi, C. Bhattacharyya, and K. R. K. Murthy(2000),Improvements to the SMO Algorithm for SVM Regression, IEEE TRANSACTIONS ON NEURAL NETWORKS, 11(5),1188-1193.
13. Smola, AJ. (1998), Learning with kernels, Unpublisheddoctoral dissertation. Department of ComputerScience, Technical University Berlin, Gemany.61
14. X. Wu, V. Kumar, J. R. Quinlan, J. Ghosh, Q. Yang, H. Motoda, et al.,(2008) "Top 10 algorithms in data mining," Knowledge and Information Systems, 14,1-37.
【中文文獻】
1. 行政院經濟建設委員會經濟研究處(2013),台灣經濟景氣指標月刊,第 37 卷,第 3期,44-51。
2. 沙永傑 (2004) 演化式演算法應用於資料探勘之研究,行政院國家科學委員會專題研究計畫成果報告。
3. 呂金河譯(2005),多變量分析,台北:約翰威立。
4. 呂學慶(2002),「台灣地區汽車市場銷售預測之探討」,國立中央大學管理學院高階主管企管碩士班碩士論文。
5. 范姜士燻(2008),「使用資料探勘輔助學習者探索大型資料庫-學習者經驗之研究」,中央大學網路學習科技研究所碩士論文。
6. 張崇賢(2010),「應用資料探勘於流通業智慧型行動電話銷售預測」,大同大學資訊經營學系(所)碩士論文。
7. 許郁卿(2011),「整合資料探勘及紅綠配行銷策略於商品銷售之研究」,靜宜大學資訊管理學系碩士論文。
8. 陳彥琴(2005),「應用灰色理論預測新上市之生技保健食品銷售量」,國立成功大學工業與資訊管理學系碩士在職專班論文。
9. 彭文正譯(2001),資料採礦:顧客關係管理暨電子行銷之應用,台北:數博網資訊。
10. 黃朋祥(2013),「運用資料採礦技術建立客戶流失預測模型以聯成電腦為例」,致理技術學院服務業經營管理研究所碩士論文。
11. 黃俊英(1999),「企業研究方法」,台北:東華書局。
12. 謝邦昌(2003),類神經網路概述及實例,輔仁大學統計資訊系。
13. 謝瑞南(2012),「港口吞吐量與景氣相關性之研究- 以台灣地區為例」,國立臺灣海洋大學商船學系所碩士論文。
【網路資料】
1. 教育部數位教學資源入口網(accessed 2013/9/19, available at:
http://content.edu.tw/senior/life_tech/tc_t2/inform/data2.htm)
2. MBA 智庫百科,(accessed 2013/9/19, available at:
http://wiki.mbalib.com/zh-tw/%E9%94%80%E5%94%AE%E6%94%B6%E5%85%A5
%E9%A2%84%E6%B5%8B)
3. 行政院經濟建設委員會(accessed 2013/9/19, available at:
http://www.cepd.gov.tw/PowerUp/m1.aspx?sNo=0009674)
4. 全人教育百寶箱(accessed 2013/9/19, available at:
http://hep.ccic.ntnu.edu.tw/browse2.php?s=242) |