利用資料探勘技術建立商用複合機銷售預測模型

以作者查詢圖書館館藏

、以作者查詢臺灣博碩士

、以作者查詢全國書目

、勘誤回報

、線上人數：6

、訪客IP：18.218.172.249

姓名

洪彥群(Yen-Chun Hung) 查詢紙本館藏

畢業系所

資訊管理學系在職專班

論文名稱

利用資料探勘技術建立商用複合機銷售預測模型
(Applying Data Mining Techniques to Construct the Sale Forecast Model for Multiple Function Devices)

相關論文

★ 應用資料探勘技術於資源配置預測之研究-以某電腦代工支援單位為例	★ 資料探勘技術應用於航空業航班延誤分析-以C公司為例
★ 全球供應鏈下新產品的安全控管-以C公司為例	★ 資料探勘應用於半導體雷射產業-以A公司為例
★ 應用資料探勘技術於空運出口貨物存倉時間預測-以A公司為例	★ 使用資料探勘分類技術優化YouBike運補作業
★ 特徵屬性篩選對於不同資料類型之影響	★ 資料探勘應用於B2B網路型態之企業官網研究-以T公司為例
★ 衍生性金融商品之客戶投資分析與建議-整合分群與關聯法則技術	★ 應用卷積式神經網路建立肝臟超音波影像輔助判別模型
★ 基於卷積神經網路之身分識別系統	★ 能源管理系統電能補值方法誤差率比較分析
★ 企業員工情感分析與管理系統之研發	★ 資料淨化於類別不平衡問題: 機器學習觀點
★ 資料探勘技術應用於旅客自助報到之分析—以C航空公司為例	★ 應用機器學習建立單位健保欠費催繳後繳納預測模型

檔案

[Endnote RIS 格式]

[Bibtex 格式]

[相關文章]

[文章引用]

[完整記錄]

[館藏目錄]

至系統瀏覽論文 ( 永不開放)

摘要(中)

商用多功能複合機是將影印、列印、傳真以及掃描等多項功能配載於單一裝置中，
透過簡易與直覺化的操作，提供使用者一站式的服務，藉此以提升工作效率；對於辦公室採購者而言，可以減少其他裝置的採購與佈署，讓商辦空間坪效獲得更靈活運用。
國內目前針對銷售預測的論文題目數量相當多，但鮮少有文獻嘗試進行「連續型與
離散型資料」及「單一與多重分類器比較」的銷售預測效能進行比較，因此本研究以個案公司的真實銷售資料，試圖找出符合個案公司需求之最適工具並期許本研究結果能提供學術界參考。
本研究針對資料來源逐步進行維度篩選、無效資料刪除、維度整理、資料前處理等
動作。在實驗流程上，將資料分成連續型資料與離散型資料，並分別透過資料探勘工具Weka3.6.9 版本，進行不同分類器實驗，以試圖獲得最佳銷售預測模型。其中離散型資料是根據個案公司每月銷售數量，以常態分配法劃分為 3 類。
為能找出個案公司資料中具備影響力的維度，本研究更進一步比PCA(principle
components analysis)篩選後的維度，其連續型與離散型的預測結果。在連續型資料的預測工具上，本研究分別採用 Linear Regression、MultilayerPerceptron、SMOreg 與 kNN 等4 種單一分類器，並搭配 Additive Regression 與 Bagging 多重分類器加以驗證；在離散型資料則採用 MultilayerPerceptron、SMO、LibSVM、kNN、CART 與 BayNet 等 6 種單一分類器，並搭配 Adaboost 與 Bagging 多重分類器加以驗證。
經過實驗結果得知，PCA 對於連續型或離散型資料的預測結果影響都不大，而在連
續型資料上，以 SMOreg 的表現最佳，錯誤率整體來說最低；而在離散型資料，則以LibSVM 的正確率較高。

摘要(英)

Multiple function devices are a type of office machines which combines E-mail, fax,copy, printing, and scanning functions. It was designed to provide users with easy and promptoperation and usage. In the literature of data mining applications, very few focus on B2B selling forecast in Taiwan. Moreover, there is no a comparative study for the applicability of data mining techniques to different types of forecasting results, which are continuous and discrete prediction outputs. Therefore, in this thesis the research objective is to compare different supervised learning techniques for the sale forecast of multiple function devices. The contributions of this thesis are able to provide some guidelines for the case company to conduct sales forecast and can give academics a reference on B2B industry.
In the experiments, the attributes relate to sales from historical data are collected, and the data completeness in each attribute is also taken into account. Next, the historical selling quantity (i.e. continuous values) is used as the prediction output. In addition, the selling quantity is further divided into 3 classes by normal distribution for comparison. On the other hand, in order to find out the effect of performing feature selection on the forecasting result,PCA (principle components analysis) is used to select more representative attributes from the original data set. For model construction, different single and multiple classification techniques are compared.
The experimental results show that performing feature selection does not significantly affect the final prediction results no matter for continuous or discrete prediction output. For continuous prediction without PCA, the support vector machine (SVM) performs the best in terms of MAE (Mean Absolute Error). For discrete prediction without PCA, the SVM outperforms the other models in terms of prediction accuracy.

關鍵字(中)

★ 資料探勘
★ 銷售預測
★ 單一分類器與多重分類器

關鍵字(英)

★ Data Mining
★ Sales Forecast
★ Single Classifiers
★ Multiple Classifiers

論文目次

iv
目錄
摘要 ..............................................i
Abstract ..........................................ii
目錄 ..............................................iv
圖目錄 ............................................vi
表目錄 ............................................vii
第一章前言 .......................................1
1.1 研究背景 ......................................1
1.2 研究動機 ......................................2
1.3 研究目的 ......................................3
1.4 研究對象與範圍 ................................4
1.4.1 個案公司介紹 ................................4
1.4.2 資料蒐集範圍 ................................4
1.4.3 資料蒐集的限制 ..............................4
第二章文獻探討 ...................................5
2.1 資料探勘介紹 ..................................5
2.1.1 資料探勘定義 ................................5
2.1.2 資料探勘常見功能 ............................7
2.1.3 資料探勘的程序 ..............................9
2.2 銷售預測目的與方法 ............................11
2.2.1 銷售預測目的 ................................11
2.2.2 銷售預測的方法 ..............................14
2.3 景氣指標介紹 ..................................19
2.3.1 景氣對策信號(Monitoring Indicator)...........21
2.3.2 景氣燈號(Monitoring Lights) .................22
2.4 相關銷售預估論文回顧 ..........................23
第三章研究方法 ...................................26
3.1 監督式學習技術 ................................26
3.1.1 Linear Regression............................26
3.1.2 Multilayer Perceptron........................27
3.1.3 SVM..........................................27
3.1.4 kNN..........................................28
3.1.5 CART.........................................29
3.1.6 Bayes Network................................30
3.1.7 Adaboost.....................................30
3.1.8 Bagging......................................31
3.2 實驗流程 ......................................32
3.2.1 資料來源 ....................................32
3.2.2 主成分分析 Principle Components Analysis.....36
3.2.3 模型建立流程 ................................40
3.2.4 連續型資料 ..................................41
3.2.5 離散型資料 ..................................41
3.2.6 K 折交叉驗證 K-Fold Cross-Validation.........42
第四章研究結果 ...................................43
4.1 連續型資料研究結果 ............................43
4.1.1 Linear Regression............................44
4.1.2 MultilayerPerceptron.........................45
4.1.3 SMOreg.......................................46
4.1.4 kNN .........................................47
4.1.5 連續型資料結果小結 ..........................48
4.2 離散型資料研究結果 ............................49
4.2.1 MultilayerPerceptron.........................50
4.2.2 CART.........................................50
4.2.3 LibSVM.......................................51
4.2.4 SMO .........................................51
4.2.5 kNN .........................................52
4.2.6 BayesNetwork.................................53
4.2.7 離散型資料結果小結..............................54
4.3 討論 ..........................................55
第五章結論 .......................................56
5.1 研究結論 ......................................56
5.2 研究貢獻 ......................................58
5.3 研究限制及未來研究方向 ........................59
5.3.1 研究限制 ....................................59
5.3.2 建議未來研究的方向 ..........................59
參考文獻 ..........................................60

參考文獻

60
參考文獻
【英文文獻】
1. Berry, M. J. A. and Linoff, G.(1997), Data Mining Technique for Marketing, Sale, and Customer Support, Wiley Computer.
2. Cabena, P.P.O. Hadjinian, R. Stadler, DR. J. Verhees, and A. Zanasi(1997), Discovering Data Mining from Concept to Implementation, Prentice Hall.
3. Dorian Pyle(1999), Data Preparation for Data Ming, Morgan Kaufmann.
4. Dunham, M. H.(2003), Data Mining Introductory and Advanced Topics, N. J. , Pearson Education Inc.
5. Fayyad, M.U(1996), ”Data Mining and Knowledge Discovery: Making Sense Out of Data, ”IEEE Expect, 11(10), 20-25
6. Frawley, W.J., Sharpiro, G. P. and Matheus C. J.(1992), “knowledge Discovery in Database: An Overview,” AI Magazine, 13(3), 57-10.
7. Grupe, G. H. and Owrang(1995), M. M “Database Mining Discovering New Knowledge and Cooperative Advantage,” Information System Management, l(12), 26-31.
8. Han, J. and Kamber M.(2001), Data Mining: Concepts and Techniques, Academic Press, San Diego.
9. Kleissner, C., & Technol, A. (1998). Data mining for the enterprise. System Sciences,1998., Proceedings of the Thirty-First Hawaii International Conference on, 7.
10. Rakesh Agrawal, Tomasz Imielinskim and Arun Swami(1993), Database Mining: A Performance Perspective, IEEE Trans on Knowledge and Data Engineering, 5(6),914-925.
11. Shaw, M. J., Subramaniam C., Tan, G. W. and Welge, M. E.(2001), Knowledge Management and Data Mining for Marketing,Decision Support Systems, 31, 127-137.
12. S. K. Shevade, S. S. Keerthi, C. Bhattacharyya, and K. R. K. Murthy(2000),Improvements to the SMO Algorithm for SVM Regression, IEEE TRANSACTIONS ON NEURAL NETWORKS, 11(5),1188-1193.
13. Smola, AJ. (1998), Learning with kernels, Unpublisheddoctoral dissertation. Department of ComputerScience, Technical University Berlin, Gemany.61
14. X. Wu, V. Kumar, J. R. Quinlan, J. Ghosh, Q. Yang, H. Motoda, et al.,(2008) "Top 10 algorithms in data mining," Knowledge and Information Systems, 14,1-37.
【中文文獻】
1. 行政院經濟建設委員會經濟研究處(2013)，台灣經濟景氣指標月刊，第 37 卷，第 3期，44-51。
2. 沙永傑 (2004) 演化式演算法應用於資料探勘之研究，行政院國家科學委員會專題研究計畫成果報告。
3. 呂金河譯(2005)，多變量分析，台北：約翰威立。
4. 呂學慶(2002)，「台灣地區汽車市場銷售預測之探討」，國立中央大學管理學院高階主管企管碩士班碩士論文。
5. 范姜士燻(2008)，「使用資料探勘輔助學習者探索大型資料庫-學習者經驗之研究」，中央大學網路學習科技研究所碩士論文。
6. 張崇賢(2010)，「應用資料探勘於流通業智慧型行動電話銷售預測」，大同大學資訊經營學系(所)碩士論文。
7. 許郁卿(2011)，「整合資料探勘及紅綠配行銷策略於商品銷售之研究」，靜宜大學資訊管理學系碩士論文。
8. 陳彥琴(2005)，「應用灰色理論預測新上市之生技保健食品銷售量」，國立成功大學工業與資訊管理學系碩士在職專班論文。
9. 彭文正譯(2001)，資料採礦：顧客關係管理暨電子行銷之應用，台北：數博網資訊。
10. 黃朋祥(2013)，「運用資料採礦技術建立客戶流失預測模型以聯成電腦為例」，致理技術學院服務業經營管理研究所碩士論文。
11. 黃俊英(1999)，「企業研究方法」，台北：東華書局。
12. 謝邦昌(2003)，類神經網路概述及實例，輔仁大學統計資訊系。
13. 謝瑞南(2012)，「港口吞吐量與景氣相關性之研究- 以台灣地區為例」，國立臺灣海洋大學商船學系所碩士論文。
【網路資料】
1. 教育部數位教學資源入口網(accessed 2013/9/19, available at:
http://content.edu.tw/senior/life_tech/tc_t2/inform/data2.htm)
2. MBA 智庫百科，(accessed 2013/9/19, available at:
http://wiki.mbalib.com/zh-tw/%E9%94%80%E5%94%AE%E6%94%B6%E5%85%A5
%E9%A2%84%E6%B5%8B)
3. 行政院經濟建設委員會(accessed 2013/9/19, available at:
http://www.cepd.gov.tw/PowerUp/m1.aspx?sNo=0009674)
4. 全人教育百寶箱(accessed 2013/9/19, available at:
http://hep.ccic.ntnu.edu.tw/browse2.php?s=242)

指導教授

蔡志豐(Chih-Fong Tsai)

審核日期

2014-5-15

推文