姓名 陳冠吟(Guan-Yin Chen)  查詢紙本館藏   畢業系所 人力資源管理研究所
論文名稱 決策樹、羅吉斯迴歸與類神經網路預測員工績效之比較研究
摘要(中) 人力資源領域中將資料探勘的分類技術應用於各方面並未相當常見。本研究將運用個案公司所提供人事資料庫之資料作為研究樣本,經由資料的蒐集及彙整過後,將資料進行分割,主要拆分為訓練樣本及驗證樣本兩部分,並以決策樹、羅吉斯迴歸、類神經網路等三種資料探勘技術建構員工績效高低預後模型。
Using classification data-mining algorithm in predicting employee performance is rare. This study uses personnel data as research sample. After data cleaning and compiling process, data is divided into training dataset and the verification dataset. Then, this study uses three data mining technologies including decision tree, logistic regression and neural network to build employee performance prediction model by using training dataset.
  The results show that the model of decision tree and neural network are the best in predicting employee performance by using verification dataset. Two accuracy of two model is 90%. Moreover, AUC is 0.907 and 0.914. It indicates that decision tree and neural network model have better prediction ability than logistic regression.
關鍵字(中) ★ 資料探勘
★ 人力資源管理
★ 決策樹
★ 羅吉斯迴歸
★ 類神經網路
關鍵字(英) ★ data exploration
★ human resource management
★ decision tree
★ logistic regression
★ neural network
第一章 緒論 1
第一節 研究背景與動機 1
第二節 研究目的 2
第三節 研究流程 3
第二章 文獻探討 4
第一節 員工績效之相關理論 4
第二節 人力資源領域之數據分析 6
第三節 資料探勘之相關演算法 8
第三章 研究方法 22
第一節 研究架構 22
第二節 資料前置處理 25
第三節 研究工具 30
第四章 研究結果分析 35
第一節 敘述統計分析 35
第二節 模型建立 36
第三節 模型比較 45
第五章 結論 47
第一節 結論 47
第二節 研究限制與建議 50
參考文獻 51
[1]Han, J. & Kamber, M.(2003)。資料探礦---概念與技術(曾龍譯)。出版商:維科出版
[2]何子銘、盧瑜芬、許家瑋、白健佑、白璐、周雨青,… 朱基銘(2006)。運用三種資料探勘方法預測子宮頸癌存活情形之比較。台灣家庭醫學雜誌,16(3),192-203.
[6]邱莉燕、鄭婷方(2013)。《Big Data大數據正在改變生活.創造新生意,看見未來5分鐘》,遠見雜誌,319。
[8]徐晟熏(2015)。資料探勘(Data mining)在人力資源管理上的分析與應用,國立中央大學人力資源管理研究所碩士論文,桃園市。
[9]徐雅慧(2004)。個人特徵、工作滿意度與工作績效關係之探討-以 H 公司為例(未出版之碩士論文)。國立中央大學人力資源管理所,桃園市。

[13]郭淳頤(2016)。大數據時代下個人資料保護法間接識別之研究。私立東吳大學法 學院法律學系研究所在職專班碩士論文,台北市。
[16]廖述賢、溫志皓(2009)。資料採礦與商業智慧 Data Mining and Business Intelligence。出版商:雙葉書廊有限公司。
[19]謝邦昌(2005)。資料採礦與商業智慧-SQL Server 2005。出版商:鼎茂圖書社。

[1]Agresti A (2002) Categorical data analysis (2nd ed.) John Wiley, New York, p 710
[2]Berry, M., and Linoff, G. Data Mining Technique: For Marketing, Sales, and Customer Support. New York: Wiley Computer Publishing, 1997.
[3]Bersin, J. (2012). Big data in HR. Bersin & Associates, 1-84.
[4]Borman, W. C., & Motowidlo, S. J. (1993). Expanding the criterion domain to include elements of contextual performance. Personnel selection in organizations, 71-98. San Francisco, CA: Jossey-Bass.
[5]Borman, W. C., & Motowidlo, S. J. (1997). Task performance and contextual performance: The meaning for personnel selection research. Human performance, 10(2), 99-109.
[6]Boyatzis, R. E.(1982). The competent manager: A model for effective performance. New York, Wiley.
[7]Breiman, L., Friedman, J., Stone, C. J., & Olshen, R. A. (1984). Classification and regression trees. CRC press.
[8]Carson, K. P., Cardy, R. L., & Dobbins, G. H. (1991). Performance appraisal as effective management or deadly management disease two initial empirical investigations. Group & Organization Management, 16(2), 143-159.
[9]Caudill, M. (1987). Neural networks primer, part I. AI expert, 2(12), 46-52.
[10]Chien, C. F., & Chen, L. F. (2008). Data mining to improve personnel selection and enhance human capital: A case study in high-technology industry. Expert Systems with applications, 34(1), 280-290.
[11]Curt, H. (1995). The Devile’s in the detail: techniques, tool, and applications for data mining and knowledge discovery – Part 1. Intelligent Software Strategies, 6(9), 1-15.
[12]Delen, D., Walker, G., & Kadam, A. (2005). Predicting breast cancer survivability: a comparison of three data mining methods. Artificial intelligence in medicine, 34(2), 113-127.2005; 34: 113-27.
[13]Fayyad, U. M., Piatesky-Shapiro, G., & Smyth, P. (1996). From data mining to knowledge discovery: an overview’’ advances in knowledge discovery and data mining. MIT Press.
[14]Feinberg S (1985) The analysis of cross-classified categorical data (2nd ed.) MIT Press, Cambridge, p 198
[15]Freeman, J. A., & Skapura, D. M. (1992). Neural Networks: Algorithms, Applications and Programming Techniques. JOURNAL OPERATIONAL RESEARCH SOCIETY, 43, 1106-1106.
[16]Gentry, J. A., Newbold, P., & Whitford, D. T. (1985). Classifying bankrupt firms with funds flow components. Journal of Accounting research, 146-160.
[17]H.V. Jagadish,2015, Big Data and Science: Myths and Reality, Big Data Research,2(2),49-52.
[18]Han, J., Pei, J., & Kamber, M. (2011). Data mining: concepts and techniques. Elsevier.
[19]Hopfield, J. J. (1982). Neural networks and physical systems with emergent collective computational abilities. Proceedings of the national academy of sciences, 79(8), 2554-2558
[20]Hui, S. C., & Jha, G. (2000). Data mining for customer service support. Information & Management, 38, 1-13.
[21]Jantan, H., Hamdan, A. R., & Othman, Z. A. (2009). Knowledge discovery techniques for talent forecasting in human resource application. World Academy of Science, Engineering and Technology, 50, 775-783.

[22]Jantan, H., Hamdan, A. R., & Othman, Z. A. (2010). Human talent prediction in HRM using C4. 5 classification algorithm. International Journal on Computer Science and Engineering, 2(08-2010), 2526-2534.
[23]Jantan, H., Hamdan, A. R., & Othman, Z. A. (2011). Data mining classification techniques for human talent forecasting. INTECH Open Access Publisher.
[24]Jantan, H., Hamdan, A., Othman, Z., & Puteh, M. (2010, May). Applying data mining classification techniques for employee’s performance prediction. In Knowledge Management 5th International Conference (KMICe2010) ,645-652.
[25]K. Mehta, S. Bhattacharyya Adequacy of training data for evolutionary mining of trading rules. Decision Support Systems, 37 (2004), pp. 461- 474.
[26]Kass, G. V., “An exploratory technique for investigating large quantities of categorical data,” Applied Statistics, Vol. 29, No. 2, pp. 119-127 (1980).
[27]Katz, D., & Kahn, R. L. (1966). The social psychology of organizations (1st ed.). New York: John Wiley & Sons.
[28]Laney, Douglas. “The Importance of ‘Big Data’: A Definition”. Gartner. Retrieved 21 June 2012.
[29]Luan, J. (2002). Data mining and its applications in higher education. New directions for institutional research, 2002(113), 17-36.
[30]McAfee, A., Brynjolfsson, E., Davenport, T. H., Patil, D. J., & Barton, D. (2012). Big data. The management revolution. Harvard Bus Rev, 90(10), 61-67.
[31]McClelland, D. C.(1973). Testing for competence rather than for “intelligence”. The American Psychologist, 28(1), 1-14.
[32]Quinlan, J. R. (1993). C4.5: Programming for machine learning. Morgan Kauffmann, 38.
[33]Quinlan, J.R.(1986).Induction of decision trees.Machine learning, 1, 81-106.

[34]Ranjan, J. (2008). Data Mining Techniques for better decisions in Human Resource Management Systems. International Journal of Business Information Systems, 3(5), 464-481.
[35]Ranjan, J., Goyal, D. P., & Ahson, S. I. (2008). Data mining techniques for better decisions in human resource management systems. International Journal of Business Information Systems, 3(5), 464-481.
[36]Reddin, W. J.(1970). Managerial Effectiveness.
[37]Robbins, S. P. 1996. Organizational Behavior-Concepts, Controversies, and Application , 7th ed., Prentice-Hall, International Inc.
[38]Rosenblatt, F. (1962). A comparison of several perceptron models. Self-Organizing Systems, 463-484.
[39]Rumelhart, D. E., Hinton, G. E., and Williams, R. J. (1986) "Learning Internal Representations by Error Propagation" in Rumelhart, D. E. and McClelland, J. L.J Parallel Distributed Processing: Explorations in the Microstructure of Cognition, MIT Press.
[40]Schermerhorn, J. R., Hunt, J. M., & Osborn, R. N. (2000). Managing Organizational Behavior, 6th ed., John Wiley and Sons, Inc., New York.
[41]Spencer, L. M. & Spencer S. M. (1993). “Competence at Work: Models for Superior Performance.” New York: John Wiley & Son
[42]Tso, G. K., & Yau, K. K. (2007). Predicting electricity energy consumption: A comparison of regression analysis, decision tree and neural networks. Energy,32(9), 1761-176.
指導教授 鄭晉昌、林俊宏(Jihn-Chang Jehng Chun-Hung Lin) 審核日期 2017-6-13
