English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 78818/78818 (100%)
造訪人次 : 34469962      線上人數 : 2051
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋


    請使用永久網址來引用或連結此文件: http://ir.lib.ncu.edu.tw/handle/987654321/84065


    題名: 在破產預測與信用評估領域對資料正規化與離散化的比較分析;Comparative Analysis of Data Normalization and Discretization for Bankruptcy Prediction and Credit Scoring
    作者: 羅聖明;Lo, Sheng-Ming
    貢獻者: 資訊管理學系
    關鍵詞: 正規化;離散化;破產預測;信用評估;機器學習;Normalization;Discretization;Bankruptcy Prediction;Credit Scoring;Machine Learning
    日期: 2020-07-29
    上傳時間: 2020-09-02 18:00:00 (UTC+8)
    出版者: 國立中央大學
    摘要: 在過往的破產預測以及信用評估領域中,有許多研究在前處理時進行正規化,然而大多研究僅採用單一種正規化方法進行實驗。為了瞭解正規化在破產預測與信用評估領域的適用性,本研究蒐集了四個信用資料集(Australia、Japan、Germany、Kaggle)與四個破產資料集(Bankruptcy、Japan、TEJ-Taiwan、USA),搭配四種正規化方法,minMAX、MaxAbs、Standard、Robust,並以三種不同的分類器,K-Nearest Neighbor、Logistic Regression、Support Vector Machine進行分類,期望能了解不同正規化方法對於結果的影響。另外,有鑑於近年也有研究在正規化後進行離散化,因此本研究也進一步探討是否正規化搭配離散化能夠更提升準確率並改善效能,主要採用三種離散化方法,最小化描述長度原則(Minimum Description Length Principle,MDLP)、卡方分箱法(ChiMerge)、CAIM(Class-Attribute Interdependence Maximization)。本研究發現在整體平均下,正規化方法(MaxAbs、Standard、Robust)對於AUC及Type II具有正面影響。而正規化若進一步搭配CAIM或MDLP,對於AUC及Type II會有更進一步的提升。在所有實驗組合中,Robust搭配MDLP在三種分類器都會達到最佳的AUC,而Standard搭配MDLP則會有最佳的Type II結果。;In the field of bankruptcy prediction and credit evaluation, many studies implemented normalization in data pre-processing, but most studies only conducted with a single normalization method. So, in order to understand the applicability of normalization in the field of bankruptcy prediction and credit evaluation. We collected four credit datasets (Australia, Japan, Germany, Kaggle) and four bankrupt datasets (Bankruptcy, Japan, TEJ-Taiwan, USA), with four normalization methods (minMAX, MaxAbs, Standard, Robust) and using three kinds of prediction models (K-Nearest Neighbor, Logistic Regression, Support Vector Machine) to examine the prediction performance of normalization. In addition, some studies have performed discretization after normalization in recent years, so this study further explores whether discretization after normalization can improve accuracy and performance, we use three differents kinds of discretization methods, Minimum Description Length Principle (MDLP), ChiMerge and Class-Attribute Interdependence Maximization (CAIM).
    This study founds that under the overall average, the normalization methods (MaxAbs, Standard and Robust) has a positive effect on AUC and Type II. Moreover, if the normalization is further combine with CAIM or MDLP, the AUC and Type II will effectively improve. In all experimental combinations, Robust with MDLP will achieve the best AUC in three classifiers, and Standard with MDLP will have the best Type II result.
    顯示於類別:[資訊管理研究所] 博碩士論文

    文件中的檔案:

    檔案 描述 大小格式瀏覽次數
    index.html0KbHTML193檢視/開啟


    在NCUIR中所有的資料項目都受到原著作權保護.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明