English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 80990/80990 (100%)
造訪人次 : 42119718      線上人數 : 1537
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋


    請使用永久網址來引用或連結此文件: http://ir.lib.ncu.edu.tw/handle/987654321/74662


    題名: 結合離群值偵測與特徵選取改善預測模型性能;Improving Performance of Prediction Model with Outlier Detection and Feature Selection
    作者: 林俊慶;LIN, JUN-QING
    貢獻者: 資訊工程學系
    關鍵詞: 離群值偵測;特徵選取;多元線性迴歸;學習成效預測;Outlier Detection;Feature Selection;Multiple Linear Regression;Learning performance prediction
    日期: 2017-07-19
    上傳時間: 2017-10-27 14:35:28 (UTC+8)
    出版者: 國立中央大學
    摘要: 為了提升學生的學習成效,提早並準確識別高風險學生,使得教師能夠早期介入輔導,是許多相關研究關注的議題。
    混成式課程是一種結合線上與線下學習的課程,有別於傳統的線下學習,學生亦能夠透過線上學習平台,來進行多方面的學習。然而,學生在學習過程當中,會留下許多紀錄,例如學生的作業成績、影片瀏覽行為、線上活動頻率、線上測驗成績等等。因此,本論文透過資料探勘與機器學習技術,收集一門混成式微積分課程的學生學習活動資料,使用多元線性迴歸來預測學生的期末成績。
    相關研究指出,預測模型的準確率容易受到離群值的影響。因此,本論文使用RANSAC演算法,作為離群值偵測的方法,將離群值從資料中去除。為了在移除離群值後更進一步改善預測模型的準確率,本論文以T檢定作為特徵選取的方法,保留對期末成績有顯著影響的關鍵特徵,來進一步改善預測模型的準確率。
    根據研究結果顯示,透過本論文提出的離群值偵測與特徵選取流程,預測誤差由15.516分降低至4.571分,改善了約70%的預測誤差。
    ;In order to improve students’ learning performance, early and accurately identify at-risk students, so that teachers can early intervention, is the focus topic of many related research.
    Blended course is a course which combine online and offline learning, different from traditional offline learning, students are also able to learn through the online learning platform. However, students will leave a lot of records in the learning process, such as students′ homework grade, video viewing behavior, online activity frequency, online test grade etc. Therefore, this paper based on data mining and machine learning technologies, collects students’ learning activity data from a blended calculus course, uses multiple linear regression to predict students’ final grade.
    Related researchs point out the accuracy of the prediction model is easily affected by outliers. Therefore, this paper uses RANSAC algorithm as outlier detection method to remove outliers from data. In order to futher improve accuracy of prediction model after remove outliers, this paper uses T-Test as feature selection method, retains the key features that have a significant impact on the final grade, to futher improve accuracy of prediction model.
    According to the results of research, through the outlier detection and feature selection process proposed in this paper, prediction error from 15.516 down to 4.571 points, improving the prediction error about 70 percent.
    顯示於類別:[資訊工程研究所] 博碩士論文

    文件中的檔案:

    檔案 描述 大小格式瀏覽次數
    index.html0KbHTML220檢視/開啟


    在NCUIR中所有的資料項目都受到原著作權保護.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明