中大機構典藏-NCU Institutional Repository-提供博碩士論文、考古題、期刊論文、研究計畫等下載:Item 987654321/77634
English  |  正體中文  |  简体中文  |  全文笔数/总笔数 : 80990/80990 (100%)
造访人次 : 41649975      在线人数 : 1440
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜寻范围 查询小技巧:
  • 您可在西文检索词汇前后加上"双引号",以获取较精准的检索结果
  • 若欲以作者姓名搜寻,建议至进阶搜寻限定作者字段,可获得较完整数据
  • 进阶搜寻


    jsp.display-item.identifier=請使用永久網址來引用或連結此文件: http://ir.lib.ncu.edu.tw/handle/987654321/77634


    题名: 以word2vec擴展關鍵字詞應用於商品名稱自動化分類
    作者: 石秀媖;Shih, Hsiu-Ying
    贡献者: 資訊管理學系
    关键词: 文件分類;深度學習;詞向量;自動化分類;word2vec;Text Classification;Word Embedding;Deep Learning;Automatic Classification
    日期: 2018-07-27
    上传时间: 2018-08-31 14:51:02 (UTC+8)
    出版者: 國立中央大學
    摘要: 隨著網際網路及資訊技術普及應用,人們經由網路所獲得資訊的時間越來越短,但 資訊量急劇增加,過量的資訊逐漸形成資訊爆炸的問題,各個企業與機構的數位文件也 不斷快速累積,數量大到難以有效的管理與利用,文件分類 (Text Classification) 因應而 生,利用自動化的技術協助人工分類,來應付大量暴增的分類需求。傳統文件分類的做 法是以人工方式進行,近年來,深度學習 (Deep Learning) 已被廣泛討論並應用在多種 研究上,許多文獻顯示利用深度學習的技術,可以幫助結果更加完善或增進效能。
    本研究利用消費者於實體店面購買的商品名稱資料,透過新興的深度學習技術,應 用 word2vec 詞向量模型於文件自動化分類,藉由其自行學習語義間關係的技術,將商 品自動分類到正確的類別,並透過四個實驗探討不同的因素下訓練出的 word2vec 詞向 量模型,會影響其成效,最後也證實以 word2vec 擴展關鍵字詞能提高分類成效。;With the popularization of Internet and technology, people get more information through the Internet, but the amount of information has increased dramatically. Excessive information has gradually formed the problem of information explosion. Digital documents of various enterprises and organizations are also constantly increasing. The amount of digital documents is large to be difficult to manage and utilize effectively. Text Classification is created in response to deal with the massive surge in classification needs. Traditional text classification is done manually. In recent years, Deep Learning has been widely discussed and applied in a variety of studies. Many literatures show that deep learning techniques can help improve results or improve performance.
    This study uses the data of product names purchased by consumers in physical store to apply the word2vec word embedding model to the automatic classification of documents through the deep learning technology. By self-learning the semantic relationship, the products are automatically classified into the correct category. And through a number of experiments to explore the word2vec word embedding model trained under different factors, will affect its effectiveness. Finally, this study confirmed that applied word2vec to expand keywords can improve the effect of classification.
    显示于类别:[資訊管理研究所] 博碩士論文

    文件中的档案:

    档案 描述 大小格式浏览次数
    index.html0KbHTML271检视/开启


    在NCUIR中所有的数据项都受到原著作权保护.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明