English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 94201/94201 (100%)
造訪人次 : 80415553      線上人數 : 138
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋


    請使用永久網址來引用或連結此文件: https://ir.lib.ncu.edu.tw/handle/987654321/106883


    題名: On efficiently mining high utility sequential patterns
    作者: 陳以錚;Wang, Jun-Zhe;Huang, Jiun-Long;Chen, Yi-Cheng
    貢獻者: 管理學院資訊管理學系
    關鍵詞: Algorithms;Analysis;Candidates;Computer Science;Data mining;Data Mining and Knowledge Discovery;Database Management;Efficiency;Information Storage and Retrieval;Information Systems and Communication Service;Information Systems Applications (incl.Internet);IT in Business;Pattern analysis;Pattern recognition;Performance evaluation;Pruning;Regular Paper;Searching;Strategy;Studies;Television sets;Utilities
    日期: 2016-11-01
    上傳時間: 2026-04-23 13:47:47 (UTC+8)
    出版者: Springer London;London: Springer Science and Business Media LLC
    摘要: 摘要: High utility sequential pattern mining is an emerging topic in pattern mining, which refers to identify sequences with high utilities (e.g., profits) but probably with low frequencies. To identify high utility sequential patterns, due to lack of downward closure property in this problem, most existing algorithms first generate candidate sequences with high sequence-weighted utilities (SWUs), which is an upper bound of the utilities of a sequence and all its supersequences, and then calculate the actual utilities of these candidates. This causes a large number of candidates since SWU is usually much larger than the real utilities of a sequence and all its supersequences. In view of this, we propose two tight utility upper bounds, prefix extension utility and reduced sequence utility, as well as two companion pruning strategies, and devise HUS-Span algorithm to identify high utility sequential patterns by employing these two pruning strategies. In addition, since setting a proper utility threshold is usually difficult for users, we also propose algorithm TKHUS-Span to identify top- k high utility sequential patterns by using these two pruning strategies. Three searching strategies, guided depth-first search (GDFS), best-first search (BFS) and hybrid search of BFS and GDFS, are also proposed to improve the efficiency of TKHUS-Span. Experimental results on some real and synthetic datasets show that HUS-Span and TKHUS-Span with strategy BFS are able to generate less candidate sequences and thus outperform other prior algorithms in terms of mining efficiency.
    其他題名: Knowl Inf Syst
    出版者: London: Springer Science and Business Media LLC
    出版日期: 2016-11-01
    出處: Knowledge and Information Systems, 2016-11, Vol.49 (2), p.597-627
    資源來源: ABI/INFORM Collection
    版權: Springer-Verlag London 2016
    識別號: ISSN: 0219-1377
    識別號: EISSN: 0219-3116
    識別號: DOI: 10.1007/s10115-015-0914-8
    識別號: CODEN: KISNCR
    顯示於類別:[資訊管理學系] 期刊論文

    文件中的檔案:

    檔案 描述 大小格式瀏覽次數
    index.html0KbHTML29檢視/開啟


    在NCUIR中所有的資料項目都受到原著作權保護.

    社群 sharing

    ::: Copyright National Central University. | 國立中央大學圖書館版權所有 | 收藏本站 | 設為首頁 | 最佳瀏覽畫面: 1024*768 | 建站日期:8-24-2009 :::
    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 隱私權政策聲明