用於不斷發展的分類法之具備新穎性檢 測的分層文本分類技術

以作者查詢圖書館館藏

、以作者查詢臺灣博碩士

、以作者查詢全國書目

、勘誤回報

、線上人數：49

、訪客IP：3.144.160.219

姓名

楊若函(Jo-Han Yang) 查詢紙本館藏

畢業系所

資訊工程學系

論文名稱

用於不斷發展的分類法之具備新穎性檢測的分層文本分類技術
(Hierarchical text classification with novelty detection for evolving taxonomies)

相關論文

★ A Real-time Embedding Increasing for Session-based Recommendation with Graph Neural Networks	★ 基於主診斷的訓練目標修改用於出院病摘之十代國際疾病分類任務
★ 混合式心臟疾病危險因子與其病程辨識於電子病歷之研究	★ 基於 PowerDesigner 規範需求分析產出之快速導入方法
★ 社群論壇之問題檢索	★ 非監督式歷史文本事件類型識別──以《明實錄》中之衛所事件為例
★ 應用自然語言處理技術分析文學小說角色之關係：以互動視覺化呈現	★ 基於生醫文本擷取功能性層級之生物學表徵語言敘述：由主成分分析發想之K近鄰算法
★ 基於分類系統建立文章表示向量應用於跨語言線上百科連結	★ Code-Mixing Language Model for Sentiment Analysis in Code-Mixing Data
★ 藉由加入多重語音辨識結果來改善對話狀態追蹤	★ 對話系統應用於中文線上客服助理:以電信領域為例
★ 應用遞歸神經網路於適當的時機回答問題	★ 使用多任務學習改善使用者意圖分類
★ 使用轉移學習來改進針對命名實體音譯的樞軸語言方法	★ 基於歷史資訊向量與主題專精程度向量應用於尋找社群問答網站中專家

檔案

[Endnote RIS 格式]

[Bibtex 格式]

[相關文章]

[文章引用]

[完整記錄]

[館藏目錄]

[檢視]

[下載]

本電子論文使用權限為同意立即開放。
已達開放權限電子全文僅授權使用者為學術研究之目的，進行個人非營利性質之檢索、閱讀、列印。
請遵守中華民國著作權法之相關規定，切勿任意重製、散佈、改作、轉貼、播送，以免觸法。

摘要(中)

在深度學習的領域中，分類任務的技術越趨成熟，而近年來相關研
究人員也陸續投身於具備新穎性檢測的階層式分類方法。我們在此篇
論文提出了一個利用分解信心值和連接條件機率，達到具新穎性檢測
的階層式分類模型，且訓練過程中不需加入額外的新類別資料，而基
準模型包含了自頂向下方法及攤平方法。將我們提出的模型與基準模
型相互比較，從結果可以得知，我們的模型除了有效提升已知類別的
準確度外，於尋找新的分類上也更加精準。此外針對階層式新類偵測
的任務，論文中提出了一個新的算分方法，目的是同時考慮新類偵測
以及階層式分類兩個任務，使其能更精確地顯示出模型的效能。

摘要(英)

With the development of classification methods based on deep learning,
hierarchical classification tasks with new class detection began to attract researchers’ attention. In this paper, we propose a hierarchical classification
with a novelty detection model by decomposing confidence and concatenating conditional probability, which can be trained without labeled novelty data.
We compare it with a baseline model that combines the topdown method and
flatten method. From the results, we found that our model can improve the
classification accuracy of known categories and find instances belonging to
new categories more effectively. We propose a new evaluation metric for the
hierarchical novelty detection task. It considers both novelty detection and
hierarchical classification so that it is able to express the performance of the
model more obviously.

關鍵字(中)

★ 自然語言處理
★ 階層式文字分類
★ 階層式新類偵測

關鍵字(英)

★ Nature Language Processing
★ Hierarchical Text Classification
★ Hierarchical Novelty Detection

論文目次

Contents
中文摘要 i
Abstract ii
謝誌 iii
Contents iv
List of Figures vi
List of Tables vii
1 Introduction 1
2 Related Work 4
2.1 Pretrained Language Model . . . . . . . . . . . . . . . . . . . . . . . . 4
2.2 Hierarchical Classification . . . . . . . . . . . . . . . . . . . . . . . . . 6
2.3 Novelty detection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
2.4 Hierarchical evaluation measure . . . . . . . . . . . . . . . . . . . . . . 8
3 Method 11
3.1 Encoder . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
3.2 Hierarchical Decomposed Network . . . . . . . . . . . . . . . . . . . . . 12
3.3 Concatenating Conditional Probability . . . . . . . . . . . . . . . . . . . 15
iv
4 Experiment 17
4.1 Dataset . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
4.2 Evaluation Metric . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
4.3 Evaluation of our model and baseline . . . . . . . . . . . . . . . . . . . 20
4.4 Ablation Experiment . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
5 Conclusion 23
Bibliography 24

參考文獻

[1] J. Devlin, M.W. Chang, K. Lee, and K. Toutanova, “Bert: Pretraining of deep
bidirectional transformers for language understanding,” 2019.
[2] Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, “Gradientbased learning applied to
document recognition,” Proceedings of the IEEE, vol. 86, no. 11, pp. 2278–2324,
1998.
[3] L. S. Larkey and W. B. Croft, “Combining classifiers in text categorization,” in
Proceedings of the 19th Annual International ACM SIGIR Conference on Research
and Development in Information Retrieval, ser. SIGIR ’96. New York, NY, USA:
Association for Computing Machinery, 1996, p. 289–297. [Online]. Available:
https://doi.org/10.1145/243199.243276
[4] D. Gao, W. Yang, H. Zhou, Y. Wei, Y. Hu, and H. Wang, “Deep hierarchical classification for category prediction in ecommerce system,” 2020.
[5] G.R. Xue, D. Xing, Q. Yang, and Y. Yu, “Deep classification in largescale
text hierarchies,” in Proceedings of the 31st Annual International ACM SIGIR
Conference on Research and Development in Information Retrieval, ser. SIGIR ’08.
24
New York, NY, USA: Association for Computing Machinery, 2008, p. 619–626.
[Online]. Available: https://doi.org/10.1145/1390334.1390440
[6] D. Hendrycks and K. Gimpel, “A baseline for detecting misclassified and outofdistribution examples in neural networks,” 2018.
[7] P. F. Brown, P. V. deSouza, R. L. Mercer, V. J. D. Pietra, and J. C. Lai, “Classbased
ngram models of natural language,” Comput. Linguist., vol. 18, no. 4, p. 467–479,
Dec. 1992.
[8] T. Mikolov, M. Karafiát, L. Burget, J. H. Cernocký, and S. Khudanpur, “Recurrent
neural network based language model,” in INTERSPEECH, 2010.
[9] A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez,
L. u. Kaiser, and I. Polosukhin, “Attention is all you need,” in Advances in
Neural Information Processing Systems, I. Guyon, U. V. Luxburg, S. Bengio,
H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett, Eds., vol. 30. Curran
Associates, Inc., 2017. [Online]. Available: https://proceedings.neurips.cc/paper/
2017/file/3f5ee243547dee91fbd053c1c4a845aaPaper.pdf
[10] C. N. Silla and A. A. Freitas, “A survey of hierarchical classification across different
application domains,” Data Mining and Knowledge Discovery, vol. 22, pp. 31–72,
2010.
[11] S. Kumar, J. Ghosh, and M. Crawford, “Hierarchical fusion of multiple classifiers
for hyperspectral data analysis,” Pattern Anal. Appl., vol. 5, pp. 210–220, 06 2002.
25
[12] W. Liu, X. Wang, J. D. Owens, and Y. Li, “Energybased outofdistribution detection,” 2021.
[13] Y.C. Hsu, Y. Shen, H. Jin, and Z. Kira, “Generalized odin: Detecting outofdistribution image without learning from outofdistribution data,” 06 2020, pp.
10 948–10 957.
[14] S. Kiritchenko and F. Famili, “Functional annotation of genes using hierarchical text
categorization,” Proceedings of BioLink SIG, ISMB, 01 2005.
[15] Y. Wu, M. Schuster, Z. Chen, Q. V. Le, M. Norouzi, W. Macherey, M. Krikun, Y. Cao,
Q. Gao, K. Macherey, J. Klingner, A. Shah, M. Johnson, X. Liu, Łukasz Kaiser,
S. Gouws, Y. Kato, T. Kudo, H. Kazawa, K. Stevens, G. Kurian, N. Patil, W. Wang,
C. Young, J. Smith, J. Riesa, A. Rudnick, O. Vinyals, G. Corrado, M. Hughes, and
J. Dean, “Google’s neural machine translation system: Bridging the gap between
human and machine translation,” 2016.
[16] K. Lee, K. Lee, K. Min, Y. Zhang, J. Shin, and H. Lee, “Hierarchical novelty detection for visual object recognition,” 2018

指導教授

蔡宗翰(Richard Tzong-Han Tsai)

審核日期

2021-10-19

推文