參考文獻 |
[1] J. Devlin, M.W. Chang, K. Lee, and K. Toutanova, “Bert: Pretraining of deep
bidirectional transformers for language understanding,” 2019.
[2] Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, “Gradientbased learning applied to
document recognition,” Proceedings of the IEEE, vol. 86, no. 11, pp. 2278–2324,
1998.
[3] L. S. Larkey and W. B. Croft, “Combining classifiers in text categorization,” in
Proceedings of the 19th Annual International ACM SIGIR Conference on Research
and Development in Information Retrieval, ser. SIGIR ’96. New York, NY, USA:
Association for Computing Machinery, 1996, p. 289–297. [Online]. Available:
https://doi.org/10.1145/243199.243276
[4] D. Gao, W. Yang, H. Zhou, Y. Wei, Y. Hu, and H. Wang, “Deep hierarchical classification for category prediction in ecommerce system,” 2020.
[5] G.R. Xue, D. Xing, Q. Yang, and Y. Yu, “Deep classification in largescale
text hierarchies,” in Proceedings of the 31st Annual International ACM SIGIR
Conference on Research and Development in Information Retrieval, ser. SIGIR ’08.
24
New York, NY, USA: Association for Computing Machinery, 2008, p. 619–626.
[Online]. Available: https://doi.org/10.1145/1390334.1390440
[6] D. Hendrycks and K. Gimpel, “A baseline for detecting misclassified and outofdistribution examples in neural networks,” 2018.
[7] P. F. Brown, P. V. deSouza, R. L. Mercer, V. J. D. Pietra, and J. C. Lai, “Classbased
ngram models of natural language,” Comput. Linguist., vol. 18, no. 4, p. 467–479,
Dec. 1992.
[8] T. Mikolov, M. Karafiát, L. Burget, J. H. Cernocký, and S. Khudanpur, “Recurrent
neural network based language model,” in INTERSPEECH, 2010.
[9] A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez,
L. u. Kaiser, and I. Polosukhin, “Attention is all you need,” in Advances in
Neural Information Processing Systems, I. Guyon, U. V. Luxburg, S. Bengio,
H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett, Eds., vol. 30. Curran
Associates, Inc., 2017. [Online]. Available: https://proceedings.neurips.cc/paper/
2017/file/3f5ee243547dee91fbd053c1c4a845aaPaper.pdf
[10] C. N. Silla and A. A. Freitas, “A survey of hierarchical classification across different
application domains,” Data Mining and Knowledge Discovery, vol. 22, pp. 31–72,
2010.
[11] S. Kumar, J. Ghosh, and M. Crawford, “Hierarchical fusion of multiple classifiers
for hyperspectral data analysis,” Pattern Anal. Appl., vol. 5, pp. 210–220, 06 2002.
25
[12] W. Liu, X. Wang, J. D. Owens, and Y. Li, “Energybased outofdistribution detection,” 2021.
[13] Y.C. Hsu, Y. Shen, H. Jin, and Z. Kira, “Generalized odin: Detecting outofdistribution image without learning from outofdistribution data,” 06 2020, pp.
10 948–10 957.
[14] S. Kiritchenko and F. Famili, “Functional annotation of genes using hierarchical text
categorization,” Proceedings of BioLink SIG, ISMB, 01 2005.
[15] Y. Wu, M. Schuster, Z. Chen, Q. V. Le, M. Norouzi, W. Macherey, M. Krikun, Y. Cao,
Q. Gao, K. Macherey, J. Klingner, A. Shah, M. Johnson, X. Liu, Łukasz Kaiser,
S. Gouws, Y. Kato, T. Kudo, H. Kazawa, K. Stevens, G. Kurian, N. Patil, W. Wang,
C. Young, J. Smith, J. Riesa, A. Rudnick, O. Vinyals, G. Corrado, M. Hughes, and
J. Dean, “Google’s neural machine translation system: Bridging the gap between
human and machine translation,” 2016.
[16] K. Lee, K. Lee, K. Min, Y. Zhang, J. Shin, and H. Lee, “Hierarchical novelty detection for visual object recognition,” 2018 |