||Basheer, I. A., and Hajmeer, M. (2000). “Artificial neural networks:Fundamentals, computing, design, and application.” Journal of Microbiological Methods, 43(1), pp. 3–31.|
Cortes, C. and Vapnik, V. (1995). “Support-Vector Networks.” Machine Learning, 20(3), pp. 273–297.
Drucker, H., Wu, D., and Vapnik, V.N. (1999). “Support Vector Machines for Spam cate- gorization.” IEEE Transactions on Neural Networks, 10(5), pp. 1048–1054.
Enrquez, F., Troyano, J.A., Lpez-Solaz, T. (2016). “An approach to the Use of Word Embeddings in an Opinion Classification Task.” Expert Systems with Applications, 66(12), pp. 1–6.
Fürnkranz, J. (1998). “A Study Using N-Gram Features for Text Categorization.” Austrian Research Institute for Artifical Intelligence, 3(1998), pp. 1–10.
Greff, K., Srivastava, R. K., Koutn´ık, J., Steunebrink, B. R., and Schmidhuber, J. (2015). “LSTM: A Search Space Odyssey.” CoRR, abs/1503.04069.
Guggilla, C., Miller, T.,and Gurevych, I. (2016) “CNN-and LSTM-based claim classification in online user comments.” In Proceedings of the 26th International Conference on Computational Linguistics: Technical Papers (COLING 2016), pp. 2740–2751.
Hinton, G. E. (1986). “Learning distributed representations of concepts.” In Proceedings of the eighth annual conference of the cognitive science society, pp. 1–12.
Hochreiter, S., and Schmidhuber, J. (1997). “Long short-term memory,” Neural computation, 9(8), pp. 1735–1780.
Ikonomakis, M., Kotsiantis, S., and Tampakas, V. (2005). “Text Classification Using Machine Learning Techniques.” WSEAS Transactions on Computers, 4(8), pp. 966–974.
Joachims, T. (1998.) “Text Categorization with Support Vector Machines: Learning with Many Relevant Features.” In Proceedings of the European Conference on Machine Learning (ECML), pp. 137–142.
Lilleberg, J., Zhu, Y., and Zhang, Y. (2015). “Support Vector Machines and Word2vec for Text Classification with Semantic Features.” In 2015 IEEE 14th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC), pp. 136–140.
Medlock, B. (2003). “A Language Model Approach to Spam Filtering.” http://www.benmedlock.co.uk/medlock-03.pdf [accessed on Apr. 1, 2008], 7 pages.
Mikolov, T., Chen, K., Corrado, G., and Dean, J. (2013). “Efficient Estimation of Word Representations in Vector Space.” CoRR, abs/1301.3781.
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., and Dean, J. (2013). “Distributed Representations of Words and Phrases and Their Compositionality.” In NIPS, pp. 3111–3119.
Mikolov, T., Deoras, A. Povey, D., Burget, L., and Cernocky, J. (2011). “Strategies for Training Large Scale Neural Network Language Models.” In Proceedings of Automatic Speech Recognition and Understanding(ASRU), pp. 196–201.
Masand, B., Linoff, G., and Waltz, D. (1992). “Classifying news stories using memorybased reasoning.” In Proceedings of SIGIR-92, 15th ACM International Conference on Research and Development in Information Retrieval (Kobenhavn, DK, 1992), pp. 59–65.
Olah, C. (2015). “Understanding LSTM Networks.”, colah′ blog, 27 August. Available at
https://colah.github.io/posts/2015-08-Understanding-LSTMs. [Accessed 25 Apr. 2019].
Pennington, J., Socher, R., and Manning, C.D. (2014). “Glove: Global vectors for word representation,” In Proceedings of the Empirical Methods in Natural Language Processing, pp. 1532–1543.
Pradhan, L., Taneja, N.A., Dixit, C., and Suhag, M. (2017) “Comparison of Text Classifiers on News Articles.” Int. Res. J. Eng. Technol., 4(3), pp. 2513–2517.
Salton, G., and Buckley, C. (1988). “Term weighting approaches in automatic text retrieval.” Information Processing and Management, 24(5), pp. 513-523.
Sebastiani, F. (2002). “Machine learning in automated text categorization.” ACM Computing Surveys, 34(1), pp. 1−47.
Sak, H., Senior, A., and Beaufays, F. (2014). “Long short-term memory recurrent neural network architectures for large scale acoustic modeling.” In Proceedings of the Annual Conference of International Speech Communication Association (INTERSPEECH).
Shen, D., Sun, J., Yang, Q. and Chen, Z. (2006). “Text Classification Improved Through Multigram Models.” In Proceedings of the 15th ACM International Conference on Information and Knowledge Management, pp. 672–681.
Su, Z., Xu, H., Zhang, D., and Xu, Y. (2014). “Chinese sentiment classification using a neural network tool- Word2vec” In 2014 International Conference on Multisensor Fusion and Information Integration for Intelligent Systems (MFI), pp. 1–6.
Sundermeyer, M., Schluter, R., and Ney, H. (2010). “Lstm neural networks for language modeling.” In INTERSPEECH.
Spärck Jones, K. (1972). “A statistical interpretation of term specificity and its application
in retrieval.” Journal of Documentation, 28 (1), pp. 11–21.
van Aken, B., Risch, J., Krestel, R., L¨oser, A. (2018). “Challenges for toxic comment classification: An in-depth error analysis.” In Proceedings of the Workshop on Abusive Language Online (ALW@EMNLP), pp. 33–42.
Weinberger, K.Q., Blitzer, J., and Saul, L.K. (2006). “Distance metric learning for large margin nearest neighbor classification.” In Advances NIPS.
Zhang, D., Xu, H., Su, Z., and Xu, Y. (2015). “Chinese Comments Sentiment Classification Based on Word2vec and SVMperf.” Expert Systems with Applications, 42(4), pp. 1857–1863.
Zhu, Z., Zhang, W., Li, G-Z., He, C.,and Zhang, L. (2016) "A study of damp-heat syndrome classification using Word2vec and TF-IDF." In Proceedings of 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 15-18.