|| Chinea-Rios, Mara, Germán Sanchis-Trilles, and Francisco Casacuberta. "Sentence clustering using continuous vector space representation." Iberian Conference on Pattern Recognition and Image Analysis. Springer International Publishing, 2015.|
 Le, Quoc V., and Tomas Mikolov. "Distributed Representations of Sentences and Documents." ICML. Vol. 14. 2014.
 Chang, Yung-Chun, et al. "Linguistic Template Extraction for Recognizing Reader-Emotion and Emotional Resonance Writing Assistance." ACL-IJCNLP (2015): 775-780.
 Wang, Li. Hanyu Shigao. Vol. 2. Science Press, 1958.
 Huang, Hen-Hsen, Chuen-Tsai Sun, and Hsin-Hsi Chen. "Classical chinese sentence segmentation." Proceedings of CIPS-SIGHAN Joint Conference on Chinese Language Processing. 2010.
 Shi, Min, X. H. Chen, and B. Li. "CRF Based Research on a Unified Ap-proach to Word Segmentation and POS Tagging for Pre-Qin Chinese." Journal of Chinese Information Processing 2.24 (2010): 39-45.
 Liu, Shih-Gang. "Automated Annotation of Person Name of the Veritable Records of the Qing Dynasty." Master Thesis, Department of Computer Science and Information Engineering, National Taiwan University (2012): 1-50.
 Kao, Shin-Kai. "Automated Annotation of Geo-information of Historical Documents: A Case Study with the Veritable Records of the Qing Dynasty." Master Thesis, Department of Computer Science and Information Engineering, National Taiwan University (2013): 1-40.
 Pang, Wai-him et al. “Automated Name-extraction in Chinese Classics: Applying PMI (Pointwise Mutual Information) Segmentation to Zizhi Tongjian.” Digital Humanities and Craft：Technological Change. (2014): 232.
 Tang, Yafen. "Research of Automatically Recognizing Name in Pre-Qin Ancient Chinese Classics." XINADAI TUSHU QINGBAO JISHU 29.7/8 (2013): 63-68.
 Li, Qi, Heng Ji, and Liang Huang. "Joint Event Extraction via Structured Prediction with Global Features." ACL (1). 2013.
 Aliguliyev, Ramiz M. "A new sentence similarity measure and sentence based extractive technique for automatic text summarization." Expert Systems with Applications 36.4 (2009): 7764-7772.
 Wang, Dingding, et al. "Multi-document summarization via sentence-level semantic analysis and symmetric matrix factorization." Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 2008.
 Sarkar, Kamal. "Sentence clustering-based summarization of multiple text documents." International Journal of Computing Science and Communication Technologies 2.1 (2009): 325-335
 Han, Jiawei, Jian Pei, and Micheline Kamber. Data mining: concepts and techniques. Elsevier, 2011.
 Wei, Furu, et al. "Query-sensitive mutual reinforcement chain and its application in query-oriented multi-document summarization." Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 2008.
 Kumaran, Giridhar, and James Allan. "Text classification and named entities for new event detection." Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 2004.
 Hammouda, Khaled M., and Mohamed S. Kamel. "Efficient phrase-based document indexing for web document clustering." IEEE Transactions on knowledge and data engineering 16.10 (2004): 1279-1296.
 Zhao, Lin, Xuanjing Huang, and Lide Wu. "Fudan university at DUC 2005." Proceedings of DUC. Vol. 2005. 2005.
 Kotlerman, Lili, et al. "Sentence clustering via projection over term clusters." Proceedings of the First Joint Conference on Lexical and Computational Semantics-Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation. Association for Computational Linguistics, 2012.
 Mikolov, Tomas, et al. "Efficient estimation of word representations in vector space." arXiv preprint arXiv:1301.3781 (2013).
 MacQueen, James. "Some methods for classification and analysis of multivariate observations." Proceedings of the fifth Berkeley symposium on mathematical statistics and probability. Vol. 1. No. 14. 1967.
 Qian, Gang, et al. "Similarity between Euclidean and cosine angle distance for nearest neighbor queries." Proceedings of the 2004 ACM symposium on Applied computing. ACM, 2004.
 Chang, Chih-Chung, and Chih-Jen Lin. "LIBSVM: a library for support vector machines." ACM Transactions on Intelligent Systems and Technology (TIST) 2.3 (2011): 27.
 Dai, Andrew M., Christopher Olah, and Quoc V. Le. "Document embedding with paragraph vectors." arXiv preprint arXiv:1507.07998 (2015).
 Andrés-Ferrer, Jesús, Germán Sanchis-Trilles, and Francisco Casacuberta. "Similarity word-sequence kernels for sentence clustering." Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR). Springer Berlin Heidelberg, 2010.
 Yue, Chih-chia, “The Evolution of the Military System in Chiang-his during the Ming Dynasty,” Bulletin of the Institute of History and Philology (BIHP) Vol. 66-4, (1995.12)
 Wikipedia, Hundred Family Surnames, https://en.wikipedia.org/wiki/Hundred_Family_Surnames