|| A. Graves, “Supervised Sequence Labelling,” Studies in Computational Intelligence Supervised Sequence Labelling with Recurrent Neural Networks, pp. 5–13, 2012.|
 A. Monreale, F. Pinelli, R. Trasarti, and F. Giannotti, “WhereNext: a location predictor on trajectory pattern mining,” Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining (KDD 2009), pp.637-646, 2009.
 B. Zhang and Y. Feng, “Improving Temporal Recommendation Accuracy and Diversity via Long and Short-Term Preference Transfer and Fusion Models,” Web Technologies and Applications Lecture Notes in Computer Science, pp. 174–185, 2016.
 Berjani, B., & Strufe, T. (2011, April). A recommendation system for spots in location-based online social networks. In Proceedings of the 4th workshop on social network systems (p. 4). ACM.
 C. Aggarwal, “Context-Sensitive Recommender Systems,” Recommender Systems, pp. 255–281, 2016.
 C. Cheng, H. Yang, I. King, and M. Lyu. “Fused matrix factorization with geographical and social influence in location-based social networks,” Proceedings of the 26th AAAI Conference on Artificial Intelligence (AAAI 2012), pp. 17-23, 2012.
 C. Cheng, H. Yang, M. Lyu, and I. King, “Where you like to go next: successive point-of-interest recommendation,” Proceedings of the 23th international joint conference on Artificial Intelligence (IJCAI 2013), pp. 2605-2611, 2013.
 C. Zhang, K. Zhang, Q. Yuan, L. Zhang, T. Hanratty, and J. Han, “GMove: Group-Level Mobility Modeling Using Geo-Tagged Social Media,” Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2016), pp. 1305-1314, 2016.
 Chen, T., Chen, Y., Guo, H., & Luo, J. (2018, April). When E-commerce Meets Social Media: Identifying Business on WeChat Moment Using Bilateral-Attention LSTM. In Companion Proceedings of the The Web Conference 2018 (pp. 343-350). International World Wide Web Conferences Steering Committee.
 Cheng, C., Yang, H., King, I., & Lyu, M. R. (2012, July). Fused matrix factorization with geographical and social influence in location-based social networks. In Twenty-Sixth AAAI Conference on Artificial Intelligence.
 Cho, K., Van Merriënboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., & Bengio, Y. (2014). Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078.
 D. Graur, R. Mariş, R. Potolea, M. Dînşoreanu, and C. Lemnaru, “Complex Localization in the Multiple Instance Learning Context,” New Frontiers in Mining Complex Patterns Lecture Notes in Computer Science, pp. 93–106, 2018.
 D. Yao, C. Zhang, J. Huang, and J. Bi, “SERM: A Recurrent Model for Next Location Prediction in Semantic Trajectories,” Proceedings of the 2017 ACM on Conference on Information and Knowledge Management (CIKM 2017), pp. 2411-2414, 2017.
 Gambs, S., Killijian, M. O., & del Prado Cortez, M. N. (2012, April). Next place prediction using mobility markov chains. In Proceedings of the First Workshop on Measurement, Privacy, and Mobility (p. 3). ACM.
 Ghosh, S., Vinyals, O., Strope, B., Roy, S., Dean, T., & Heck, L. (2016). Contextual lstm (clstm) models for large scale nlp tasks. arXiv preprint arXiv:1602.06291.
 H. Cheng, M. Ispir, R. Anil, Z. Haque, L. Hong, V. Jain, X. Liu, H. Shah, L. Koc, J. Harmsen, T. Shaked, T. Chandra, H. Aradhye, G. Anderson, G. Corrado, and W. Chai, “Wide & Deep Learning for Recommender Systems,” Proceedings of the 1st Workshop on Deep Learning for Recommender Systems (DLRS 2016), pp. 7-10, 2016.
 H. Dyvik, “Exploiting structural similarities in machine translation,” Computers and the Humanities, vol. 28, no. 4-5, pp. 225–234, 1994.
 H. Li, “Learning to Rank,” Encyclopedia of Machine Learning and Data Mining, pp. 1–6, 2016.
 H. Zhang, Y. Yang, H. Luan, S. Yang, and T. Chua, “Start from Scratch: Towards Automatically Identifying, Modeling, and Naming Visual Attributes,” Proceedings of the ACM International Conference on Multimedia (MM 2014), pp. 187-196, 2014.
 I. Baytas, C. Xiao, X. Zhang, F. Wang, A. Jain, and J. Zhou, “Patient Subtyping via Time-Aware LSTM Networks,” Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2017), pp. 65-74, 2017.
 J. Feng, Y. Li, C. Zhang, F. Sun, F. Meng, A. Guo, and D. Jin, “DeepMove: Predicting Human Mobility with Attentional Recurrent Networks,” Proceedings of the 2018 World Wide Web Conference on World Wide Web (WWW 2018), pp. 1459-1468, 2018.
 J. Manotumruksa, C. Macdonald, and I. Ounis, “A Contextual Attention Recurrent Architecture for Context-Aware Venue Recommendation,” Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval (SIGIR 2018), pp. 555-564, 2018.
 J. Manotumruksa, C. Macdonald, and I. Ounis, “A Deep Recurrent Collaborative Filtering Framework for Venue Recommendation,” Proceedings of the 2017 ACM on Conference on Information and Knowledge Management (CIKM 2017), pp. 1429-1438, 2017.
 J. Yoo and S. Choi, “Probabilistic matrix tri-factorization,” Proceeding of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2009), pp. 1553-1556, 2009.
 J. Zhang and C. Chow, “CRATS: An LDA-Based Model for Jointly Mining Latent Communities, Regions, Activities, Topics, and Sentiments from Geosocial Network Data,” IEEE Transactions on Knowledge and Data Engineering (TKDE 2016), vol. 28, no. 11, pp. 2895–2909, 2016.
 L. Xiong, X. Chen, T. Huang, J. Schneider, and J. Carbonell, “Temporal Collaborative Filtering with Bayesian Probabilistic Tensor Factorization,” Proceedings of the 2010 SIAM International Conference on Data Mining (SDM 2010), pp. 211-222, 2010.
 Li, P., Li, J., Sun, F., & Wang, P. (2017, August). Short Text Emotion Analysis Based on Recurrent Neural Network. In Proceedings of the 6th International Conference on Information Engineering (p. 6). ACM.
 M. Chen, Y. Liu, and X. Yu, “NLPMM: A Next Location Predictor with Markov Modeling,” Procedding of Advances in Knowledge Discovery and Data Mining Lecture Notes in Computer Science (PAKDD 2014), pp. 186–197, 2014.
 M. Ye, P. Yin, W. Lee, and D. Lee, “Exploiting geographical influence for collaborative point-of-interest recommendation,” Proceedings of the 34th international ACM SIGIR conference on Research and development in Information (SIGIR 2011), pp. 325-334, 2011.
 N. Du, H. Dai, R. Trivedi, U. Upadhyay, M. Gomez-Rodriguez, and L. Song, “Recurrent Marked Temporal Point Processes,” Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2016), pp. 1555-1564, 2016.
 Nallapati, R., Zhou, B., Gulcehre, C., & Xiang, B. (2016). Abstractive text summarization using sequence-to-sequence rnns and beyond. arXiv preprint arXiv:1602.06023.
 Nguyen, D. D., Le Van, C., & Ali, M. I. (2018, June). Vessel destination and arrival time prediction with sequence-to-sequence models over spatial grid. In Proceedings of the 12th ACM International Conference on Distributed and Event-based Systems (pp. 217-220).
 Park, S. H., Kim, B., Kang, C. M., Chung, C. C., & Choi, J. W. (2018, June). Sequence-to-sequence prediction of vehicle trajectory via LSTM encoder-decoder architecture. In 2018 IEEE Intelligent Vehicles Symposium (IV) (pp. 1672-1678). IEEE.
 Q. Liu, S. Wu, L. Wang, and T. Tan, “Predicting the next location: a recurrent model with spatial and temporal contexts,” Proceedings of the 13th AAAI Conference on Artificial Intelligence (AAAI 2016), pp.194-200, 2016.
 Q. Wu and C. Pu, “Modeling and implementing collaborative editing systems with transactional techniques,” Proceedings of the 6th International ICST Conference on Collaborative Computing: Networking, Applications, Worksharing (CollaborateCom 2010), pp. 1-10, 2010.
 R. Mehta and K. Rana, “A review on matrix factorization techniques in recommender systems,” The 2nd International Conference on Communication Systems, Computing and IT Applications (CSCITA 2017), pp. 269-274, 2017.
 R. Rosenkrantz, “Information Theory and Statistical Mechanics I (1957),” E. T. Jaynes: Papers on Probability, Statistics and Statistical Physics, pp. 4–16, 1989.
 S. Gambs, M. Killijian, and M. Cortez, “Next place prediction using mobility Markov chains,” Proceedings of the First Workshop on Measurement, Privacy, and Mobility (MPM 2012), no. 3, pp. 1-6, 2012.
 S. Rendle, C. Freudenthaler, and L. Schmidt-Thieme, “Factorizing personalized Markov chains for next-basket recommendation,” Proceedings of the 19th international conference on World Wide Web (WWW 2010), pp. 811-820, 2010.
 S. Zhao, M. Lyu, and I. King, “Geo-Teaser: Geo-Temporal Sequential Embedding Rank for POI Recommendation,” SpringerBriefs in Computer Science Point-of-Interest Recommendation in Location-Based Social Networks, pp. 57–78, 2018.
 S. Zhao, M. Lyu, and I. King, “STELLAR: Spatial-Temporal Latent Ranking Model for Successive POI Recommendation,” SpringerBriefs in Computer Science Point-of-Interest Recommendation in Location-Based Social Networks, pp. 79–94, 2018.
 Shi, Y., Serdyukov, P., Hanjalic, A., & Larson, M. (2011, July). Personalized landmark recommendation based on geotags from photo sharing sites. In Fifth International AAAI Conference on Weblogs and Social Media.
 Sutskever, I., Vinyals, O., & Le, Q. V. (2014). Sequence to sequence learning with neural networks. In Advances in neural information processing systems (pp. 3104-3112).
 V. Sridhar, “Unsupervised Text Normalization Using Distributed Representations of Words and Phrases,” Proceedings of the 1st Workshop on Vector Space Modeling for Natural Language Processing, pp. 8-16, 2015.
 Venugopalan, S., Rohrbach, M., Donahue, J., Mooney, R., Darrell, T., & Saenko, K. (2015). Sequence to sequence-video to text. In Proceedings of the IEEE international conference on computer vision (pp. 4534-4542).
 W. Mathew, R. Raposo, and B. Martins, “Predicting future locations with hidden Markov models,” Proceedings of the 2012 ACM Conference on Ubiquitous Computing (UbiComp 2012), pp. 911-918, 2012.
 Y. Zhang, H. Dai, C. Xu, J. Feng, T. Wang, J. Bian, B. Wang, and T. Liu, “Sequential click prediction for sponsored search with recurrent neural networks,” Proceedings of the 28th AAAI Conference on Artificial Intelligence (AAAI 2014), pp. 1369-1375, 2014.
 Y. Zhu, H. Li, Y. Liao, B. Wang, Z. Guan, H. Liu, and D. Cai, “What to Do Next: Modeling User Behaviors by Time-LSTM,” Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI 2017), pp. 3602-3608, 2017.
 Yang, Q., He, Z., Ge, F., & Zhang, Y. (2017, May). Sequence-to-sequence prediction of personal computer software by recurrent neural network. In 2017 International Joint Conference on Neural Networks (IJCNN) (pp. 934-940). IEEE.
 Zaremba, W., Sutskever, I., & Vinyals, O. (2014). Recurrent neural network regularization. arXiv preprint arXiv:1409.2329.