dc.description.abstract |
The development of the dialogue system has become a hot research project in recent years, many companies have this demand. The dialogue system can be divided into two categories according to the purpose. First, the task-oriented dialogue system, such as: customer service, to answer customer questions for specific areas, or personal assistant Siri, can integrate information (mobile phone address book, weather, calendar, time ... and other related information), and supply enquiry; Second, non-task-oriented dialogue system, such as: to accompany the main purpose of the robot Alice, a simple chat dialogue. Our research focuses on the latter. The purpose is to respond to the user′s words. The user′s sentence may be a question, complain, sigh, facts, and so on. chatbots how to answer is key point in this paper.
Short text conversation system can be divided into two categories: Retrieval-based、Generative-based. The former approach depends on the quality of the database, the latter is required to have a grammar check module. In this paper, we hope to solve the problem of Generative-based STC, but adopt Retrieval-based as the base, and use the network as a database to retrieve candidate sentences from Google Abstract, so we do not need to collect a large number of text-rich databases in advance. Practice includes: First, the query keyword generation; Second, the punctuation and candidate sentences of the pre-processing; Third, SVMrank sort sentences.
We use the NTCIR STC2 response evaluation criteria. 3 non-expert evaluate responses of 100 posts. The average score is 0.713. | en_US |