论文标题
后臂:通过后生成对抗网络提供信息和连贯的响应产生
Posterior-GAN: Towards Informative and Coherent Response Generation with Posterior Generative Adversarial Network
论文作者
论文摘要
神经对话模型学会通过考虑对话历史记录来产生响应。这些模型通常在查询响应对上进行优化,并具有最大似然估计目标。但是,查询响应元素是自然松散耦合的,并且存在多种响应,可以响应给定的查询,从而导致对话模型学习繁重。此外,当模型面对毫无意义的响应训练实例时,总体沉闷的响应问题甚至会恶化。从直觉上讲,高质量的响应不仅对给定的查询做出响应,而且还链接到未来的对话中,在本文中,我们利用查询响应意见转弯三元三元,以诱导同时考虑给定上下文和未来对话的生成的响应。为了促进这些三元组的建模,我们进一步提出了一种新型的基于编码器的生成对抗学习框架,后生成对抗网络(后验),该框架由向前和向后的生成歧视器组成,以合作地鼓励通过互补的评估来提供供电的响应,并具有互补的评估。实验结果表明,我们的方法有效地提高了对自动和人类评估的响应的信息性和连贯性,这验证了考虑两个评估观点的优势。
Neural conversational models learn to generate responses by taking into account the dialog history. These models are typically optimized over the query-response pairs with a maximum likelihood estimation objective. However, the query-response tuples are naturally loosely coupled, and there exist multiple responses that can respond to a given query, which leads the conversational model learning burdensome. Besides, the general dull response problem is even worsened when the model is confronted with meaningless response training instances. Intuitively, a high-quality response not only responds to the given query but also links up to the future conversations, in this paper, we leverage the query-response-future turn triples to induce the generated responses that consider both the given context and the future conversations. To facilitate the modeling of these triples, we further propose a novel encoder-decoder based generative adversarial learning framework, Posterior Generative Adversarial Network (Posterior-GAN), which consists of a forward and a backward generative discriminator to cooperatively encourage the generated response to be informative and coherent by two complementary assessment perspectives. Experimental results demonstrate that our method effectively boosts the informativeness and coherence of the generated response on both automatic and human evaluation, which verifies the advantages of considering two assessment perspectives.