Mitigating the Negative Impact of Over-association for Conversational Query Production
Ante Wang, Linfeng Song, Zijun Min, Ge Xu, Xiaoli Wang, Junfeng Yao, and Jinsong Su

TL;DR
This paper identifies the over-association problem in conversational query generation, which causes models to miss important concepts, and proposes instance-level weighting strategies to improve performance and data efficiency.
Contribution
It introduces novel weighting strategies to mitigate over-association effects in pretrained Seq2seq models for conversational query production.
Findings
Significant performance improvements on Wizard-of-Internet and DuSinc benchmarks.
Models select better concepts from dialogue histories.
Achieves 10 times more data efficiency than baseline.
Abstract
Conversational query generation aims at producing search queries from dialogue histories, which are then used to retrieve relevant knowledge from a search engine to help knowledge-based dialogue systems. Trained to maximize the likelihood of gold queries, previous models suffer from the data hunger issue, and they tend to both drop important concepts from dialogue histories and generate irrelevant concepts at inference time. We attribute these issues to the over-association phenomenon where a large number of gold queries are indirectly related to the dialogue topics, because annotators may unconsciously perform reasoning with their background knowledge when generating these gold queries. We carefully analyze the negative effects of this phenomenon on pretrained Seq2seq query producers and then propose effective instance-level weighting strategies for training to mitigate these issues…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Expert finding and Q&A systems · Information Retrieval and Search Behavior
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory · Sequence to Sequence
