Using Large Pretrained Language Models for Answering User Queries from Product Specifications
Kalyani Roy (1), Smit Shah (1), Nithish Pai (2), Jaidam Ramtej (2),, Prajit Prashant Nadkarn (2), Jyotirmoy Banerjee (2), Pawan Goyal (1), and, Surender Kumar (2) ((1) Indian Institute of Technology Kharagpur, (2), Flipkart)

TL;DR
This paper explores using large pretrained language models like XLNet and BERT to automatically identify relevant product specifications for answering user queries on e-commerce sites, improving response accuracy across different product categories.
Contribution
It introduces a method to create training data automatically and demonstrates that pretrained models outperform Siamese models in this task, with cross-vertical generalization.
Findings
Pretrained models outperform Siamese models in accuracy.
Models trained on one vertical generalize well to others.
Automatic dataset creation is effective for this task.
Abstract
While buying a product from the e-commerce websites, customers generally have a plethora of questions. From the perspective of both the e-commerce service provider as well as the customers, there must be an effective question answering system to provide immediate answers to the user queries. While certain questions can only be answered after using the product, there are many questions which can be answered from the product specification itself. Our work takes a first step in this direction by finding out the relevant product specifications, that can help answering the user questions. We propose an approach to automatically create a training dataset for this problem. We utilize recently proposed XLNet and BERT architectures for this problem and find that they provide much better performance than the Siamese model, previously applied for this problem. Our model gives a good performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLinear Layer · Weight Decay · Softmax · Attention Dropout · Byte Pair Encoding · Layer Normalization · Attention Is All You Need · WordPiece · Residual Connection · BERT
