A Syntax Aware BERT for Identifying Well-Formed Queries in a Curriculum Framework
Avinash Madasu, Anvesh Rao Vijjini

TL;DR
This paper introduces a syntax-aware BERT model enhanced with parts-of-speech information and curriculum learning techniques to effectively identify well-formed queries, significantly outperforming previous methods and approaching human performance.
Contribution
It proposes a novel syntax-aware BERT model with curriculum learning for query validation, achieving state-of-the-art accuracy in the task.
Findings
Achieved 83.93% accuracy, surpassing previous 75%.
Incorporating POS info improves model performance.
Curriculum learning techniques enhance the model's effectiveness.
Abstract
A well formed query is defined as a query which is formulated in the manner of an inquiry, and with correct interrogatives, spelling and grammar. While identifying well formed queries is an important task, few works have attempted to address it. In this paper we propose transformer based language model - Bidirectional Encoder Representations from Transformers (BERT) to this task. We further imbibe BERT with parts-of-speech information inspired from earlier works. Furthermore, we also train the model in multiple curriculum settings for improvement in performance. Curriculum Learning over the task is experimented with Baby Steps and One Pass techniques. Proposed architecture performs exceedingly well on the task. The best approach achieves accuracy of 83.93%, outperforming previous state-of-the-art at 75.0% and reaching close to the approximate human upper bound of 88.4%.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Online Learning and Analytics
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Refunds@Expedia|||How do I get a full refund from Expedia? · Residual Connection · WordPiece · Linear Warmup With Linear Decay · Attention Dropout · Dropout · Softmax
