Improving Text Auto-Completion with Next Phrase Prediction
Dong-Ho Lee, Zhiqiang Hu, Roy Ka-Wei Lee

TL;DR
This paper introduces Next Phrase Prediction, a self-supervised training method to improve domain-specific text auto-completion by enhancing pre-trained language models, demonstrated through experiments in email and academic writing.
Contribution
Proposes a novel self-supervised Next Phrase Prediction objective to adapt language models for domain-specific auto-completion tasks efficiently.
Findings
Outperforms baseline models in email auto-completion
Enhances academic writing auto-completion accuracy
Enables faster domain adaptation of language models
Abstract
Language models such as GPT-2 have performed well on constructing syntactically sound sentences for text auto-completion task. However, such models often require considerable training effort to adapt to specific writing domains (e.g., medical). In this paper, we propose an intermediate training strategy to enhance pre-trained language models' performance in the text auto-completion task and fastly adapt them to specific domains. Our strategy includes a novel self-supervised training objective called Next Phrase Prediction (NPP), which encourages a language model to complete the partial query with enriched phrases and eventually improve the model's text auto-completion performance. Preliminary experiments have shown that our approach is able to outperform the baselines in auto-completion for email and academic writing domains.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Adam · Dense Connections · Cosine Annealing · Dropout · Weight Decay · Attention Dropout
