Domain-oriented Language Pre-training with Adaptive Hybrid Masking and Optimal Transport Alignment
Denghui Zhang, Zixuan Yuan, Yanchi Liu, Hao Liu, Fuzhen Zhuang, Hui, Xiong, Haifeng Chen

TL;DR
This paper proposes a domain-oriented pre-training framework that enhances language models by incorporating adaptive hybrid masking for phrase knowledge and optimal transport-based entity alignment, leading to improved domain-specific NLP performance.
Contribution
It introduces a generalized approach combining adaptive hybrid masking and optimal transport alignment to better capture phrase and entity-level knowledge in domain pre-training.
Findings
Outperforms baseline models on four domain-specific NLP tasks.
Effectively preserves phrase-level knowledge with the hybrid masking strategy.
Enhances semantic learning through entity alignment guided by optimal transport.
Abstract
Motivated by the success of pre-trained language models such as BERT in a broad range of natural language processing (NLP) tasks, recent research efforts have been made for adapting these models for different application domains. Along this line, existing domain-oriented models have primarily followed the vanilla BERT architecture and have a straightforward use of the domain corpus. However, domain-oriented tasks usually require accurate understanding of domain phrases, and such fine-grained phrase-level knowledge is hard to be captured by existing pre-training scheme. Also, the word co-occurrences guided semantic learning of pre-training models can be largely augmented by entity-level association knowledge. But meanwhile, by doing so there is a risk of introducing noise due to the lack of groundtruth word-level alignment. To address the above issues, we provide a generalized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Attention Dropout · WordPiece · Weight Decay · Softmax · Residual Connection · Adam · Dropout
