TL;DR
This paper introduces robust local and global context-based pairwise models for sentence ordering, utilizing novel transformer architectures and pre-training strategies, achieving state-of-the-art accuracy and better understanding of pairwise model behavior.
Contribution
It proposes new pairwise ordering strategies with transformer-based encoding and decoding, improving accuracy and understanding over previous global and local context methods.
Findings
Our models outperform previous state-of-the-art methods.
Pre-training ALBERT significantly improves performance over BERT.
Analysis reveals error propagation mechanisms in pairwise models.
Abstract
Sentence Ordering refers to the task of rearranging a set of sentences into the appropriate coherent order. For this task, most previous approaches have explored global context-based end-to-end methods using Sequence Generation techniques. In this paper, we put forward a set of robust local and global context-based pairwise ordering strategies, leveraging which our prediction strategies outperform all previous works in this domain. Our proposed encoding method utilizes the paragraph's rich global contextual information to predict the pairwise order using novel transformer architectures. Analysis of the two proposed decoding strategies helps better explain error propagation in pairwise models. This approach is the most accurate pure pairwise model and our encoding strategy also significantly improves the performance of other recent approaches that use pairwise models, including the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAttention Is All You Need · Linear Layer · Weight Decay · Adam · Multi-Head Attention · Residual Connection · Dropout · WordPiece · Layer Normalization · Attention Dropout
