Peek Across: Improving Multi-Document Modeling via Cross-Document Question-Answering
Avi Caciularu, Matthew E. Peters, Jacob Goldberger, Ido Dagan, Arman, Cohan

TL;DR
This paper introduces a novel cross-document question answering pre-training objective that enhances multi-document language models' ability to understand and generate across related texts, improving performance on various tasks.
Contribution
The paper proposes a new pre-training method using cross-document QA to improve multi-document modeling, enabling better informational relation recovery and task versatility.
Findings
Up to 7% performance improvement on multi-document tasks
Significantly outperforms zero-shot GPT-3.5 and GPT-4
Enhances both short and long text generation capabilities
Abstract
The integration of multi-document pre-training objectives into language models has resulted in remarkable improvements in multi-document downstream tasks. In this work, we propose extending this idea by pre-training a generic multi-document model from a novel cross-document question answering pre-training objective. To that end, given a set (or cluster) of topically-related documents, we systematically generate semantically-oriented questions from a salient sentence in one document and challenge the model, during pre-training, to answer these questions while "peeking" into other topically-related documents. In a similar manner, the model is also challenged to recover the sentence from which the question was generated, again while leveraging cross-document information. This novel multi-document QA formulation directs the model to better recover cross-text informational relations, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning in Healthcare
Methods{Dispute@FaQ-s}How to file a dispute with Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Label Smoothing · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Cosine Annealing · Transformer · Softmax · Layer Normalization
