Mixed-modality Representation Learning and Pre-training for Joint Table-and-Text Retrieval in OpenQA
Junjie Huang, Wanjun Zhong, Qian Liu, Ming Gong, Daxin Jiang, Nan, Duan

TL;DR
This paper introduces OTTeR, a novel joint table-and-text retriever for open-domain QA that leverages mixed-modality representation learning and synthetic pre-training to improve evidence retrieval accuracy.
Contribution
The paper proposes a new retriever model with enhanced mixed-modality learning and pre-training strategies, addressing data sparsity and modality discrepancy issues.
Findings
OTTeR significantly outperforms previous methods on OTT-QA dataset.
The proposed mechanisms effectively improve retrieval performance.
The system achieves state-of-the-art results in open-domain QA.
Abstract
Retrieving evidences from tabular and textual resources is essential for open-domain question answering (OpenQA), which provides more comprehensive information. However, training an effective dense table-text retriever is difficult due to the challenges of table-text discrepancy and data sparsity problem. To address the above challenges, we introduce an optimized OpenQA Table-Text Retriever (OTTeR) to jointly retrieve tabular and textual evidences. Firstly, we propose to enhance mixed-modality representation learning via two mechanisms: modality-enhanced representation and mixed-modality negative sampling strategy. Secondly, to alleviate data sparsity problem and enhance the general retrieval ability, we conduct retrieval-centric mixed-modality synthetic pre-training. Experimental results demonstrate that OTTeR substantially improves the performance of table-and-text retrieval on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Expert finding and Q&A systems
