Hybrid Autoregressive Inference for Scalable Multi-hop Explanation Regeneration
Marco Valentino, Mokanarangan Thayaparan, Deborah Ferreira, Andr\'e, Freitas

TL;DR
This paper introduces SCAR, a hybrid inference framework combining bi-encoders and sparse models to efficiently regenerate multi-hop scientific explanations at scale, outperforming previous models in speed and quality.
Contribution
The paper presents SCAR, a novel hybrid bi-encoder and sparse model framework that enables scalable, efficient, and high-quality multi-hop explanation regeneration.
Findings
SCAR outperforms previous sparse models in explanation quality.
SCAR achieves comparable performance to state-of-the-art cross-encoders.
SCAR is approximately 50 times faster and scalable to millions of facts.
Abstract
Regenerating natural language explanations in the scientific domain has been proposed as a benchmark to evaluate complex multi-hop and explainable inference. In this context, large language models can achieve state-of-the-art performance when employed as cross-encoder architectures and fine-tuned on human-annotated explanations. However, while much attention has been devoted to the quality of the explanations, the problem of performing inference efficiently is largely under-studied. Cross-encoders, in fact, are intrinsically not scalable, possessing limited applicability to real-world scenarios that require inference on massive facts banks. To enable complex multi-hop reasoning at scale, this paper focuses on bi-encoder architectures, investigating the problem of scientific explanation regeneration at the intersection of dense and sparse models. Specifically, we present SCAR (for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Machine Learning in Healthcare
