T-SciQ: Teaching Multimodal Chain-of-Thought Reasoning via Mixed Large   Language Model Signals for Science Question Answering

Lei Wang; Yi Hu; Jiabang He; Xing Xu; Ning Liu; Hui Liu; Heng Tao Shen

arXiv:2305.03453·cs.CL·December 19, 2023·1 cites

T-SciQ: Teaching Multimodal Chain-of-Thought Reasoning via Mixed Large Language Model Signals for Science Question Answering

Lei Wang, Yi Hu, Jiabang He, Xing Xu, Ning Liu, Hui Liu, Heng Tao Shen

PDF

Open Access 1 Repo

TL;DR

T-SciQ introduces a novel method that leverages large language model signals to generate high-quality reasoning rationales, enabling smaller models to excel in complex multimodal science question answering tasks with state-of-the-art accuracy.

Contribution

The paper presents T-SciQ, a new approach that uses LLM signals to teach reasoning, reducing reliance on costly human annotations and improving performance on science QA benchmarks.

Findings

01

Achieves 96.18% accuracy on ScienceQA benchmark.

02

Outperforms fine-tuned baselines by 4.5%.

03

Demonstrates effective teaching of reasoning in complex modalities.

Abstract

Large Language Models (LLMs) have recently demonstrated exceptional performance in various Natural Language Processing (NLP) tasks. They have also shown the ability to perform chain-of-thought (CoT) reasoning to solve complex problems. Recent studies have explored CoT reasoning in complex multimodal scenarios, such as the science question answering task, by fine-tuning multimodal models with high-quality human-annotated CoT rationales. However, collecting high-quality COT rationales is usually time-consuming and costly. Besides, the annotated rationales are hardly accurate due to the external essential information missed. To address these issues, we propose a novel method termed T-SciQ that aims at teaching science question answering with LLM signals. The T-SciQ approach generates high-quality CoT rationales as teaching signals and is advanced to train much smaller models to perform CoT…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

t-sciq/t-sciq
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques