TL;DR
AdaMCOT introduces an adaptive reasoning framework that improves multilingual factual reasoning and cross-lingual consistency in large language models without additional pretraining.
Contribution
It proposes a dynamic routing mechanism through intermediary 'thinking languages' to enhance multilingual reasoning without extra pretraining.
Findings
Significant improvements in factual reasoning across multiple benchmarks.
Enhanced cross-lingual consistency, especially in low-resource languages.
Deeper understanding of reasoning pathways through analysis of hidden states.
Abstract
Large language models (LLMs) have shown impressive multilingual capabilities through pretraining on diverse corpora. Although these models show strong reasoning abilities, their performance varies significantly between languages due to the imbalanced distribution of training data. Existing approaches using sample-level translation for extensive multilingual pretraining and cross-lingual tuning face scalability challenges and often fail to capture nuanced reasoning processes across languages. In this paper, we introduce AdaMCOT (Adaptive Multilingual Chain-of-Thought), a framework that enhances multilingual factual reasoning by dynamically routing thought processes in intermediary "thinking languages" before generating target-language responses. AdaMCOT leverages a language-agnostic core and incorporates an adaptive, reward-based mechanism for selecting optimal reasoning pathways without…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
