TL;DR
ARMOR is a novel framework that adaptively combines multiple AI tools for reaction feasibility prediction, significantly improving accuracy especially on conflicting cases by modeling tool utilities and resolving conflicts.
Contribution
It introduces an agentic, utility-aware hierarchical approach to effectively leverage multiple tools for reaction prediction, outperforming existing aggregation methods.
Findings
ARMOR outperforms single-tool and baseline aggregation methods.
The framework shows significant improvements on reactions with conflicting tool predictions.
Extensive experiments validate the effectiveness of the hierarchical, utility-aware approach.
Abstract
Reaction feasibility prediction, as a fundamental problem in computational chemistry, has benefited from diverse tools enabled by recent advances in artificial intelligence, particularly large language models. However, the performance of individual tools varies substantially across reactions, making it difficult for any single tool to consistently perform well across all cases. This raises a critical challenge: how to effectively leverage multiple tools to obtain more accurate feasibility predictions. To address this, we propose ARMOR, an agentic framework that explicitly models tool-specific utilities, adaptively prioritizes tools, and further resolves the potential tool conflicts to produce the final prediction for each reaction. Unlike existing approaches that rely on simple aggregation or heuristic assignment over various tools, ARMOR organizes tools into a hierarchy that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
