GENIUS: Generative Fluid Intelligence Evaluation Suite

Ruichuan An; Sihan Yang; Ziyu Guo; Wei Dai; Zijun Shen; Haodong Li; Renrui Zhang; Xinyu Wei; Guopeng Li; Wenshan Wu; Wentao Zhang

arXiv:2602.11144·cs.LG·February 12, 2026

GENIUS: Generative Fluid Intelligence Evaluation Suite

Ruichuan An, Sihan Yang, Ziyu Guo, Wei Dai, Zijun Shen, Haodong Li, Renrui Zhang, Xinyu Wei, Guopeng Li, Wenshan Wu, Wentao Zhang

PDF

Open Access

TL;DR

GENIUS introduces a new benchmark suite to evaluate Generative Fluid Intelligence in multimodal models, focusing on their ability to induce patterns, execute constraints, and adapt to new contexts, revealing current limitations and proposing diagnostic strategies.

Contribution

This paper formalizes GFI as three primitives, creates the GENIUS benchmark for assessment, and provides diagnostic insights and interventions to improve models' dynamic reasoning capabilities.

Findings

01

Models show significant deficits in GFI tasks.

02

Performance issues are due to limited context understanding, not generative ability.

03

Proposed attention intervention improves model reasoning without additional training.

Abstract

Unified Multimodal Models (UMMs) have shown remarkable progress in visual generation. Yet, existing benchmarks predominantly assess $Crystallized Intelligence$ , which relies on recalling accumulated knowledge and learned schemas. This focus overlooks $Generative Fluid Intelligence (GFI)$ : the capacity to induce patterns, reason through constraints, and adapt to novel scenarios on the fly. To rigorously assess this capability, we introduce $GENIUS$ ( $GEN$ Fluid $I$ ntelligence Eval $U$ ation $S$ uite). We formalize $GFI$ as a synthesis of three primitives. These include $Inducing Implicit Patterns$ (e.g., inferring personalized visual preferences), $Executing Ad-hoc Constraints$ (e.g., visualizing abstract metaphors), and $Adapting to Contextual Knowledge$ (e.g., simulating counter-intuitive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis · Artificial Intelligence in Games