Visual Evolutionary Optimization on Graph-Structured Combinatorial Problems with MLLMs: A Case Study of Influence Maximization
Jie Zhao, Kang Hao Cheong

TL;DR
This paper introduces visual evolutionary optimization (VEO), a novel framework that uses multimodal large language models to solve complex graph-structured problems by representing solutions as images, improving scalability and effectiveness.
Contribution
The work presents a new VEO framework leveraging MLLMs with a context-aware image encoding scheme for graph problems, enhancing structural understanding and scalability.
Findings
VEO outperforms traditional evolutionary algorithms in influence maximization tasks.
Graph sparsification improves VEO's scalability on large networks.
VEO demonstrates high reliability across diverse real-world networks.
Abstract
Graph-structured combinatorial problems in complex networks are prevalent in many domains, and are computationally demanding due to their complexity and non-linear nature. Traditional evolutionary algorithms (EAs), while robust, often face obstacles due to content-shallow encoding limitations and lack of structural awareness, necessitating hand-crafted modifications for effective application. In this work, we introduce an original framework, visual evolutionary optimization (VEO), leveraging multimodal large language models (MLLMs) as the backbone evolutionary optimizer in this context. Specifically, we propose a context-aware encoding scheme, representing the solution of the network as an image. In this manner, we can utilize MLLMs' image processing capabilities to intuitively comprehend network configurations, thus enabling machines to solve these problems in a human-like way. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
