Leveraging Discrete Function Decomposability for Scientific Design
James C. Bowden, Sergey Levine, Jennifer Listgarten

TL;DR
This paper introduces DADO, a novel distributional optimization algorithm that leverages the decomposability of property predictors in scientific design tasks to improve efficiency in discrete object optimization.
Contribution
The paper presents DADO, a new algorithm that exploits decomposability via junction trees and graph message-passing to enhance optimization of discrete designs.
Findings
DADO outperforms existing methods in efficiency and effectiveness.
Leveraging decomposability improves optimization in complex design spaces.
The approach is applicable to various scientific design problems.
Abstract
In the era of AI-driven science and engineering, we often want to design discrete objects in silico according to user-specified properties. For example, we may wish to design a protein to bind its target, arrange components within a circuit to minimize latency, or find materials with certain properties. Given a property predictive model, in silico design typically involves training a generative model over the design space (e.g., protein sequence space) to concentrate on designs with the desired properties. Distributional optimizationwhich can be formalized as an estimation of distribution algorithm or as reinforcement learning policy optimizationfinds the generative model that maximizes an objective function in expectation. Optimizing a distribution over discrete-valued designs is in general challenging because of the combinatorial nature of the design…
Peer Reviews
Decision·ICLR 2026 Poster
- DADO is well-motivated and extensively derived - DADO clearly converges faster than standard EDA - DADO is a novel, more efficient algorithm than fills a gap in the literature
- This paper is hard to read due to large amounts of text, in-line math, and few subsections. - The evaluation is limited to three synthetic functions and four learned protein property functions - Standard EDA that is unaware of function decomposability can outperform DADO in the absence of ad hoc hyperparameter tuning. - The GB1 evaluation appears prematurely ended, as the EDA does not appear to converge by the final training iteration Typo:263 shaing (shaping)
The authors are focused on an important and challenging problem in scientific design, as the design space in such settings tends to be discrete or combinatorial and therefore is challenging to explore or optimize over. Their focus on reducing the combinatorial complexity of this search problem to one that is significantly more manageable is a worthwhile pursuit and has real applications. The authors motivated the problem well and provided strong justification for an approach such as theirs for m
The requirement that the decompositional form be known is a very strong one in practice. Indeed, in the protein property prediction experiments, the authors resorted to a very specifically designed predictor to make use of this decomposition--a design which limits the accuracy of the proposed property predictor. Hence, it is unclear to me whether this method has much utility to the community. It would be helpful if the authors could better profile the impact of the decomposition on the accuracy
1. Unlike standard Estimation of Distribution Algorithms (EDAs) or reinforcement learning (RL) policy optimization, DADO explicitly leverages the decomposability of objective functions (via junction trees) to avoid optimizing over intractable full combinatorial spaces. This is a departure from "black-box" optimization approaches that treat the objective as monolithic. 2. It generalizes classical max-plus message passing (used for exact global optimization) to distributional optimization, replac
1. The paper relies heavily on structure-based decomposability (AlphaFold3 contact graphs) for proteins but does not explore other practical sources of decomposability: For example, in protein design, decomposability could also be derived from sequence homology (conserved vs. variable regions) or functional annotations (binding sites vs. structural loops). Similarly, in circuit design, decomposability might come from modular components. 2. The paper empirically shows that "loose" decomposability
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Advanced Multi-Objective Optimization Algorithms · Protein Structure and Dynamics
