Leveraging Discrete Function Decomposability for Scientific Design

James C. Bowden; Sergey Levine; Jennifer Listgarten

arXiv:2511.03032·cs.LG·March 3, 2026

Leveraging Discrete Function Decomposability for Scientific Design

James C. Bowden, Sergey Levine, Jennifer Listgarten

PDF

Open Access 3 Reviews

TL;DR

This paper introduces DADO, a novel distributional optimization algorithm that leverages the decomposability of property predictors in scientific design tasks to improve efficiency in discrete object optimization.

Contribution

The paper presents DADO, a new algorithm that exploits decomposability via junction trees and graph message-passing to enhance optimization of discrete designs.

Findings

01

DADO outperforms existing methods in efficiency and effectiveness.

02

Leveraging decomposability improves optimization in complex design spaces.

03

The approach is applicable to various scientific design problems.

Abstract

In the era of AI-driven science and engineering, we often want to design discrete objects in silico according to user-specified properties. For example, we may wish to design a protein to bind its target, arrange components within a circuit to minimize latency, or find materials with certain properties. Given a property predictive model, in silico design typically involves training a generative model over the design space (e.g., protein sequence space) to concentrate on designs with the desired properties. Distributional optimization $\unicode x 2013$ which can be formalized as an estimation of distribution algorithm or as reinforcement learning policy optimization $\unicode x 2013$ finds the generative model that maximizes an objective function in expectation. Optimizing a distribution over discrete-valued designs is in general challenging because of the combinatorial nature of the design…

Peer Reviews

Decision·ICLR 2026 Poster

Reviewer 01Rating 6Confidence 2

Strengths

- DADO is well-motivated and extensively derived - DADO clearly converges faster than standard EDA - DADO is a novel, more efficient algorithm than fills a gap in the literature

Weaknesses

- This paper is hard to read due to large amounts of text, in-line math, and few subsections. - The evaluation is limited to three synthetic functions and four learned protein property functions - Standard EDA that is unaware of function decomposability can outperform DADO in the absence of ad hoc hyperparameter tuning. - The GB1 evaluation appears prematurely ended, as the EDA does not appear to converge by the final training iteration Typo:263 shaing (shaping)

Reviewer 02Rating 2Confidence 3

Strengths

The authors are focused on an important and challenging problem in scientific design, as the design space in such settings tends to be discrete or combinatorial and therefore is challenging to explore or optimize over. Their focus on reducing the combinatorial complexity of this search problem to one that is significantly more manageable is a worthwhile pursuit and has real applications. The authors motivated the problem well and provided strong justification for an approach such as theirs for m

Weaknesses

The requirement that the decompositional form be known is a very strong one in practice. Indeed, in the protein property prediction experiments, the authors resorted to a very specifically designed predictor to make use of this decomposition--a design which limits the accuracy of the proposed property predictor. Hence, it is unclear to me whether this method has much utility to the community. It would be helpful if the authors could better profile the impact of the decomposition on the accuracy

Reviewer 03Rating 4Confidence 3

Strengths

1. Unlike standard Estimation of Distribution Algorithms (EDAs) or reinforcement learning (RL) policy optimization, DADO explicitly leverages the decomposability of objective functions (via junction trees) to avoid optimizing over intractable full combinatorial spaces. This is a departure from "black-box" optimization approaches that treat the objective as monolithic. 2. It generalizes classical max-plus message passing (used for exact global optimization) to distributional optimization, replac

Weaknesses

1. The paper relies heavily on structure-based decomposability (AlphaFold3 contact graphs) for proteins but does not explore other practical sources of decomposability: For example, in protein design, decomposability could also be derived from sequence homology (conserved vs. variable regions) or functional annotations (binding sites vs. structural loops). Similarly, in circuit design, decomposability might come from modular components. 2. The paper empirically shows that "loose" decomposability

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Materials Science · Advanced Multi-Objective Optimization Algorithms · Protein Structure and Dynamics