Optimal Clustering with Dependent Costs in Bayesian Networks
Paul Pao-Yen Wu, Fabrizio Ruggeri, Kerrie Mengersen

TL;DR
This paper introduces DCMAP, a novel algorithm for optimal clustering in Bayesian Networks that efficiently finds near-optimal cluster mappings considering dependent costs, significantly improving computational efficiency in complex models.
Contribution
We develop DCMAP, an algorithm that efficiently identifies all optimal cluster mappings with dependent costs in Bayesian Networks, addressing a critical gap in existing clustering methods.
Findings
DCMAP finds all least cost cluster solutions analytically.
It performs efficiently on large, complex Bayesian Network configurations.
Demonstrated on a seagrass DBN with billions of possible cluster mappings.
Abstract
Background: Clustering of nodes in Bayesian Networks (BNs) and related graphical models such as Dynamic BNs (DBNs) has been demonstrated to enhance computational efficiency and improve model learning. It typically involves partitioning the underlying Directed Acyclic Graph (DAG) into cliques or optimising for some cost or criteria. Objectives: We focus on a critical but understudied aspect of optimal clustering involving cost dependency. This is where inference outcomes and hence clustering costs depend on both nodes within a cluster and the mapping of clusters that are connected by at least one arc. Methods: We propose a novel algorithm called Dependent Cluster MAPping (DCMAP) which can, given an arbitrary, positive cost function, iteratively and rapidly find near-optimal, then optimal cluster mappings. Results: DCMAP is shown analytically to be optimal in terms of finding all of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Bayesian Methods and Mixture Models · Statistical Methods and Bayesian Inference
