# Learning in Probabilistic Boolean Networks via Structural Policy Gradients

**Authors:** Pedro Juan Rivera Torres

PMC · DOI: 10.3390/e27111150 · 2025-11-13

## TL;DR

This paper introduces a new way to train interpretable rule-based models that perform as well as neural networks on various tasks.

## Contribution

The novel approach uses structural policy gradients to train Probabilistic Boolean Networks with interpretable logic.

## Key findings

- Learning PBNs achieve ANN-level performance on classification, regression, and reinforcement learning tasks.
- The models maintain interpretability while achieving high accuracy and low error rates.
- Learned logic becomes stable during training and performs well in noisy/tabular data regimes.

## Abstract

We revisit Probabilistic Boolean Networks as trainable function approximators. The key obstacle, non-differentiable structural choices (which predictors to read and which Boolean operators to apply), is addressed by casting the PBN’s structure as a stochastic policy whose parameters are optimized with score-function (REINFORCE) gradients. Continuous output heads (logistic/linear/softmax or policy logits) are trained with ordinary gradients. We call the resulting model a Learning PBN. We formalize the Learning Probabilistic Boolean Network, derive unbiased structural gradients with variance reduction, and prove a universal approximation property over discretized inputs. Empirically, Learning Probabilistic Boolean Networks approach ANN performance across classification (accuracy ↑), regression (RMSE ↓), representation quality via clustering (ARI ↑), and reinforcement learning (return ↑) while yielding interpretable, rule-like internal units. We analyze the effect of binning resolution, operator sets, and unit counts, and show how the learned logic stabilizes as training progresses. Our results indicate that PBNs can serve as general-purpose learners, competitive with ANNs in tabular/noisy regimes, without sacrificing interpretability.

## Full-text entities

- **Genes:** CDH1 (cadherin 1) [NCBI Gene 999] {aka Arc-1, BCDS1, CD324, CDHE, ECAD, LCAM}, RB1 (RB transcriptional corepressor 1) [NCBI Gene 5925] {aka OSRC, PPP1R130, RB, p105-Rb, p110-RB1, pRb}, PCNA (proliferating cell nuclear antigen) [NCBI Gene 5111] {aka ATLD2}, XDH (xanthine dehydrogenase) [NCBI Gene 7498] {aka XAN1, XDH/XO, XO, XOR}, DCTN6 (dynactin subunit 6) [NCBI Gene 10671] {aka WS-3, WS3, p27}, UBE2C (ubiquitin conjugating enzyme E2 C) [NCBI Gene 11065] {aka UBCH10, dJ447F3.2}, CDC20 (cell division cycle 20) [NCBI Gene 991] {aka CDC20A, OOMD14, OZEMA14, bA276H19.3, p55CDC}, CCNA2 (cyclin A2) [NCBI Gene 890] {aka CCN1, CCNA}
- **Diseases:** Binning Granularity B (MESH:D006509), injury to (MESH:D014947), ARI (MESH:D000275), LPBN (MESH:D007859)
- **Chemicals:** CartPole (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12650854/full.md

---
Source: https://tomesphere.com/paper/PMC12650854