# Deriving equivalent symbol-based decision models from feedforward neural networks

**Authors:** Sebastian Seidel, Uwe M. Borghoff

PMC · DOI: 10.3389/frai.2025.1618149 · 2025-07-24

## TL;DR

This paper introduces a method to convert opaque neural networks into interpretable decision trees, making AI systems more transparent and trustworthy.

## Contribution

A systematic approach to derive symbolic decision models from feedforward neural networks, enhancing interpretability and trust in AI.

## Key findings

- Symbolic models like decision trees can be derived from FNNs while preserving functionality.
- A prototype using Keras and TensorFlow demonstrates the feasibility of the proposed method.
- The approach enables scalability to deeper networks through iterative refinement of subpaths.

## Abstract

Artificial intelligence (AI) has emerged as a transformative force across industries, driven by advances in deep learning and natural language processing, and fueled by large-scale data and computing resources. Despite its rapid adoption, the opacity of AI systems poses significant challenges to trust and acceptance. This work explores the intersection of connectionist and symbolic approaches to artificial intelligence, focusing on the derivation of interpretable symbolic models, such as decision trees, from feedforward neural networks (FNNs). Decision trees provide a transparent framework for elucidating the operations of neural networks while preserving their functionality. The derivation is presented in a step-by-step approach and illustrated with several examples. A systematic methodology is proposed to bridge neural and symbolic paradigms by exploiting distributed representations in FNNs to identify symbolic components, including fillers, roles, and their interrelationships. The process traces neuron activation values and input configurations across network layers, mapping activations and their underlying inputs to decision tree edges. The resulting symbolic structures effectively capture FNN decision processes and enable scalability to deeper networks through iterative refinement of subpaths for each hidden layer. To validate the theoretical framework, a prototype was developed using Keras .h5-data and emulating TensorFlow within the Java JDK/JavaFX environment. This prototype demonstrates the feasibility of extracting symbolic representations from neural networks, enhancing trust in AI systems, and promoting accountability.

## Full-text entities

- **Diseases:** XAI (MESH:C538243), AI (MESH:C538142)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

15 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12329772/full.md

---
Source: https://tomesphere.com/paper/PMC12329772