# eXCube2: Explainable Brain-Inspired Spiking Neural Network Framework for Emotion Recognition from Audio, Visual and Multimodal Audio–Visual Data

**Authors:** N. K. Kasabov, A. Yang, Z. Wang, I. Abouhassan, A. Kassabova, T. Lappas

PMC · DOI: 10.3390/biomimetics11030208 · Biomimetics · 2026-03-14

## TL;DR

eXCube2 is a brain-inspired AI framework using spiking neural networks to recognize emotions from audio, visual, and combined data, offering better explainability and adaptability.

## Contribution

eXCube2 introduces a novel brain-inspired spiking neural network framework for emotion recognition with explainability and adaptability.

## Key findings

- eXCube2 achieved above 80% accuracy on single-modality data and 88.9% on multimodal data.
- The framework enables brain-inspired mapping of audio and visual inputs for emotion recognition.
- It supports neuromorphic hardware for reduced power consumption and improved performance.

## Abstract

This paper introduces a biomimetic framework and novel brain-inspired AI (BIAI) models based on spiking neural networks (SNNs) for emotional state recognition from audio (speech), visual (face), and integrated multimodal audio–visual data. The developed framework, named eXCube2, uses a three-dimensional SNN architecture NeuCube that is spatially structured according to a human brain template. The BIAI models developed in eXCube2 are trainable on spatio- and spectro-temporal data using brain-inspired learning rules. Such models are explainable in terms of revealing patterns in data and are adaptable to new data. The eXCube2 models are implemented as software systems and tested on speech and video data of subjects expressing emotional states. The use of a brain template for the SNN structure enables brain-inspired tonotopic and stereo mapping of audio inputs, topographic mapping of visual data, and the combined use of both modalities. This novel approach brings AI-based emotional state recognition closer to human perception, provides a better explainability and adaptability than existing AI systems. It also results in a higher or competitive accuracy, even though this was not the main goal here. This is demonstrated through experiments on benchmark datasets, achieving classification accuracy above 80% on single-modality data and 88.9% when multimodal audio–visual data are used, and a “don’t know” output is introduced. The paper further discusses possible applications of the proposed eXCube2 framework to other audio, visual, and audio–visual data for solving challenging problems, such as recognizing emotional states of people from different origins; brain state diagnosis (e.g., Parkinson’s disease, Alzheimer’s disease, ADHD, dementia); measuring response to treatment over time; evaluating satisfaction responses from online clients; cognitive robotics; human–robot interaction; chatbots; and interactive computer games. The SNN-based implementation of BIAI also enables the use of neuromorphic chips and platforms, leading to reduced power consumption, smaller device size, higher performance accuracy, and improved adaptability and explainability. This research shows a step toward building brain-inspired AI systems.

## Linked entities

- **Diseases:** Parkinson’s disease (MONDO:0005180), Alzheimer’s disease (MONDO:0004975), ADHD (MONDO:0007743), dementia (MONDO:0001627)

## Full-text entities

- **Genes:** PTPN6 (protein tyrosine phosphatase non-receptor type 6) [NCBI Gene 5777] {aka HCP, HCPH, HPTP1C, PTP-1C, SH-PTP1, SHP-1}, MMP1 (matrix metallopeptidase 1) [NCBI Gene 4312] {aka CLG}, LIF (LIF interleukin 6 family cytokine) [NCBI Gene 3976] {aka CDF, DIA, HILDA, MLPLI}, PCSK1 (proprotein convertase subtilisin/kexin type 1) [NCBI Gene 5122] {aka BMIQ12, NEC1, PC1, PC1/3, PC3, SPC3}
- **Diseases:** depressed (MESH:D003866), Bipolar Disorder (MESH:D001714), brain diseases (MESH:D001927), dementia (MESH:D003704), injury to (MESH:D014947), ADHD (MESH:D001289), Alzheimer's disease (MESH:D000544), Parkinson's disease (MESH:D010300), SNN (MESH:D031261)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Mutations:** A 7T
- **Cell lines:** MNI152 — Homo sapiens (Human), Huntington's disease, Induced pluripotent stem cell (CVCL_WR61)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13023482/full.md

## Figures

15 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13023482/full.md

## References

88 references — full list in the complete paper: https://tomesphere.com/paper/PMC13023482/full.md

---
Source: https://tomesphere.com/paper/PMC13023482