# Emergence of number sense through the integration of multimodal information: developmental learning insights from neural network models

**Authors:** Kamma Noda, Takafumi Soda, Yuichi Yamashita

PMC · DOI: 10.3389/fnins.2024.1330512 · Frontiers in Neuroscience · 2024-01-17

## TL;DR

This study shows that combining different types of information improves number sense and math skills, using neural networks to model how the brain learns.

## Contribution

The study experimentally links multimodal learning to enhanced mathematical abilities through neural network models.

## Key findings

- Multimodal training improves latent representation quality compared to single-modal methods.
- Multimodal representations lead to better performance in arithmetic tasks.
- The study connects multimodal learning to cognitive functions like math skills.

## Abstract

Associating multimodal information is essential for human cognitive abilities including mathematical skills. Multimodal learning has also attracted attention in the field of machine learning, and it has been suggested that the acquisition of better latent representation plays an important role in enhancing task performance. This study aimed to explore the impact of multimodal learning on representation, and to understand the relationship between multimodal representation and the development of mathematical skills.

We employed a multimodal deep neural network as the computational model for multimodal associations in the brain. We compared the representations of numerical information, that is, handwritten digits and images containing a variable number of geometric figures learned through single- and multimodal methods. Next, we evaluated whether these representations were beneficial for downstream arithmetic tasks.

Multimodal training produced better latent representation in terms of clustering quality, which is consistent with previous findings on multimodal learning in deep neural networks. Moreover, the representations learned using multimodal information exhibited superior performance in arithmetic tasks.

Our novel findings experimentally demonstrate that changes in acquired latent representations through multimodal association learning are directly related to cognitive functions, including mathematical skills. This supports the possibility that multimodal learning using deep neural network models may offer novel insights into higher cognitive functions.

## Full-text entities

- **Genes:** CLIP1 (CAP-Gly domain containing linker protein 1) [NCBI Gene 6249] {aka CLIP, CLIP-170, CLIP170, CYLN1, RSN}
- **Diseases:** Confusions (MESH:D003221), neurodegeneration (MESH:D019636), Synesthesia (MESH:D000080311), mental disorders (MESH:D001523), semantic dementia (MESH:D057180), MMA (MESH:D018886), prosopagnosia (MESH:D020238), savant syndrome (MESH:C000721847), neurodevelopmental disorders (MESH:D002658), calculation disorder (MESH:D009358)
- **Species:** Homo sapiens (human, species) [taxon 9606], Cercopithecidae (monkey, family) [taxon 9527]
- **Cell lines:** S2 — Drosophila melanogaster (Fruit fly), Spontaneously immortalized cell line (CVCL_Z232)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC10828047/full.md

## Figures

12 figures with captions in the complete paper: https://tomesphere.com/paper/PMC10828047/full.md

## References

55 references — full list in the complete paper: https://tomesphere.com/paper/PMC10828047/full.md

---
Source: https://tomesphere.com/paper/PMC10828047