# Nucleus Neural Network: A Data-driven Self-organized Architecture

**Authors:** Jia Liu, Maoguo Gong, Haibo He

arXiv: 1904.04036 · 2019-05-15

## TL;DR

This paper introduces the nucleus neural network (NNN), a flexible architecture that removes layer constraints, and proposes an efficient learning method, demonstrating robustness and improved performance on a modified MNIST dataset.

## Contribution

The paper presents a novel self-organized neural architecture called NNN and an architecture learning method, enhancing learning capability and robustness to irrelevant data.

## Key findings

- NNN architecture is more powerful than traditional layered networks.
- The proposed learning method effectively determines connection weights and biases.
- NNN shows significant improvement on a reconstructed MNIST dataset with background variations.

## Abstract

Artificial neural networks which are inspired from the learning mechanism of brain have achieved great successes in many problems, especially those with deep layers. In this paper, we propose a nucleus neural network (NNN) and corresponding connecting architecture learning method. In a nucleus, there are no regular layers, i.e., a neuron may connect to all the neurons in the nucleus. This type of architecture gets rid of layer limitation and may lead to more powerful learning capability. It is crucial to determine the connections between them given numerous neurons. Based on the principle that more relevant input and output neuron pair deserves higher connecting density, we propose an efficient architecture learning model for the nucleus. Moreover, we improve the learning method for connecting weights and biases given the optimized architecture. We find that this novel architecture is robust to irrelevant components in test data. So we reconstruct a new dataset based on the MNIST dataset where the types of digital backgrounds in training and test sets are different. Experiments demonstrate that the proposed learner achieves significant improvement over traditional learners on the reconstructed data set.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.04036/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/1904.04036/full.md

## References

32 references — full list in the complete paper: https://tomesphere.com/paper/1904.04036/full.md

---
Source: https://tomesphere.com/paper/1904.04036