Exploring possible vector systems for faster training of neural networks with preconfigured latent spaces
Nikita Gabdullin

TL;DR
This paper investigates various vector systems for configuring latent spaces in neural networks, demonstrating that optimal configurations can accelerate training and reduce storage needs, especially for large-scale datasets.
Contribution
It provides a comprehensive overview of vector systems for latent space configuration and shows how they can be used to speed up training of neural networks on large datasets.
Findings
Predefined vector systems can significantly speed up neural network training.
Using minimal latent space dimensions accelerates convergence.
Optimized vector configurations reduce storage for neural network embeddings.
Abstract
The overall neural network (NN) performance is closely related to the properties of its embedding distribution in latent space (LS). It has recently been shown that predefined vector systems, specifically An root system vectors, can be used as targets for latent space configurations (LSC) to ensure the desired LS structure. One of the main LSC advantage is the possibility of training classifier NNs without classification layers, which facilitates training NNs on datasets with extremely large numbers of classes. This paper provides a more general overview of possible vector systems for NN training along with their properties and methods for vector system construction. These systems are used to configure LS of encoders and visual transformers to significantly speed up ImageNet-1K and 50k-600k classes LSC training. It is also shown that using the minimum number of LS dimensions for a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Neural Networks and Applications · Advanced Statistical Modeling Techniques
