Separability is not the best goal for machine learning
Wlodzislaw Duch

TL;DR
This paper argues that shifting the learning goal from linear separability to k-separability simplifies the transformation of complex data distributions in neural networks, potentially reducing the need for deep architectures.
Contribution
It introduces the concept of k-separability as an alternative learning goal, enabling simpler solutions for complex data and Boolean problems like parity.
Findings
k-separability simplifies learning complex data distributions
Linear projection combined with k-separability solves Boolean problems efficiently
Replacing deep layers with k-separability targets can streamline neural network training
Abstract
Neural networks use their hidden layers to transform input data into linearly separable data clusters, with a linear or a perceptron type output layer making the final projection on the line perpendicular to the discriminating hyperplane. For complex data with multimodal distributions this transformation is difficult to learn. Projection on line segments is the simplest extension of linear separability, defining much easier goal for the learning process. Simple problems are 2-separable, but problems with inherent complex logic may be solved in a simple way by -separable projections. The difficulty of learning non-linear data distributions is shifted to separation of line intervals, simplifying the transformation of data by hidden network layers. For classification of difficult Boolean problems, such as the parity problem, linear projection combined with \ksep is sufficient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Rough Sets and Fuzzy Logic · Blind Source Separation Techniques
