Bilinear Models for Machine Learning

Tayssir Doghri; Leszek Szczecinski; Jacob Benesty; Amar Mitiche

arXiv:1912.03354·cs.CV·December 10, 2019

Bilinear Models for Machine Learning

Tayssir Doghri, Leszek Szczecinski, Jacob Benesty, Amar Mitiche

PDF

TL;DR

This paper introduces bilinear models for machine learning that better exploit data structure, such as images, leading to fewer parameters and improved efficiency in tasks like MNIST classification.

Contribution

It proposes bilinear operations as a replacement for linear ones in ML models, enhancing data structure utilization and reducing parameter count.

Findings

01

Bilinear models outperform linear models on MNIST classification.

02

Fewer parameters are needed for similar or better performance.

03

Bilinear operations better capture spatial relationships in images.

Abstract

In this work we define and analyze the bilinear models which replace the conventional linear operation used in many building blocks of machine learning (ML). The main idea is to devise the ML algorithms which are adapted to the objects they treat. In the case of monochromatic images, we show that the bilinear operation exploits better the structure of the image than the conventional linear operation which ignores the spatial relationship between the pixels. This translates into significantly smaller number of parameters required to yield the same performance. We show numerical examples of classification in the MNIST data set.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.