Exponential Machines

Alexander Novikov; Mikhail Trofimov; Ivan Oseledets

arXiv:1605.03795·stat.ML·December 11, 2017

Exponential Machines

Alexander Novikov, Mikhail Trofimov, Ivan Oseledets

PDF

3 Repos

TL;DR

Exponential Machines (ExM) introduce a novel tensor factorization approach to model all feature interactions of any order, enabling efficient training and state-of-the-art performance in high-order interaction tasks.

Contribution

The paper presents Exponential Machines, a new model that uses Tensor Train factorization and Riemannian optimization to efficiently capture all feature interactions.

Findings

01

Achieves state-of-the-art results on synthetic high-order interaction data.

02

Performs comparably to high-order factorization machines on MovieLens 100K.

03

Successfully models tensors with up to 2^160 entries.

Abstract

Modeling interactions between features improves the performance of machine learning solutions in many domains (e.g. recommender systems or sentiment analysis). In this paper, we introduce Exponential Machines (ExM), a predictor that models all interactions of every order. The key idea is to represent an exponentially large tensor of parameters in a factorized format called Tensor Train (TT). The Tensor Train format regularizes the model and lets you control the number of underlying parameters. To train the model, we develop a stochastic Riemannian optimization procedure, which allows us to fit tensors with 2^160 entries. We show that the model achieves state-of-the-art performance on synthetic data with high-order interactions and that it works on par with high-order factorization machines on a recommender system dataset MovieLens 100K.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.