Generative Maximum Entropy Learning for Multiclass Classification

Ambedkar Dukkipati; Gaurav Pandey; Debarghya Ghoshdastidar; Paramita; Koley; D. M. V. Satya Sriram

arXiv:1205.0651·cs.IT·December 31, 2013

Generative Maximum Entropy Learning for Multiclass Classification

Ambedkar Dukkipati, Gaurav Pandey, Debarghya Ghoshdastidar, Paramita, Koley, D. M. V. Satya Sriram

PDF

TL;DR

This paper introduces a generative maximum entropy classification method with feature selection for high-dimensional data, utilizing divergence measures to discriminate class densities and extending to multi-class problems with computational efficiency.

Contribution

It proposes a novel generative maximum entropy classification approach that employs divergence-based feature selection and extends to multi-class cases using Jensen-Shannon divergence.

Findings

01

Effective on large text datasets

02

Scales well with high-dimensional data

03

Outperforms existing methods in accuracy

Abstract

Maximum entropy approach to classification is very well studied in applied statistics and machine learning and almost all the methods that exists in literature are discriminative in nature. In this paper, we introduce a maximum entropy classification method with feature selection for large dimensional data such as text datasets that is generative in nature. To tackle the curse of dimensionality of large data sets, we employ conditional independence assumption (Naive Bayes) and we perform feature selection simultaneously, by enforcing a `maximum discrimination' between estimated class conditional densities. For two class problems, in the proposed method, we use Jeffreys ( $J$ ) divergence to discriminate the class conditional densities. To extend our method to the multi-class case, we propose a completely new approach by considering a multi-distribution divergence: we replace Jeffreys…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.