Generative Maximum Entropy Learning for Multiclass Classification
Ambedkar Dukkipati, Gaurav Pandey, Debarghya Ghoshdastidar, Paramita, Koley, D. M. V. Satya Sriram

TL;DR
This paper introduces a generative maximum entropy classification method with feature selection for high-dimensional data, utilizing divergence measures to discriminate class densities and extending to multi-class problems with computational efficiency.
Contribution
It proposes a novel generative maximum entropy classification approach that employs divergence-based feature selection and extends to multi-class cases using Jensen-Shannon divergence.
Findings
Effective on large text datasets
Scales well with high-dimensional data
Outperforms existing methods in accuracy
Abstract
Maximum entropy approach to classification is very well studied in applied statistics and machine learning and almost all the methods that exists in literature are discriminative in nature. In this paper, we introduce a maximum entropy classification method with feature selection for large dimensional data such as text datasets that is generative in nature. To tackle the curse of dimensionality of large data sets, we employ conditional independence assumption (Naive Bayes) and we perform feature selection simultaneously, by enforcing a `maximum discrimination' between estimated class conditional densities. For two class problems, in the proposed method, we use Jeffreys () divergence to discriminate the class conditional densities. To extend our method to the multi-class case, we propose a completely new approach by considering a multi-distribution divergence: we replace Jeffreys…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
