Deep Direct Likelihood Knockoffs

Mukund Sudarshan; Wesley Tansey; Rajesh Ranganath

arXiv:2007.15835·stat.ML·August 3, 2020

Deep Direct Likelihood Knockoffs

Mukund Sudarshan, Wesley Tansey, Rajesh Ranganath

PDF

Open Access 1 Repo 1 Video

TL;DR

Deep Direct Likelihood Knockoffs (DDLK) is a novel method that directly minimizes the KL divergence to generate valid knockoffs, improving feature discovery accuracy while controlling false discovery rate in complex models.

Contribution

DDLK introduces a new approach that directly optimizes the likelihood and swap properties of knockoffs using KL divergence and Gumbel-Softmax, enhancing validity and power.

Findings

01

Higher power than baseline methods in synthetic benchmarks

02

Effective FDR control across diverse datasets

03

Successful application to COVID-19 dataset

Abstract

Predictive modeling often uses black box machine learning methods, such as deep neural networks, to achieve state-of-the-art performance. In scientific domains, the scientist often wishes to discover which features are actually important for making the predictions. These discoveries may lead to costly follow-up experiments and as such it is important that the error rate on discoveries is not too high. Model-X knockoffs enable important features to be discovered with control of the FDR. However, knockoffs require rich generative models capable of accurately modeling the knockoff features while ensuring they obey the so-called "swap" property. We develop Deep Direct Likelihood Knockoffs (DDLK), which directly minimizes the KL divergence implied by the knockoff swap property. DDLK consists of two stages: it first maximizes the explicit likelihood of the features, then minimizes the KL…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rajesh-lab/ddlk
pytorchOfficial

Videos

Deep Direct Likelihood Knockoffs· slideslive

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning · Topic Modeling