# Gene Hunting with Knockoffs for Hidden Markov Models

**Authors:** Matteo Sesia, Chiara Sabatti, Emmanuel J. Cand\`es

arXiv: 1706.04677 · 2019-05-14

## TL;DR

This paper extends model-free knockoff methods to hidden Markov models, enabling rigorous variable selection with FDR control in complex genomic data analysis.

## Contribution

It introduces an efficient algorithm for generating knockoffs for HMMs and demonstrates their application in genome-wide association studies.

## Key findings

- Effective knockoff sampling for HMMs developed
- Controlled FDR in genomic variable selection demonstrated
- Applied to Crohn's disease and cholesterol datasets

## Abstract

Modern scientific studies often require the identification of a subset of relevant explanatory variables, in the attempt to understand an interesting phenomenon. Several statistical methods have been developed to automate this task, but only recently has the framework of model-free knockoffs proposed a general solution that can perform variable selection under rigorous type-I error control, without relying on strong modeling assumptions. In this paper, we extend the methodology of model-free knockoffs to a rich family of problems where the distribution of the covariates can be described by a hidden Markov model (HMM). We develop an exact and efficient algorithm to sample knockoff copies of an HMM. We then argue that combined with the knockoffs selective framework, they provide a natural and powerful tool for performing principled inference in genome-wide association studies with guaranteed FDR control. Finally, we apply our methodology to several datasets aimed at studying the Crohn's disease and several continuous phenotypes, e.g. levels of cholesterol.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1706.04677/full.md

## Figures

29 figures with captions in the complete paper: https://tomesphere.com/paper/1706.04677/full.md

## References

61 references — full list in the complete paper: https://tomesphere.com/paper/1706.04677/full.md

---
Source: https://tomesphere.com/paper/1706.04677