Bayesian variable selection using Knockoffs with applications to genomics

Jurel K. Yap*, Iris Ivy M. Gauran

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Given the costliness of HIV drug therapy research, it is important not only to maximize true positive rate (TPR) by identifying which genetic markers are related to drug resistance, but also to minimize false discovery rate (FDR) by reducing the number of incorrect markers unrelated to drug resistance. In this study, we propose a multiple testing procedure that unifies key concepts in computational statistics, namely Model-free Knockoffs, Bayesian variable selection, and the local false discovery rate. We develop an algorithm that utilizes the augmented data-Knockoff matrix and implement Bayesian Lasso. We then identify signals using test statistics based on Markov Chain Monte Carlo outputs and local false discovery rate. We test our proposed methods against non-bayesian methods such as Benjamini–Hochberg (BHq) and Lasso regression in terms TPR and FDR. Using numerical studies, we show the proposed method yields lower FDR compared to BHq and Lasso for certain cases, such as for low and equi-dimensional cases. We also discuss an application to an HIV-1 data set, which aims to be applied analyzing genetic markers linked to drug resistant HIV in the Philippines in future work.

Original languageEnglish (US)
Pages (from-to)1771-1790
Number of pages20
JournalComputational Statistics
Volume38
Issue number4
DOIs
StateAccepted/In press - 2022

Bibliographical note

Publisher Copyright:
© 2022, The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.

Keywords

  • Bayesian variable selection
  • Drug resistant HIV-1
  • False discovery control
  • Model-free Knockoffs

ASJC Scopus subject areas

  • Statistics and Probability
  • Statistics, Probability and Uncertainty
  • Computational Mathematics

Fingerprint

Dive into the research topics of 'Bayesian variable selection using Knockoffs with applications to genomics'. Together they form a unique fingerprint.

Cite this