Predicting viral infection from high-dimensional biomarker trajectories

Minhua Chen, Aimee Zaas, Christopher Woods, Geoffrey S. Ginsburg, Joseph Lucas, David Dunson, Lawrence Carin

Research output: Contribution to journalArticlepeer-review

22 Scopus citations

Abstract

There is often interest in predicting an individual's latent health status based on high-dimensional biomarkers that vary over time. Motivated by time-course gene expression array data that we have collected in two influenza challenge studies performed with healthy human volunteers, we develop a novel time-aligned Bayesian dynamic factor analysis methodology. The time course trajectories in the gene expressions are related to a relatively low-dimensional vector of latent factors, which vary dynamically starting at the latent initiation time of infection. Using a nonparametric cure rate model for the latent initiation times, we allow selection of the genes in the viral response pathway, variability among individuals in infection times, and a subset of individuals who are not infected. As we demonstrate using held-out data, this statistical framework allows accurate predictions of infected individuals in advance of the development of clinical symptoms, without labeled data and even when the number of biomarkers vastly exceeds the number of individuals under study. Biological interpretation of several of the inferred pathways (factors) is provided. © 2011 American Statistical Association.
Original languageEnglish (US)
Pages (from-to)1259-1279
Number of pages21
JournalJournal of the American Statistical Association
Volume106
Issue number496
DOIs
StatePublished - Dec 1 2011
Externally publishedYes

Bibliographical note

Generated from Scopus record by KAUST IRTS on 2021-02-09

Fingerprint

Dive into the research topics of 'Predicting viral infection from high-dimensional biomarker trajectories'. Together they form a unique fingerprint.

Cite this