Abstract
Although the leave-subject-out cross-validation (CV) has been widely used in practice for tuning parameter selection for various nonparametric and semiparametric models of longitudinal data, its theoretical property is unknown and solving the associated optimization problem is computationally expensive, especially when there are multiple tuning parameters. In this paper, by focusing on the penalized spline method, we show that the leave-subject-out CV is optimal in the sense that it is asymptotically equivalent to the empirical squared error loss function minimization. An efficient Newton-type algorithm is developed to compute the penalty parameters that optimize the CV criterion. Simulated and real data are used to demonstrate the effectiveness of the leave-subject-out CV in selecting both the penalty parameters and the working correlation matrix. © 2012 Institute of Mathematical Statistics.
Original language | English (US) |
---|---|
Pages (from-to) | 3003-3030 |
Number of pages | 28 |
Journal | The Annals of Statistics |
Volume | 40 |
Issue number | 6 |
DOIs | |
State | Published - Dec 2012 |
Externally published | Yes |
Bibliographical note
KAUST Repository Item: Exported on 2020-10-01Acknowledged KAUST grant number(s): KUS-CI-016-04
Acknowledgements: Supported in part by Award Number KUS-CI-016-04, made by King Abdullah University of Science and Technology (KAUST).Supported in part by NSF Grants DMS-09-07170, DMS-10-07618, DMS-12-08952 and NCI Grant CA57030.
This publication acknowledges KAUST support, but has no KAUST affiliated authors.