Nonparametric independence screening and structure identification for ultra-high dimensional longitudinal data

Ming-Yen Cheng; Toshio Honda; Jialiang Li; Heng Peng

doi:10.1214/14-aos1236

ScienceGate Book Chapters

JOURNAL ARTICLE

Nonparametric independence screening and structure identification for ultra-high dimensional longitudinal data

Ming-Yen Cheng Toshio Honda Jialiang Li Heng Peng

Year: 2014 Journal: The Annals of Statistics Vol: 42 (5) Publisher: Institute of Mathematical Statistics

DOI: 10.1214/14-aos1236

Get Full-Text PDF Get Analytical Report

Abstract

Ultra-high dimensional longitudinal data are increasingly common and the\nanalysis is challenging both theoretically and methodologically. We offer a new\nautomatic procedure for finding a sparse semivarying coefficient model, which\nis widely accepted for longitudinal data analysis. Our proposed method first\nreduces the number of covariates to a moderate order by employing a screening\nprocedure, and then identifies both the varying and constant coefficients using\na group SCAD estimator, which is subsequently refined by accounting for the\nwithin-subject correlation. The screening procedure is based on working\nindependence and B-spline marginal models. Under weaker conditions than those\nin the literature, we show that with high probability only irrelevant variables\nwill be screened out, and the number of selected variables can be bounded by a\nmoderate order. This allows the desirable sparsity and oracle properties of the\nsubsequent structure identification step. Note that existing methods require\nsome kind of iterative screening in order to achieve this, thus they demand\nheavy computational effort and consistency is not guaranteed. The refined\nsemivarying coefficient model employs profile least squares, local linear\nsmoothing and nonparametric covariance estimation, and is semiparametric\nefficient. We also suggest ways to implement the proposed methods, and to\nselect the tuning parameters. An extensive simulation study is summarized to\ndemonstrate its finite sample performance and the yeast cell cycle data is\nanalyzed.\n

Keywords:

Metrics

Cited By

5.82

FWCI (Field Weighted Citation Impact)

Refs

0.96

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Statistical Methods and Inference

Physical Sciences → Mathematics → Statistics and Probability

Bayesian Methods and Mixture Models

Physical Sciences → Computer Science → Artificial Intelligence

Gene expression and cancer classification

Life Sciences → Biochemistry, Genetics and Molecular Biology → Molecular Biology

Nonparametric independence screening and structure identification for ultra-high dimensional longitudinal data

Abstract

Metrics

Citation History

Topics

Related Documents

Nonparametric independence screening for ultra-high-dimensional longitudinal data under additive models

Nonparametric independence screening for ultra-high dimensional generalized varying coefficient models with longitudinal data

Nonparametric Independence Screening in Sparse Ultra-High-Dimensional Additive Models

Nonparametric Independence Screening in Sparse Ultra-High-Dimensional Varying Coefficient Models

Robust conditional nonparametric independence screening for ultrahigh-dimensional data