JOURNAL ARTICLE

Superpopulation model inference for non probability samples under informative sampling with high-dimensional data

Zhan LiuDianni WangYingli Pan

Year: 2024 Journal:   Communication in Statistics- Theory and Methods Vol: 54 (5)Pages: 1370-1390   Publisher: Taylor & Francis

Abstract

Non probability samples have been widely used in various fields. However, non probability samples suffer from selection biases due to the unknown selection probabilities. Superpopulation model inference methods have been discussed to solve this problem, but these approaches require the non informative sampling assumption. When the sampling mechanism is informative sampling, that is, selection probabilities are related to the outcome variable, the previous inference methods may be invalid. Moreover, we may encounter a large number of covariates in practice, which poses a new challenge for inference from non probability samples under informative sampling. In this article, the superpopulation model approaches under informative sampling with high-dimensional data are developed to perform valid inferences from non probability samples. Specifically, a semiparametric exponential tilting model is established to estimate selection probabilities, and the sample distribution is derived for estimating the superpopulation model parameters. Moreover, SCAD, adaptive LASSO, and Model-X knockoffs are employed to select variables, and estimate parameters in superpopulation modeling. Asymptotic properties of the proposed estimators are established. Results from simulation studies are presented to compare the performance of the proposed estimators with the naive estimator, which ignores informative sampling. The proposed methods are further applied to the National Health and Nutrition Examination Survey data.

Keywords:
Statistics Inference Sampling (signal processing) Statistical inference Probability sampling Computer science Sampling design Probability model Econometrics Mathematics Data mining Artificial intelligence Medicine Environmental health

Metrics

1
Cited By
1.53
FWCI (Field Weighted Citation Impact)
40
Refs
0.70
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Statistical Methods and Inference
Physical Sciences →  Mathematics →  Statistics and Probability
Advanced Statistical Methods and Models
Physical Sciences →  Mathematics →  Statistics and Probability
Statistical Methods and Bayesian Inference
Physical Sciences →  Mathematics →  Statistics and Probability

Related Documents

JOURNAL ARTICLE

Superpopulation model inference for non-probability samples under informative sampling

Zhan LiuDianni WangYingli Pan

Journal:   Communications in Statistics - Simulation and Computation Year: 2024 Vol: 54 (10)Pages: 4213-4234
JOURNAL ARTICLE

Inference for non-probability samples under high-dimensional covariate-adjusted superpopulation model

Yingli PanWen CaiZhan Liu

Journal:   Statistical Methods & Applications Year: 2022 Vol: 31 (4)Pages: 955-979
BOOK-CHAPTER

Inference Under Informative Probability Sampling

Michail Sverchkov

International Encyclopedia of Statistical Science Year: 2025 Pages: 1193-1196
BOOK-CHAPTER

Inference Under Informative Probability Sampling

Michail Sverchkov

International Encyclopedia of Statistical Science Year: 2011 Pages: 662-664
JOURNAL ARTICLE

Nonlinear Superpopulation Model Inference for Non‐Probability Samples With Nonignorable Missingness

Zhan LiuJiajing XuRuohan LiYingli Pan

Journal:   Statistical Analysis and Data Mining The ASA Data Science Journal Year: 2025 Vol: 18 (5)
© 2026 ScienceGate Book Chapters — All rights reserved.