DISSERTATION

Model choice and variable selection in mixed & semiparametric models

Abstract

Semiparametric and mixed models allow different kinds of data structures and \ndata types to be considered in regression models. Spatial and temporal \nstructures of discrete or spatial data can be treated as flexibly as, for \ninstance, functional data. This growing flexibility increasingly requires a \nstatistician to make choices between competing models. \nIn model selection the degrees of freedom play an important role as a measure of \nmodel complexity. In this thesis three approaches for the estimation of the \ndegrees of freedom in mixed and semiparametric models are developed, each for \ndifferent distributions of the (conditional) responses. The interpretation of \nsemiparametric models as mixed models justifies using the same model selection \ntechniques for both model classes. \nBy using Steinian methods, the degrees of freedom can be determined for a \ngroup of distributions belonging to the exponential family. The developed \nmethods for determining the degrees of freedom are illustrated by an example of \ntree growth data. \nFor a larger class of distributions the degrees of freedom can be determined by \ncross-validation and bootstrap methods. Additionally, an approximate Steinian \nmethod can be adapted for further distributions. \nBased on the implicit function theorem the degrees of freedom of a variance or \nsmoothing parameter can de derived analytically if the response is normally \ndistributed. Failure to take these degrees of freedom into account can lead to \nbiased model selection. In addition to the methodological derivation, the \ngeometrical properties of the degrees of freedom of the variance and smoothing \nparameters are analysed. Furthermore, numerical problems in the computation of \nthe degrees of freedom are considered.

Keywords:
Degrees of freedom (physics and chemistry) Smoothing Mathematics Model selection Conditional probability distribution Interpretation (philosophy) Applied mathematics Computer science Mathematical optimization Econometrics Statistics

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
102
Refs
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Soil Geostatistics and Mapping
Physical Sciences →  Environmental Science →  Environmental Engineering
Statistical Methods and Inference
Physical Sciences →  Mathematics →  Statistics and Probability
Forest ecology and management
Physical Sciences →  Environmental Science →  Nature and Landscape Conservation

Related Documents

JOURNAL ARTICLE

Variable Selection for Semiparametric Mixed Models in Longitudinal Studies

Xiao NiDaowen ZhangHao Helen Zhang

Journal:   Biometrics Year: 2009 Vol: 66 (1)Pages: 79-88
JOURNAL ARTICLE

Robust variable selection in semiparametric mixed effects longitudinal data models

Huihui SunQiang Liu

Journal:   Communication in Statistics- Theory and Methods Year: 2022 Vol: 53 (3)Pages: 1049-1064
BOOK-CHAPTER

Variable Selection in Semiparametric Bi-functional Models

Silvia NovoGermán AneirosPhilippe Vieu

Contributions to statistics Year: 2020 Pages: 197-204
JOURNAL ARTICLE

Bayes Variable Selection in Semiparametric Linear Models

Suprateek KunduDavid B. Dunson

Journal:   Journal of the American Statistical Association Year: 2014 Vol: 109 (505)Pages: 437-447
BOOK-CHAPTER

Variable Selection in Generalized Semiparametric Longitudinal Models

‎M‎ohammad ArashiSamuel Manda

Emerging topics in statistics and biostatistics Year: 2024 Pages: 221-230
© 2026 ScienceGate Book Chapters — All rights reserved.