Some Discussions On Model Selection Criteria

Posted on:2017-04-21

Degree:Master

Type:Thesis

Country:China

Candidate:F F Liu

Full Text:PDF

GTID:2270330488492155

Subject:Probability theory and mathematical statistics

Abstract/Summary:

PDF Full Text Request

Model selection has become a hot topic in statistic analysis, which usually contains the selections of model types and the independent variables. Since the 20th century, statistical scholars have got a lot of results on model selection, such as criterion based on information theory and Bayesian methods. However, these formulas aren’t suitable for high-dimensional models completely. The model selection methods for the high-dimensional data are the hot topics in statistics.Firstly, this dissertation introduces the basic concepts of linear regression model, as well as the Bayesian statistical inference of multiple linear regression. Meanwhile, the Deviation Information Criterion (DIC) is introduced, and we use it to the linear regression model selection. The results show that the use of DIC and MCMC sampling can lead to "best" model. Compared with the existing criteria to select the optimal model, DIC criterion is almost consistency.Secondly, the dissertation introduces the Poisson model with random effects (REP) and fixed effects (FEP), and the relationships between them are also discussed. The cross-validation log score (LScv) and full-sample log score (LSFS) are defined. We analyze the relationships between DIC and Log score function by using the data generation mechanism. Simulation results illustrate that, they have a significant negative correlation in FEP model. However, they don’t have a significant correlation in the REF model. In the end, we research the advantages and disadvantages between of Log scores and DIC with the small sample size data. The results show that DIC is slightly better than LScv, the advantages is gradually decreasing as n is increasing. But in the REP model, LSFS is more accurate than LScv and DIC.

Keywords/Search Tags:

DIC, cross-validation log score, full-sample log score, prediction, model selection

PDF Full Text Request

Related items

1	Corrections To The Score Tests On Large Dimensional Sample Covariance Matrices Structure
2	A volumetric score function for computational protein structure modeling and protein structure validation
3	Application Of Over Cut-off Line Percentile Regression Model In University Admission Score Prediction Engineering
4	Linear Model Selection By Cross-validation
5	Transformation Of Exon Scores In Gene Prediction
6	Study On Application Of Normal Distribution In Score Ranking And Adjusting
7	A Summary Of Cross-Validation In Model Selection
8	Identifying Local Dependence with a Score Test Statistic Based on the Bifactor 2-Parameter Logistic Model
9	Research On Financial Distress Prediction Of Chinese Real Estate Listed Companies Based On Z-Score Model
10	An Empirical Study On The Application Of Z-score Model In Domestic Credit Bond Issuers