Robust variable selection in linear regression models / Shokrya Saleha A. Alshqaq

This study looks at two problems related to the robust variable selection in linear regression models with six objectives in mind. The first three objectives are concerned with the problem of selection variables in small data sets in a linear regression model. The first is the investigation of th...

Full description

Saved in:
Bibliographic Details
Main Author: Shokrya Saleha, A. Alshqaq
Format: Thesis
Published: 2015
Subjects:
Online Access:http://studentsrepo.um.edu.my/9970/1/Alshqaq_Shorya_Saleh_A.pdf
http://studentsrepo.um.edu.my/9970/2/Alshqaq%2C_Shokrya_Saleh_A_%E2%80%93_Thesis.pdf
http://studentsrepo.um.edu.my/9970/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This study looks at two problems related to the robust variable selection in linear regression models with six objectives in mind. The first three objectives are concerned with the problem of selection variables in small data sets in a linear regression model. The first is the investigation of the robustness of various best variable selection criteria in the presence of outliers and leverage points in the data set. The second derives the influence function of AIC, Cp, and SIC criteria and discussed the properties of these functions. The third is to explore the role of two robust methods for selecting the best variable in the linear regression. The first approach considered is a modified version of AIC, Cp, and SIC statistics by utilizing the high breakdown point estimators of the regression model. The other methods are based on diagnostic regression approach using outliers and leverage diagnostics in regression model procedures. For each method, the power of performance is compared with classical non-robust criteria and the existing criteria, based on M-estimation. In general, our findings show that these criteria are capable of selecting the appropriate models in the presence of outliers. The following three objectives look at the development of LASSO variable selection regression to solve the problem of multicollinearity and large data in variable selection procedure. The fourth is to investigate the sensitivity of non-robust LASSO (LASSO and adaptive-LASSO) and robust LASSO (LAD-LASSO and Huber-LASSO) toward the existence of outliers and leverage points in the data. The fifth looks at extending the Huber-LASSO to include more robust estimators. We present the GM-LASSO and MM-LASSO methods. If the multicollinearity does exist, we use the idea of the LASSO regression analysis to find the best variable in the model. The performance of these methods has also been compared with classical non-robust LASSO, and the existing robust LAD-LASSO and Huber-LASSO are generally good. The final objective is to prepare a new LASSO method based on diagnostic regression approach.