Comparisons of Penalized Regression Methods under High-Dimensional Sparse Data with Correlated Variables

Supranee Lisawadi; Apisara Sripanich; Patchanok Srisuradetchai

PDF

Published: Jun 14, 2023

Keywords:

Adaptive elastic net Adaptive LASSO Adaptive weights Penalized regression

Supranee Lisawadi

Department of Mathematics and Statistic, Faculty of Science and Technology, Thammasat University, Pathum Thani 12120, Thailand

Apisara Sripanich

Department of Mathematics and Statistic, Faculty of Science and Technology, Thammasat University, Pathum Thani 12120, Thailand

Patchanok Srisuradetchai

Department of Mathematics and Statistic, Faculty of Science and Technology, Thammasat University, Pathum Thani 12120, Thailand

Abstract

Regression models are frequently used to explain a response variable using independent variables in statistics. However, it is common to encounter situations in which the number of independent variables exceeds the number of observations and predictors are correlated. In this instance, a large number of predictors are statistically insignificant, also known as sparse data. Standard statistical methods do not always apply to such data. Problematic aspects include interpretation, estimation inefficiency, and computation. The penalized regression method, which consists of Ridge, least absolute shrinkage and selection operator (LASSO), elastic net (Enet), adaptive LASSO (ALASSO), and adaptive elastic net (AEnet), is frequently employed during the estimation and variable selection phases. The purpose of this paper was to assess the prediction and variable selection performances of Ridge, Enet, LASSO, ALASSO, and AEnet methods in multiple linear regression with normal or positively skewed error terms, sparse data, and correlated independent variables. In addition, Poisson and logistic regression models are studied. The adaptive weights are created using the remaining three estimators: Ridge, Enet, and LASSO. The results indicate that the Ridge estimator is a viable initial adaptive weight estimator for ALASSO and AEnet. In terms of prediction, AEnet and ALASSO typically outperform the competition. Given the objectives, different tactics are necessary to achieve the lowest false positive rate (FPR) and false negative rate (FNR). Enet or AEnet is essential to attain the lowest FPR, while LASSO or ALASSO will yield the lowest FNR.

How to Cite

Supranee Lisawadi, Apisara Sripanich, & Patchanok Srisuradetchai. (2023). Comparisons of Penalized Regression Methods under High-Dimensional Sparse Data with Correlated Variables. Science & Technology Asia, 28(2), 31–42. retrieved from https://ph02.tci-thaijo.org/index.php/SciTechAsia/article/view/249855

Issue

Vol.28 No.2 (April-June 2023)

Section

Physical sciences

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Article Sidebar

Main Article Content

Abstract

Article Details