Comparisons of Penalized Regression Methods under High-Dimensional Sparse Data with Correlated Variables

Authors

  • Supranee Lisawadi Department of Mathematics and Statistic, Faculty of Science and Technology, Thammasat University, Pathum Thani 12120, Thailand
  • Apisara Sripanich Department of Mathematics and Statistic, Faculty of Science and Technology, Thammasat University, Pathum Thani 12120, Thailand
  • Patchanok Srisuradetchai Department of Mathematics and Statistic, Faculty of Science and Technology, Thammasat University, Pathum Thani 12120, Thailand

Keywords:

Adaptive elastic net, Adaptive LASSO, Adaptive weights, Penalized regression

Abstract

Regression models are frequently used to explain a response variable using independent variables in statistics. However, it is common to encounter situations in which the number of independent variables exceeds the number of observations and predictors are correlated. In this instance, a large number of predictors are statistically insignificant, also known as sparse data. Standard statistical methods do not always apply to such data. Problematic aspects include interpretation, estimation inefficiency, and computation. The penalized regression method, which consists of Ridge, least absolute shrinkage and selection operator (LASSO), elastic net (Enet), adaptive LASSO (ALASSO), and adaptive elastic net (AEnet), is frequently employed during the estimation and variable selection phases. The purpose of this paper was to assess the prediction and variable selection performances of Ridge, Enet, LASSO, ALASSO, and AEnet methods in multiple linear regression with normal or positively skewed error terms, sparse data, and correlated independent variables. In addition, Poisson and logistic regression models are studied. The adaptive weights are created using the remaining three estimators: Ridge, Enet, and LASSO. The results indicate that the Ridge estimator is a viable initial adaptive weight estimator for ALASSO and AEnet. In terms of prediction, AEnet and ALASSO typically outperform the competition. Given the objectives, different tactics are necessary to achieve the lowest false positive rate (FPR) and false negative rate (FNR). Enet or AEnet is essential to attain the lowest FPR, while LASSO or ALASSO will yield the lowest FNR.

Downloads

Published

2023-06-14

How to Cite

Supranee Lisawadi, Apisara Sripanich, & Patchanok Srisuradetchai. (2023). Comparisons of Penalized Regression Methods under High-Dimensional Sparse Data with Correlated Variables. Science & Technology Asia, 28(2), 31–42. Retrieved from https://ph02.tci-thaijo.org/index.php/SciTechAsia/article/view/249855

Issue

Section

Physical sciences