Performance Comparison of the Quantile Regression Coefficient Estimation with Outliers
Keywords:
Regression coefficient, simple regression, quantile regression, interquartile range, outliersAbstract
This research aims to compare the coefficients estimation performance of quantile regression (QR) at different percentiles and simple regression (SR) for dataset with outliers. Five quantile positions were considered comprising QR(10)th, QR(25)th, QR(50)th, QR(75)th and QR(90)th. These were obtained by modifying the probability density functions for the kernel function adjustment under errors. The results were then compared with the simple regression coefficient estimation. After applying variety situations of the simulation, the mean absolute error (MAE) was used as criteria for consideration. The results showed that although the best performance model for large sample size was the SR, it still gave the best performance for small sample size as well as QR(25)th and QR(50)th models. Furthermore, the QR(25)th and the QR(50)th models were the most efficient estimation of the quantile regression coefficients with outliers for moderate sample size. They also indicated that there were changes scattered around zero value when the sample size was small. Comparing between the SR model and the QR(50)th model for every sample size, it was found that their model coefficients are slightly different. Considering kurtosis and skewness for the SR and all QR models, the results revealed that both values increased with small sample size. Then, they decreased with moderate sample size and increased again with large sample size. Therefore, the quantile regression coefficient estimation is effective in relationship analysis and provides estimates that are more accurate than one answer and suitable as an alternative analysis for skewed data.
References
Ali HA, Nae'l AA. Comparative study on outliers-detection procedures in binary logistic regression model. Thail Stat. 2024; 22(1): 180-191.
Ampanthong P, Suwattee P. Robust estimation of regression coefficients with outliers. Thail Stat. 2010; 8(2): 183-205.
Araveeporn A, Ghosh SK, Budsaba K. Forecasting the stock exchange rate of Thailand Index by conditional heteroscedastic autoregressive nonlinear model with autocorrelated errors. Thail Stat. 2010; 8(2): 109-122.
Choosawat C, Reangsephet O, Srisuradetchai P, Lisawadi S. Performance comparison of penalized regression methods in poisson regression under high-dimensional sparse data with multicollinearity. Thail Stat. 2020; 18(3): 306-318.
Dawoud I. Modified two parameter regression estimators for solving the multicollinearity. Thail Stat. 2022; 20(4): 842-859.
Devroye L, Gyorfi L. Nonparametric density estimation: the L1 view. New York: John Wiley Sons; 1985.
Fiscal Policy Office, 2023, [cited 2023 Jul 1]; Available from: https://govspending.data.go.th/
dashboard/6?language=en.
Härdle W, Hall P, Ichimura H. Optimal smoothing in single-index models. Ann Stat. 1993; 21(1): 157-178.
Hao L, Naiman D. Quantile regression. London: Sage; 2007.
Kleinbaum DG, Kupper LL, Nizam A, Muller KE. Applied regression analysis and other multivariable methods. 4th ed. Belmont, CA: Duxbury; 2008.
Koenker R, Bassett G. Regression quantiles. Econometrica. 1978; 46(1): 33-50.
Kumari SN, Tan A. Modeling and forecasting volatility series: with reference to gold price. Thail Stat. 2018; 16(1): 79-93.
Manoj J, Suresh KK. Application of time-varying coefficient regression model for forecasting financial data. Thail Stat. 2023; 21(1): 180-195.
Ninbai T. Quantile regression. Economics, Ramkhamhaeng University. 2019 [cite 2023 May 8]. Available from: http://www.eco.ru.ac.th/images/gallery/km/KMecon.pdf.
Pagano M, Gauvreau K. Principles of biostatistics. 2nd ed. Duxbury: Pacific Grove; 2000.
Rosenblatt M. Remarks on some nonparametric estimates of a density function. Ann Math Stat. [serial on the internet]. 1956 [cited 2023 May 8]; 27(3): 832-837. Available from https://doi:10.1214/
aoms/1177728190.
Yimnak K, Piampholphan K. A Comparison of MM-estimation and fuzzy robust regression for multiple regression model with outliers. Thail Stat. 2021; 19(2): 411-419.
Downloads
Published
How to Cite
Issue
Section
License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.