Efficiency Comparison of the Population Standard Deviation Estimation Methods for Data set with Normal Distribution and Containing Outlier
Main Article Content
Abstract
The objective of this research was to compare the efficiency of four population standard deviation estimation methods, mean absolute deviation, adjusted range, adjusted standard deviation and sample standard deviation – for a normal distribution when data set containing outlier when the lowest absolute bias and the lowest mean square error were used as criteria. Under 90 simulation scenarios that normal distribution with mean equals 30 and the population standard deviation equals 1, 5, 10, 15 and 20, sample size equals 10, 20, 30, 50, 70 and 100 with the percentage of outlier equals 0%, 10% and 20%. The results found that sample standard deviation was the most efficient when the data set not contain outlier and it was more efficient if the sample size was larger, but the efficiency would decrease when standard deviation was increase and found that adjusted standard deviation has more efficient than adjusted range in all situation which the data set not contain outlier and found that mean absolute deviation was the most efficient when the percentage of outlier equals 10% and 20%.
Article Details
References
Aggarwal, C. (2016). An Introduction to Outlier Analysis. 1st ed. New York: Springer, Cham.
Anscombe, F.J. (1960). Rejection of Outliers. American Statistical Association and American Society for Quality, 2(2), 123-147.
Barnett, V. and Lewis, T. (1995). Outlier in Statistical Data. 3th ed. New York: John Wiley.
Beckman, R. J. and Cook, R. D. (1983). Outliers. American Statistical Association and American Society for Quality, 2(25), 119-149.
Boonyavaha, P., Pratchayalamlert, N., Boonpet, B., and Sinsomboonthong, J. (2018). Efficiency Comparison of the Standard Deviation Estimation Methods for a Normal Distribution When Data Set Containing Outliers. Science and Technology, 27(2), 213-225. (in Thai)
Chelishchev, P., Popov, A., and Sorby, K. (2018). An investigation of Outlier Detection Procedures for CMM Measurement Data. The Proceedings of the 2022 The 6th International Conference on Mechanical, System and Control Engineering, 20-22 April 2022 at Moscow, 1-6.
Christos, C. (2020). Statistical techniques to identify and handle outliers in multivariate data [Online]. Retrieved December 5, 2022, from: https://idus.us.es/bitstream/handle/11441/115316/Grentzelos%2C%20Christos.pdf?se quence=1&isAllowed=y
Hubert, M., and Vandervieren, E. (2008). An adjusted boxplot for skewed distributions. Comput Stat Data Anal, 52(3), 5186-5201.
Junsawang, P., Promwongsa, M. and Srisodaphol, M. (2021). Robust Outliers Detection Method for Skewed Distribution. Thailand Statistician, 19(3), 450-471.
Leon, H. (1960). Tables of range and studentized range, Ann. Math. Stat, 31(4), 1122-1147.
Ley, C., Klein, O., Bernard, P. and Licata., L. (2013). Detecting outliers: Do not use standard deviation around the mean, use absolute deviation around the median. Experimental Social Psychology, 49(4), 764-766.
Montgomery, C. (2012). Introduction to Statistical Quality Control. 7th ed. New York: John
Wiley.
NIST/SEMATECH (2012). Engineering Statistics [Online]. Retrieved December 5, 2022, from: https://www.itl.nist.gov/div898/handbook/prc/section1/prc16.htm.
Tukey, J. W. (1977). Exploratory data analysis. Massachusetts: Addison-Wesley.