A Non-Parametric Estimator of the Probability Weighted Moments for Large Datasets

Authors

  • Toufik Guermah LMA Laboratory/Departement of Mathematics/Hassiba Benbouali University, Chlef, Algeria
  • Abdelaziz Rassoul GEE Laboratory/National Higher School for Hydraulics, Blida, Algeria

Keywords:

PWM, median-of-means, large datasets, empirical likelihood, hypothesis test

Abstract

In this paper, we introduces a nonparametric median-of-means (MoM) estimator for Probability Weighted Moments (PWM) specifically designed for large datasets. Our approach draws inspiration from the data grouping method, a widely utilized technique in various domains including economics, hydrology, finance, and insurance. We establish the consistency and asymptotic normality of the
proposed estimator under reasonable assumptions regarding the increasing number of subgroups. Additionally, we present a novel approach for testing hypotheses related to Probability Weighted Moments (PWM) using the Empirical Likelihood method (EL) specifically tailored for the median. Notably, our method circumvents the need for prior estimation of the variance structure associated
with the estimator, a task that can be challenging and prone to inaccuracies. We conducted numerical simulations to assess the performance of our proposed estimator. The results clearly indicate that our estimator showcases remarkable robustness, particularly when confronted with outliers.

References

Akaike H. Information theory and an extension of the maximum likelihood principle. In Selected

papers of hirotugu akaike. New York: Springer; 1998. pp. 199-213.

Alon N, Matias Y, Szegedy M. The space complexity of approximating the frequency moments. In

Proceedings of the twenty-eighth annual ACM symposium on Theory of computing; 1996. p.

-29.

Audibert JY, Catoni, O. Robust linear least squares regression. Ann Stat. 2011; 39(5): 2766-2794.

Bhati D, Kattumannil SK, Sreelakshmi N. Jackknife empirical likelihood based inference for probability weighted moments. J Korean Stat Soc. 2021; 50: 98-116.

Blair C. (1985). Problem complexity and method efficiency in optimization (as nemirovsky and db

yudin). Siam Review. 1985; 27(2): 264.

Bubeck S, Cesa-Bianchi N, Lugosi G. Bandits with heavy tail. IEEE Trans Inf Theory. 2013; 59(11):

-7717.

Chen J, Variyath AM, Abraham B. Adjusted empirical likelihood and its properties. J Comput Graph

Stat. 2008; 17(2): 426-443.

David HA, Nagaraja HN. Order Statistics. 3rd ed. New York: John Wiley and Sons; 2003.

Devroye L, Lerasle M, Lugosi G, Oliveira RI. Sub-Gaussian mean estimators. Ann Stat. 2016; 44(6):

-2725.

Greenwood JA, Landwehr JM, Matalas NC, Wallis JR. Probability weighted moments: definition

and relation to parameters of several distributions expressable in inverse form. Water Resour

Res. 1979; 15(5): 1049-1054.

Hoeffding W. A class of statistics with asymptotically normal distribution. In: Breakthroughs in

Statistics: Foundations and Basic Theory. Springer; 1992.

Hosking JRM, Wallis JR, Wood EF. Estimation of the generalized extreme-value distribution by the

method of probability-weighted moments. Technometrics. 1985; 27(3): 251-261.

Hsu D, Sabato S. (2016). Loss minimization and parameter estimation with heavy tails. J Mach Learn

Res. 2016; 17(1): 543-582.

Jerrum MR, Valiant LG, Vazirani VV. Random generation of combinatorial structures from a uniform

distribution. Theor Comput Sci. 1986; 43: 169-188.

Jiang H, Zhao Y. Transformed jackknife empirical likelihood for probability weighted moments. J

Stat Comput Simul. 2022; 92(8): 1618-1639.

Jing BY, Yuan J, Zhou W. Jackknife empirical likelihood. J Am Stat Assoc. 2009; 104(487): 1224-

Mohamed L, Abdelaziz R. A robust estimator of the S-Gini index for massive data. Commun Stat

Simulat. 2023; 52(8): 3502-3519.

Lerasle M, Oliveira RI. Robust empirical mean estimators. arXiv preprint arXiv:1112.3914. 2011.

Lugosi G, Mendelson S. (2019). Sub-Gaussian estimators of the mean of a random vector. Ann Stat.

; 47(2): 783-794.

Ma X, Wang S, Zhou W. (2022). Statistical inference in massive datasets by empirical likelihood.

Comput Stat. 2021; 37: 1143-1164.

Minsker S. Geometric median and robust estimation in Banach spaces. Bernoulli. 2015; 21(4): 2308-

Owen A. Empirical likelihood ratio confidence regions. Ann Stat. 1990; 18(1): 90-120.

Owen AB. (2001). Empirical likelihood. Chapman & Hall: CRC Press; 2001.

Vexler A, Zou L, Hutson AD. An extension to empirical likelihood for evaluating probability

weighted moments. J Stat Plan Infer. 2017; 182: 50-60.

Vexler A, Zou L. Empirical likelihood ratio tests with power one. Stat Probab Lett. 2018; 140: 160-

Zhao Y, Meng X, Yang H. Jackknife empirical likelihood inference for the mean absolute deviation.

Comput Stat Data Anal. 2015; 91: 92-101.

Downloads

Published

2025-09-27

How to Cite

Guermah, T. ., & Rassoul, A. . (2025). A Non-Parametric Estimator of the Probability Weighted Moments for Large Datasets. Thailand Statistician, 23(4), 771–785. retrieved from https://ph02.tci-thaijo.org/index.php/thaistat/article/view/261560

Issue

Section

Articles