Overdispersed Count Data Using Negative Binomial-Quasi XGamma Distribution
Keywords:
negative binomial distribution, quasi-xgamma distribution, mixed distribution, count data, overdispersionAbstract
This paper introduces the negative binomial-quasi xgamma (NB-QX) distribution, a three-parameter model derived by mixing the negative binomial and quasi xgamma distributions. The NB-QX distribution offers significant flexibility in analyzing overdispersed count data and subsumes the negative binomial-gamma distribution as a special case. We derive its fundamental properties, including the probability mass function, factorial moments, mean, and variance. Parameter estimation is conducted via the maximum likelihood method. The model’s performance is evaluated using three real-world count datasets, with goodness-of-fit assessed through the Kolmogorov–Smirnov (KS) test for discrete distributions, AIC, and BIC. The empirical results demonstrate that the NB-QX distribution outperforms Poisson, negative binomial, and negative binomial-gamma distributions, particularly in capturing overdispersion within a unified framework.
References
Akaike, H. (1974). A new look at the statistical model identification. IEEE Transactions on Automatic Control, 19(6), 716–723. https://doi.org/10.1109/TAC.1974.1100705.
Bugallo, M., Esteban, M. D., Marey Pérez, M. F., & Morales, D. (2023). Zero inflated negative binomial mixed models for predicting number of wildfires in Spain. Journal of Environmental Management, 330, Article 116788. https://doi.org/10.1016/j.jenvman.2022.116788.
Dzinyela, R., Shirazi, M., Das, S., & Lord, D. (2024). The negative Binomial Lindley model with time dependent parameters: Accounting for temporal variations and excess zero observations in crash data. Accident Analysis & Prevention, 207, Article 107711. https://doi.org/10.1016/j.aap.2024.107711.
Gençtürk, M., & Yiğiter, E. (2015). Negative binomial-gamma distribution for overdispersed count data. Communications in Statistics – Theory and Methods, 44(12), 2565–2576. https://doi.org/10.1080/03610926.2013.791373.
İyit, N. (2024). The global effects of digestive system cancers worldwide on the COVID 19 pandemic by negative binomial (mixed) regression models. Journal of Radiation Research and Applied Sciences, 17(3), 100944. https://doi.org/10.1016/j.jrras.2024.100944.
Klugman, S. A., Panjer, H. H., & Willmot, G. E. (2019). Loss models: From data to decisions (5th ed.). Wiley.
Marsaglia, G., Tsang, W. W., & Wang, J. (2003). Evaluating Kolmogorov’s distribution. Journal of Statistical Software, 8(18), 1–4.
Pudprommarat, C., Bodhisuwan, W., & Zeephongsekul, P. (2012). A new mixed negative binomial distribution. Journal of Applied Sciences, 12, 1853–1858. https://doi.org/10.3923/jas.2012.1853.1858.
R Core Team. (2025). R: A language and environment for statistical computing. R Foundation for Statistical Computing. https://www.R-project.org/.
Schwarz, G. E. (1978). Estimating the dimension of a model. The Annals of Statistics, 6(2), 461–464. https://doi.org/10.1214/aos/1176344136.
Sen, P., & Chandra, P. (2017). The quasi xgamma distribution: A flexible lifetime model. Journal of Statistical Theory and Practice, 11(4), 755–774. https://doi.org/10.1007/s41060-017-0056-5.
Sen, S., Maiti, S. S., & Chandra, N. (2016). The xgamma distribution: Statistical properties and application. Journal of Modern Applied Statistical Methods, 15(1), Article 38. https://doi.org/10.22237/jmasm/1461851580.
Shanker, R., & Fesshaye, H. (2015). On Poisson Lindley distribution and its applications to biological sciences. Biometrics & Biostatistics International Journal, 2(4), 103 107. https://doi.org/10.15406/bbij.2015.02.00036.
Simmachan, T., & Boonkrong, P. (2025). A comparison of count and zero-inflated regression models for predicting claim frequencies in Thai automobile insurance. Lobachevskii Journal of Mathematics, 45, 6400–6414. https://doi.org/10.1134/S1995080224607604.
Tin, A. (2008). Modeling zero-inflated count data with underdispersion and overdispersion [Conference paper]. Research Foundation for Mental Hygiene, New York, NY.
Tzougas, G., Hoon, W. L., & Lim, J. M. (2019). The negative binomial–inverse Gaussian regression model with an application to insurance ratemaking. LSE Research Online Documents on Economics, 101728. London School of Economics and Political Science, LSE Library. https://lse.ac.uk/lse-library.
Zamani, H., & Ismail, N. (2010). Negative Binomial-Lindley distribution for insurance claims. Journal of Mathematics and Statistics, 6(4), 309–314. https://thescipub.com/abstract/jmssp.2010.4.9.
Zhang, X., Mallick, H., Tang, Z., Zhang, L., Cui, X., Benson, A. K., … Yi, N. (2017). Negative binomial mixed models for analyzing microbiome count data. BMC Bioinformatics, 18, Article 4. https://doi.org/10.1186/s12859-016-1441-7.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Journal of Applied Statistics and Information Technology

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
เนื้อหาและข้อมูลที่ปรากฏในบทความที่ตีพิมพ์ในวารสารสถิติประยุกต์และเทคโนโลยีสารสนเทศถือเป็นความคิดเห็นส่วนบุคคลของผู้เขียนแต่ละท่าน ความผิดพลาดของข้อความและผลที่อาจเกิดจากนำข้อความเหล่านั้นไปใช้ผู้เขียนบทความจะเป็นผู้รับผิดชอบแต่เพียงผู้เดียว บทความ ข้อมูล เนื้อหา รูปภาพ ฯลฯ ที่ได้รับการตีพิมพ์ในวารสารถือเป็นลิขสิทธิ์ของวารสาร หากบุคคลหรือหน่วยงานใดต้องการนำทั้งหมดหรือส่วนหนึ่งส่วนใดไปเผยแพร่ต่อหรือเพื่อกระทำการใดๆ จะต้องได้รับอนุญาตเป็นลายลักอักษรณ์จากวารสาร ก่อนเท่านั้น