Abstract
It is a compound discrete probability distribution which has two parameters and whose particular case is Poisson-New Linear-Exponential Distribution (PNLED) [1]. The statistical characteristics required for this distribution such as probability mass function (pmf), statistical moments, estimation of parameters and goodness of fit have been derived and explained nicely. To test theoretical reliability of this distribution, goodness of fit has been applied to some over-dispersed data which were used by other researchers and further, we found that this distribution looks more appropriate for statistical modelling than PNLED, generalised Poisson-Lindley distribution (GPLD) [1, 2] and Poisson-Lindley distribution (PLD) [3].
1. Introduction
A generalisation of any probability distribution is considered more appropriate when it is more reliable than its particular case as well as previously obtained generalised distribution with similar conditions in all aspects.
The proposed distribution is a compound distribution. For compounding process, at least two distributions are required. We have needed Generalised Poisson distribution (GPD) [4] and New Linear-exponential distribution (NLED) [5] to conduct compounding process. GPD [4] distribution is a discrete distribution whereas NLED [5] is a continuous distribution. GPD [4] has two parameters and in mixing process it plays role of an original distribution and its original parameter follows NLED [5]. The pmf of GPD [4] has been given by:
where ; ; .
The probability density function (pdf) of NLED [5] was given by:
Hence, Generalised Poisson-New Linear-exponential distribution (GPNLED) has been obtained by compounding GPD [4] with NLED [5]. The study of this papers mentioned in the following references [1, 2], [4], [6] have very much contributed to increase quality of the paper.
In this paper, the following works have been done. Construction and derivation of:
1) Probability Mass Function (pmf) of GPNLED.
2) Statistical Moments of GPNLED.
3) Estimation of Parameters of GPNLED.
4) Goodness of Fit and applications of GPNLED.
We have been studied the following papers to improve quality of this paper which have been mentioned in the following references [2], [7-14].
2. Results
The results obtained for this paper have been placed under the following sub-headings.
2.1. Probability mass function of GPNLED
GPD has two parameters and . is an original parameter of GPD which follows NLED. is an additional parameter of GPD which is versatile in nature and hence, GPD. Here, and || < 1. NLED has a single parameter . The Probability mass of function of GPNLED is obtained by mixing GPD with NLED. This is how the pmf of GPNLED have been extracted:
Probability mass function of GPNLED can be obtained for each value such as can be obtained by using the Eq. (3) as follows:
2.2. Statistical moments of GPNLED
Let denotes th moment about origin of GPNLED:
Fig. 1Showing the mean of GPNLED
Central moments of GPNLED:
Fig. 2Showing the variance of GPNLED
Proof. Variance > Mean.
Or:
Or:
Or
Or:
Hence the statement.
The third moment about the mean of this distribution can be obtained as:
The third central moment () gives positive value, this distribution is positively skewed in shape.
The Eq. (18) is obtained to know about nature of distribution according to size and it is the fourth central moment. Co-efficient of skewness and kurtosis based on moments can be obtained as follows.
From Eq. (19), it has been found that range of is . Hence, it is positively skewed:
2.3. Estimation of parameters
This distribution has two parameters and which can be obtained as
By using and . We have:
Solving it, we get:
The Polynomial Eq. (21) can be solved by using Regula-Falshi method or Newton Rapson method, where denote .Substituting the estimated value of in the following Eq. (22), we get an estimated value of :
and : substituting the value of obtained from Eq. (22) in the expression of , we get Polynomial Eq. (23) in which can be solve by Newton-Rapson or Regla-Falsi method:
2.4. Goodness of fit and applications of GPNLED
This distribution can be used to test goodness of fit and to get possible inferential Statistics in the field related to accident proneness, risk management of production engineering, ecological sciences, agricultural fields related to events about insects, error per page, biological sciences, and other fields [15-17].
Applications of GPNLE model in the field of engineering. This distribution is applicable to fit better where Poisson, Generalised Poisson and Poisson mixtures of continuous distributions are applicable such as:
1) The count of -particles emitted per unit of time is useful in analysis of any radio-active substances.
2) Number of telephone calls received on a given switch board per small unit of time.
3) In industrial production to find the proportion of defects per unit length, per unit area, etc.
4) In the field of reliability engineering.
In the following examples Chi-square goodness of fit have been applied by using GPNLE model.
Table 1Example (1)
Z | 0 | 1 | 2 | 3 | |
Observed frequency | 35 | 11 | 8 | 4 | 2 |
Table 2Example (2)
Number of accidents | 0 | 1 | 2 | 3 | 4 | 5+ |
Observed frequency | 447 | 132 | 42 | 21 | 3 | 2 |
Table 3Example (3)
Z | 0 | 1 | 2 | 3 | 4 | 5 | 6 |
Observed frequency | 200 | 57 | 30 | 7 | 4 | 0 | 2 |
The first example is related to the number of errors per page which was given by Kemp and Kemp [18]. The second example is related to the data of accidents to 647 women working on H. E. Shells in 5 weeks was given by Greenwood and Yule [19]. The data related to Class per exposure (μg/kg) which is included in example (3) was given by Catcheside et al. [20].
The first and second examples have been used in the Doctoral Thesis. The theoretical frequencies obtained by using PLD, GPLD and GPNLED have been placed in Table 4 and 5 to make comparison. In Table 6, the theoretical frequencies obtained by using PLD, PNLED and GPNLED have been placed for comparison [1-3].
Table 4Observed versus expected frequency of example (1)
Number of errors per page | Observed frequency | Expected frequency | ||
PLD | GPLD | GPNLED | ||
0 | 35 | 33.0 | 35.0 | 35.0 |
1 | 11 | 15.3 | 13.3 | 13.3 |
2 | 8 | 6.8 | 6.3 | 6.4 |
3 | 4 | 2.9 | 3.0 | 2.9 |
4+ | 2 | 2.0 | 2.4 | 2.4 |
60.0 | 60.0 | 60.0 | 647.0 | |
0.7833333 | – | – | – | |
1.85 | – | – | – | |
– | 1.7434 | 1.9387733 | 1.642188744 | |
– | – | 0.1174867 | 0.09640955 |
Table 5Observed versus expected frequency of example (2)
Number of accidents | Observed frequency | Expected frequency | ||
PLD | GPLD | GPNLED | ||
0 | 447 | 441 | 447.0 | 447.0 |
1 | 132 | 143 | 1132.7 | 132.9 |
2 | 42 | 45 | 44.8 | 45.0 |
3 | 21 | 14 | 15.1 | 15.0 |
4 | 3 | 4 | 5.0 | 5.0 |
5+ | 2 | 1 | 2.4 | 2.1 |
647 | 647 | 647.0 | 647.0 | |
0.4652241 | – | – | – | |
0.9100646 | – | – | – | |
– | – | 2.8563455 | 2.496217 | |
– | – | 0.0523222 | 0.0415098 |
Table 6Observed versus expected frequency of example (3)
Class per exposure (μg/kg) | Observed frequency | Expected frequency | ||
PLD | PNLED | GPNLED | ||
0 | 200 | 191.8 | 192.8 | 200.0 |
1 | 57 | 70.3 | 69.2 | 60.5 |
2 | 30 | 24.9 | 24.6 | 23.7 |
3 | 7 | 8.6 | 8.7 | 9.5 |
4 | 4 | 2.9 | 3.1 | 3.8 |
5 | 0 | 1.0 | 1.1 | 1.5 |
6 | 2 | 0.5 | 0.5 | 1.0 |
300 | 300.0 | 300.0 | 300.0 | |
0.55333333 | – | – | – | |
1.2533333 | – | – | – | |
– | 2.353339 | 2.0501155 | 2.256957115 | |
– | – | – | 0.1002898421 |
3. Conclusions
1) This distribution will be over-dispersed if .
2) The range of is . So, it is positively skewed in shape.
3) The range of is . So, it is leptokurtic by size.
4) From the Tables 4 and 5, it is found that P-value obtained by using GPNLED is greater than those obtained by using GPLD [2] and PLD [3]
5) From Table 6, what we have observed that the P-value obtained by using GPNLED is bigger than those of PNLED [1] and PLD [3]
6) Hence, it is suggested to apply GPNLED instead of PLD, GPLD and PNLED in similar situation in all aspects.
References
-
B. K. Sah and S. K. Sahani, “Poisson-new quadratic-exponential distribution,” Mikailalsys Journal of Mathematics and Statistics, Vol. 2, No. 2, pp. 27–45, Apr. 2024, https://doi.org/10.58578/mjms.v2i2.2862
-
B. K. Sah, “Generalisations of some countable and continuous mixtures of Poisson distribution and their applications,” Patna University, Patna, India, 2012.
-
M. Sankaran, “The discrete Poisson-Lindley distribution,” Biometrics, Vol. 26, pp. 145–149, Mar. 1970, https://doi.org/10.2307/2529053
-
B. K. Sah, “New linear-exponential distribution,” Applied Science Periodical, Vol. 26, No. 2, pp. 1–16, May 2022.
-
P. C. Consul and G. C. Jain, “A generalization of the Poisson distribution,” Technometrics, Vol. 15, No. 4, pp. 791–799, Nov. 1973, https://doi.org/10.1080/00401706.1973.10489112
-
M. Ahsanullah, “Two characterizations of the generalized Poisson distribution,” Pakistan Journal of Statistics, Vol. 7, pp. 15–19, 1971.
-
E. L. Lehmann and J. P. Romano, Testing of Statistical Hypothesis. New York: Springer, 2005.
-
A. Mishra, “Generalization of some discrete distributions,” Patna University, Patna, India, 1979.
-
B. K. Sah, “A generalised Neyman type – a distribution,” International Journal for Research in Applied Science and Engineering Technology, Vol. 9, No. 1, pp. 465–470, Jan. 2021, https://doi.org/10.22214/ijraset.2021.32839
-
B. K. Sah, “A two-parameter quasi-Lindley mixture of generalised Poisson distribution,” International Journal of Mathematics and Statistics Invention, Vol. 3, No. 7, 2015.
-
B. K. Sah and S. K. Sahani, “Polynomial-exponential mixture of Poisson distribution,” Turkish Journal of Computer and Mathematics Education (TURCOMAT), Vol. 14, No. 3, Sep. 2023, https://doi.org/10.48047/tukomat/14.03.31
-
M. M. Shoukri, “Estimation problems for some generalized discrete distributions,” Ph.D. Thesis, University of Calgary, Calgary, Canada, 1980.
-
M. J. Bollen, Understanding Power Quality Problems Voltages sags and Interruptions. New Jersey: IEEE Press, 2000.
-
B. K. Sah and A. Mishra, “On a generalised exponential-Lindley mixture of generalised Poisson distribution,” Nepalese Journal of Statistics, Vol. 4, pp. 33–42, Dec. 2020, https://doi.org/10.3126/njs.v4i0.33449
-
I. R. Miller et al., Probability and Statistics for Engineers. New Jersey: Prentice Hall, 1990.
-
M. Sulaimon, “Application of linear regression model for Poisson distribution in forecasting,” Imperial Journal of Interdisciplinary Research, Vol. 1, No. 5, pp. 17–24, 2015.
-
D. R. Yates and D. J. Goodman, “Probability and stochastic processes: a friendly introduction for electrical and computer engineers,” Computers and Mathematics with Applications, Vol. 37, No. 6, p. 136, Mar. 1999, https://doi.org/10.1016/s0898-1221(99)90299-7
-
C. D. Kemp and A. W. Kemp, “Some properties of the ’Hermite’ distribution,” Biometrika, Vol. 52, No. 3-4, pp. 381–394, Dec. 1965, https://doi.org/10.1093/biomet/52.3-4.381
-
M. Greenwood and G. U. Yule, “An inquiry into the nature of frequency distributions representative of multiple happenings with particular reference to the occurrence of multiple attacks of disease or of repeated accidents,” Journal of the Royal Statistical Society, Vol. 83, No. 2, p. 255, Mar. 1920, https://doi.org/10.2307/2341080
-
D. G. Catcheside, D. E. Lea, and J. M. Thoday, “The production of chromosome structural changes inTradescantia microspores in relation to dosage, intensity and temperature,” Journal of Genetics, Vol. 47, No. 2, pp. 137–149, Jan. 1946, https://doi.org/10.1007/bf02986783
About this article
The authors have not disclosed any funding.
The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.
Binod Kumar Sah: conceptualization, data curation, formal analysis, investigation, writing – original draft preparation. Suresh Kumar Sahani: conceptualization, data curation, formal analysis, investigation, supervision, writing – original draft preparation, writing – review and editing.
The authors declare that they have no conflict of interest.