ON THE PROPERTIES AND APPLICATIONS OF TOPPLEONE GOMPERTZ INVERSE RAYLEIGH DISTRIBUTION
Sule Omeiza Bashiru1 & O.Y. Halid2
department of Mathematical Sciences, Prince Abubakar Audu University, Anyigba, Kogi State,
Nigeria
^Department of Statistics, Ekiti State University, Ado-Ekiti, Nigeria Email: [email protected], [email protected]
Abstract
In this study, we introduce a new four-parameter continuous probability distribution known as the Topp-Loene Gompertz Inverse Rayleigh (TLGoIRa) distribution. This novel model extends the Gompertz Inverse Rayleigh distribution. We present various mathematical properties of the distribution, including moments, moment generating functions, quantile functions, survival functions, hazard functions, reversed hazard functions, and odd functions. We also derive the distribution of order statistics, yielding both the maximum and minimum order statistics. This process of parameter estimation using the maximum likelihood estimation method is discussed. Furthermore, we present two real-life applications that illustrate the effectiveness and robustness of the TLGoIRa distribution when compared to several considered lifetime models. Our analysis reveals that the TLGoIRa distribution demonstrates superior robustness in comparison to the competing lifetime models. Additionally, the study highlights the distribution's efficacy in fitting biomedical datasets.
Keywords: Bladder cancer patients, goodness of fit, continuous probability, Gompertz Inverse Rayleigh, adequacy model.
I. Introduction
The Inverse Rayleigh distribution, originally introduced by [1], stands as a continuous probability distribution with significant utility in modeling the time until failure of a system. It emerges as a specialized variant within the broader framework of the inverse Weibull (IW) distribution, a potent tool for modeling lifetime data. The realm of statistical research has witnessed a fervent exploration of the inverse Rayleigh distribution, with scholars such as [2], [3], [4], [5], [6], [7], [8], and [9] delving into its intricacies, unraveling its nuances, and investigating its various generalizations and extensions. This study is inherently driven by a paramount objective: to fortify the existing model by incorporating additional shape parameters into the GoIRa distribution, as initially proposed by [9]. This augmentation is sought to yield a heightened degree of robustness, amplifying its aptness in fitting real-world datasets that abound within the realm of medical science. The proposed enhancement is envisaged to concomitantly bolster the goodness-of-fit of the model to such complex and diverse datasets, rendering it a more potent tool in the hands of researchers, clinicians, and decision-makers tasked with extracting meaningful insights and informed decisions from the intricate fabric of medical data. By infusing the GoIRa distribution with further shape parameters, the study seeks to attain a higher degree of flexibility, enabling the model to more accurately capture the multifaceted variability present in medical datasets. This endeavor is underscored by a firm statistical foundation, harnessing robust quantitative techniques to ensure the viability and efficacy of the proposed enhancement. The profound implications of such an augmented model resonate across medical research, aiding in predictive modeling, risk assessment, and optimal resource
Sule Omeiza Bashiru and O.Y. Halid RT&A, No 4 (76) Topp-Leone Gompertz Inverse Rayleigh Distribution_Volume 18, Deœmber 2023
allocation, ultimately advancing the frontiers of knowledge and facilitating improved healthcare
outcomes. The GoIRa distribution will be combined with the Toppleone family of distributions,
introducing additional skewness to the GoIRa distribution. This augmentation aims to enhance the
baseline distribution's capacity to accurately model datasets that demonstrate a significant degree of
skewness.
II. Methods
2.1 Topp-Leone Gompertz Inverse Rayleigh (TLGoIRa) distribution
In this section a new continuous probability distribution function (pdf) known as TLGoIRa distribution is derived. Also, some plots of its pdf and hazard rate function (hrf) were plotted in order to assess the shape of the new distribution.
The cdf and pdf of the family of distribution proposed by [10] are given as:
f(x;0,\) = [l-[ 1 -G(xt )]2
f {x;e,\) = 20g(x;Ç )[l)] 1 -[1 -G{x* )!
(1) (2)
where £ is the vector of parameters of the baseline distribution.
where G(x;Ç) is the cumulative distribution function (cdf) of the baseline distribution with vector of parameter £. for 0,e,Ç> 0,
where equations (1) and (2) are the cdf and pdf of the family of distributions proposed by [10]. The cdf and pdf of the GoIRa distribution are given a
G(x;a,X,p) = 1 -t
1 - e
-f
-iif
1- 1-e 1 -J
g(x-,a,X,P) = 2aX2 x 3 To obtain the cdf of the new model, equation (3) is inserted into equation (1) as
(3)
(4)
F[x;0,a,X,p) =
1 -
iÀî -f
1-
(5)
On differentiating equation (5), the pdf of TLGoIRa distribution is obtained which is given as
f {x\d,a,X,p) = 4adk
2x"3e 1 x
1 -e
-P-1 ia 1-
(6)
0-1
a
0
-1
Figure 1: Plots of cdf of TLGoIRa distribution for different parameter value
Figure 2: Plots of pdf of TLGoIRa distribution for different parameter values Where x > 0 ,A> 0 is the scale parameter and a,0, fi > 0 are the shape parameters respectively
2.1.1 Expansion of density
In this section the pdf in equation (6) is expanded using binomial expansion. Expanding the last term in equation (6), we have
■ frf -f
1- 1-e!-J
=z
2a
f
1 -
1 -e
1 -
r <2 -f
1- 1-e UJ
=n-i 1
-ß
1- 1-e -
/=0
/
2ja
/ -2 -f
1- 1-e UJ
=1
ß
k!
/ \2 -ß'
-i-1
1 - 1 -e ^-
_ _ -ß k+1 f k + 1 Ï r _
-fAl2 -fAl2
1 - 1 -Z A = I(-1)m m 1 -Z A1
m=0
- V
-ßm
1 -e
-i(-i r
-ß(l + m)-1
On substituting all the expansions into equation (6), we have
<X)
f (x) = 4aûÂ2 £
i ,j ,k ,m ,w=0
2a) i 2 ja
ZiLZ. i! k!
k+l)f-z(l + m)-1 m w
\ -M21
-3 Ixl
x e v 1
Now
f (x-,e,aXß)= X ^
i ,j ,k ,m ,w=0
where
2a) ( 2 ja
y = 4a0A2--
q i! k!
(q-1 V/„ . -iV
-(-1 )j
k +1 )(-r(l + m)-1
m w
\
(7)
(8)
Equation (7) is the expansion of equation (6) which will be used to derive some of the properties of the distribution.
Also, equation (5) is expanded as
[F^d.aXß)^
1 -
-ß
-i-1
i- 1-e W
k
w- 0
1 -
r ^2 -f
1- 1-e W
eh
=1 (-1 )'
/ \2 -f
1- 1-e W
/ \2 -f
1- 1-e 1 'J
=Z
t\
1 -
1 -e
r m2! -p' t ' t I r r*\2~\
1 - 1 -,- =z(-i)d d=0 d i-,-
l J - K J
pd
-J 1 -ßd r -T1
1 tJ =±(-1 y P ^ TJ
p=a
V y
Substituting all the expansions into equation (5), we have
h _ h [F{x-,e,aXP)] = XI
2za
» h \ ß
z,d,p=0 t=0
t!
z+d+p
f t Y-ßdYeh
d
Now
[F^-e.a^ß)]h = £ cc
f=0
where
C = Z
2za
ß ) / \z+d+p
( t Y-ßdY eh
z,d,p=0
t!
(9)
(10)
Equation (9) is the expansion of equation (5) which will be used to derive some of the properties of the distribution.
2.1.2 Properties of the TLGoIRa distribution
In this section, some of the mathematical and statistical properties of TLGoIRa distribution such as the quantile function, moments, moment generating function, reliability measure, odds function, reversed hazard function and order statistics are derived.
2z
/
p
P
z
2.1.2.1 Moments
) = lxf X)
dx
(11)
= I ^ K
i ,j ,k,m,w=0 o
dx
0
ArT 1 -
dx = -
(w +1)1-2
ArT
1 --
i,j,k,m,w=0 ( , m X1 9
IW + 1) 2
(12)
Equation (12) is the moments of TLGoIRa distribution. To obtain mean, we set r = 1 in equation (12).
2.1.2.2 Moment generating function (mgf)
= jeaf (x)dx
the series expansion for etx is given as
=1
AT
TO i-v TO
"«cO—ZOr Z r.
i--
v—0 i,j,k,m,w—0
[w +1) -2
(13)
(14)
(15)
2.1.2.3 Quantile function
Quantile function has a significant position in probability theory and it is the inverse of the cdf. The quantile function is obtained using
Q(u) = F\u) (16)
Using the inverse of equation (5), we have the quantile function of TLGoIRa distribution given as
x
=Q(u)=-
- log
1 -
1 -
I
a
log
1 -UB
The median is obtained by setting u = 0.5 in equation (17) given as
x
= Q(0.5) =
2
- log
1 -
1 -
P
log
1 - 0.5e
(17)
(18)
2.1.2.4 Hazard function
Hazard function is given as
f(x\a,p,A,d)
T(x-,a,p,A,d) = —----
R(x-,a,p,A,0)
The hazard function of the TLGoIRa distribution is given as
u
tx
e
v\
r-0
2
\aßl2x^e
1 -e
-ß-1 2l - I 1- 1-e
A*)-
1 -
1 - e
-ß]
1- 1-e 1 xJ
(20)
-1
e
Figure 3: Plots of hazard function of the TLGoIRa distribution for different parameter values
2.1.2.5 Survival function
The reliability function is also known as survival function, which is the probability of an item not failing prior to some time. It can be defined as R(x\a,p, X.ff) = 1 -F(x;a,p,X,ff)
(21)
The survival function of the TLGoIRa distribution is given as
R W =1 -
1 -
M2 -f
1-
(22)
e
Figure 4: Plots of survival function of the TLGoIRa distribution for different parameter values
2.1.2.6 Reversed hazard function Reversed hazard function of a random variable x is given as
%x-,a,P,X,0) = --——
F(x\a,P,X,d)
The reverse hazard rate function of the TLGoIRa distribution is given as
(23)
4a012x-3e 1"
1 -e
-P-1 2l
*(x) = .
1-
(24)
1-
2.1.2.7 Odds function
The odds function of the TLGoIRa distribution is given as
-10-1
6
tt
a
2.2 Order Statistics
Let Xx,X2,...,Xn be n independent random variable from the TLGoIRa distributions and let X(1) < X(2) < ... < X(n) be their corresponding order statistic. Let Fr.n(x) and f.n(x), r = 1,2,3,...n denote the cdf and pdf of the rh order statistics Xr7! respectively. The pdf of the rh order statistics of Xr7! is given as
(n- h
f (x) =
rn v J
f(x)
B(r,n- r +1)
n-r
ZK
F(x)v
(26)
The pdf of rth order statistic for distribution is obtained also by replacing h with v+r-1 in cdf expansion. We have
1
n-r œ v+r-1
f (x) = rc-1-y y y
D(1 ,n 1 + 1) v=0i,j,k,m,w=0 f=0
x
w+^+1
(27)
where
2a ^ ( 2ja
y - 4aA2 ^ZiLZ q i! k!
(e-1
fk +1 Y-ß(l + m)-1
m w
and
2za
( 1-\(
i. = (-1)
z,d,p=0 L •
\z+d+p
d
-ßd
?(v + r-1)
The pdf of minimum order statistic of the distribution is obtained by setting r=1 in equation (27).
n-1 & v
f jx=rqcnX X Ex-
v=0 i ,j ,k ,m ,w=0 t=0
w+p+1
(28)
Now ,
2za
T
( t\(-ßd\(dv^
c - (-1)
\z+d+p
z,d,p-0
d
Also, the pdf of maximum order statistic of the distribution is obtained by setting r = n in equation (27)
œ v+n-1
fnnycn ^^ ^^ X
i ,j ,k ,m ,w=0 t=0
w+p+1
(29)
where
2za
T
i (-1)
z,d,p=0 L •
\z+d+p
d
-ßd
?(v + n-1)
r=0
3
t
t
z
t
2.3 Estimation method
The method of maximum likelihood estimation (MLE) is used in this section to estimate the parameters of the TLGoIRa distribution. For a random sample, XltX2,...,Xn of size n from the
TLGoIRa(a,p,e,X), the log-likelihood function L (a,(,0, A) of (6) is given as
n n
logL = nlog (4) + nlog (0) + nlog (a) + 2nlog (l)-3£log (x, )-l2 X
-(ß + l)Xlog
1 -e
+-ßl
1-
1 -e
>- i)Xnlog
x
(30)
Differentiating the log-likelihood with respect to l,a,0,p and equating the result to zero, we have
^ = ^ + flog
de eh
1 -
= 0
d logl _ n 2 n
da aß
-ß"
1 - 1 -e (xJ
-2(0-l)t-
M2 -ß M2 -ß
i- iJxl (ß) i- iJxl
e -
1 -e
= 0
(31)
(32)
dlogl _ 2n
r ^2
n ^
dl 2
le
i-i v
evß'e v 'y e
x
flf -ß
1-
+2ß(8- 1)J-
1 -e
1-e
1-
1
+ 4«2^e vx
1-1
ß-1
1-e
- 0
(33)
a
1-1
i-1
d log/
dp
=-£ log
1 -e
p2 5
1 -e
-p
log
1 -e
log
n
+2(0-1)5:-
1 -e
= 0
(34)
1 -e
Now, equations (31), (32), (33) and (34) do not have a simple analytical form and are therefore not tractable. As a result, we have to resort to non-linear estimation of the parameters using iterative method.
III. Results
p
1=1
3.1 Applications
In this section, we present two applications of the TLGoIRa distribution using different datasets from the biomedical field. These applications are intended to demonstrate the flexibility of the distribution in modeling real-life datasets. The data are fitted to the TLGoIRa distribution, as well as four other comparator distributions: Gompertz Inverse Rayleigh (GoIRa) distribution, Generalized Gompertz (GGo) distribution, Exponentiated Exponential (EtEx) distribution, and Inverse Rayleigh (IRa) distribution. This fitting process is carried out to test the new distribution's flexibility against these comparators. We utilized the Adequacy Model package within the R software to perform the analysis and produce the results. To evaluate the performance of the distribution, we employed the Akaike Information Criterion (AIC) and the Bayesian Information Criterion (BIC). These criteria were used to compare the performance of the TLGoIRa distribution with other existing distributions that align with the baseline distribution in terms of providing a good parametric fit to the dataset. AIC = -211 + 2k (30)
BIC = -211 + k1og(n) (31)
The model selection is carried out using the AIC and the BIC. Where ll denotes the log-likelihood function evaluated at the maximum likelihood estimates, k is the number of parameters, and n is the sample size from the data. The model with minimum value of AIC and BIC is chosen as the best model to fit the data set.
Data set 1 has been used by [11] and represents the sum of skin folds in 202 athletes collected at the Australian Institute of Sports as.
28.0, 98, 89.0, 68.9, 69.9, 109.0, 52.3, 52.8, 46.7, 82.7, 42.3, 109.1, 96.8, 98.3, 103.6, 110.2, 98.1, 57.0, 43.1,
71.1, 29.7, 96.3, 102.8, 80.3, 122.1, 71.3, 200.8, 80.6, 65.3, 78.0, 65.9, 38.9, 56.5, 104.6, 74.9, 90.4, 54.6, 131.9, 68.3, 52.0, 40.8, 34.3, 44.8, 105.7, 126.4, 83.0, 106.9, 88.2, 33.8, 47.6, 42.7, 41.5, 34.6, 30.9, 100.7, 80.3, 91.0, 156.6, 95.4, 43.5, 61.9, 35.2, 50.9, 31.8, 44.0, 56.8, 75.2, 76.2,101.1, 47.5, 46.2, 38.2, 49.2, 49.6,
34.5, 37.5, 75.9, 87.2, 52.6, 126.4, 55.6, 73.9, 43.5, 61.8, 88.9, 31.0, 37.6,52.8, 97.9, 111.1, 114.0, 62.9, 36.8, 56.8, 46.5, 48.3, 32.6, 31.7, 47.8, 75.1, 110.7, 70.0, 52.5, 67, 41.6, 34.8, 61.8, 31.5, 36.6, 76.0, 65.1, 74.7, 77.0,
62.6, 41.1, 58.9, 60.2, 43.0, 32.6, 48, 61.2, 171.1, 113.5, 148.9, 49.9, 59.4, 44.5, 48.1, 61.1, 31.0, 41.9, 75.6, 76.8, 99.8, 80.1, 57.9, 48.4, 41.8, 44.5, 43.8, 33.7, 30.9, 43.3, 117.8, 80.3, 156.6, 109.6, 50.0, 33.7, 54.0, 54.2, 30.3, 52.8, 49.5, 90.2, 109.5, 115.9, 98.5, 54.6, 50.9, 44.7, 41.8, 38.0, 43.2, 70.0, 97.2, 123.6, 181.7, 136.3, 42.3, 40.5, 64.9, 34.1, 55.7, 113.5, 75.7, 99.9, 91.2, 71.6, 103.6, 46.1, 51.2, 43.8, 30.5, 37.5, 96.9, 57.7, 125.9, 49.0, 143.5, 102.8, 46.3, 54.4, 58.3, 34.0, 112.5, 49.3, 67.2, 56.5, 47.6, 60.4, 34.9.
Data set 2 has been used by [12] and it represents the remission times (in months) of a random sample
Sule Omeiza Bashiru and O.Y. Halid RT&A, No 4 (76) Topp-Leone Gompertz Inverse Rayleigh Distribution_Volume 18, December 2023
of one hundred and twenty-eight (128) bladder cancer patients.
0.08, 2.09, 3.48, 4.87, 6.94 , 8.66, 13.11, 23.63, 0.20, 2.23, 3.52, 4.98, 6.97, 9.02, 13.29, 0.40, 2.26, 3.57, 5.06, 7.09, 9.22, 13.80, 25.74, 0.50, 2.46 , 3.64, 5.09, 7.26, 9.47, 14.24, 25.82, 0.51, 2.54, 3.70, 5.17, 7.28, 9.74, 14.76, 26.31, 0.81, 2.62, 3.82, 5.32, 7.32, 10.06, 14.77, 32.15, 2.64, 3.88, 5.32, 7.39, 10.34, 14.83, 34.26, 0.90, 2.69, 4.18, 5.34, 7.59, 10.66, 15.96, 36.66, 1.05, 2.69, 4.23, 5.41, 7.62, 10.75, 16.62, 43.01, 1.19, 2.75, 4.26, 5.41, 7.63, 17.12, 46.12, 1.26, 2.83, 4.33, 5.49, 7.66, 11.25, 17.14, 79.05, 1.35, 2.87, 5.62, 7.87, 11.64, 17.36, 1.40, 3.02, 4.34, 5.71, 7.93, 11.79, 18.10, 1.46, 4.40, 5.85, 8.26, 11.98, 19.13, 1.76, 3.25, 4.50, 6.25, 8.37, 12.02, 2.02, 3.31, 4.51, 6.54, 8.53, 12.03, 20.28, 2.02, 3.36, 6.76, 12.07, 21.73, 2.07, 3.36, 6.93, 8.65, 12.63, 22.69.
Table 1: The models' MLEs and performance requirements based on data set 1
Models â 9 i ß II AIC BIC
TLGoIR 0.8992 0.2125 121.0713 1.2501 -949.7464 1907.4930 1920.7260
GoIR 0.0031 - 0.0000 0.8601 -987.5204 1981.0410 1990.9660
GGo -0.0052 15.4031 - 0.0597 -956.0865 1918.1730 1928.9200
EtEx 0.0406 8.5786 - - -958.0065 1920.0130 1926.6300
IR 52.6054 - - - -966.4625 1934.9250 1938.2330
Empirical arid theoretical dens. O-O plot
SO 100 ISO 200 0.0 0.2 0.4^ 0.6 0.8 1.0
Data Theoretical probabilities
Figure 5: Density plots for data set 1
Table 2: The models' MLEs and performance requirements based on data set 2
Models â 9 i ß ll AIC BIC
TLGoIR 0.0002 2.6653 0.0001 0.3345 -410.6935 829.3871 834.7952
GoIR 0.0839 - 0.0041 0.5129 -413.5753 833.1505 836.1377
GGo -0.0224 1.5034 - 0.1678 -413.1834 832.3668 835.3539
EtEx 0.1213 1.2180 - - -413.0776 830.1552 834.8592
IR 2.2612 - - - -774.3416 1550.6830 1553.5350
Figure 6: Density plots for data set 2
IV. Discussion
In this study, we have introduced and thoroughly examined a novel probability distribution, coined the Topp Leone Gompertz Inverse Rayleigh distribution. Drawing from the framework of distributions proposed by [10], we have both theoretically established its foundations and conducted practical investigations. A comprehensive exploration of the distribution's mathematical properties has been undertaken, encompassing crucial characteristics such as moments, moment generating function, quantile function, odd function, reverse hazard function, order statistics, and reliability analysis. These analyses have contributed to a robust comprehension of the distribution's behaviors and traits. The method of maximum likelihood estimation has been employed to effectively determine the distribution's parameters, lending statistical rigor to our subsequent analyses. In a bid to assess the distribution's practical utility, it has been subjected to rigorous testing using two distinct datasets from the realm of medical sciences. This assessment has entailed a comparative study against several competing distributions, including the GoIRa, Generalized Gompertz, Exponentiated Exponential, and Inverse Rayleigh distributions, all sharing common baseline distributions. The outcomes of our meticulous analyses indicate that the Topp-Leone Gompertz Inverse Rayleigh distribution stands out as the most adept candidate for fitting both datasets. The distribution's apparent superior performance among the alternatives highlights its potential to adeptly capture the underlying data-generation mechanisms. This research introduces not only a novel distribution but also offers practical insights into its capacity to effectively represent intricate real-world phenomena.
References
[1] Trayer, V. N. Doklady Acad, Nauk, Belorus, U.S.S.R, 1964.
[2] Gharraph, M.K. (1993). Comparison of Estimators of Location Measures of an Inverse Rayleigh Distribution. The Egyptian Statistical Journal, 37:295-309.
[3] Mukarjee, S.P. and Maitim, S.S. (1996) A Percentile Estimator of the Inverse Rayleigh Parameter, IAPQR Transactions, 21, 63-65.
[4] Voda, V. G. (1972). On the Inverse Rayleigh Random Variable, Pep. Statist. App. Res., JUSE, 19, 13-21.
[5] Mohsin, M. and Shahbaz, M.Q (2005). Comparison of Negative Moment Estimator with Maximum Likelihood Estimator of Inverse Rayleigh Distribution, PJSOR, 1(1): 45-48.
[6] Khan, M.S. (2014). Modified Inverse Rayleigh Distribution. International Journal of Computer Applications, 87(13) : 28-33.
[7] Khan, M. S. and King, R. (2015). Transmuted Modified Inverse Rayleigh Distribution, Austrian Journal of Statistics, 44(3): 17-29
[8] Malik, A. S. and Ahmad, S. P. (2018). A New Inverse Rayleigh Distribution: Properties and Application, International Journal of Scientific Research in Mathematical and Statistical Sciences, 5(5): 9296.
[9] Halid, O.Y. and Sule, O.B. (2022). A classical and Bayesian estimation techniques for Gompertz inverse Rayleigh distribution: properties and application, Pakistan Journal of Statistics, 38(1): 49-76
[10] Al-Shomrani, A., Arif, O., Shawky, A., Hanif, S. and Shahbaz, M. Q., (2016). Topp-Leone family of distributions: Some properties and application, Pakistan Journal of Statistics and Operation Research. XII, 3, 443-451.
[11] Hosseini, B., Afshari, M. and Alizadeh M. (2018). The generalized odd gamma-G family of distributions: properties and applications. Austrian Journal of Statistics, 47(2):69-89
[12] Shabeer, A.M., Krishnana, B. and Jayakumar, K. (2023). A Generalization of Lindley Distribution: Characterizations, Methods of Estimation and Applications. Reliability: Theory & Applications, 1(72): 575-588.