Generalized Truncated Fréchet Generated Family Distributions and Their Applications

Ramadan ZeinEldin; Christophe Chesneau; Farrukh Jamal; Mohammed Elgarhy; Abdullah Almarashi; Sanaa Al-Marzouki

doi:10.32604/cmes.2021.012169

Ramadan A. ZeinEldin1,2, Christophe Chesneau3,*, Farrukh Jamal4, Mohammed Elgarhy5, Abdullah M. Almarashi6 and Sanaa Al-Marzouki6

1Deanship of Scientific Research, King AbdulAziz University, Jeddah, 21589, Saudi Arabia
2Faculty of Graduate Studies for Statistical Research, Cairo University, Al Orman, Giza Governorate, 12613, Egypt
3Université de Caen, LMNO, Campus II, Science 3, Caen, 14032, France
4Department of Statistics, The Islamia University of Bahawalpur, Punjab, 63100, Pakistan
5The Higher Institute of Commercial Sciences, Al Mahalla Al Kubra, Algarbia, 31951, Egypt
6Statistics Department, Faculty of Science, King AbdulAziz University, Jeddah, 21551, Saudi Arabia
*Corresponding Author: Christophe Chesneau. Email: christophe.chesneau@unicaen.fr
Received: 16 June 2020; Accepted: 16 October 2020

Abstract: Understanding a phenomenon from observed data requires contextual and efficient statistical models. Such models are based on probability distributions having sufficiently flexible statistical properties to adapt to a maximum of situations. Modern examples include the distributions of the truncated Fréchet generated family. In this paper, we go even further by introducing a more general family, based on a truncated version of the generalized Fréchet distribution. This generalization involves a new shape parameter modulating to the extreme some central and dispersion parameters, as well as the skewness and weight of the tails. We also investigate the main functions of the new family, stress-strength parameter, diverse functional series expansions, incomplete moments, various entropy measures, theoretical and practical parameters estimation, bivariate extensions through the use of copulas, and the estimation of the model parameters. By considering a special member of the family having the Weibull distribution as the parent, we fit two data sets of interest, one about waiting times and the other about precipitation. Solid statistical criteria attest that the proposed model is superior over other extended Weibull models, including the one derived to the former truncated Fréchet generated family.

Keywords: Truncated distribution; general family of distributions; incomplete moments; entropy; copula; data analysis

Determining the underlying distribution of data is a crucial topic in many applied fields, such as medicine, reliability, finance, economics, engineering and environmental sciences. Among the possible approaches, one can define general families of continuous distributions from well-established parental distributions, having enough interesting properties to offer statistical models that adapt to all possible situations. The constructions of such families are based on specific mathematical techniques which may depend on one or several tunable parameters. For an overview on classic families of distributions and the associated techniques, we refer the reader to the surveys of [1–3].

In recent studies, the composition-truncation technique by [4] has been used to develop families of distributions achieving the goals of simplicity and efficiency. Among them, there are the truncated exponential-G family by [5], truncated Fréchet-G family by [6], truncated inverted Kumaraswamy-G family by [7], truncated Weibull-G family by [8], truncated Cauchy power-G family by [9], truncated Burr-G family by [10], type II truncated Fréchet-G family by [11], truncated log-logistic-G family by [12], right truncated T-X family by [13] and truncated Lomax-G family by [14]. The functions defining these families have the advantages of being simple, with a reasonable number of parameters, and having original monotonic and non-monotonic forms, which makes them attractive for statistical applications.

Especially, the truncated Fréchet-G family innovates in the following aspects: (i) Its functions are quite manageable, with a corresponding cumulative distribution function (CDF) having a simple exponential expression, (ii) It has a reasonable number of parameters: two plus those of the parental distribution, and (iii) Provides distributions with original monotonic and non-monotonic shapes, as shown in [6] with the gamma distribution as the parent. The combination of these qualities makes this family unique compared to others, and also attractive for statistical purposes. However, the price of the simplicity is that the nice flexibility of these distributions depends strongly on the choice of the parental distribution. And, to our knowledge, only the special distribution based on the gamma distribution has been explored in detail.

In this paper, we take one more step in this direction, by proposing a generalization of the truncated Fréchet-G family. It is also based on the composition-truncation technique, but uses a generalized version of the truncated Fréchet distribution called generalized Fréchet (GFr) distribution. First, the GFr distribution is defined by the following CDF:

where

, (and

otherwise). This distribution is also known under the names of exponentiated Fréchet distribution and exponentiated Gumbel type-2 distribution pioneered by [15,16]. As an alpha property, the GFr distribution is connected with the famous exponentiated exponential (EE) distribution introduced by [17] in the following sense: if X denotes a random variable (RV) following the GFr distribution with parameters images

and

, then

follows the EE distribution with parameters images

and

. The GFr distribution contains the former Fréchet distribution, obtained by taking images

. Also, it is proved in [15,16] that the parameter images

makes the GFr model really more pliant than the former Fréchet model. This has motivated the study of some of its extensions, as the successful one proposed in [18]. Here, we exploit the features of the GFr distribution to define a new general family of distributions. Following the spirit of [4], we first derive the truncated generalized Fréchet distribution over the interval images

, specified by the following CDF:

We complete this definition by assuming that images

for

and

for

. As far as we know, this truncated distribution is unlisted in the literature, and can be of independent interest. Here, we use it to define the truncated generalized Fréchet generated (TGFr-G) family of (continuous) distributions by considering the CDF obtained as

where

denotes the CDF of a parent (continuous) distribution and images

. Note that we have put images

in the definition of (2) to avoid the over-parameterization phenomenon; if necessary, one may re-introduce it easily by replacing images

, where

is a continuous CDF. One can observe that the TGFr-G and truncated Fréchet-G families coincide by taking images

. The main innovation of the TGFr-G family remains in its definition involving the shape parameter images

which opens new modelling perspectives, in the same spirit as the GFr distribution extends those of the classic Fréchet distribution. In this study, we formalize this claim by pointing out the desirable mathematical properties and applicability of the TGFr-G family. In particular, we investigate the precise role of images

in the features of the main functions, stress-strength parameter, incomplete moments and various entropy measures. The parameters estimation and bivariate extensions are also discussed, as well as a complete estimation work on the parameters. The applicable aspect of the new family is mainly highlighted by a special three-parameter distribution, defined with the Weibull distribution as the parent. It is called the truncated generalized Fréchet Weibull (TGFrW) distribution. For the related model, the maximum likelihood estimates of the parameters are derived and a simulation study is also made to check their accuracy. Then, two data sets are considered to evaluate how good the fit of the proposed model is. Diverse criteria are used in this regard, pointing out that the fit of the TGFrW model is better to those of comparable Weibull type models, with possible more parameters. In particular, the proposed model surpasses the analogous truncated Fréchet model, attesting to the importance of the findings.

The following organization is adopted. The TGFr-G family is defined in Section 2. Diverse properties are discussed in Section 3, including the analytical study of the main functions, stress-strength parameter, series expansions, incomplete moments with derivations, various entropy measures, theoretical and practical parameters estimation and various bivariate extensions of the proposed family through the use of copulas. Section 4 is devoted to the TGFrW distribution, with an emphasis on its applicability in simulated and concrete statistical settings. Section 5 contains some concluding notes.

The basics of the TGFr-G family are proposed in this section, exhibiting its main functions of interest, as well as a short list of special distributions.

First of all, we recall that the CDF given as (3) defines the TGFr-G family. Hereafter, a RV X having the CDF given as (3) is denoted by images

. By taking images

, it corresponds to the special case of the truncated Fréchet-G family by [6].

The analytical properties of these functions are very informative on the data fitting possibilities of the associated models. This aspect will be the subject of further discussions. Also, the quantile function (QF), obtained by inverting the CDF in (3), is given as

where

denotes the QF of the parental distribution. The fact that images

has a closed-form expression is a plus for the TGFr-G family. In particular, we can simply determine the median as images

, derive several functions related to this QF and generate random values through the inverse transform sampling method.

In order to illustrate the heterogeneity of the TGFr-G family, Tab. 1 lists several of its members based on standard parental distributions, with various supports and numbers of parameters.

In our applications, a focus will be put on the TGFrW distribution defined with images

. This choice is motivated by upstream numerical and graphical investigations.

In this section, we develop some notable properties of the TGFr-G family, and discuss some new motivations.

Here, some analytical results on the functions of the TGFr-G family are studied. Firstly, we investigate the equivalences of images

and

. Mathematical facts force us to distinguish the cases: images

and

. It is assumed that images

for these four last cases, but images

and

are not excluded.

Let us mention that images

is equivalent to say that x tends to the lower limit of the adherence of the set images

, and

is equivalent to say that x tends to the upper limit of the adherence of the set images

. The obtained equivalences for images

and

are described in Tab. 2.

From Tab. 2, the following remarks hold. When images

, we see that images

has a significant impact on the limit of images

. In particular, the term images

can dominate images

and thus

with an exponential decay. When images

, for the limit of images

, both

and

influence the proportionality constant, but the limit comportment of images

remains determinant. When images

with

and fix

, we have

. When

, the limiting function of images

is obtained as

and one can remark that images

is a valid CDF. As far as we know, it is unlisted in the literature, offering a new and original “logarithmic-exponential-G family”. This finding also reveals the richness of the proposed TGFr-G family.

From Tab. 3, when images

, we see that the limit of images

truly depends on images

, which is not the case when images

, where the limiting function correspond to the HRF of the parental distribution. In the case where images

is excluded and images

, we have

showing the importance of the parameter images

in this regard. Note that, when both images

and

, with a fix images

, we have

. Also, when images

is excluded, with fix images

and

, and

, we have

. The obtained limit when images

is a complex function with respect to x, and, when images

is excluded, with fix images

and

, and

, we have

The number and definition(s) of the mode(s) depend on the parental distribution, images

and

. However, even though all of these quantities are known, the complexity of the above equations constitutes an obstacle to get an analytical expression of the mode(s). Thus, mathematical software seems necessary for any numerical appreciation.

The stress-strength parameter provides one of the most important measurements in reliability analysis. From two independent RVs X and Y, the stress-strength parameter is defined by R = P(Y < X). As a common application, it is a measure of performance of a system; it evaluates the probability that a random strength modeled by X exceeds an independent random stress modeled by Y. For the theory and applications on this probabilistic object, we may refer the reader to [19,20].

The following result shows that, under a certain scenario on the parameters, a stress-strength parameter associated to the TGFr-G family has a tractable analytical expression.

Proposition 3.1. Let images

, with X1 and X2 independent, and R = P(X2 < X1). Then, we have

From Proposition 3.1, we can note that R is finally independent of the chosen parental distribution. Also, when images

, X1 and X2 are identically distributed and R takes the value 1/2 as expected in this simple case. The manageable expression of R is useful for estimation purposes; with the plug-in approach, images

and

can be substituted by adequate estimates to derive an estimate for R. Further developments in this regard are however out the scope of this study.

The following proposition proves that the “possibly complex” exponentiated PDF images

can be simply expressed as a series depending on parental exponentiated functions. Such expansion is useful for diverse algebraic manipulations of images

involving differentiation or integration, as discussed in full generality in [21].

A1: In terms of images

and exponentiated survival functions of the parental distribution, i.e., images

, we have

To obtain A1: One can express images

in terms of exponentiated images

via the generalized binomial theorem as

To obtain A2: One can express images

in terms of exponentiated images

via the generalized and standard binomial theorems as

The incomplete moments of images

are useful to derive crucial measures and functions of the TGFr-G family, with a high potential of applicability. Mathematically, the rth incomplete moment of images

at any

can be expressed as

For some special parental distributions, the calculus of this integral by usual integration techniques is not excluded. However, for further analytical manipulations or evaluation, a series expression is sometimes preferable. In this regard, several possibilities are presented below, depending on the level of complexity in the definition of images

B1: From (6), by applying the change of variable images

, i.e.,

, and the generalized binomial expansion, assuming that the integral and sum signs are interchangeable, we get

If the QF of the parental distribution is not too complex, the integral term can be made explicit.

B2: For more universal series developments, Proposition 3.2 applied with images

gives series expansions of images

that can be injected into (6). For instance, by considering the expression A1, assuming that the integral and sum signs are interchangeable, we get

For a wide panel of parental distributions, the integrals images

and

are available in the literature or easily calculable. Also, for practical aims, one can truncate the infinite sums by any large integer to have suitable approximation functions for images

. Further detail on the interest of such series expansions in the treatment of various probabilistic measures can be found in [21].

As example of applications, from the incomplete moments of images

, we can derive the rth raw moments of X defined by images

, the rth central moment of X specified by the following relation: images

, the variance of X given as images

, the general coefficient of X defined by images

allowing to define the skewness coefficient corresponding to S = C3 and the kurtosis coefficient obtained as K = C4, among others.

Also, from the mean incomplete moment images

, that is

taken with r = 1, one can express the mean deviation of X about images

, the mean deviation about M as images

, the mean residual life as images

, the mean waiting time as images

, the Bonferroni curve as images

, and the Lorenz curve as L(u) = uB(u), images

The entropy is a fundamental concept in information theory, with applications in statistical inference, neurobiology, linguistics, cryptography, quantum computer science and bioinformatics. In the literature, there exists several entropy measures to determine the randomness of a distribution. Most of them are discussed in the survey of [22]. By considering a generic (continuous) distribution with PDF denoted by f(x), some of them are presented in Tab. 4. In this table, it is supposed that images

and

From Tab. 4, we see that the main term in the definitions of the entropy measures is the following integral term: images

. We now investigate it in the context of the TGFr-G family. So, we set

For some special parental distributions, we can inspect the calculus of this integral by standard techniques. A more universal approach consists in expressing it as a tractable series expansion. Hence, once can apply Proposition 3.2 with the choice images

to obtain series expansions of images

and use it into (8). Thus, assuming that the integral and sum signs are interchangeable, from A1, we get

For most of the standard parental distributions, the integrals images

and

can be determined with mathematical efforts. Thus, one can deduce expansions of all the entropy measures presented in Tab. 4. In particular, the Tsallis entropy of the TGFr-G family can be expanded as

One can deduce a precise approximation of it by truncating the infinite sum by any large integer.

The main objective of the TGFr-G family is to provide pliant semi-parametric models for statistical applications. To reach this aim, the estimation of the model parameters is a crucial step, and several methods of estimation are possible. Here, we provide the essential theory on the maximum likelihood (ML) method of estimation in the context of the TGFr-G family. The generalities can be found in [28].

First of all, let images

be n independent and identically distributed RVs from images

and

. Then, assuming that they are unique, the ML estimators of the parameters images

and

, say

and

, respectively, are the RVs obtained as

Assuming that images

is differentiable with respect to images

, the ML estimators are the solutions of the following equations: images

and

, where

. In most of the cases, there are no analytical expressions for these estimators, but practical solutions exist and will be discussed later. Then, under some regularity conditions, the ML estimators satisfy remarkable convergence properties, including the asymptotically normal property presented below. Let m be the number of components in images

(which can be numerous since images

is itself a vector of components) and images

be the uth component of images

. Then, the asymptotic distribution of images

is the multivariate normal distribution images

, where

denotes the images

covariance matrix defined by images

In a concrete statistical scenario, we deal with data corresponding to observations of images

. Let us denoted them by images

. Then, the ML vector of estimates of images

, say

, is defined by the corresponding observation of images

. Thanks to the argmax definition, it can be obtained numerically by optimization via the use of any Newton-Raphson type algorithm. With the R software, this numerical work can be done via the functions of the package AdequacyModel.

For the practice of the asymptotic normality, the covariance matrix images

is often difficult to determine analytically and depends on the unknown parameters. A standard approach consists in using the following approximation: images

, where

. Thus, the asymptotic distribution of images

can be considered as the multivariate normal distribution images

, where

. This result is useful to construct asymptotic two-sided confidence intervals (CIs) of the parameters. More precisely, for any images

and

, the

CI of

is obtained as

where

and

are the lower and upper bounds of the interval, defined by images

and

, respectively, where du is the uth component in the diagonal of I−1 and images

is the quantile of the normal distribution images

taken at

. As the main interpretation, there is images

of chances that images

belongs to CI, which is of interest by taking images

small enough. The typical values for images

are 0.01, 0.05 or 0.1.

Finally, by the invariance property of the ML estimates, we can deduce ML estimates of several measures of the TGFr-G family. For instance, we can inspect the estimation of the Tsallis entropy of the TGFr-G family as defined in (9); the ML estimate of images

is naturally obtained as images

The ML estimates, CIs and estimate of the Tsallis entropy will be the object of a numerical study later, by the consideration of a special distribution of the TGFr-G family.

Bivariate families of distributions are of interest to model distributions behind two dimensional phenomena or measures, observed via bivariate data. This remains an actual demand in regression or clustering analysis, among others. The univariate TGFr-G family can be extended to the bivariate case via several approaches. The most natural one is to use a bivariate parental distribution characterized by a bivariate CDF, say images

, where

is the vector of parameters. Thus, based on (3), we can define the 2TGFr-G family by the following bivariate CDF:

where

. Then, it is clear that, if images

, then

and

. However, the structure of dependence between X and Y remains unmanageable. A more technical approach but with a clear dependence structure consists in employing special functions called copulas.

• By using the Farlie-Gumbel-Morgenstern copula, a bivariate extension of the TGFr-G family, called FGMTGFr-G family, is defined by the bivariate CDF given as

where

and

are defined as (3) with possibly different parental CDFs, say images

and

, respectively. Note that the independence copula corresponds to the case images

• By using the Clayton copula, a bivariate extension of the TGFr-G family, called CTGFr-G family, is defined by the bivariate CDF specified by

Other interesting bivariate extensions can be derived from other notorious copulas. A complete list of them, with more theoretical elements, can be found in [29].

The TGFr-G family contains a plethora of potential interesting distributions. Here, we emphasize with the truncated generalized Fréchet Weibull (TGFrW) distribution as presented in Tab. 1, discussing its numerous qualities.

Let us recall that the TGFrW distribution as described in Tab. 1 with images

corresponds the following configuration: images

, x > 0, ( images

otherwise), and images

, x > 0. Concretely, it is defined by the following CDF:

The pliancy of the curvatures of images

and

is illustrated in Figs. 1 and 2, respectively.

In Fig. 1, various degrees of skewness (asymmetry) and kurtosis are observed for images

, showing decreasing and bell shapes, as well various weights on the right tail mainly. In Fig. 2, we see that images

possesses reversed J, bathtub decreasing and increasing shapes, with possibly several critical points.

Hence, quartiles and random generations numbers from the TGFrW distribution can be easily investigated.

The general properties studied for the TGFr-G family in Section 2 can be applied to the TGFrW distribution. A selection of them are presented below. First of all, in order to complete the observations made on Figs. 1 and 2, let us investigate the equivalences and limits of images

and

. When

, we have

In particular, we note that images

plays the major role in these convergence, images

in all cases, and, when images

has the same comportment to the HRF of the parental distribution, i.e., images

when

, and

when

Also, by the Riemann integral criteria, the equivalence results for images

ensure that the raw moments of all orders of images

exist, for all the values of the parameters. In this setting, let us now discuss the rth incomplete moment of X, rth raw moment of X with related measures, and the Tsallis entropy.

As usual, the rth incomplete moment of X can be expressed as its principal integral form. Alternatively, owing to (7) and the equality: images

, where

denotes the lower incomplete gamma function, we have

We can manipulate this expansion to derive approximations of the measures and functions presented in Subsection 3.5. Also, by applying images

, we get the rth raw moment of X, i.e.,

where

. As numerical works, Tabs. 5 and 6 collected the numerical values of some measures of the TGFrW distribution derived to the raw moments.

Table 5: Values of some measures of the TGFrW distribution for several values of images

and at

Table 6: Values of some measures of the TGFrW distribution for several values of images

and at

and

Among others, Tabs. 5 and 6 show how the values of some moments measures of images

can vary according to the values of the parameters. Here, a great variation of the values on the mean and kurtosis are mainly observed.

As described in Subsection 3.6, the Tsallis entropy of the TGFrW distribution is initially defined by an integral expression. A tractable series expansion can be deduced from (9). Indeed, since images

provided that images

, we have

Table 7: Values of the Tsallis-entropy of the TGFrW distribution for several values of the parameters

Tab. 7 reveals that the amount of randomness of the TGFrW distribution measured by the Tsallis entropy is versatile. Indeed, it can take negative values, as well as small or large positive values. The rest of the study focuses on the statistical usefulness of the TGFrW model in a statistical framework.

The ML estimates of the parameters of the TGFrW model, the corresponding CIs and the estimate of the Tsallis entropy can be obtained via the approach described in Subsection 3.7. Here, we provide a numerical study on these statistical objects through the simple random sampling scheme. This scheme is based on the QF defined by (10). A performance study of the estimates is conducted relatively to the mean square errors (MSEs), (average) LBs and UBs of the corresponding 90% and 95% CIs, as well as the corresponding average lengths (ALs), i.e., images

. The software Mathematica 9 is used in this regard. The following steps are followed.

Step 1: A random sample of values of size n = 100, 200, 300, 1000 and 3000 is generated from the TGFrW distribution.

Step 2: We consider the following sets of parameters: set1: ( images

), set2: ( images

), set3: ( images

) and set4: ( images

Step 3: For each of the above sets and each sample of size n, the ML estimates are computed.

Step 4: We repeat the previous steps N times, dealing with different samples, where N = 5000. Then, the MSEs of the estimates are computed.

Table 8: Values of ML estimates and IC measures related to the TGFrW model for set1: ( images

)

Table 9: Values of ML estimates and IC measures related to the TGFrW model for set2: ( images

)

Table 10: Values of ML estimates and IC measures related to the TGFrW model for set3: ( images

)

Table 11: Values of ML estimates and IC measures related to the TGFrW model for set4: ( images

)

For all the considered sets of parameters, the values in Tabs. 8–11, indicate that the ML estimates stabilize to the right values as n increases. Also, the MSEs and ALs decrease and tend to 0 as n becomes large as expected.

Now, we check the numerical performance of the estimate of the Tsallis entropy of the TGFrW model as described in Subsection 3.7. In this regard, Tabs. 12–15 list the values of this estimate under the simulation scenario described above. We adopt the criteria of the relative bias (RB), defined as images

Table 12: Values of the Tsallis entropy estimates related to the TGFrW model for set 1: ( images

)

Table 13: Values of the Tsallis entropy estimates related to the TGFrW model for set 2: ( images

)

Table 14: Values of the Tsallis entropy estimates related to the TGFrW model for set 3: ( images

)

Table 15: Values of the Tsallis entropy estimates related to the TGFrW model for set 4: ( images

)

For all the considered sets of parameters, the values in Tabs. 8–11, indicate that the estimates of the Tsallis entropy stabilize to the exact values as n increases. Also, the RBs decrease and tend to 0 as n becomes large, which is a consistent observation with the expected theoretical convergence.

Here, we show that the TGFrW model is ideal to fit practical data of various kinds, with better results in comparison to solid extended Weibull models. More specifically, the two following data sets are considered.

The first data set, called datasetI, contains 100 observations on minutes waiting time before a client receives the desired service in a bank. It is: datasetI = {0.8, 0.8, 1.3, 1.5, 1.8, 1.9, 1.9, 2.1, 2.6, 2.7, 2.9, 3.1, 3.2, 3.3, 3.5, 3.6, 4, 4.1, 4.2, 4.2, 4.3, 4.3, 4.4, 4.4, 4.6, 4.7, 4.7, 4.8, 4.9, 4.9, 5.0, 5.3, 5.5, 5.7, 5.7, 6.1, 6.2, 6.2, 6.2, 6.3, 6.7, 6.9, 7.1, 7.1, 7.1, 7.1, 7.4, 7.6, 7.7, 8, 8.2, 8.6, 8.6, 8.6, 8.8, 8.8, 8.9, 8.9, 9.5, 9.6, 9.7, 9.8, 10.7, 10.9, 11.0, 11.0, 11.1, 11.2, 11.2, 11.5, 11.9, 12.4, 12.5, 2.9, 13.0, 13.1, 13.3, 13.6, 13.7, 13.9, 14.1, 15.4, 15.4, 17.3, 17.3, 18.1, 18.2, 18.4, 18.9, 19.0, 19.9, 20.6, 21.3, 21.4, 21.9, 23, 27, 31.6, 33.1, 38.5}. The reference for this data is [30].

The second data set, called datasetII, represents 30 successive values of precipitation (in inches), in one month, in Minneapolis. It is: datasetII = {0.77, 1.74, 0.81, 1.20, 1.95, 1.20, 0.47, 1.43, 3.37, 2.20, 3.00, 3.09, 1.51, 2.10, 0.52, 1.62, 1.31, 0.32, 0.59, 0.81, 2.81, 1.87, 1.18, 1.35, 4.75, 2.48, 0.96, 1.89, 0.90, 2.05}. The reference for this data is [31].

The following competitors are taken into account: truncated Fréchet-Weibull (TFrW) model proposed by [6], odd log-logistic Weibull (OLLW) model introduced by [32], beta Weibull (BW) model by [33], exponentiated Weibull (EW) model introduced by [34], and gamma-exponentiated exponential (GE) model studied by [35].

For all the models, the estimation of the parameters are performed via the ML method. We refer to Subsection 3.7 concerning the ML estimates of the TGFrW model. As standard criteria of comparison, the following measures are taken into account: images

, AIC, BIC, W, A, KS and p-value (KS), corresponding to the minus estimated log-likelihood function at the data, Akaike information criterion, Bayesian information criterion, Anderson-Darling statistic, Cramer–von Mises statistic, Kolmogorov–Smirnov statistic and the p-value of the Kolmogorov–Smirnov test, respectively. The corresponding mathematical formulas are described below.

where n is the number of observations, p is the number of parameters of the considered model, images

are the ordered observations, images

, where

denotes the estimated CDF of the model involving the ML estimates for the parameters and Fn(x) denotes the random empirical CDF. The details on these statistical measures can be found in [36,37].

It is admitted that the smaller the values of AIC, BIC, W, A and KS and the greater the values of p-value (KS), the better the model is to fit to the considered data. The software R is used for all the calculations.

For the considered models, the ML estimates with their related standard errors (SEs) are reported in Tabs. 16 and 17 for datasetI and datasetII, respectively.

In particular, for datasetI, the parameters images

and

of the TGFrW model are estimated by images

and

, respectively, and for datasetII, they are estimated by images

and

, respectively. We remark that the novel parameter images

is estimated far from 1, making a strong difference between the estimated TGFrW model and the former estimated TFrW model.

From Tabs. 18 and 19, it is clear that the TGFrW model is the best of all, with respect to the considered criteria. In particular, it has p-values (KS) closed to 1. As an important remark, the TGFrW model surpasses the former TFrW model, justifying the importance of the generalization.

Several kinds of fits of the TGFrW model are shown in Figs. 3 and 4 for datasetI and datasetII, respectively. Specifically, the estimated PDFs of the TGFrW distribution are plotted over the corresponding histograms and the estimated CDFs are plotted over the empirical CDFs. The empirical probabilities versus estimated probabilities (P-P) plots and the empirical quantiles versus estimated quantiles (Q-Q) plots are also shown. In all the cases, a near perfect fit is observed, validating the remarkable performance of the TGFrW model.

Figure 3: Various fits of the TGFrW model for datasetI: (a) Estimated PDF, (b) estimated CDF, (c) P-P plot and (d) Q-Q plot

Figure 4: Various fits of the TGFrW model for datasetII: (a) estimated PDF, (b) estimated CDF, (c) P-P plot and (d) Q-Q plot

We have motivated the use of the truncated generalized Fréchet distribution to define a new generalized family of continuous distributions, called the truncated generalized Fréchet generated (TGFr-G) family. Diverse mathematical and practical investigations show the full potential of the new family, supported by detailed graphical and numerical evidences. A focus is put on the truncated generalized Fréchet Weibull (TGFrW) distribution, with a complete statistical treatment of the related model. Comparative fitting are performed through the use of two practical data sets, with favorable results to the new model in comparison to other popular extended Weibull models. In particular, under a comparable setting, the new model surpasses the former truncated Fréchet model. As perspectives of future work, other special models of the TGFr-G family may be the subjects of further investigation, specially those with support on images

. Also, the bivariate extensions of the TGFr-G family can be explored more, with applications in the fields of regression and clustering, for instance. Also, applications in physics remain an interesting challenge, exploring the possible randomness of various networks [38] and various differential equations [39].

Acknowledgement: We thank the reviewers for their thorough comments and remarks which contributed to improve the quality of the paper. This work was funded by the Deanship of Scientific Research (DSR), King AbdulAziz University, Jeddah, under Grant No. (G:531-305-1441). The authors gratefully acknowledge the DSR technical and financial support. The authors, therefore, acknowledge with thanks to DSR technical and financial support.

Funding Statement: This work was funded by the Deanship of Scientific Research (DSR), King AbdulAziz University, Jeddah, under Grant No. G:531-305-1441. The authors gratefully acknowledge the DSR technical and financial support.

References

1. Tahir, M. H., Cordeiro, G. M. (2016). Compounding of distributions: A survey and new generalized classes. Journal of Statistical Distributions and Applications, 3(1), 37. DOI 10.1186/s40488-016-0052-1. [Google Scholar] [CrossRef]

2. Brito, C., Rêgo, L., Oliveira, W., Gomes-Silva, F. (2019). Method for generating distributions and classes of probability distributions: The univariate case. Hacettepe Journal of Mathematics and Statistics, 48(3), 897–930. [Google Scholar]

3. Ahmad, Z., Hamedani, G. G., Butt, N. S. (2019). Recent developments in distribution theory: A brief survey and some new generalized classes of distributions. Pakistan Journal of Statistics and Operation Research, 15(1), 87–110. DOI 10.18187/pjsor.v15i1.2803. [Google Scholar] [CrossRef]

4. Mahdavi, A., Silva, G. (2016). A method to expand family of continuous distributions based on truncated. Journal of Statistical Research Iran, 13, 231–247. [Google Scholar]

5. Barreto-Souza, W., Simas, A. B. (2013). The exp-G family of probability distributions. Brazilian Journal of Probability and Statistics, 27(1), 84–109. DOI 10.1214/11-BJPS157. [Google Scholar] [CrossRef]

6. Abid, A., Abdulrazak, R. (2017). [0,1] truncated Fréchet-G generator of distributions. Applied Mathematics, 7, 51–66. [Google Scholar]

7. Bantan, R., Jamal, F., Chesneau, C., Elgarhy, M. (2019). Truncated inverted Kumaraswamy generated family of distributions with applications. Entropy, 21(1089), 1–22. [Google Scholar]

8. Najarzadegan, H., Alamatsaz, M. H., Hayati, S. (2017). Truncated Weibull-G more flexible and more reliable than beta-G distribution. International Journal of Statistics and Probability, 6(5), 1–17. DOI 10.5539/ijsp.v6n5p1. [Google Scholar] [CrossRef]

9. Aldahlan, M., Jamal, F., Chesneau, C., Elgarhy, M., Elbatal, I. (2019). The truncated Cauchy power family of distributions with inference and applications. Entropy, 22(346), 1–24. DOI 10.3390/e22010001. [Google Scholar] [CrossRef]

10. Jamal, F., Bakouch, H., Nasir, M. (2020). A truncated general-G class of distributions with application to truncated burr-g family. (in press). [Google Scholar]

11. Aldahlan, M. (2019). Type II truncated Fréchet generated family of distributions. International Journal of Applied Mathematics, 7, 221–228. [Google Scholar]

12. Akbarinasab, M., Arabpour, A., Mahdavi, A. (2019). Truncated log-logistic family of distributions. Journal of Biostatistics and Epidemiology, 5(2), 137–147. [Google Scholar]

13. Alzaatreh, A., Aljarrah, M. A., Smithson, M., Shahbaz, S. H. Shahbaz, M. Q. et al. (2020). Truncated family of distributions with applications to time and cost to start a business. Methodology and Computing in Applied Probability, 42(6), 547. DOI 10.1007/s11009-020-09801-1. [Google Scholar] [CrossRef]

14. Hassan, A., Sabry, M., Elsehetry, A. (2020). A new family of upper-truncated distributions: Properties and estimation. Thailand Statistician, 18(2), 196–214. [Google Scholar]

15. Nadarajah, S., Kotz, S. (2003). The exponentiated Fréchet distribution. Interstat Electronic Journal, 1–7. [Google Scholar]

16. Okorie, I. E., Akpanta, A. C., Ohakwe, J. (2016). The exponentiated Gumbel type-2 distribution: Properties and application. International Journal of Mathematics and Mathematical Sciences, 2016(2), 1–10. DOI 10.1155/2016/5898356. [Google Scholar] [CrossRef]

17. Gupta, R., Kundu, D. (2001). Exponentiated exponential family: An alternative to Gamma and Weibull distributions. Biometrical Journal, 43, 117–130. [Google Scholar]

18. Mansour, M., Aryal, G., Afify, A., Ahmad, M. (2018). The Kumaraswamy exponentiated Fréchet distribution. Pakistan Journal of Statistics, 34(3), 177–193. [Google Scholar]

19. Surles, J. G., Padgett, W. J. (2001). Inference for reliability and stress-strength for a scaled Burr-type X distribution. Lifetime Data Analysis, 7(2), 187–200. DOI 10.1023/A:1011352923990. [Google Scholar] [CrossRef]

20. Kotz, S., Lumelskii, Y., Pensky, M. (2003). The stress-strength model and its generalization: Theory and applications. Singapore: World Scientific. [Google Scholar]

21. Cordeiro, G., Silva, R., Nascimento, A. (2020). Recent advances in lifetime and reliability models. Sharjah, UAE: Bentham Books. [Google Scholar]

22. Amigo, J., Balogh, S., Hernandez, S. (2018). A brief review of generalized entropies. Entropy, 20(11), 813. DOI 10.3390/e20110813. [Google Scholar] [CrossRef]

23. Rényi, A. (1961). On measures of entropy and information. Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, pp. 547–561, Univ. of California Press. [Google Scholar]

24. Havrda, J., Charvát, F. (1967). Quantification method of classification processes, concept of structural -entropy. Kybernetika, 3(1), 30–35. [Google Scholar]

25. Arimoto, S. (1971). Information-theoretical considerations on estimation problems. Information and Control, 19(3), 181–194. DOI 10.1016/S0019-9958(71)90065-9. [Google Scholar] [CrossRef]

26. Awad, A., Alawneh, A. (1987). Application of entropy to a life-time model. IMA Journal of Mathematical Control and Information, 4(2), 143–148. DOI 10.1093/imamci/4.2.143. [Google Scholar] [CrossRef]

27. Tsallis, C. (1988). Possible generalization of Boltzmann-Gibbs statistics. Journal of Statistical Physics, 52(1–2), 479–487. DOI 10.1007/BF01016429. [Google Scholar] [CrossRef]

28. Casella, G., Berger, R. (1990). Statistical inference. Bel Air, CA, USA: Brooks/Cole Publishing Company. [Google Scholar]

29. Nelsen, R. (2006). An introduction to copulas. 2nd edition, New York: Springer-Verlag. [Google Scholar]

30. Ghitany, M. E., Atieh, B., Nadarajah, S. (2008). Lindley distribution and its application. Mathematics and Computers in Simulation, 78(4), 493–506. DOI 10.1016/j.matcom.2007.06.007. [Google Scholar] [CrossRef]

31. Hinkley, D. (1977). On quick choice of power transformations. Journal of the Royal Statistical Society, Series C, Applied Statistics, 26, 67–69. [Google Scholar]

32. Cordeiro, G. M., Alizadeh, M., Ozel, G., Hosseini, B., Ortega, E. M. M. et al. (2016). The generalized odd log-logistic family of distributions: Properties, regression models and applications. Journal of Statistical Computation and Simulation, 87(5), 908–932. DOI 10.1080/00949655.2016.1238088. [Google Scholar] [CrossRef]

33. Lee, C., Famoye, F., Olumolade, O. (2007). Beta-weibull distribution: Some properties and applications to censored data. Journal of Modern Applied Statistical Methods, 6(1), 173–186. DOI 10.22237/jmasm/1177992960. [Google Scholar] [CrossRef]

34. Pal, M., Ali, M., Woo, J. (2006). Exponentiated Weibull distribution. Statistica, 66(2), 139–147. [Google Scholar]

35. Ristić, M. M., Balakrishnan, N. (2012). The gamma-exponentiated exponential distribution. Journal of Statistical Computation and Simulation, 82(8), 1191–1206. DOI 10.1080/00949655.2011.574633. [Google Scholar] [CrossRef]

36. Chen, G., Balakrishnan, N. (2018). A general purpose approximate goodness-of-fit test. Journal of Quality Technology, 27(2), 154–161. DOI 10.1080/00224065.1995.11979578. [Google Scholar] [CrossRef]

37. Massey, F. J. Jr. (1951). The Kolmogorov-Smirnov test for goodness of fit. Journal of the American Statistical Association, 46(253), 68–78. DOI 10.1080/01621459.1951.10500769. [Google Scholar] [CrossRef]

38. Liu, J. B., Zhao, J., Cai, Z. Q. (2020). On the generalized adjacency, Laplacian and signless Laplacian spectra of the weighted edge corona networks. Physica A: Statistical Mechanics and Its Applications, 540, 123073. DOI 10.1016/j.physa.2019.123073. [Google Scholar] [CrossRef]

39. Akgül, A. (2018). A novel method for a fractional derivative with non-local and non-singular kernel. Chaos, Solitons & Fractals, 114, 478–482. DOI 10.1016/j.chaos.2018.07.032. [Google Scholar] [CrossRef]