APPLICATION OF KANIADAKIS k -STATISTICS TO EXTREME WIND SPEED LOAD DISTRIBUTIONS
Anna Bushinskaya12 , Sviatoslav Timashev12
1 Science and Engineering Center "Reliability and Safety of Large Systems and Machines", Ural Branch, Russian Academy of Sciences, Russia 2 Ural Federal University, Russia [email protected] [email protected]
Abstract
The article considers application of the Kaniadakis' K-statistics [1-3] (non-extensive statistical mechanics) which introduced in 2001 in the framework of Einstein's special theory of relativity, to the analysis and adequate description of extreme wind loads. The K-deformed Kaniadakis exponential function is used to introduce new classes of K-deformed statistical versions of known distributions. These distributions coincide with the original ones with the exception that their k-deformed tail follows the Pareto power law. This allows converting the original distributions into heavy-tailed distributions that more closely match the experimental data of mixed systems and systems operating under conditions of increased uncertainty. This allows, within the framework of known distributions of loads and impacts, to model above-standard stressors and analyze the near impossible to predict "Black Swan" and "Dragon-King" ultra-rare type of events with humongous consequences.
Keywords: Kaniadakis, K-statistics, heavy-tailed distributions, black swan, dragon-king, wind speed, uncertainty.
I. Introduction
During the operation of a complex technical system, due to its complexity, no matter how carefully the calculations are carried out during its design, there always will be unforeseen impacts due to beyond- design (extreme) loads, which will eventually lead, at best, to its local damage. In this case, it is very important to know whether these damages will cause catastrophic destruction of the system as a whole or to its unsuitability for further operation. Such out-of-design or extreme loads and impacts fall on the tails of the so-called fat-tailed distributions. An example of such a distribution is shown in Fig.1.
The distribution shown in Fig. 1 is divided into two parts: central (solid thick line) and caudal (dashed thick line). Separation boundary K represents a threshold value determined by the specifics of the task. The central part represents the values at design loading (normal operation), the tail represents the beyond design values. The central part is usually described by standard distributions (exponential, Weibull, Rayleigh, lognormal, etc.), the caudal (tail) part is described by power-law or heavy-tailed distributions.
Below, we consider the application of Kaniadakis K-statistics [1-3] to known distributions of loads and impacts, which makes it possible to simulate above-norm stressors in the tails of these distributions and analyze the consequences of beyond-design situations (including " Black Swan"
and "Dragon-King" type disasters) . The use of k -statistics makes it possible to obtain simple analytical closed-form expressions for all major statistical functions such as the probability density function, distribution function, survival function, quantile function, risk function, and cumulative risk function.
A*)
K x
Figure 1: Example of distribution of an impact, load or damage
The analysis was made on the example of extreme wind speeds in the Arctic zone of the Russian Federation, obtained at various weather stations. Information about extremely high wind speeds is very important, since knowledge of the true distribution of wind loads is necessary when designing new and operating existing antennas and masts, bridges, wind turbines, high-rise buildings and other objects. This is especially true nowadays, when the extreme events frequency is observed to be constantly increasing [5].
The statistical theory of extremes is based on the idea that extreme identically distributed independent random variables obey one of the three probability distributions (Gumbel, Frechet and Weibull) [6-8], and the Weibull law is successfully used to approximate the distribution of extreme wind speed. Another approach to studying extreme values is by approximating only the tail of the distribution. For this, the so-called Pareto distribution is used, to which the probabilities of events whose intensity (amplitude) exceeds a certain threshold value [6-8] are subjected. The question of how to assign / determine the point K is not disclosed in this article. This is a separate task, determined by the specifics of the problem under consideration.
The initial data for the analysis are taken from the works [6-8], devoted to the study of anomalous features of the wind regime. The study region includes the coastal zone of the Arctic (from the Kola Peninsula to the Chukchi Peninsula), as well as some inland areas (in total, data from more than 30 meteorological stations was used in the analysis). Standard station wind speed measurements at a height of 10 m, averaged over 10 minutes were used. The analyzed data is for the period of 1966-2013.
According to [6-8] one of the basic principles underlying the theory of extreme random processes is violated - the requirement that all sample data belong to the same set. It is shown that the analyzed sample data includes representatives of two different distributions, each of which is approximated by its own Weibull function. This is a situation where representatives of fundamentally different distributions occur among identical (by nomenclature) quantities. The main array of "intermediate" extremes is called "White Swans", and the appearance of the largest and rarest phenomena in this sample is called "Black Swans" [9]. It should be noted that objects belonging to the same distribution have a similar genesis, i.e. large anomalies differ from their "smaller relatives" only in amplitude or degree of impact. Events that belong to a different distribution have a different genesis and characterize fundamentally different objects, called "Dragons" or "Dragons-Kings" [10].
Anna Bushinskaya, Sviatoslav Timashev RT&A, Special Issue № 4 (70) APPLICATION OF KANIADAKIS k-ST ATIST ICS..._Volume 17, November 2022
II. Mathematical description of k -statistics
Consider a random value (RV) X with probability density function (PDF) f (x). In statistical
mechanics, the general equation for the rate of change of f (t) is a first-order linear ordinary differential equation (ODE):
= ( x ) f ( x ), (1)
where the function r (x) is the decay rate. The solution to this ODE is exponential
( v A
f (t) = c exp
-J r (t ) dt
(2)
with the standard normalization condition defining the constant c:
) dt = 1 (3)
J f ( t ) <
x0
As an exponential decision, consider three simple cases.
1. Exponential model for constant decay rate, ie
r ( x) = 2, (4)
which from Equ. (1) leads to the exponential PDF
f (x) = Ae-Zt. (5)
2. Pareto distribution type I. The PDF of the Pareto distribution is obtained from Eq. (1) with a decay rate function equal to
r (x) = P, p > 1, (6)
x
with PDF in this case
f (x) = ^X0) , p > 1, x e(x0, +») , x0 > 0 .
(7)
3. The K-exponential model [10-13]. has proven useful in many applications. According to [1-4], experimental data indicate that the PDFs should resemble an exponential function for x ^ 0. However, as x ^ 0, the Pareto PDF diverges. On the other hand, for high values of x, many experimental results show a Pareto - like PDF with power-law tails instead of tails with exponential decay. Therefore, as x ^ 0 it follows that r (x) ~ 2, and as x ^ + ^ it follows that r(x) ~ p/x. Thus, the actual decay rate function r(x) should smoothly interpolate between these two modes. A good suggestion for r(x) was introduced in the context of special relativity where the function r(x) is given via the Lorentz factor
yK(q) = >/1 + q2 . (8)
This expression includes the dimensionless momentum q, where the parameter k is the reciprocal of the dimensionless speed of light c , i.e. k « 1/ c . Then, taking
r(x)= . \2 2, (9)
V1 + K2x2 x2
we get for x ^ 0, the decay rate r(x) corresponds to an exponential distribution, i.e r(x) ~ 2, and as x ^ + ^ corresponds to the Pareto distribution, i.e r(x) ~ 1/k x.
The solution of ODE (1) leads to the following PDF:
Anna Bushinskaya, Sviatoslav Timashev RT&A, Special Issue № 4 (70) APPLICATION OF KANIADAKIS k-ST ATIST ICS..._Volume 17, November 2022
f (x) = l(l-k2)expK{-Xx) , (10)
where k is the deformed exponential function given by
expK( x) = (V 1 + k2 x2 + kx ) ^. (11)
for 0 < k < 1.
It is important to note that as k ^ 0 and as x ^ 0, the function expic(x) tends to the usual exponent, i.e.
expK(x) ~ exp(x),
k—0 (12) exp^(x) ~ exp(x).
x—0
On the other hand, the function expK (- x) as x ^ + ^ is a power tail, i.e.,
expK(-x) ~ (2kx)-1/k . (13)
t—
In addition, the K-exponential satisfies the following identity
exp^(x)exp^(-x) = 1. (14)
by analogy with the standard, undeformed exponential.
The K-exponent is a very powerful tool that can be used to formulate a generalized statistical theory capable of considering systems described by distribution functions that have power tails.
Consider the application of the Kaniadakis K-statistic to the Weibull distribution. According to [4], the K-deformed Weibull cumulative distribution function:
FK( x) = 1 - expK(-( x / r)a)), x > 0, a,T,K> 0, (15)
where a is the shape parameter, t is the scale parameter .
Another parameterization of the Weibull distribution is often used. The shape parameter a remains the same, but the scale parameter is replaced with ¡3 = . Then
FK( x) = 1 - expK (-¡xa ), x > 0, a, ¡> 0,0 <k< 1. (16)
It is easy to obtain the K-deformed Weibull PDF :
eK(-pta)a3Kxa-1
dFK( x )
V 3 x"
f (x) = -^ =-r i \-¿. (17)
dx (J 1 + k 2 ¡2 x2a ^
¡K x
Introducing a substitution y = a.3 K x-, yields
3 Kxa
V, , 2 02 2a
1 + k 3 x -1
f K ( x ) =
■y/1 + K2 ¡2 x2
eK (-Pxa)apKxa-1 (1 -y)_ eK (-¡xa)afiKxa-1 y
¡k a1 - y ^ = ¡xa
y
(-¡xa)a3xa-1 a (x/t)"-1 eK (-(x/t)")
(18)
+ K2 ¡2 x2a t + K2 ( x / t)2
The shape parameter (index of power) a quantitatively characterizes the shape of the distribution, which is less (more) pronounced at smaller (larger) values of the parameter. The parameter t is a scale parameter: if t is small, then the distribution will be more concentrated around the mode; if t is large, the distribution will be less concentrated and more scattered. Finally, the parameter K characterizes and measures the heaviness of the right tail: the larger (smaller) its value, the thicker (thinner) the tail. As k ^ 0, the distribution tends to the standard
Weibull distribution. It can be easily verified [4]
( a—-1
lim fK( ——,t,k) = —i — \ exp (-( — It—),
K^-0 T \T ) V '
lim FK (—,a,T, k) = 1 - exp(-(—It)— ).
Since the exponential distribution is a special case of Weibull PDF with a shape parameter
equal to 1, then as k ^ 0 and a = 1, the K-deformed functions tend to the exponential law. For x ^
0+, the distribution behaves similarly to the standard Weibull model, while for large values of x it
_! a approaches a Pareto distribution with a scale parameter z (2k) a and a shape parameter —, i.e.,
K
a
lim fK (——,t,k) = -
-( 2k)-
lim fk (
—,a,T,K
) = 1 -
— K
i __1 \
t(2k) a
—
(20)
From Eq. (15) one can find the quantile function (inverse distribution function) and the survival (reliability) function [4]:
FK 1 (p, a, t, k) = t I ln
1la
= ß a I ln
1la
0 < p < 1,
(21)
1 ' - 1
1 - p ) v 1 - P
Sk(——,t,k) = expK (- ( — I T)a ) = expK (-ß—a ),
where the K-logarithm of lnc(w) is the inverse of expic(w), i.e. lnK(expK (x)) = exp^lnc (x)) = x , and is determined by the formula
ln-(—) = -
2k
The median of K-deformed Weibull distribution
xmed, = r [ln, (2)]« = p- [ln, (2)]« . (22)
The mode xmode, of PDF can be obtained analytically as a function of the parameters a, p and k. If xmode, is the point of PDF maximum, then by equating the derivative of the PDF to zero, we
can find —
mode,,
[4]:
—mode—
a2 + 2k2 (a-1)
2 k'
(a2 -k2 )
4k2 (a2-k2)(a-1)2
1 +-^--1
(a-1)]
a2+2k2'
2a
(23)
According to [4], the moments of K-deformed Weibull distribution can be calculated using the formula:
' 1
r 1+- r
2k 2a
(2k)— 1 + r- rf-l + r
a ^2, 2a,
Where, in particular, the mean, variance and coefficient of variation:
a
K
a+1
a
= T
r
T
a
ar - =
MK[ X ] =
r| 1+1 lrf-1 -±
a J I 2k 2a
( 2K)a
1 +
K
a
+-1 , 2k 2a
D- X ] =
( 2K)a
ru+2 )rf± -1
K^-MK [ X ],
1 + 2- ri—+
a I 2k a
vk[x ] = .
2,K
-1 =
rf1+1 Mf -1
a J V2k a
1 + 2^0+1 a I 2k a
1
Denote
c =
M1+1 VU-
V a J ^ 2k 2a
1+- rpL+J_
a V 2k 2a
-— r
(2k) a V 2k 2a
T+—- ifX~li
a I 2k 2a
-1.
(25)
(26)
Then the connection between the moments of the k -deformed ar,, and standard
a„ =Tr r 1 +
a
Weibull distribution is given by:
a—- = arCr . (27)
Now it is not difficult to obtain expressions relating the main numerical characteristics of the standard and k -deformed Weibull distributions
Mk[ X ] = M [ X ] C1,
D- [X] = a2C2 -M2 [X]C2 = D[X]C2 + M2 [X](C2 + C2),
(28)
V- [ X ] +1 = =
2,K a2C2
1,k a\C\
= (V2 [X] +1)
C
C
2
■2 .
T
2
z
2
2
a
1,K
III. Applications of k -statistics to the analysis of extreme wind speeds
The analysis of extreme values of wind speeds was carried out separately for the cold and warm seasons, which in the Arctic are characterized not only by sharply contrasting temperatures, but also by features of the atmospheric circulation. The summer months here are, in fact, July and August, while the winter season covers not only December, January and February, but usually includes November, March and April as well.
To approximate the distribution function of the frequency of wind speeds, we used the Weibull law, which has the distribution function [6, 7]
n « FK(u ) = 1 - exp (-¡ua~), u > 0, a, ¡> 0,
where u is the wind speed module, n / N characterizes the accumulated frequency. This expression can be transformed
In [-In (N - n) / N] = ln u.
Anna Bushinskaya, Sviatoslav Timashev RT&A, Special Issue № 4 (70) APPLICATION OF KANIADAKIS k-ST AT 1ST ICS..._Volume 17, November 2022
It follows that in special coordinates {ln [ -ln (N - n) / N], ln u } the Weibull probability
distribution is represented by a straight line. The degree of deviation of empirical points from it characterizes, together with known statistical criteria, the applicability of the theoretical distribution law.
Consider examples of the empirical distribution of the frequency of wind speeds (Fig. 2) [6]. It can be seen from Fig. 2 that the set of empirical points has a rectilinear section, however, when moving to especially large values, the regression line bends down. This feature turned out to be typical for the data of all considered meteorological stations [6]. In this case, the selection of a general linear dependence by the least squares' method is possible since most of the points fill the "linear" section. However, such an approximation will not satisfactorily describe the largest values of wind speed (located in the tail of the distribution) and as a result, their probability will be significantly underestimated. It was assumed that the sample included representatives of two different distributions, and each of them (since this is the case of extreme values) can be approximated by the Weibull law.
Figure 2: Empirical distributions of extreme wind speeds (1966 - 2013) according to measurements at Teriberka station (left) in cold periods and at Okunev Nose station (right) in warm periods of the year, plotted in the coordinates of the
Weibull probability distribution
Thus, the selection of two independent straight lines (Weibull distributions) has been carried out. This situation is shown as an example in Fig. 3 [6] for two stations and different seasons.
Figure 3: Empirical distributions of extreme wind speeds of the year (1966 - 2013) according to measurements at Lovozero (left) stations during cold periods and Okunev Nose (right) during warm periods, constructed in the coordinates of the Weibull probability distribution, where the line segments correspond to two different laws of
distribution.
Each group of points with values greater than (u > Uth) and less than (u < Uth) is well approximated by its straight line in a special coordinate system, showing that the set of extremes is formed from values belonging to different general populations.
The obtained parameters of the Weibull distribution for the data of the three stations considered above are presented in Table 1 [7]. For the data of each station for the summer and winter periods, the parameters of two Weibull distributions were determined, describing: the main wind speeds distribution ("white and black swans" - "WS and BS") and the distribution of large (tail) values ("dragons" - "DR").
Table 1: Weibull distribution parameters calculated for the period 1966-2013, separately for two groups of extremes corresponding to two families of distributions "WS and BS" and "DR" (for different periods and speeds, in m/s)
Station Family membership winter period Summer period
Parameter a Parameter f Parameter a Parameter f
Teriberka WS and BS 3.97 1.6 -10"5 4.39 3.1-10"5
DR 1.77 0.012 2.12 0.0081
Okunev Nose WS and BS 3.40 0.0017 4.40 0.0002
DR 0.52 1.1722 0.98 0.3816
Lovozero WS and BS 3.19 0.0013 4.45 0.0003
DR 1.69 0.0429 2.30 0.0202
The PDFs of the obtained distributions are shown in Figs. 4-6. Thus, in order to adequately describe the extremes of wind speed with one distribution, it is necessary to combine the PDFs as follows: before the intersection point of the PDF, the main distribution family is used ("WS and BS" - solid line), after the intersection point, the distribution of large (tail) values ("DR " - dotted line).
We modify the main distribution ("WS and BS") so that its tail part adequately describes large values of extreme wind speed. To do this, we construct a K-deformed Weibull PDF, for which we choose the parameter k so that the right tail corresponds to the tail of the "dragons" distribution. The results obtained are presented in Figs. 7-9.
O.OB ■
0.05
0.04
0.02
0.00
---Weibull (DR) - Weibull (WS and BS)
t t t , t I j h \ \ V 1 \ \
1 / I / t / 1 / 1 / V
i / I / ' / / v.
— i i i
10 20 30
Annual wind speed, m/s
40
Figure 4: PDF of extreme wind speeds (1966 - 2013) according to measurements at Teriberka station in winter, corresponding to two families of distributions "WS and BS" and "DR"
0.30
0.25
0.20
a 0.15 a.
010 0.05 0.00
1 t * ---Weibull (DR) - Weibull (WS and BS)
\ \ \ \
1 / 1 1 i 4 i / 1 i / 1
X / i / X 1 \ 1
/ \ j V
— —- \ \ V \ V
10 15
Annual wind speed, m/s
20
25
Figure 5: PDF of extreme wind speeds (1966 - 2013) according to measurements at Okunev Nose station in summer, corresponding to two families of distributions "WS and BS" and "DR"
0.16 0.14 0L2 0.10 OOS 0.06 0.04 0.02 0.00
— Weibull (DR) Weibull (WS and BS)
/ \ \
t / / / f f V \ ^ \
' / 1 / \ \ > \ \ \
1 / J / 1 / \ V * \
' / ' / \ \ \ \
/
10 15
Annual wind speed, m/5
20
Figure 6: PDF of extreme wind speeds (1966 - 2013) according to measurements at Lovozero station in winter, corresponding to two families of distributions "WS and BS" and "DR"
OOS ■
0.06
o
l-L.
0.04
0.02
0.00
/ A ..... Weibull (DR) - Weibull (WS and BS)
/ ---Kap pa-We ¡bull
/ A / A
f\ \
LA\ V-
10 20 30
Annual wind speed, nn/s
40
Figure 7: PDF of extreme wind speeds according to measurements at Teriberka station (winter period) corresponding to three families of Weibull distributions: "WS and BS", "DR" and k -deformed.
Figure 8: PDF of extreme wind speeds according to measurements at Okunev Nose station (summer period) corresponding to three families of Weibull distributions: "WS and BS", "DR" and K-deformed
From Fig. 7-9 it can be seen that the K-deformed distribution ("dot-dash" line) adequately describes not only the main distribution ("WS and BS" - solid line), but also almost completely coincides with the distribution of "dragons" - large (tail) extreme values (dotted line). Thus, k-statistics allows describing data in which representatives of two different distributions are mixed with a single K-deformed distribution.
Figure 9: PDF of extreme wind speeds according to measurements at Lovozero station (winter period) corresponding to three families of Weibull distributions: "WS and BS", "DR" and K-deformed
The processing of data from all weather stations showed similar results, which convincingly demonstrate the success of applying K-statistics to the analysis of wind speed extremes, which consist of a mixture of values belonging to different general populations.
In [7], to characterize the geography of extremes and compare summer and winter conditions, quantile speed values were constructed using the formula:
Anna Bushinskaya, Sviatoslav Timashev RT&A, Special Issue № 4 (70) APPLICATION OF KANIADAKIS k-ST ATIST ICS..._Volume 17, November 2022
—In- 1
U ( p ) = ln
( P LP 1 -P_
where p is the probability value and U ( p ) is the corresponding quantile value of the wind speed.
The p value is expressed in terms of the "return time", which characterizes the time interval after which the same (or larger) speed anomalies reappear: T = 1/(1- p). For the analysis of extremes, the value p = 0.99 was used. For the summer period (62 days - July and August), the total sample size for 48 years (1966-2013), subject to sieving every 3 days, is ~974 days. The share of events (1 - 0.99 = 0.01) is approximately 10 days. For 48 years, this corresponds to the situation of the appearance of an extreme U(0.99) once in 5 warm seasons. For the cold season, p = 0.99 corresponds to the average time of extreme value U(0.99) appearance once in two cold periods of the year. The values calculated from the station data (separately for "WS and BS" and "DR" and for the winter period) are presented in Table 2. The values calculated from the K-deformed distribution are also presented (see formula (21)).
Table 2: Quantile values of extreme wind speeds (winter period), m/s, U (0.99), calculated separately for three distribution families: "WS and BS", "DR" and K-deformed
Station Distribution family
WS and BS DR K-deformed
Teriberka 24 29 28.98
Lovozero 13 16 1 5.81
As can be seen from Table 2, the same quantile values are significantly larger (by 10-30%) in the distribution of "Dragons" than those of "Swans". Moreover, the values of the K-deformed distribution almost coincide with the values of the "Dragons".
IV. Conclusion
The study of wind speed extremes based on standard observations in the coastal regions of the Russian Arctic shows that they represent two sets of data with different statistical properties, each of them obeying its own Weibull law. The K-statistical approach presented in this paper made it possible to construct a K-deformed Weibull distribution that adequately describes both sets of extreme wind speed data with different statistical characteristics.
It stands to reason to claim that the developed above approach is applicable to any other type of meteorological loads and impacts.
References
[1] Kaniadakis, G. Statistical mechanics in the context of special relativity. Phys. Rev. E 66, 17 (2002).
[2] Kaniadakis, G. Maximum entropy principle and power-law tailed distributions. Eur. Phys. J. B 70, 3-13 (2009).
[3] Kaniadakis, G., Scarfone, AM, Sparavigna, A. & Wada, T. Composition law of kappa-entropy for statistically independent systems. Phys. Rev. E 95, 052112 (2017).
[4] Clementi, F.; Gallegati, M.; Kaniadakis, G. A model of personal income distribution with application to Italian data. Empirical economics. 2011, 39, 559-591.
[5] The second assessment report of RosHydromet on climate change and its consequences on the territory of the Russian Federation. General summary. M., 2014. 58 p.
[6] Kislov A.V., Matveeva T.A. Wind speed extremes in the European sector of the Arctic // Meteorology and Hydrology. 2016. No 7. P. 5-14.
[7] Kislov A.V., Matveeva T.V., Platonov V.S. 2016. General description of the variability of hazardous weather phenomena in the Arctic. - In the book: "Changing climate and the socioeconomic potential of the Russian Arctic", vol. 2. - Moscow, Liga-Vent, pp. 10-45.
[8] Kislov A.V., Matveeva T.A., Platonov V.S. 2015. Wind speed extremes in the Arctic. -Fundamental and Applied Climatology, No. 2, p. 63-80.
[9] Taleb NN The black swan: The impact of the highly improbable fragility. - Random House, 2010, vol. 2.
[10] Sornette D. Dragon-Kings, Black Swans and the prediction of crises // International Journal of Terraspace Science and Engineering, 2009, no. 2 (1), pp. 1-18. 7.