Научная статья на тему 'Asymptotic dependence of average failure rate and MTTF for a recursive, meshed network architecture'

Asymptotic dependence of average failure rate and MTTF for a recursive, meshed network architecture Текст научной статьи по специальности «Математика»

CC BY
58
9
i Надоели баннеры? Вы всегда можете отключить рекламу.
Область наук

Аннотация научной статьи по математике, автор научной работы — C. Tanguy

The paper is concerned with the exact and asymptotic calculations of the availability, average failure rate and MTTF (Mean Time To Failure) for a recursive, meshed architecture proposed by Beichelt and Spross. It shows that the asymptotic size dependences of average failure rate and MTTF are different, but not inverse of each other, as is unfortunately assumed too frequently. Besides, the asymptotic limit is reached for rather small networks

i Надоели баннеры? Вы всегда можете отключить рекламу.
iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.
i Надоели баннеры? Вы всегда можете отключить рекламу.

Текст научной работы на тему «Asymptotic dependence of average failure rate and MTTF for a recursive, meshed network architecture»

ASYMPTOTIC DEPENDENCE OF AVERAGE FAILURE RATE AND MTTF FOR A RECURSIVE, MESHED NETWORK ARCHITECTURE

C. Tanguy

Orange Labs, CORE/MCN/TOP, 38-40 rue du Général Leclerc, 92794 Issy-les-Moulineaux Cedex 9, France

e-mail: [email protected]

ABSTRACT

The paper is concerned with the exact and asymptotic calculations of the availability, average failure rate and MTTF (Mean Time To Failure) for a recursive, meshed architecture proposed by Beichelt and Spross. It shows that the asymptotic size dependences of average failure rate and MTTF are different, but not inverse of each other, as is unfortunately assumed too frequently. Besides, the asymptotic limit is reached for rather small networks.

1 INTRODUCTION

Network availability and reliability have long been a practical issue in telecommunication networks, among others. Quality of Service (QoS) requirements imply high availabilities A, but also a good knowledge of the failure frequency v - and of the average failure rate X=v / A - of (for instance) point-to-point connections, when the system is repairable. When the system is not repairable, an important parameter is the MTTF (Mean Time To Failure). As explained in many textbooks (Shooman 1968, Singh & Billinton 1977, Kuo & Zuo 2003), a system whose failure rate X is constant over time has a reliability described by the exponential distribution R(t) = exp(-X t), so that the MTTF, defined by

MTTF =<t>= J t (-R'(t) dt) = J R(t) dt, (1)

0 0

is in this case MTTFexp = 1/X. This may lead to confusions in repairable systems, where it may still be legitimate to consider constant failure rates for each element of the system, and yet obtain an average failure rate X . If A is the availability of a system made of m elements - whose failures are assumed to be statistically independent - having individual availabilities pi (1 < i < m) and constant failure rates Xi, then (Buzacott 1967, Singh & Billinton 1974, Schneeweiss 1981, 1983, Shi 1981, Hayashi 1991, Druault-Vicard & Tanguy 2006)

— 1 m d A

X = - YXPi —. (2)

Most results of the literature are devoted to series-parallel systems, where all components are identical, with the same (constant) failure rate X. For n components in series, A(p) = pn so that the

— p dA

aggregate failure rate is equal to X = X——=n X . The reliability of n components in series is R(t) =

A dp

[exp(-A t)]n = exp(- n A t), which gives MTTF = 1/(n A). For n components in parallel however, R(t) = 1-(1-exp(-A t))n, which leads to

1 m 1 1

MTTFpara,fe, =— = — A i=i I A

(

ln n + C +—+... 2 n

(3)

y

for n large (Shooman 1968, Kuo 2003) and where C = 0.577216... is the Euler constant. Quite generally, it is therefore important to estimate the reliability and related parameters of large systems in order to get a better understanding of key issues (Kolowrocki 2004).

In this work, we consider a recursive, meshed - not series-parallel - network configuration first considered by Beichelt and Spross (Beichelt & Spross 1989) as well as Prekopa and collaborators (Prekopa et al. 1991). For the repairable case we shall use the availability A(p), and in the non-repairable case the reliability R(p(t)), even though A(p) and R(p) are formally identical for the same network made of identical components. We show in detail that when such a system is large, knowledge of the generating_function of the reliability/availability allows us find the analytic, asymptotic expressions for A and MTTF. These expressions, which both have simple n-dependences, are not the inverse of each other: while for A , we find again a linear dependence in n (Druault-Vicard & Tanguy 2006), we obtain a n"15 dependence for the MTTF. Besides, they are in very good agreement with the exact values even when n remains relatively small.

2 NETWORK ARCHITECTURE: A CASE STUDY 2.1 Description

The network configuration defined by Beichelt and Spross (Beichelt & Spross 1989) is represented in Figure 1. They wanted to estimate the two-terminal reliability between the endpoints of the structure (in the original paper, the destination point was S6).

Following the method developed in (Tanguy 2007), we have been able to show that the two-terminal reliability between S0 and Sn may be expressed as a product of transfer matrices, in which each edge or link probability of functioning is arbitrary. It turns out that this transfer matrix is 15 x 15. However, if nodes are perfect and if links have the same reliability/availability p, things are much simpler, because a single transfer matrix needs be considered, the successive powers of which are to be calculated. Fortunately, these necessarily obey a recursion relation of finite order stemming from the characteristic polynomial of the transfer matrix. When dealing with Rel2(S0 ^ Sn) = Rel2(n), a very useful tool is the generating function formalism (Stanley 1997), since it encodes the exact result in a very concise manner.

2.2 Generating function of the reliability/availability

The generating function G(z) = Zn Rel2(n)(p) zn may eventually be written as G(z) = N(z)/D(z) (Tanguy 2007), where

N(z) = 1 - (1- p)2 p( 1 + 4p + 8p2 - 20p3 + 9p4) z

- (1-p)3 p3 (-2- 7p + 13p2 + 26p3 - 74p4 + 38p5 + 29p6 - 34p7 + 9p8) z2

- (1- p)6 p5(1 + 6p - 17p2 + 4p3 + 3p4 + 5p5 -14p6 + 8p7 + 5p8 - 7p9 + 2p10) z3

- (1- p)9 p8 (-2 + 16p2 - 4p3 - 36p4 + 34p5 - 7p6 - 3p7 + p8) z4

+ (1-p)13 pn(-1 + 3p - p2 - 3 p3 + p4) z5, (4)

D(z) = 1 - p ( 2 + 4p -p2 - 33p3 + 58p4 - 38p5 + 9p6) z

+ (1- p)2 p2(1 + 6p + 11 p2 - 31 p3 - 44p4 + 168p5 -158p6 + 20p7 + 63p8

- 43 p9 + 9 p10) z2

- (1- p)4 p4 (2 + 10p - 2p2 - 73p3 +138p4 -105p5 + 41 p6 - 40p7 + 64p8 - 41 p9 - 5p10

+ 21 p11 -11 p12 + 2 p13) z3 + (1- p)8 p6(1 + 8p + 2p2 - 4p3 - 30p4 + 23p5 + 43p6 - 76p7 + 47 p8 -10p9

- 2 p10 + p11) z 4

- (1-p)12 p9 (2 + 6p + p2 - 18p3 +13p4 + 2p5 - 4p6 + p7) z5 + (1-p)16 p12 z6. (5)

We deduce for « = 6

(6)' ,„

= 1 + 42 p + 328 p2 + 826p3 - 1473 p4 - 11400p5 - 9975 p6 + 61060p7 + 160918 p8

Rel2 (p) _1 . An. ^ . o00 »,2 . Qn.r „^ ^nn „^ 6 . ri-rri »,7 . i ¿Arn o

p 6

- 153606p9 - 1203380p10 - 101102p11 + 6957668 p12 + 2134306p13 - 33913956 p14

- 11462384 p15 + 179889959 p16 - 49002916 p17 - 965490222p18 + 2056136956 p19

- 213511696 p20 - 7360834390p21 + 19329198282p22 - 29836117826 p23

+ 33105011509 p24 - 28179232812 p25 + 18911540288 p26 - 10111211062 p27 + 4305721566 p28 - 1446762862p29 + 376155108p30 - 73146582 p31 + 10029258 p32 - 865872 p33 + 35442 p34 , (6)

so that Rel2(6)(0.9) is equal to 0.9974544308852755355007942390030310588362, which is close to the upper bound given by Beichelt and Spross (Beichelt & Spross 1989). A partial fraction decomposition of G(z) gives

ÇiZ

G(z) = £ ^^ . (7)

i=1 1

There are six eigenvalues Q; a few of them may be pairs of complex conjugate values for some values of p. When the Q's are distinct, (7) immediately gives

Rel2(n)(p| = £ or, gt'. (8)

i = 1

2.3 Asymptotic reliability/availability

In the limit n ^ <x>, a single eigenvalue will prevail in the sum of (8), that of largest modulus. In the following, we shall name it Z+. It is real for the whole range 0 < p < 1 (see Fig. 2), and necessarily goes to 1 whenp ^ 1 because Rel2(M) (p = 1) = 1; all other eigenvalues tend to zero in that limit.

p

Figure 2. Variation of Ç+ with p; Ç+ (0.9) = 0.9999596999379792.

Even though it is not possible to get an analytic expression for Z+ as a function of p (D(z) is of degree 6 in z), we may compute it numerically very effectively, and also derive the expansion of Z+ as a function of q = 1 - p for small q's. Using symbolic software, we deduce from the constraint D(1/Z+) = 0

£+ ^ 1 -4q5 -4q7 + 9q8 + 9q9 +13q10 +... (9)

Whenp is close to zero, we have instead:

£+ ^ p+V2 p3/2 + p2 + ^ p5/2 +... (10)

The prefactor a+ is deduced from p and Z+ because it is closely related to the residue of G(z) at z = 1/Z+. The general result is in fact

a -£+N(1^0 (11)

* D\ (1/£+ )

where D'z = d D(z)/ dz. From the knowledge of p and the numerical value of Z+(p), we simply obtain a+(p), which is plotted in Figure 3.

Here again, we may consider two limits. Forp ^ 1,

a* ^ 1 -2q3 -4q4 +10q5 -7q6 +... (12)

while when p ^ 0,

1 -\/2 1/2 1lV2 3/2

a+ —p--p +.

+ 2 4 16

1.0

0.8

0.6

0.4

0.2

0.0 0.0

0.2

0.4 0.6

P

0.8

1.0

(13)

Figure 3. Variation of Ç+ with p; Ç+ (0.9) = 09976956497611774972.

The essential result is that, when n is large,

Rel2(n)( p) = a+ ç+ „ + (neglig.terms)

(14)

Basically, it looks as if the recursive network is made of n elements, in series, each of which having the reliability/availability Z+. The two asymptotic expressions of A and MTTF we shall derive as functions of n in the following section are a mere consequence of (14).

3 AVERAGE FAILURE RATE

3.1 Exact expression

In the case of identical links with constant failure rate A, (2) gives

a _Vn p dAn(p) An _-_ A-

A„ ( p) A„ ( p) d p

(15)

Knowing = Rel2(n)(p) by recursion (using (4)-(5)), the derivative is easily obtained for arbitrary values 0 < p < 1.

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

3.2 Asymptotic expression

Because Rel2(n) « a+ Z+n for n large, we get (Druault-Vicard & Tanguy 2006)

A„ ~A

„-

d lnç+ d ln a

+-

d ln p d ln p

(16)

Of course, it would be easier to get d ln Z+/d ln p and d ln a+/d ln p if Z+ were known analytically. Still, as in the formal calculation of a+, D(1/Z+) = 0 implies that

d D d p

(1/^+ ) +

g+'( p) ( p)2

+

d D

d z

(1/^+ ) = 0

(17)

from which we deduce Z+'(p) and then

d lng+ d ln p

= p$+

D\ Q/gy) D\ (1/sv)

d ln a+ d ln g+

d ln p d ln p

1- N'z (1/gy) + D''zz (1/^+)

N(1/g+) D'z (1/^+)

+ P-

Np (1/^+) D''zp (1/^+) --p-

N(1/g+) ^ D\ (1/^+)

(18) (19)

Their variations for 0 <p < 1 are displayed in Figures 4-5.

1.0

Figure 4. Variation of d ln Z+ /d lnp with p.

d ln □ d ln p

0.6 0.5 0.4 0.3 0.2 0.1

0.0 0.

0.2

0.4

0.6

0.8

1.0

Figure 5. Variation of d ln a+ /d ln p with p.

p

p

Note that, unsurprisingly, they exhibit singular behaviors in the vicinity of p = 0:

dln^+ > 1+V2 pl/2 + 1W2

1 +— p1/2 + —— p3/2 + ... (20) d ln p 2 8

2H p1/2 -1 p- ^ p3/2 + k (21)

d ln p 4 4 16

Exact results as well as the linear approximation (see (16)) are displayed in Figure 6 for p = 0.9. We see that the agreement is excellent even for n = 2.

0.10

0.08

0.06

□Li

0.04

0.02

0.00

46 n

10

Figure 6. Comparison between exact results (purple) and asymptotic approximation 0.06426+0.0018180 n (orange) for

AIA andp = 0.9.

4 MTTF CALCULATIONS 4.1 Exact expression

We are now considering a non-repairable system, and its reliability Rn(t). Let us recall that

MTTFn =[ Rn (t) dt . (22)

If each element has reliability p(t) = exp (-A t), we can write t = (- 1I A) ln p(t) and then (22) as

MTTF; =IH dpRel2W(p) . AJ 0 p

(23)

We can reuse the results obtained in Section II. Clearly, the exact MTTFn is obtained from (23), since such an integration is routinely performed by mathematical software.

4.2 Asymptotic expression

The calculation of the asymptotic expansion of MTTFn is based again on Rn « a+ Z+n when n is large:

MTTF

\ a ( p) ( p).

AJ 0 p

(24)

We have plotted and Ç+40 in Figure 7. Because vanishes for p ^ 0, the Hp factor does not play a significant role in the integral. As ; increases, the essential contribution to the integral will obviously come from the domain "p close to unity".

0

2

8

Figure 7. Variation withp of Z+ and Z +40.

The best approach is therefore to use q as the variable of integration

MTTF„

\ l-d- (1 - q) çl (1 - q) . AJ 0 1 - q

(25)

The gist of the calculation, quite standard in asymptotic expansions, is to extract the prevailing contribution of the integrand when q ^ 0. We can write

Ç+ = exp(- « (-In Ç+ ))

(26)

and derive the expansion of - ln Z+ in q from (9)

- ln ç+ = 4 q5 + 4 q7 - 9 q8 - 9 q9 - 5 qlu +...,

10

(27)

so that

ç+ = e-4nq5 exp(-«(4q7 -9q8 -9q9 -5q10 + ...))

(28)

This manipulation may seem quite formal, but now we can use a rescaled variable t = 4 n q , or, equivalently, set q = t15/(4 n) /5. Equation (28) then gives

ç+ = eexp

- n

f f , \7/5

v 4 « J

- 9

f , Y/5

v 4 « J

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

+...

= e exp

«

2/5

T

7/5

+

«

3/5

T

,8/5

+.

(29)

Equation (25) leads to

1 r

MTTF ^-F

« ; J0

1 p4« T-45 dT a+1

AJ° 5(4 «)

1/5

1 "(r.)") -,

e x exp

1 - t

L« 1

7/5 8/5

4 + ^ fl^ + ...

«2/5 v 4,

v

«3/5 V 4

. (30)

J

The upper bound of the integral depends on n. However, because of the e-T factor, the error made by replacing this upper bound by vanishes exponentially with n (as also do the already discarded

4/5 -T

contributions of the eigenvalues different from Z+). Consequently, we can merely integrate T e

4

9

multiplied by an expression admittedly depending on t and n, but which can be easily expanded in the n ^ <x> limit, assuming t remains finite. For instance, the leading term of MTTFn is (see (12))

1 r

MTTF; ^ - f ; ; Jq

1 rœ t

-4/5

dT

AJq 5(4 ;)

1/5

e = —

1 r(1/5) 1 r(6/5)

; 5(4;) ; (4;)

1/5

(31)

where r(x) is the Euler gamma function. Using (30), going beyond the leading term is not difficult, and we find

;MTTF; ^ im+1K2Î5I + 2. IËÎ5I - JL+...

(4 ;)1/5 5 (4;)

2/5

25 (4 ;)

3/5

40 ;

0.6958417869 0.2547996219 0.05185668604

- +-7-7- + -

0.025

;

1/5

;

2/5

;

3/5

- +...

;

(32)

(33)

By contrast to the series or parallel cases, the leading term in the asymptotic expansion of the

MTTF has a behavior in n"15, which slowly decreases with n. Each of the following terms of the

expansion adds another ;"15 factor.

4.3 Comparison of exact and asymptotic results

We can now compare (33) with the exact values. The results are displayed in Figure 8. Even for n ^10, the asymptotic expansion gives a very satisfying agreement, despite the limited number of used terms (four).

□MTTF

1.2 1.0 0.8

0.6 0.4 0.2

0.0

10

15

20

25

30

Figure 8. Comparison between exact values (purple) and the four-term asymptotic expansion of (33) (orange) of the

MTTF.

0

5

n

5 CONCLUSION AND OUTLOOK

We have calculated the availability of the architecture studied by Beichelt and Spross, and shown that for perfect nodes and identical links with constant failure rate, the asymptotic expansions of the associated average failure rate and MTTF obey quite different power-law behaviors in n (the extension of the network). It could be useful as a reminder that average failure rate and MTTF are not necessarily the inverse of each other.

The present study may be easily generalized to various recursive networks. Actually, it is possible to find the asymptotic expansion of the MTTF for different classes of large, arbitrary recursive networks, even though the exact generating function is not known (Tanguy 2008).

REFERENCES

1. Beichelt, F. & Spross, L. 1989. Bounds on the reliability of binary coherent systems. IEEE Trans. Reliability 38, 425-427.

2. Buzacott, J. A. 1967. Finding the MTBF of repairable systems by reduction of the reliability block diagram. Microelectron. & Reliab. 6, 105-112.

3. Druault-Vicard, A. & Tanguy, C. 2006. Exact failure frequency calculation for extended systems. Submitted (see also arXiv: cs.PF/0612141).

4. Hayashi, M. 1991. System failure-frequency analysis using a differential operator. IEEE Trans. Reliability 40, 444-447.

5. Kolowrocki, K. 2004. Reliability of Large Systems. Amsterdam: Elsevier.

6. Kuo, W. & Zuo, M. J. 2003. Optimal reliability modeling: principles and applications. Hoboken: Wiley.

7. Prekopa, A, Boros, E. & Lih, K.-W. 1991. The use of binomial moments for bounding network reliability, in Reliability of Computer and Communication Networks (DIMACS 5), F. S. Roberts, F. Hwang, & C. L. Monma (Editors), American Mathematical Society, New Brunswick, 197-212.

8. Schneeweiss, W. G. 1981. Computing failure frequency, MTBF & MTTR via mixed products of availabilities and unavailabilities. IEEE Trans. Reliability R-30, 362-363.

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

9. Schneeweiss, W. G. 1983. Addendum to: "Computing failure frequency, MTBF & MTTR via mixed products of availabilities and unavailabilities." IEEE Trans. Reliability R-32, 461-462.

10. Shi, D.-H. 1981. General formulas for calculating the steady-state frequency of system failure. IEEE Trans. Reliability R-30, 444-447.

11. Shooman, M. L. 1968. Probabilistic reliability: an engineering approach, New York: McGraw-Hill.

12. Singh, C. & Billinton, R. 1974. A new method to determine the failure frequency of a complex system. IEEE Trans. Reliability R-23, 231-234.

13. Singh, C. & Billinton, R. 1977. System reliability modelling and evaluation. London: Hutchinson.

14. Stanley, R. P. 1997. Enumerative combinatorics, volume 1, chapter 4. Cambridge: Cambridge University Press.

15. Tanguy, C. 2007. What is the probability of connecting two points? J. Phys. A: Math. Theor. 40, 14099-14116.

16. Tanguy, C. 2008. Asymptotic Mean Time To Failure and Higher Moments for Large, Recursive Networks. (submitted).

i Надоели баннеры? Вы всегда можете отключить рекламу.