Научная статья на тему 'Bounds for the reliability of different redundant systems'

Bounds for the reliability of different redundant systems Текст научной статьи по специальности «Математика»

CC BY
104
12
i Надоели баннеры? Вы всегда можете отключить рекламу.
Область наук

Аннотация научной статьи по математике, автор научной работы — H Schäbe, I. Shubinsky

In many cases, reliability can be improved by using redundant components. This is an approach that is applied especially in information networks. In this paper we study redundant systems with imperfect switches. We show that there exists a limit as the number of redundant components tends to infinity. This limit is computed for components with exponential life time distributions, which is the typical distribution for digital equipment used in information systems. For components with distributions belonging to the NBUE or HNBUE classes, bound are derived.

i Надоели баннеры? Вы всегда можете отключить рекламу.
iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.
i Надоели баннеры? Вы всегда можете отключить рекламу.

Текст научной работы на тему «Bounds for the reliability of different redundant systems»

BOUNDS FOR THE RELIABILITY OF DIFFERENT REDUNDANT

SYSTEMS

H. Schabe •

TÜV Rheinland InterTraffic GmbH, 51105 Cologne, Germany e-mail: [email protected].

I. Shubinsky •

ul. Nizhegoroskaya 32, str. 15, room 310, 109029 Moscou, Russia e-mail: [email protected]

ABSTRACT

In many cases, reliability can be improved by using redundant components. This is an approach that is applied especially in information networks. In this paper we study redundant systems with imperfect switches. We show that there exists a limit as the number of redundant components tends to infinity. This limit is computed for components with exponential life time distributions, which is the typical distribution for digital equipment used in information systems. For components with distributions belonging to the NBUE or HNBUE classes, bound are derived.

1. INTRODUCTION

In order to improve the reliability of a system there are mainly two possibilities. The first one is to improve the reliability of the components, the second is to implement redundancy. Mainly this is done by using more than one component to fulfill the same function, see e.g. Barlow & Proschan (1976). Redundancy means that in a technical system there are more possibilities present to ensure a function, than the necessary minimum. If one discards influences as costs and needed space, one might come to the conclusion that using redundant items, one could improve system reliability up to an arbitrarily high level. In this paper we will discuss the problem whether it is possible to improve reliability up to an arbitrary high level. Using redundant components is an approach used mainly in networks, especially in telecommunication networks. If a certain link or node fails, traffic is rerouted to other nodes and links.

In this paper we will show that, under several assumptions, reliability cannot be improved further than to a certain limit.

In section 2 we will describe the main assumptions of our model. In the next two chapters we consider two extremal modes of standby, hot standby and cold standby. Hot standby means that the load on the standby component is the same as on the main component and that no load sharing between the redundant components occurs. Cold standby describes a situation, where the redundant devices do not age at all during their standby phase, i.e. when the main component provides the service. All other modes of standby will describe modes of ageing that are between these two situations of load on the redundant components.

In the third section we describe the situation of hot standby, the worst case regarding ageing.

In the fourth section, we discuss the situation of cold standby, no ageing of the standby components.

Section five provides an example and in section six we give a summary and conclusion.

2. MAIN ASSUMPTIONS

For the model the following assumptions shall hold

a) Detection and switching to another component is not perfect but fails. Here the probability of failure of switching from the failed component to the redundant one includes the failure of the switch itself in case of detection of the failure, the failure of the detection mechanisms when the switch is working as well as failure of both switch and detection mechanisms. This resulting probability is denoted by y

b) The lifetime of the components is random and follows the lifetime distribution F(x) with F(0) = 0 and

lim F(x) =1

x^ro

c) The failure times of all redundant components are completely statistically independent from each other.

d) The number of redundant components is not limited.

e) All redundant components have the same lifetime distribution.

f) The lifetime distribution of the components is continuous, differentiable and has a finite mean.

The model has been described in more detail in Shubinsky (2012).

Parallel systems with imperfect switching to redundant components will be called imperfect systems in this paper.

The following figure shows an example of a system with redundant components. Each of the m, possibly different, components has n redundant replications. We will study this type of systems for n ^ro

n —> n —> OO n —> OO n —> OO

I__________________________________________________________________________________I

Figure 1. System with redundant components

In the following subsections we will simplify the system in figure 1 by considering only one component with its redundant replications.

3. HOT STANDBY

For hot standby, all components are under full load from the beginning. So this is in fact a situation of a simple parallel system. Assume that a component with lifetime distribution F(x) is connected in parallel with all its replications. The following figure 2 shows the reliability block diagram of the system. Assume that n components are connected in parallel.

u

k ^œ

Figure 2. System with parallel structure of components

The lifetime distribution of the parallel system with hot standby can now be computed as follows.

In order to have achieve a redundancy of level k, i.e. that k are components functioning, k-1 successful switchovers are necessary with a failure on the k-th switch-over.

The probability of this event is (l-y)^'^y. The distribution function of k identical units with lifetime distribution F(x) and connected in parallel is

1-(1-F(x))k (1)

Combining both expressions and summing up we arrive at

k

Iy(l-y)i-l(l-(l-F(x))i) (2)

i=l

If now k tends to infinity, this gives

co

Iy(l-y)i-lF(x)i = = G(x), (3)

i=l

where G(x) denotes the distribution function of the lifetime of the redundant system. Note that, the lifetime distribution of the parallel system is given by an analytic expression. Moreover, one can observe that

which follows easily from

yF(x) <F(x) -(l-y)F(x)2 and (l-y)F(x)>(l-y)F(x)2.

The latter is obvious since

F(x) > F(x)2

Considering (4) one can see that (4) is smaller than the distribution of a single component, but even in the limiting case, the failure probability does not vanish. This is only possible for

perfect switching, i.e. y=0. For all positive values of y which means imperfect switching, G(x) will form a lower bound for all systems with a large but finite number of redundant elements. Now we can compute the mean lifetime by

oo °°

mG = J(l-G(x))dx = J',^'"^^ dx (5)

0 0 For an exponential distribution, one computes

co co

mG=ji-(i-yxi^pUx))dx=us^dx=-(i/i) ' (6) 0 0 For y=l this gives 1 /X, which is the result for the exponential distribution without redundancy. Again, for imperfect switching, mG always stays bounded and its value is determined

by mp and y.

Now, for a function that belongs to the NBUE (new better than used in expectation) or NWUE (new worse than used in expectation) family we can show that an expression as (1) is an upper (lower) bound on the mean value of the distribution function G.

A lifetime distribution function belongs to the class NBUE (NWUE) if it satisfies

OT

J*(1-F(t))dt <(>) mF(1-F(x)) ,

x

where mF is the mean of F(x), see e.g. Barlow and Proschan (1976)

If now F(x) belongs to the class NBUE (or NWUE) the following inequality holds

mG<(>)-mFln(y)/(l-y). (7)

This result can be proven as follows. We rewrite (6) in the following form:

OT

" 00

d /(1-F(t))dt " l-F(x) . x

mG :

dx = -

l-(l-y)F(x)ax „

l-(l-y)F(x)

(8)

0 0 Integrating this expression by parts, we arrive at

œ

œ

mG = mF/(l-(l-y)) +

(1-F(t))dt d I-ö^ (9)

x

«/

0

Using the NBUE (NWUE) property this can be rewritten as

mG - (-) mF/(l-(l-y)) - JmF (l-F(x)) d 1_(1_^)F(x) (10)

0

and integrating by parts again

œ

mG < (>) mF JttB^P = "mF ln (7) /(1-T)

(11)

0

This proves (7).

Using the expression (3), we can derive an inequality for the residual life function Trl. The latter is defined by

OT

TRL = J(l-G(t))dt.

x

Using (3) we arrive at

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

OT

OT

œ

Trl =

(^Ft^)dt = -

(T^I^)d /(1-F(s))ds .

x

x

Integrating by parts, we get

œ

Trl =

1

œ

l-(l-y)F(x)

/(1-F(t))dt

+

x

œ

(1-F(s))ds d(

1

l-(l-y)F(t)

x

For a NBUE (NWUE) distribution this leads to

mF(1-F(x)) -1-(1-y)F(x)

œ

mF(1-F(t)) d(

1

l-(l-y)F(t)

x

Integrating by parts again, this expression equals

œ

œ

TRL < (>) -mF

' d(1-F(t)) _ 1-(1-y)F(t) mF

_dFit)_

l-(l-y)F(t)

=(mp/y) ln ('

J1

l-(l-y)F(x)

)

x

x

Putting everything together, we arrive at

TRL<(>)(mF/y)ln(1.(1^)F(x)) For the exponential distribution, the equality holds.

4. COLD STANDBY

The case of cold standby is the other extremal case. Here, the lifetime distribution of a parallel system is computed by

OT

G(x)= Zyil-y^^FOHx), (12)

i=l

where F(i)(x) denotes the i-fold convolution of the distribution function F(x). The convolution is defined by

t

t

F(1)(x) = F(x)

for the first order convolution, all higher orders are defined iteratively by

x

F(k+l)(x) = /F(k)(x-t)dF(t) . (13)

0

Formula (12) is derived from the probability (l-y)1_ly for a failure of the system when the

switching to the i-th redundant component and the lifetime distribution F(i)(x) of i successively used components .

For the type of distributions given by (12), a general analytical solution does not exist. However, the following results can easily be obtained.

For an exponential distribution with density f(x) = A, exp(-Ax) one obtains (see /Shubinski/)

G(x) = l-exp(-Xyx). (14)

If y=l (switching fails always), we arrive at the usual exponential distribution of a single component. The result (9) can be easily derived by using

f00(x) = A,k-lexp(-A,x) /(k-1)! (15)

and computing the density g(x).

Using results of Schabe (1986), we can also derive other analytical results for special Gamma distributions that have the following form

F(x) = №a-1exp(-Xx)/r(a) (16)

The results are given in the following table.

Table 1. density functions g(x) for special types of gamma densities for f(x).

Parameters density g(x) of the parallel system

a=1/2 f\j^ exp(-^x)+ty( 1 -y)exp(-^( 1 -y)2/2)erfc(-^( 1 -yh/x)

a = 1 Xy exp(-^yx)

= 2 (exp(-(1^V^^x) - exp(-(1^V^^x)

a = 3 ^2/3 (3exp(^x(1-y)1/3)-3 exp(-^x(1-y)1/3)cos(2^x(1-y)1/3-ji/3))

a = 4 2(1^3/4 exp(^x)(sinh^(1^)1/4x- sin^(1^)1/4x)

Also, it has been shown in Schabe (1986), that

mQ = mp/y. (17)

Therefore, no approximation for mG needs to be given.

One may note, that the mean is limited, even if the number of redundant devices becomes infinite. The distribution function G(x) has no closed form expression in the general case. So, it is worthwhile to have a bound on it. In Schabe (1986) it has been shown in theorem 3.2 that if F belongs to the class NBUE (NWUE), the same holds for G. An analogous result has been proven for the class HNBUE (harmonic new better than used in expectation) and HNWUE (harmonic worse than used in expectation) in theorem 3.4. The latter result can be used to give a bound on G. If F is HNBUE (HNWUE), we have for the distribution G the following inequality for the residual life function, see Klefso (1982)

OT

J'fl -G(t))dt < (>) itiq exp(-x/mQ) = (mp/y) exp(-yx/mp) (18).

x

Also this expression shows, that an infinite number of redundant devices is not able to improve the residual life function further than to a certain value. For HNBUE distributions, we derived an upper bound on an infinitely increasing number of redundant devices.

5. EXAMPLE

In this section we will show how the mean lifetime depends on the number of components used for redundancy and how it depends on the probability y of failure of switching for a cold standby system. From (5) we have.

G(x) = l-exp(-Xyx).

For a system as in figure 1 consisting of m components connected in series each having k redundant replications this gets

G(x) = l-exp(-A,ykx).

This distribution has mean l/(^yk). Now the relative mean of the system with redundancy over a system consisting of one element with failure rate A, is

R = l/(yk).

Let us now denote by a=l-y the probability that detection of a fault and switching to the redundant component is successful.

R k ' '/ ! f '

rv 1/8 1 /9 -2 /Q '/Il >' / '

1/2 \ 1/4 \ / ✓ / ' 1 ' \ 1

10/2 ^^ ^✓* / /\ / / \/ / /sc / /\ NL 1

__-- <y x / / / /

/ / / J _____ , —

21 17 13 9 5 1

0.80 0.85 0.90 0.95 a

Figure 3. Relation of means l/(yk) depending on a.

iНе можете найти то, что вам нужно? Попробуйте сервис подбора литературы.

For k=l the mean life time is plotted by a simple line. One can observe that with increasing degree of redundancy (k) the mean lifetime grows. Also, with increasing a, i.e. with increasing quality of switching, the mean lifetime also increases.

6. DISCUSSION AND CONCLUSIONS

Now we can provide the following limits for the different types of systems.

Table 2. Overview of the limit values for parallel systems with an independent number of

components.

Characteristics Limit for hot standby Limit for cold standby

G(x) YF(x) 1-(1-Y)F(x) <x G(x) = EY(1^)i-1F(i)(x) i=l

mG < (>) -mF ln (y) /(1-y) For F being NBUE (NWUE), equality for the exponential distribution mG = mF^

<x Residual life J(l-G(t))dt x < (>) ^ hi (i-(i-y)F(x) ) For F being NBUE (NWUE), equality holds for the exponential distribution < (>) (nip/y) exp(-Yx/mp) For F being HNBUE (HNWUE), equality holds for the exponential distribution

Note that, the limit itself is an upper bound for systems with a finite number of redundant components. So the upper bounds for real systems with a finite number of components is given by the NBUE / HNBUE limits. This is given in table 3

Table 3 upper bounds for imperfect parallel systems.

Characteristics Limit for hot standby Limit for cold standby

G(x) YF(x) да G(x) = Zy(1-y)i-1F(i)(x) i=l

l-(l-y)F(x)

mG -mF ln (y) /(l-y) For F being NBUE mG = mF/y

да Residual life J(1-G(t))dt x (mF^ ln (l-(l-y)F(x) ) For F being NBUE (mp/y) exp(-yx/mp) For F being HNBUE

An imperfect system cannot achieve better values than given in the table above for components that satisfy the NBUE or HNBUE property.

In this paper we have obtained distribution functions for parallel systems in the case that switching to redundant devices is not perfect. It has turned out that there exists a limit and reliability cannot be improved up to 1. This can only be reached if switching is perfect.

This implies that at a certain stage of system development it is worthwhile to improve the reliability of the switching algorithm that to implement further additional redundant devices.

REFERENCES

Barlow, R.E., F. Proschan, Statistical Theory of Reliability, 1975, Holt, Rinehart & Winston, New York

Klefsjo, B., The HNBUE and HNWUE Classes of Life Distributions, Naval. Res. Logist. Quart. 29(1982) 331-344.

Shubinsky, I. Structural redundancy in information systems. Estimates for Boundary Values, Reliability, No. 1 (40), 2012, 118-133, (in Russian: Шубинский И.Б. Структурное резервирование в информационных системах. Предельные оценки. - Надежность)

Schabe, H., A renewal process with information loss, Journal of Information Processing and Cybernetics, no. 7/8, vol. 22 (1986), p. 423-428.

i Надоели баннеры? Вы всегда можете отключить рекламу.