Operational Risk Modeling Analytics phần 4 ppt

122 MODELS FOR THE NUMBER OF LOSSES: COUNTlNG DISTRIBUTIONS It is also interesting that the special extreme case with -1 < r < 0 and p -+ 00 is a proper distribution, sometimes called th

Trang 1

THE (a, b, 1) CLASS 121

If the original values were ail available, then the zero-truncated probabilities could have all been obtained by multiplying the original values by 1/(1 - 0.362887) = 1.569580

For the zero-modified random variable, pf = 0.6 arbitrarily From (5.4),

p r = (1 - 0.6)(0.302406)/(1 - 0.362887) = 0.189860 Then

p? = 0.189860 (5 + $+) = 0.110752,

p y = 0.110752 (5 + $4) = 0.055376

In this case, each original negative binomial probability has been multiplied

by (1 - 0.6)/(1 - 0.362887) = 0.627832 Also note that, for j 2 1, p y =

Although we have only discussed the zero-modified distributions of the (a, b, 0) class, the (a, b, 1) class admits additional distributions The (a, b) parameter space can be expanded to admit an extension of the negative binomial distribution to include cases where -1 < T < 0 For the (a, b, 0) class,

T > 0 is required By adding the additional region to the sample space, the

“extended” truncated negative binomial (ETNB) distribution has parameter restrictions ,B > 0, T > -1, T # 0

To show that the recursive equation

p k = p k - l ( U S - 3 , k = 2 , 3 , , (5.8) with po = 0 defines a proper distribution, it is sufficient to show that for any value of pl , the successive values of pk obtained recursively are each positive and that C&pk < co For the ETNB, this must be done for the parameter

Trang 2

122 MODELS FOR THE NUMBER OF LOSSES: COUNTlNG DISTRIBUTIONS

It is also interesting that the special extreme case with -1 < r < 0 and

p -+ 00 is a proper distribution, sometimes called the Sibuya distribution

It has pgf P ( z ) = 1 - (1 - z ) - ~ , and no moments exist (see Exercise 5.8) Distributions with no moments are not particularly interesting for modeling loss numbers (unless the right tail is subsequently modified) because an infinite number of losses are expected If this is the case, the risk manager should be fired!

Example 5.7 Determine the probabilities for an ETNB distribution with r = -0.5 and /3 = 1 Do this both for the truncated version and for the modified version with p f = 0.6 set arbitrarily

We have a = 1/(1 + 1) = 0.5 and b = (-0.5 - 1)(1)/(1 + 1) = -0.75 We also have p r = -0.5(1)/[(1 + l)0.5 - (1 + l)] = 0.853553 Subsequent values are

p; = ( 0.5 - - O,,) (0.853553) = 0.106694,

p; = ( 0.5 - - O,,) (0.106694) = 0.026674

For the modified probabilities, the truncated probabilities need to be multiplied by 0.4 to produce p y = 0.341421, p y = 0.042678, and p y = 0.010670 Note: A reasonable question is to ask if there is a “natural” member of the ETNB distribution, that is, one for which the recursion would begin with pl

rather than pa For that to be the case, the natural value of po would have

to satisfy pl = (0.5 - 0.75/l)p0 = -0.25~0 This would force one of the two probabilities to be negative and so there is no acceptable solution It is easy

0

to show that this occurs for any r < 0

There are no other members of the (a, b, 1) class beyond those discussed above A summary is given in Table 5.4

5.7 COMPOUND FREQUENCY MODELS

A larger class of distributions can be created by the processes of compounding any two discrete distributions The term compounding reflects the idea that

the pgf of the new distribution P(z) is written as

P ( z ) = PrV LPM ( z ) ] 7 (5.11) where PN(z) and PM ( z ) are called the primary and secondary distributions, respectively

The compound distributions arise naturally as follows Let N be a counting random variable with pgf PN(z) Let M I , M2, be identically and

Trang 3

COMPOUND FREQUENCY MODELS 123

Table 5.4 Members of the (a, b, 1) class

bExcluding T = 0, which is the logarithmic distribution

independently distributed counting random variables with pgf PM ( 2 ) As- suming that the Mjs do not depend on N , the pgf of the random sum S =

M I + M2 + + MN (where N = 0 implies that S = 0) is Ps ( Z ) = PN [PM ( 2 ) )

This is shown as

Trang 4

124 MODELS FOR THE NUMBER OF LOSSES: COUNTlNG DlSTRlBUTlONS

the number of losses (errors, injuries, failures, etc.) from the events, then

S represents the total number of losses for all such events This kind of in- terpretation is not necessary to justify the use of a compound distribution

If a compound distribution fits data well, that may be enough justification itself Also, there are other motivations for these distributions, as presented

in Section 5.9

Example 5.8 Demonstrate that any zero-modified distribution is a compound distribution

Consider a primary Bernoulli distribution It has pgf PN(z) = 1 - q + 42

Then consider an arbitrary secondary distribution with pgf PM(z) Then, from formula (5.11) we obtain

PS(z) = PN[PM(z)] = 1 - q + q p M ( z ) From formula (5.3), it is clear that this is the pgf of a ZM distribution with

This distribution is called the Poisson-Poisson or Neyrnan Type A distri-

bution Let PN(z) = e and Phf(z) = eA2('-') Then

Trang 5

COMPOUND FREQUENCY MODELS 125

When X2 is a lot larger than X I (for example, XI = 0.1 and Xz = 10) the

17 resulting distribution will have two local modes

Example 5.10 Demonstrate that the Poisson-logarithmic distribution is a negative binomia1,as compound Poisson-logarithmic distribution

The negative binomial distribution has pgf

where r = A/ ln(l+P) This shows that the negative binomial distribution can

be written as a compound Poisson distribution with a logarithmic secondary

Now, PM(T) can always be written as

h f ( z ) = fo + (1 - fo)Pif(.) (5.12) where P&(z) is the pgf of the conditional distribution over the positive range (in other words, the zero-truncated version)

Theorem 5.11 Suppose the p g f P N ( z ; 0) satisfies

PN(z; 0) = B[O(Z - l)]

for some parameter 0 and some function B ( z ) that is independent of0 That

is, the parameter 0 and the argument z only appear in the pgf as O(z - 1)

Trang 6

126 MODELS FOR THE NUMBER OF LOSSES: COUNTING DISTRIBUTIONS

There may be other parameters as well, and they may appear anywhere in the

pgf Then Ps(z) = P l v [ P ~ ( z ) ; 61 can be rewritten as

5.8 RECURSIVE CALCULATION OF COMPOUND PROBABILITIES

The probability of exactly k losses can be written as

where fin, k = O , l , , is the n-fold convolution of the function f k , k =

0,1, ., that is, the probability that the sum of n random variables which are each independent and identically distributed (iid) with probability function

f k will take on value k

oi)

n = O

Trang 7

RECURSIVE CALCULATION OF COMPOUND PROBABILITIES 127

When P ~ y ( z ) is chosen to be a member of the (a, b, 0) class,

Trang 8

128 MODELS FOR THE NUMBER OF LOSSES: COUNTING DlSTRlBUTlONS

Therefore,

Rearrangement yields the recursive formula (5.16) 0

This recursion (5.16) has become known as the Panjer recursion after its introduction as a computational tool for aggregate losses by Panjer [88] Its use here is numerically equivalent to its use for aggregate losses in Chapter 6

In order to use the recursive formula (5.16), the starting value go is required and is given in Theorem 5.15

Theorem 5.13 If the primary distribution is a member of the (a, b, 1) class, the recursive formula is

[pl - (a + b)PO]fk + c:=, (a + b j / k ) f j g k - j

g k = , k = 1 , 2 , 3 , (5.17) Proof: It is similar to the proof of Theorem 5.12 and is left to the reader 0

Example 5.14 Develop the Panjer recursive formula for the case where the primary distribution is Poisson

1 - afo

In this case a = 0 and b = A, yielding the recursive form

The starting value is, from (5.11),

go = P r ( S = 0 ) = P(O)

= e v j P M ( 0 ) l = P N ( f 0 )

-

- , - W - f o ) (5.18) Distributions of this type are called compound Poisson distributions’ When the secondary distribution is specified, the compound distribution is called

0

Poisson-X, where X is the name of the secondary distribution

The method used to obtain go applies to any compound distribution Theorem 5.15 For any compound distribution, go = PN( f o ) , where PN(z) is the pgf of the primary distribution and fo is the probability that the secondary distribution takes on the value zero

‘In some textbooks, the term conipound distribution, as in “compound Poisson,” refers to what are called in this book “mixed distributions.”

Trang 9

RECURSIVE CALCULATION O f COMPOUND PROBABILITIES 129

Note that the secondary distribution is not required to be in any special form However, to keep the number of distributions manageable, secondary distributions will be selected from the (a, b, 0) or the (a, b, 1) class

Example 5.16 Calculate the probabilities for the Poisson-ETNB distribution where X = 3 for the Poisson distribution and the E T N B distribution has

r = -0.5 and f l = 1

From Example 5.7 the secondary probabilities are fo = 0, f1 = 0.853553,

f2 = 0.106694, and f3 = 0.026674 From equation (5.18), go = exp[-3(1 - O)] = 0.049787 For the Poisson primary distribution, a = 0 and b = 3 The recursive formula (5.16) becomes

X can be any distribution

Example 5.17 Determine the probabilities for a Poisson-zero-modified E T N B distribution where the parameters are X = 7.5, p f = 0.6, r = -0.5, and /3 = 1 From Example 5.7 the secondary probabilities are fo = 0.6, f l = 0.341421,

f 2 = 0.042678, and f3 = 0.010670 From equation (5.18), go = exp[-7.5(1 - 0.6)] = 0.049787 For the Poisson primary distribution, a = 0 and b = 7.5 The recursive formula (5.16) becomes

Trang 10

5.9 A N INVENTORY OF DISCRETE DISTRIBUTIONS

In the previous sections of this chapter, we have introduced the simple (a, b, 0) class, generalized to the (a, b, 1) class, and then used compounding to create

a larger class of distributions In this section, we summarize the distributions introduced in those sections

There are relationships among the various distributions similar to those of Section 4.3.2 The specific relationships are given in Table 5.5

It is clear from earlier developments that members of the (a, b,O) class are special cases of members of the (a, b, 1) class and that zero-truncated distributions are special cases of zero-modified distributions The limiting cases are best discovered through the probability generating function, as was done on page 113, where the Poisson distribution is shown to be a limiting case of the negative binomial distribution

We have not listed compound distributions where the primary distribution

is one of the two parameter models such as the negative binomial or Poisson- inverse Gaussian This was done because these distributions are often them- selves compound Poisson distributions and, as such, are generalizations of distributions already presented This collection forms a particularly rich set

of distributions in terms of shape However, many other distributions are also possible Many others are discussed in Johnson, Kotz, and Kemp [65], Douglas [24], and Panjer and Willmot [93]

The (a, b, 0) class

Trang 11

AN INVENTORY OF DISCRETE DISTRIBUTIONS 131

Table 5.5 Relationships among discrete distributions

Distribution Is a special case of Is a limiting case of Poisson

ZM geometric

Z T negative binomial

ZM negative binomial

Negative binomial Poisson-binomial Poisson-inv Gaussian Polya-Aepplia

Neyman-Ab

ZT negative binomial

ZM negative binomial Geometric-Poisson

ZM negative binomial

ZM binomial

ZM negative binomial, Poisson-ETNB

Poisson-ETNB Poisson-ETNB

Poisson-ETNB aAlso called Poisson-geometric

bAlso called Poisson-Poisson

Trang 12

To distinguish this class from the (a, b, 0) class, the probabilities are denoted

P r ( N = k ) = p p or Pr(N = k ) = p z depending on which subclass is being represented For this class, p f is arbitrary (that is, it is a parameter) and then p v or pT is a specified function of the parameters a and b Subsequent probabilities are obtained recursively as in the (a, b, 0) class: p p = (u +

b/Ic)pE1, k = 2 , 3 , ., with the same recursion for p; There are two sub- classes of this class When discussing their members, we often refer to the

“corresponding” member of the (a, b, 0) class This refers to the member of that class with the same values for a and b The notation Pl, will continue to

be used for probabilities for the corresponding (a, b, 0) distribution

The (a, b, 1) class

5.9.3 The zero-truncated subclass

The members of this class have p: = 0 and therefore it need not be estimated These distributions should only be used when a value of zero is impossible The first factorial moment is p(1) = (a + b ) / [ ( l - a ) ( l - P O ) ] , where po is the value for the corresponding member of the (a, b, 0) class For the logarithmic distribution (which has no corresponding member), p(1) = p/ In(l+P) Higher factorial moments are obtained recursively with the same formula as with the (a, b, 0) class The variance is ( a + b ) [ l - (u + b + l ) p o ] / [ ( l - a ) ( l - po)12.For

those members of the subclass that have corresponding (a, b, 0) distributions, P; = P d ( 1 -Po)

Trang 13

AN INVENTORY OF DISCRETE DISTRIBUTIONS 133

5.9.3.1 Zero-truncated Poisson

5.9.3.2 Zero-truncated geometric

PT =

P: = E[N] =

Trang 14

134 MODELS FOR THE NUMBER OF LOSSES: COUNTING DISTRIBUTIONS

5.9.4 The zero-modified subclass

A zero-modified distribution is created by starting with a truncated distribution and then placing an arbitrary amount of probability at zero This

probability, p f , is a parameter The remaining probabilities are adjusted accordingly Values of p f can be determined from the corresponding zero- truncated distribution as p f = (1 - p f ) p z or from the corresponding ( a , b, 0) distribution as p f = (1 - pf)pk/(l - PO) The same recursion used for the

zero-truncated subclass applies

The mean is 1 - p f times the mean for the corresponding zero-truncated distribution The variance is 1 - pf times the zero-truncated variance plus py(1-pf) times the square of the zero-truncated mean The probability generating function is PM(z) = p? +(1 -pf)P(z), where P(z) is the probability generating function for the corresponding zero-truncated distribution

Trang 15

AN INVENTORY OF DISCRETE DlSTRlBUTlONS 135

5.9.5 The compound class

Members of this class are obtained by compounding one distribution with another That is, let N be a discrete distribution, called the primary distribution and let M I , M 2 , be identically and independently distributed with another discrete distribution, called the secondary distribution The com-

pound distribution is S = M I +- .+ M N The probabilities for the compound distributions are found from the Panjer recursion

k

for k = 1,2, ., where a and b are the usual values for the primary distribution [which must be a member of the (a, b, 0) class] and f j is the probability from the secondary distribution The only two primary distributions listed here are Poisson (for which po = exp[-X(l - fo)]) and geometric [for which po = l/[l+P-pfo]] The probability generating function is P ( z ) = f " [ P ~ ( z ) ] In the following list the primary distribution is always named first For the first, second, and fourth distributions, the secondary distribution is the (a, b, 0) class member with that name

5.9.5.1 Poisson-binomial

This distribution has a Poisson primary distribution and a binomial secondary or, equivalently, a Poisson primary and a zero-truncated secondary distribution

5.9.5.2 Poisson-Poisson

The parameter A1 is for the primary Poisson distribution, and X2 is for the secondary Poisson distribution This distribution is also called the

Neyman Type A

5.9.5.3 Geometric-extended truncated negative binomial

The parameter ,& is for the primary geometric distribution The last two parameters are for the secondary distribution, noting that for T = 0 the secondary distribution is logarithmic The truncated version is used

so that the extension of r is available

Trang 16

136 MODELS FOR THE NUMBER OF LOSSES: COUNTING DISTRIBUTIONS

When r = 0 the secondary distribution is logarithmic, resulting in the negative binomial distribution This distribution is also called the generalized Poisson-Pascal

5.10 A HIERARCHY OF DISCRETE DISTRIBUTIONS

The following table indicates which distributions are special or limiting cases

of others For the special cases, one parameter is set equal t o a constant to create the special case For the limiting cases, two parameters go to infinity

or zero in some special way

Distribution Is a special case of Is a limiting case of Poisson

Negative binomial, Poisson-binomial, Poisson-inv Gaussian, Polya-Aeppli,

Ne y man- A

ZM negative binomial Geometric-Poisson

ZM negative binomial Poisson-ETNB Poisson-ETNB

Trang 17

FURTHER PROPERTIES OF THE COMPOUND POISSON CLASS 137

5.11 FURTHER PROPERTIES OF THE COMPOUND POISSON CLASS

Of central importance within the class of compound frequency models is the class of compound Poisson frequency distributions Physical motivation for this model arises from the fact that the Poisson distribution is often a good model to describe the number of loss-causing accidents, and the number of losses from an accident is often itself a random variable In addition, there are numerous convenient mathematical properties enjoyed by the compound Poisson class In particular, those involving recursive evaluation of the probabilities were also discussed in Section 5.9.5 In addition, there is a close connection between the compound Poisson distributions and the mixed Pois- son frequency distributions which is discussed in more detail in Section 5.13 Here we consider some other properties of these distributions The compound Poisson pgf may be expressed as

where Q(z) is the pgf of the secondary distribution

Example 5.18 Obtain the pgf for the Poisson-ETNB distribution and show that it looks like the pgf of a Poisson-negative binomial distribution

The ETNB distribution has pgf

[ I - p(z - 1)]+ - (1 +

Q(z) = 1 - ( 1 + /3)+

for P > 0, r > -1, and r # 0 Then the Poisson-ETNB distribution has as

the logarithm of its pgf

to estimation and analysis of the parameters

Trang 18

We can compare the skewness (third moment) of these distributions to develop an appreciation of the amount by which the skewness, and hence the tails of these distributions, can vary even when the mean and variance are fixed From equation (5.19) (see Exercise 5.14) and Definition 2.18, the mean and second and third central moments of the compound Poisson distribution are

Negative binomial: p3 = 3a2 - 2p + 2 - 'I2

Note that for fixed mean and variance the third moment only changes through the coefficient in the last term for each of the five distributions For the Poisson distribution, p3 = X = 3a2 - 2p, and so the third term for each expression for p 3 represents the change from the Poisson distribution For the Poisson-binomial distribution, if m = 1, the distribution is Poisson because

it is equivalent to a Poisson-zero-truncated binomial as truncation at zero

Trang 19

FURTHER PROPERTIES OF THE COMPOUND POISSON CLASS 139

leaves only probability at 1 Another view is that from the formula for the third moment (5.21), we have

Example 5.19 The data in Table 5.6 are taken from Hossack et al [55] and give the distribution of the number of losses on accidents involving automobiles

in Australia Determine an appropriate frequency model based on the skewness results of this section

The mean, variance, and third central moment are 0.1254614, 0.1299599,

and 0.1401737, respectively For these numbers,

p3 - 3a2 + 2p

(a2 - P I 2 / P = 7.543865

From among the Poisson-binomial, negative binomial, Polya-Aeppli, Neyman Type A, and Poisson-ETNB distributions, only the latter is appropriate For this distribution, an estimate of r can be obtained from

Trang 20

Table 5.6 Hossack et al data

{qn; n = 0,1,2, }, where qn = [Aiqn(l) + Azqn(2) + + A k q n ( k ) ] / A

Proof: Let Q i ( z ) = Cr=oqn(i)zn for i = 1 , 2 , , k Then Si has pgf

P , , ( z ) = E(zsS) = exp{Ai[Qi(z) - 11) Because the Sis are independent,

One main advantage of this result is computational If we are interested

in the sum of independent compound Poisson random variables, then we do not need to compute the distribution of each compound Poisson random variable separately (i.e., recursively using Example 5.14) because Theorem 5.20 implies that a single application of the compound Poisson recursive formula

in Example 5.14 will suffice The following example illustrates this idea

Example 5.21 Suppose that k = 2 and 5’1 has a compound Poisson distribution with A1 = 2 and secondary distribution qI(1) = 0.2,q2(1) = 0.7, and q3(1) = 0.1 Also, 5’2 (independent of 5’1) has a compound Poisson distribution with A2 = 3 and secondary distribution 42(2) = 0.25,q3(2) = 0.6, and q4(2) = 0.15 Determine the distribution of S = S1 + Sz

Trang 21

FURTHER PROP€RJ/ES OF THE COMPOUND POISSON CLASS 141

We have X = X I + X2 = 2 + 3 = 5 Then

41 = 0.4(0.2) + 0.6(0) = 0.08, q2 = 0.4(0.7) + O.s(O.25) = 0.43,

43 = 0.4(0.1) + O.S(O.6) = 0.40,

q4 = 0.4(0) + 0.6(0.15) = 0.09

Thus, S has a compound Poisson distribution with Poisson parameter X = 5 and secondary distribution q1 = 0.08,qz = 0 4 3 , ~ = 0.40, and 4 4 = 0.09 Numerical values of the distribution of S may be obtained using the recursive formula

beginning with P r ( S = 0) = eP5

In various situations the convolution of negative binomial distributions is

of interest Example 5.22 indicates how this distribution may be evaluated

Example 5.22 (Convolutions of negative binomial distributions) Suppose that Ni has a negative binomial distribution with parameters ri and pi for

i = 1,2, , k and that N I , N2, , Nk are independent Determine the distribution of N = N1 + Nz + + Nk,

The pgf of Ni is Pjv,(z) = 11 - p i ( z - 1)ILT1 and that of N is PN(z) =

n i = , P ~ , ( z ) = nF=,[l - pi(z - l)]-Tz If pi = p for i = 1 , 2 , ,k, then

P N ( z ) = [l - P(.z - 1)]-(T1+T2+"'+Tk), and N has a negative binomial distribution with parameters r = r1 + r2 + + rk and p

If not all the pis are identical, however, we may proceed as follows From Example 5.10,

k

pjv,(z) = [1 - pi(z - 1)]-~1 = e ~ ~ [ Q ~ ( z ) - ' I where X i = ri ln(1 + pz) and

with

But Theorem 5.20 implies that N = Nl + N2 + + Nk has a compound Poisson distribution with Poisson parameter

Trang 22

142 MODELS FOR THE NUMBER OF LOSSES: COUNTING DISTRIBUTIONS

and secondary distribution

The distribution of N may be computed recursively using the formula

5.12 MIXED FREQUENCY MODELS

Many compound distributions can arise in a way that is very different from compounding In this section, we examine mixture distributions by treating one or more parameters as being “random” in some sense This section ex- pands on the ideas discussed in Section 5.3 in connection with the gamma mixture of the Poisson distribution being negative binomial

We assume that the parameter is distributed over the population under consideration (the collective) and that the sampling scheme that generates

our data has two stages First, a value of the parameter is selected Then, given that parameter value, an observation is generated using that parameter value

Let P(zj8) denote the pgf of the number of events (e.g., losses) if the risk parameter is known to be 6 The parameter, 8, might be the Poisson mean, for example, in which case the measurement of risk is the expected number

of events in a fixed time period

Let U(8) = P r ( 0 I 8) be the cdf of 0, where 0 is the risk parameter,

which is viewed as a random variable Then U(8) represents the probability that, when a value of 0 is selected (e.g., a driver is included in our sample), the value of the risk parameter does not exceed 8 Let u(8) be the pf or pdf

of 0 Then

(5.22)

Trang 23

MIXED FREQUENCY MODELS 143

is the unconditional pgf of the number of events (where the formula selected depends on whether 0 is discrete or continuous2) The corresponding proba-

bilities are denoted by

The mixing distribution denoted by U(0) may be of the discrete or continuous type or even a combination of discrete and continuous types Discrete mixtures are mixtures of distributions where the mixing function is of the

discrete type Similarly, continuous mixtures are mixtures of distributions

where the mixing function is of the continuous type This phenomenon of mixing was introduced for continuous mixtures of severity distributions in Section 4.7.5 and for finite discrete mixtures in Section 4.5.2

It should be noted that the mixing distribution is unobservable because the data are drawn from the mixed distribution

Example 5.23 Demonstrate that the zero-modified distributions may be created by using a two-point mixture

Suppose

P ( z ) = p 1 + (1 -p)Pz(z)

This is a (discrete) two-point mixture of a degenerate distribution that places all probability at zero and a distribution with pgf Pz(z) From formula (5.12), this is also a compound Bernoulli distribution 0

Many mixed models can be constructed beginning with a simple distribution Two examples are given here

Example 5.24 Determine the pf for a mixed binomial with a beta mixing distribution This distribution is called binomial-beta, negative hypergeometric,

or Polya-Eggenberger

The beta distribution has pdf

'We could have written the more general P ( z ) = SP(zlO)dU(O), which would include

situations where 0 has a distribution that is partly continuous and partly discrete

Tiêu đề	Operational Risk Modeling Analytics phần 4 ppt
Trường học	University of Finance and Marketing
Chuyên ngành	Operational Risk Modeling
Thể loại	PowerPoint presentation
Thành phố	Ho Chi Minh City

Định dạng
Số trang	46
Dung lượng	2,33 MB