Lecture Undergraduate econometrics - Chapter 2: Some basic probability concepts

In this chapter, students will be able to understand: Experiments, outcomes and random variables; the probability distribution of a random variable; expected values involving a single random variable; using joint probability density functions; the expected value of a function of several random variables: covariance and correlation; the normal distribution.

Trang 1

Chapter 2

Some Basic Probability Concepts

2.1 Experiments, Outcomes and Random Variables

• A random variable is a variable whose value is unknown until it is observed The

value of a random variable results from an experiment; it is not perfectly predictable

• A discrete random variable can take only a finite number of values, which can be

counted by using the positive integers

• Discrete variables are also commonly used in economics to record qualitative, or

nonnumerical, characteristics In this role they are sometimes called dummy

variables

• A continuous random variable can take any real value (not just whole numbers) in an

interval on the real number line

Trang 2

2.2 The Probability Distribution of a Random Variable

• The values of random variables are not known until an experiment is carried out, and

all possible values are not equally likely We can make probability statements about certain values occurring by specifying a probability distribution for the random

variable

• If event A is an outcome of an experiment, then the probability of A, which we write

as P(A), is the relative frequency with which event A occurs in many repeated trials of

the experiment For any event, 0 ≤ P(A) ≤ 1, and the total probability of all possible event is one

2.2.1 Probability Distributions of Discrete Random Variables

• When the values of a discrete random variable are listed with their chances of

occurring, the resulting table of outcomes is called a probability function or a

Trang 3

• The probability density function spreads the total of 1 “unit” of probability over the set

of possible values that a random variable can take

• Consider a discrete random variable, X = the number of heads obtained in a single flip

of a coin The values that X can take are x = 0,1 If the coin is “fair” then the

probability of a head occurring is 0.5 The probability density function, say f(x), for

the random variable X is

• “The probability that X takes the value 1 is 0.5” means that the two values 0 and 1

have an equal chance of occurring and, if we flipped a fair coin a very large number of

times, the value x = 1 would occur 50 percent of the time We can denote this as P[X

Trang 4

= 1] = f(1) = 0.5, where P[X = 1] is the probability of the event that the random variable X = 1

• For a discrete random variable X the value of the probability density function f(x) is the probability that the random variable X takes the value x, f(x) = P(X=x)

2.2.2 The Probability Density Function of A Continuous Random Variable

• For the continuous random variable Y the probability density function f(y) can be represented by an equation, which can be described graphically by a curve For continuous random variables the area under the probability density function

corresponds to probability

• For example, the probability density function of a continuous random variable Y might

Trang 5

and the probability that Y takes a value in the interval [a, b], or P[a ≤ Y ≤ b], is the area under the probability density function between the values y = a and y = b This is

shown in Figure 2.1 by the shaded area

• Since a continuous random variable takes an uncountable infinite number of values,

the probability of any one occurring is zero That is, P[Y = a] = P[a ≤ Y ≤ a] = 0

• In calculus, the integral of a function defines the area under it, and therefore

y a

• For any random variable x, the probability that x is less than or equal to a is denoted

F(a) F(x) is the cumulative distribution function (cdf)

• For a discrete random variable,

Trang 6

In view of the definition of f(x),

Trang 8

2.3 Expected Values Involving a Single Random Variable

• When working with random variables, it is convenient to summarize their probability

characteristics using the concept of mathematical expectation These expectations will

make use of summation notation

2.3.1 The Rules of Summation

1 If X takes n values x1, , x n then their sum is

Trang 9

3 If a is a constant then it can be pulled out in front of a summation

Trang 10

7 The arithmetic mean (average) of n values of X is

= ( ) ("Sum over all values of the index ")

( ) ("Sum over all possible values of ")

n

i

i i

Trang 11

9 Several summation signs can be used in one expression Suppose the variable Y takes

n values and X takes m values, and let f(x,y) = x+y Then the double summation of

Trang 12

2.3.2 The Mean of a Random Variable

• The expected value of a random variable X is the average value of the random variable

in an infinite number of repetitions of the experiment (repeated samples); it is denoted

Trang 13

• If X is a continuous random variable, the expected value of X is

∫ means integral over the entire range of values of x

2.3.3 Expectation of a Function of a Random Variable

• If X is a discrete random variable and g(X) is a function of it, then

x

E g X = ∑g x f x (2.3.2a)

Trang 14

The expected value of a sum of functions of random variables, or the expected value

of a sum of random variables, is always the sum of the expected values

Trang 15

• The idea of how to determine the expected value of a function of a continuous random

variable Y, say g(y), is exactly the same as in the discrete case The terms g(y) must be weighted by f(y) and then all those products summed This operation is carried out via integration, but the interpretation of the result is the same Specifically, if Y is a

continuous random variable, then

Trang 16

3 If a and c are constants and X is a random variable, then

y y

Trang 17

2.3.4 The Variance of a Random Variable

• The variance of a discrete or continuous random variable X, based on the rules in

Section 2.3.3, is defined as the expected value of g X( ) [= X −E X( )]2 Algebraically,

x x

where [ ] µE X = Examining g(X) = [X – E(X)]2, we observe that the variance of a

random variable is the average squared difference between the random variable X and its mean variable E[X] Thus, the variance of a random variable is the weighted

Trang 18

average of the squared differences (or distances) between the values x of the random variable X and the mean (center of the probability density function) of the random

variable The larger the variance of a random variable, the greater the average squared distance between the values of the random variable and its mean, or the more “spread out” are the values of the random variable

• Let a and c be constants, and let Z = a + cX Then Z is a random variable and its

variance is

var(a + cX) = E[(a + cX) – E(a + cX)]2 = c2var(X) (2.3.5)

The result in Equation (2.3.5) says that if you:

1 Add a constant to a random variable it does not affect its variance, or dispersion

This fact follows, since adding a constant to a random variable shifts the location of

its probability density function but leaves its shape, and dispersion, unaffected

Trang 19

2 Multiply a random variable by a constant, the variance is multiplied by the square

of the constant

• The square root of the variance of a random variable is called the standard deviation;

it is denoted by σ It, too, measures the spread or dispersion of a distribution, and it has the advantage of being in the same units of measure as the random variable

• A conditional variance is the variance of the conditional distribution:

2 2

2

( [ | ]) ( | ) if is discretevar[ | ] ( [ | ]) ( | ) if is continuous

y y

Trang 20

• Two other measures often used to describe a probability distribution are

3

skewness=E x[( −µ) ]and

4

kurtosis=E x[( −µ) ]

Skewness is a measure of the asymmetry of a distribution For symmetric distributions, f(µ− =x) f (µ+ and skewness = 0 For asymmetric distributions, the x)skewness will be positive (negative) if the “long tail” is in the positive (negative) direction Kurtosis is a measure of the thickness of the tails of the distribution

Trang 21

• Two common measures are

3 3

Trang 22

2.4 Using Joint Probability Density Functions

Frequently we want to make probability statements about more than one random variable

at a time To answer probability questions involving two or more random variables, we

must know their joint probability density function For the continuous random variables

X and Y, we use f(x,y) to represent their joint density function A typical joint density

function might look something like Figure 2.3 See Example 2.5

2.4.1 Marginal Probability Density Functions

• If X and Y are two discrete random variables then

Trang 23

( ) ( , ) for each value can take

Note that the summations in Equation (2.4.1) are over the other random variable, the

one that we are eliminating from the joint probability density function If the random variables are continuous the same idea works, with integrals replacing the summation sign as follows:

y x

Trang 24

2.4.2 Conditional Probability Density Functions

• Often the chances of an event occurring are conditional on the occurrence of another event For discrete random variables X and Y, conditional probabilities can be calculated from the joint probability density function f(x,y) and the marginal probability density function of the conditioning random variables Specifically, the probability that the random variable X takes the value x given that Y = y, is written P[X

= x|Y = y] This conditional probability is given by the conditional probability density function f(x|y):

Trang 25

2.4.3 Independent Random Variables

• Two random variables are statistically independent, or independently distributed, if knowing the value that one will take does not reveal anything about what value the

other may take When random variables are statistically independent, their joint

probability density function factors into the product of their individual probability

density functions, and vice versa If X and Y are independent random variables, then

f x y = f x f y (2.4.3)

Trang 26

for each and every pair of values x and y The converse is also true

• If X1, …, X n are statistically independent the joint probability density function can be factored and written as

Trang 27

2.5 The Expected Value of a Function of Several Random Variables: Covariance

and Correlation

In economics we are usually interested in exploring relationships between economic variables The covariance literally indicates the amount of covariance exhibited by the two random variables

• If X and Y are random variables, then their covariance is

cov( , )X Y = E X[( − E X[ ])(Y − E Y[ ])] (2.5.1)

• If X and Y are discrete random variables, f(x,y) is their joint probability density function, and g(X,Y) is a function of them, then

Trang 28

• If X and Y are continuous random variables, then the definition of covariance is similar,

with integrals replacing the summation signs as follows:

cov( , )X Y =∫ ∫ f x y dxdy( , )

Trang 29

• The sign of the covariance between two random variables indicates whether their association is positive (direct) or negative (inverse) The covariance between X and Y

is expected, or average, value of the random product [X – E(X)][Y – E(Y)] If two

random variables have positive covariance then they tend to be positively (or directly) related See Figure 2.4 The values of two random variables with negative covariance tend to be negatively (or inversely) related See Figure 2.5 Zero covariance implies that there is neither positive nor negative association between pairs of values See Figure 2.6

• The magnitude of covariance is difficult to interpret because it depends on the units of measurement of the random variables The meaning of covariation is revealed more

clearly if we divide the covariance between X and Y by their respective standard deviations The resulting ratio is defined as the correlation between the random variables X and Y If X and Y are random variables then their correlation is

Trang 30

cov( , )var( ) var( )

• Independent random variables X and Y have zero covariance, indicating that there is

no linear association between them However, just because the covariance or correlation between two random variables is zero does not mean that they are necessarily independent Zero covariance means that there is no linear association between the random variables Even if X and Y have zero covariance, they might have

a nonlinear association, like X2 + Y2 = 1

• If a, b, c, and d are constants and X and Y are random variables, then

Trang 32

2.5.1 The Mean of a Weighted Sum of Random Variables

• Let the function g(X,Y) = aX + bY where a and b are constants This is called a weighted sum Now use Equation (2.5.2) to find the expectation

E aX +bY = aE X +bE Y (2.5.5)

This rule says that the expected value of a weighted sum of two random variables is the weighted sum of their expected values This rule works for any number of random variables whether they are discrete or continuous

• If X and Y are random variables, then

E X +Y = E X + E Y (2.5.6)

Trang 33

In general, the expected value of any sum is the sum of the expected values

2.5.2 The Variance of a Weighted Sum of Random Variables

• If X, Y, and Z are random variables and a, b, and c are constants, then

var[aX + bY + cZ] = a2var[X] + b2var[Y] + c2var[Z]

+ 2abcov[X,Y] + 2accov[X,Z] + 2bccov[Y,Z] (2.5.7)

Proof:

Trang 34

• If X, Y, and Z are independent, or uncorrelated, random variables, then the covariance

terms are zero and:

Trang 35

• If X, Y, and Z are independent, or uncorrelated, random variables, and if a = b = c = 1,

then

var[X + Y + Z] = var[X] + var[Y] + var[Z] (2.5.9)

• When the “variance of a sum is the sum of the variances,” the random variables involved must be independent, or uncorrelated

Trang 36

2.6 The Normal Distribution

• If X is a normally distributed random variable with mean β and variance σ2,

symbolized as X ~ N(β,σ2), then its probability density function is expected mathematically as:

2 2 2

22

Trang 37

• A standard normal random variable is one that has a normal probability density

function with mean 0 and variance 1 If, X ~ N(β,σ2)then

Trang 38

• When using Table 1, and Equations (2.6.2) and (2.6.3), to compute normal probabilities, remember that:

1 The standard normal probability density function is symmetric about zero

2 The total amount of probability under the density function is 1

3 Half the probability is on either side of zero

For example, if X ~ N(3,9), then using Table 1,

P[4 ≤ X ≤ 6] = P[.33 ≤ Z ≤ 1] = 3413 – 1293 = 212

• See Figure 2.8 For the purposes of statistical testing, it is useful to know that:

1 The probability that a single observation of a normally distributed variable X will

lie within 1.96 standard deviations of its mean is approximately 95%

2 The probability that a single observation of a normally distributed variable X will

lie within 2.57 standard deviations of its mean is approximately 99%

Trang 40

Exercises

2.1 2.2 2.3 2.4 2.5 2.6 2.7 2.8 2.10 2.11 2.15 2.16 2.18 2.19 2.21

Định dạng
Số trang	40
Dung lượng	143,79 KB