1. Trang chủ
  2. » Kỹ Thuật - Công Nghệ

Exploratory Data Analysis_10 pot

42 156 0
Tài liệu đã được kiểm tra trùng lặp

Đang tải... (xem toàn văn)

Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống

THÔNG TIN TÀI LIỆU

Thông tin cơ bản

Tiêu đề Normal Distribution
Trường học National Institute of Standards and Technology
Chuyên ngành Statistics
Thể loại Essay
Năm xuất bản 2006
Thành phố Gaithersburg
Định dạng
Số trang 42
Dung lượng 2,85 MB

Các công cụ chuyển đổi và chỉnh sửa cho tài liệu này

Nội dung

Function The formula for the hazard function of the normal distribution is where is the cumulative distribution function of the standard normal distribution and is the probability densit

Trang 1

Function

The formula for the hazard function of the normal distribution is

where is the cumulative distribution function of the standard normal

distribution and is the probability density function of the standard

normal distribution

The following is the plot of the normal hazard function

1.3.6.6.1 Normal Distribution

Trang 4

Statistics

Standard Deviation The scale parameter Coefficient of

Comments For both theoretical and practical reasons, the normal distribution is

probably the most important distribution in statistics For example,

Many classical statistical tests are based on the assumption thatthe data follow a normal distribution This assumption should betested before applying these tests

In modeling applications, such as linear and non-linear regression,the error term is often assumed to follow a normal distributionwith fixed location and scale

The normal distribution is used to find significance levels in manyhypothesis tests and confidence intervals

● 1.3.6.6.1 Normal Distribution

http://www.itl.nist.gov/div898/handbook/eda/section3/eda3661.htm (6 of 7) [5/1/2006 9:57:55 AM]

Trang 5

The central limit theorem basically states that as the sample size (N)

becomes large, the following occur:

The sampling distribution of the mean becomes approximatelynormal regardless of the distribution of the original variable

1

The sampling distribution of the mean is centered at thepopulation mean, , of the original variable In addition, thestandard deviation of the sampling distribution of the mean

2

Software Most general purpose statistical software programs, including Dataplot,

support at least some of the probability functions for the normaldistribution

1.3.6.6.1 Normal Distribution

Trang 6

1 Exploratory Data Analysis

where A is the location parameter and (B - A) is the scale parameter The case

where A = 0 and B = 1 is called the standard uniform distribution The

equation for the standard uniform distribution is

Since the general form of probability functions can be expressed in terms ofthe standard distribution, all subsequent formulas in this section are given forthe standard form of the function

The following is the plot of the uniform probability density function

1.3.6.6.2 Uniform Distribution

http://www.itl.nist.gov/div898/handbook/eda/section3/eda3662.htm (1 of 7) [5/1/2006 9:57:56 AM]

Trang 8

Point

Function

The formula for the percent point function of the uniform distribution is

The following is the plot of the uniform percent point function

Hazard

Function

The formula for the hazard function of the uniform distribution is

The following is the plot of the uniform hazard function

1.3.6.6.2 Uniform Distribution

http://www.itl.nist.gov/div898/handbook/eda/section3/eda3662.htm (3 of 7) [5/1/2006 9:57:56 AM]

Trang 9

Hazard

Function

The formula for the cumulative hazard function of the uniform distribution is

The following is the plot of the uniform cumulative hazard function

1.3.6.6.2 Uniform Distribution

Trang 11

Estimation

The method of moments estimators for A and B are

The maximum likelihood estimators for A and B are

1.3.6.6.2 Uniform Distribution

Trang 12

Comments The uniform distribution defines equal probability over a given range for a

continuous distribution For this reason, it is important as a referencedistribution

One of the most important applications of the uniform distribution is in thegeneration of random numbers That is, almost all random number generatorsgenerate random numbers on the (0,1) interval For other distributions, sometransformation is applied to the uniform random numbers

Software Most general purpose statistical software programs, including Dataplot,

support at least some of the probability functions for the uniform distribution

1.3.6.6.2 Uniform Distribution

http://www.itl.nist.gov/div898/handbook/eda/section3/eda3662.htm (7 of 7) [5/1/2006 9:57:56 AM]

Trang 13

1 Exploratory Data Analysis

where t is the location parameter and s is the scale parameter The case

where t = 0 and s = 1 is called the standard Cauchy distribution The

equation for the standard Cauchy distribution reduces to

Since the general form of probability functions can be expressed interms of the standard distribution, all subsequent formulas in this sectionare given for the standard form of the function

The following is the plot of the standard Cauchy probability densityfunction

1.3.6.6.3 Cauchy Distribution

Trang 15

Point

Function

The formula for the percent point function of the Cauchy distribution is

The following is the plot of the Cauchy percent point function

Trang 18

Statistics

Standard Deviation The standard deviation is undefined

Coefficient ofVariation

The coefficient of variation is undefined

Parameter

Estimation

The likelihood functions for the Cauchy maximum likelihood estimatesare given in chapter 16 of Johnson, Kotz, and Balakrishnan Theseequations typically must be solved numerically on a computer

1.3.6.6.3 Cauchy Distribution

http://www.itl.nist.gov/div898/handbook/eda/section3/eda3663.htm (6 of 7) [5/1/2006 9:57:57 AM]

Trang 19

Comments The Cauchy distribution is important as an example of a pathological

case Cauchy distributions look similar to a normal distribution

However, they have much heavier tails When studying hypothesis teststhat assume normality, seeing how the tests perform on data from aCauchy distribution is a good indicator of how sensitive the tests are toheavy-tail departures from normality Likewise, it is a good check forrobust techniques that are designed to work well under a wide variety ofdistributional assumptions

The mean and standard deviation of the Cauchy distribution areundefined The practical meaning of this is that collecting 1,000 datapoints gives no more accurate an estimate of the mean and standarddeviation than does a single point

Software Many general purpose statistical software programs, including Dataplot,

support at least some of the probability functions for the Cauchydistribution

1.3.6.6.3 Cauchy Distribution

Trang 20

1 Exploratory Data Analysis

The formula for the probability density function of the t distribution is

where is the beta function and is a positive integer shape parameter.The formula for the beta function is

In a testing context, the t distribution is treated as a "standardized

distribution" (i.e., no location or scale parameters) However, in adistributional modeling context (as with other probability distributions),

the t distribution itself can be transformed with a location parameter, ,and a scale parameter,

The following is the plot of the t probability density function for 4

different values of the shape parameter

1.3.6.6.4 t Distribution

http://www.itl.nist.gov/div898/handbook/eda/section3/eda3664.htm (1 of 4) [5/1/2006 9:57:57 AM]

Trang 21

These plots all have a similar shape The difference is in the heaviness

of the tails In fact, the t distribution with equal to 1 is a Cauchy

distribution The t distribution approaches a normal distribution as becomes large The approximation is quite good for values of > 30

Cumulative

Distribution

Function

The formula for the cumulative distribution function of the t distribution

is complicated and is not included here It is given in the Evans,Hastings, and Peacock book

The following are the plots of the t cumulative distribution function with

the same values of as the pdf plots above

1.3.6.6.4 t Distribution

Trang 22

Point

Function

The formula for the percent point function of the t distribution does not

exist in a simple closed form It is computed numerically

The following are the plots of the t percent point function with the same

values of as the pdf plots above

1.3.6.6.4 t Distribution

http://www.itl.nist.gov/div898/handbook/eda/section3/eda3664.htm (3 of 4) [5/1/2006 9:57:57 AM]

Trang 23

Probability

Functions

Since the t distribution is typically used to develop hypothesis tests and

confidence intervals and rarely for modeling applications, we omit theformulas and plots for the hazard, cumulative hazard, survival, andinverse survival probability functions

Undefined

However, the t distribution is symmetric in allcases

Kurtosis

It is undefined for less than or equal to 4

Parameter

Estimation

Since the t distribution is typically used to develop hypothesis tests and

confidence intervals and rarely for modeling applications, we omit anydiscussion of parameter estimation

Comments The t distribution is used in many cases for the critical regions for

hypothesis tests and in determining confidence intervals The mostcommon example is testing if data are consistent with the assumedprocess mean

Software Most general purpose statistical software programs, including Dataplot,

support at least some of the probability functions for the t distribution.

1.3.6.6.4 t Distribution

Trang 24

1 Exploratory Data Analysis

probability density function of the F distribution is

where and are the shape parameters and is the gamma function.The formula for the gamma function is

In a testing context, the F distribution is treated as a "standardizeddistribution" (i.e., no location or scale parameters) However, in adistributional modeling context (as with other probability distributions),the F distribution itself can be transformed with a location parameter, ,and a scale parameter,

The following is the plot of the F probability density function for 4different values of the shape parameters

1.3.6.6.5 F Distribution

http://www.itl.nist.gov/div898/handbook/eda/section3/eda3665.htm (1 of 4) [5/1/2006 9:57:58 AM]

Trang 25

where B is the beta function

The following is the plot of the F cumulative distribution function withthe same values of and as the pdf plots above

1.3.6.6.5 F Distribution

Trang 27

Standard Deviation

Coefficient ofVariationSkewness

Parameter

Estimation

Since the F distribution is typically used to develop hypothesis tests andconfidence intervals and rarely for modeling applications, we omit anydiscussion of parameter estimation

Comments The F distribution is used in many cases for the critical regions for

hypothesis tests and in determining confidence intervals Two commonexamples are the analysis of variance and the F test to determine if thevariances of two populations are equal

Software Most general purpose statistical software programs, including Dataplot,

1.3.6.6.5 F Distribution

Trang 28

1 Exploratory Data Analysis

The chi-square distribution results when independent variables with

standard normal distributions are squared and summed The formula forthe probability density function of the chi-square distribution is

where is the shape parameter and is the gamma function Theformula for the gamma function is

In a testing context, the chi-square distribution is treated as a

"standardized distribution" (i.e., no location or scale parameters)

However, in a distributional modeling context (as with other probabilitydistributions), the chi-square distribution itself can be transformed with

a location parameter, , and a scale parameter, The following is the plot of the chi-square probability density functionfor 4 different values of the shape parameter

1.3.6.6.6 Chi-Square Distribution

http://www.itl.nist.gov/div898/handbook/eda/section3/eda3666.htm (1 of 4) [5/1/2006 9:57:59 AM]

Trang 30

1.3.6.6.6 Chi-Square Distribution

http://www.itl.nist.gov/div898/handbook/eda/section3/eda3666.htm (3 of 4) [5/1/2006 9:57:59 AM]

Trang 31

Comments The chi-square distribution is used in many cases for the critical regions

for hypothesis tests and in determining confidence intervals Twocommon examples are the chi-square test for independence in an RxC

contingency table and the chi-square test to determine if the standarddeviation of a population is equal to a pre-specified value

Software Most general purpose statistical software programs, including Dataplot,

support at least some of the probability functions for the chi-squaredistribution

1.3.6.6.6 Chi-Square Distribution

Trang 32

1 Exploratory Data Analysis

where = 0 and = 1 is called the standard exponential distribution.

The equation for the standard exponential distribution is

The general form of probability functions can be expressed in terms ofthe standard distribution Subsequent formulas in this section are givenfor the 1-parameter (i.e., with scale parameter) form of the function.The following is the plot of the exponential probability density function

1.3.6.6.7 Exponential Distribution

http://www.itl.nist.gov/div898/handbook/eda/section3/eda3667.htm (1 of 7) [5/1/2006 9:58:00 AM]

Trang 34

The formula for the hazard function of the exponential distribution is

The following is the plot of the exponential hazard function

1.3.6.6.7 Exponential Distribution

http://www.itl.nist.gov/div898/handbook/eda/section3/eda3667.htm (3 of 7) [5/1/2006 9:58:00 AM]

Trang 36

Function

The formula for the survival function of the exponential distribution is

The following is the plot of the exponential survival function

Trang 37

Statistics

MeanMedian

Standard DeviationCoefficient ofVariation

(Chapter 8) It is also discussed in chapter 19 of Johnson, Kotz, andBalakrishnan

Comments The exponential distribution is primarily used in reliability applications

1.3.6.6.7 Exponential Distribution

Trang 38

1.3.6.6.7 Exponential Distribution

http://www.itl.nist.gov/div898/handbook/eda/section3/eda3667.htm (7 of 7) [5/1/2006 9:58:00 AM]

Trang 39

1 Exploratory Data Analysis

where is the shape parameter, is the location parameter and is the scale

distribution The case where = 0 is called the 2-parameter Weibull distribution.

The equation for the standard Weibull distribution reduces to

Since the general form of probability functions can be expressed in terms of the

standard form of the function

The following is the plot of the Weibull probability density function

1.3.6.6.8 Weibull Distribution

Trang 40

Distribution

Function

The formula for the cumulative distribution function of the Weibull distribution is

The following is the plot of the Weibull cumulative distribution function with thesame values of as the pdf plots above

1.3.6.6.8 Weibull Distribution

http://www.itl.nist.gov/div898/handbook/eda/section3/eda3668.htm (2 of 7) [5/1/2006 9:58:02 AM]

Trang 41

Point

Function

The formula for the percent point function of the Weibull distribution is

The following is the plot of the Weibull percent point function with the samevalues of as the pdf plots above

Hazard

Function

The formula for the hazard function of the Weibull distribution is

The following is the plot of the Weibull hazard function with the same values of

as the pdf plots above

1.3.6.6.8 Weibull Distribution

Trang 42

Hazard

Function

The formula for the cumulative hazard function of the Weibull distribution is

The following is the plot of the Weibull cumulative hazard function with the samevalues of as the pdf plots above

1.3.6.6.8 Weibull Distribution

http://www.itl.nist.gov/div898/handbook/eda/section3/eda3668.htm (4 of 7) [5/1/2006 9:58:02 AM]

Ngày đăng: 21/06/2014, 21:20