1. Trang chủ
  2. » Giáo án - Bài giảng

Statistics in geophysics inferential statistics II

27 130 0

Đang tải... (xem toàn văn)

Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống

THÔNG TIN TÀI LIỆU

Thông tin cơ bản

Định dạng
Số trang 27
Dung lượng 675,71 KB

Các công cụ chuyển đổi và chỉnh sửa cho tài liệu này

Nội dung

Sampling distributionThe sampling distributionfor a statistic including the teststatistic for a hypothesis test is the probability distributiondescribingbatch-to-batch variations of that

Trang 1

Statistics in Geophysics: Inferential Statistics II

Steffen Unkel

Department of Statistics Ludwig-Maximilians-University Munich, Germany

Trang 2

hypotheses

These tests yield a binary decision that a particular hypothesisabout a phenomenon generating the data may be true or not

nonparametric (ordistribution-free) tests

Trang 3

Sampling distribution

The sampling distributionfor a statistic (including the teststatistic for a hypothesis test) is the probability distributiondescribingbatch-to-batch variations of that statistic

The value of a statistic computed from a particular batch ofdata will in general be different from that for the same statisticcomputed using a different batch of data of the same kind

Example: Average January temperature is obtained by

averaging daily temperatures during that month at a

particular location for a given year The statistic is differentfrom year to year

Trang 4

Elements of any hypothesis test

1 Identify a test statisticthat is appropriate to the data andquestion at hand

2 Define anull hypothesis, H0, which defines a reference againstwhich to judge the observed test statistic

3 Define an alternative hypothesis, H1 (or HA)

4 Obtain thenull distribution, which is the sampling distributionfor the test statistic, if H0 is true

5 Compare the observed test statistic to the null distribution Ifthe test statistic falls in a sufficiently improbable region of thenull distribution, H0 is rejected as too implausible to havebeen true given the observed evidence

Trang 5

Test level

The sufficiently improbably region of the null distribution isdefined by therejection level (ortest level) of the test

H0 is rejected if the probability of the observed test statistic,

and all other results at least as unfavourable to H0, is lessthan or equal to the test level

Commonly the 5% level is chosen, although tests conducted

at the 10% level or the 1% level are not unusual

Trang 6

p value

The p valueis the probability that the observed value of thetest statistic, together with all other possible values of the teststatistic that are at least as unfavourable to H0, will occur

Thus, H0 is rejected if the p value is less than or equal to thetest level and is not rejected otherwise

The p value also communicates the confidence with which anull hypothesis has or has not been rejected

Trang 7

Error types and power of a test

We define

specific alternative

Trang 8

Error types and power of a test

Figure: Illustration of the relationship of the probability of a Type I error (horizontal hatching) and the probability of a Type II error (vertical hatching) for a test conducted at the 5% level.

Trang 9

One-sided versus two-sided tests

A statistical test can be either one-sidedor two-sided

A one-sided test is appropriate

if there is a prior reason to expect that violations of H 0 will lead to values of the test statistic on a particular side of the null distribution.

when only values on one tail or the other of the null

distribution are unfavorable to H0, because the way the test statistic has been constructed.

Two-sided tests are appropriate when either very large or verysmall values of the test statistic are unfavourable to the nulldistribution

Trang 10

Confidence intervals: inverting hypothesis tests

the computed confidence interval (CI) around the observedstatistic

The 100 × (1 − α)% CI around an observed statistic

will not contain the null hypothesis value of the test if the test

is significant at the α level ,

and will contain the null value if the test is not significant at the α level

Trang 11

Example: Exact binomial test

Advertisements for a tourist resort claim that, on average, sixdays out of seven are cloudless during winter (6/7 = 0.857)

Assume that we could arrange to take observations on 25independent occasions

If cloudless skies are observed on 15 of those 25 days, is thisobservation consistent with, or does it justify questioning, theclaim?

This problem fits neatly into the parametric setting of thebinomial distribution

Trang 12

Example: Exact binomial test

The test statistic of X = 15 out of n = 25 days has beendictated by the form of the problem

0.857x(1 − 0.857)25−x = 0.0015

Trang 13

Example: Exact binomial test

Trang 14

Approximate binomial test

Approximation of the binomial distribution:

Trang 15

Approximate binomial test

ˆ

π − π 0

q

π0(1−π0) n

and given α, H 0 is rejected if

(a) |z| > z1−α/2

(b) z < −z1−α

(c) z > z 1−α

Trang 16

Example: Approximate binomial test

Trang 17

Continuity correction

Approximation of the binomial with continuity correction:

Let X ∼ B(n, π) For sufficiently large n:

Trang 18

Test for differences of mean for paired samples

t test for the mean

unknown mean µ

If the number of data values making up the sample mean islarge enough for its sampling distribution to be essentiallyGaussian, then the test statistic

T =X − µ¯ 0

S

n ,where S =

q

i =1(Xi− ¯X )2/(n − 1), follows a t distribution

The statistic T resembles the standard Gaussian variable

Z = X −µ¯ 0

σ

of the sample mean has been substituted in the denominator

Trang 19

Test for differences of mean for paired samples

t test for the mean

√ n and given α, H 0 is rejected if

Trang 20

Test for differences of mean for paired samplesComparison of two proportions

Let π1 (π2) be the probability of success in group 1 (group 2)

Trang 21

Test for differences of mean for paired samplesTest statistic

The test statistic is

Trang 22

Test for differences of mean for paired samples

If H0: π1 = π2 is true, we could estimate the commonprobability π by ˆπ = 25/60 = 0.4167

In the upper left corner we would expect to see

0.4167 × 30 = 12.501 successes in group 1, and so

30 − 12.501 = 17.499 failures in the lower left

In the upper right corner we would expect to see

0.4167 × 30 = 12.501 successes in group 2, and so

30 − 12.501 = 17.499 failures in the lower right

Trang 23

Test for differences of mean for paired samplesDecision

The value of the observed test statistic χ2 is

2

(20 − 17.499)217.499

Trang 24

Test for differences of mean for paired samplesExample: comparison of two fertilizers

Response: crop yield

Two independentsamples (each with sample size n = 6)

Crop yield using

fertilizer X : 22, 21, 18, 16, 22, 17.

fertilizer Y : 20, 22, 17, 13, 17, 18.

µX (µY) denotes the mean crop yield using fertilizer X (Y )

Test problem: H0: µX = µY vs H1: µX 6= µY

Trang 25

Test for differences of mean for paired samplesTwo-sample t test

Xk i i d ∼ N (µX, σX2) (k = 1, , n) and Yl i i d ∼ N (µY, σY2)(l = 1, , m)

Reject H0, if |t| > t1−α

2(n + m − 2)

Trang 26

Test for differences of mean for paired samplesExample:

The sample means are

The observed difference is ¯x − ¯y = 1.5

The value of the observed test statistic is

1

3 ×8.2167 = 0.9064

Decision: t = 0.9064 < t0.975(10) = 2.2814: H0 is notrejected (p value = 0.386)

Trang 27

Test for differences of mean for paired samplesPaired t test

An important from of non-independence occurs when the datavalues are paired

The two-sample t test for paired dataanalyzes the differences

Di = Xi− Yi (i = 1, , n) between corresponding members

i =1Di, µD = µX − µY and

q

1 n−1

i =1(Di − ¯D)2

Ngày đăng: 04/12/2015, 17:08

TÀI LIỆU CÙNG NGƯỜI DÙNG

  • Đang cập nhật ...

TÀI LIỆU LIÊN QUAN