Sampling distributionThe sampling distributionfor a statistic including the teststatistic for a hypothesis test is the probability distributiondescribingbatch-to-batch variations of that
Trang 1Statistics in Geophysics: Inferential Statistics II
Steffen Unkel
Department of Statistics Ludwig-Maximilians-University Munich, Germany
Trang 2hypotheses
These tests yield a binary decision that a particular hypothesisabout a phenomenon generating the data may be true or not
nonparametric (ordistribution-free) tests
Trang 3Sampling distribution
The sampling distributionfor a statistic (including the teststatistic for a hypothesis test) is the probability distributiondescribingbatch-to-batch variations of that statistic
The value of a statistic computed from a particular batch ofdata will in general be different from that for the same statisticcomputed using a different batch of data of the same kind
Example: Average January temperature is obtained by
averaging daily temperatures during that month at a
particular location for a given year The statistic is differentfrom year to year
Trang 4Elements of any hypothesis test
1 Identify a test statisticthat is appropriate to the data andquestion at hand
2 Define anull hypothesis, H0, which defines a reference againstwhich to judge the observed test statistic
3 Define an alternative hypothesis, H1 (or HA)
4 Obtain thenull distribution, which is the sampling distributionfor the test statistic, if H0 is true
5 Compare the observed test statistic to the null distribution Ifthe test statistic falls in a sufficiently improbable region of thenull distribution, H0 is rejected as too implausible to havebeen true given the observed evidence
Trang 5Test level
The sufficiently improbably region of the null distribution isdefined by therejection level (ortest level) of the test
H0 is rejected if the probability of the observed test statistic,
and all other results at least as unfavourable to H0, is lessthan or equal to the test level
Commonly the 5% level is chosen, although tests conducted
at the 10% level or the 1% level are not unusual
Trang 6p value
The p valueis the probability that the observed value of thetest statistic, together with all other possible values of the teststatistic that are at least as unfavourable to H0, will occur
Thus, H0 is rejected if the p value is less than or equal to thetest level and is not rejected otherwise
The p value also communicates the confidence with which anull hypothesis has or has not been rejected
Trang 7Error types and power of a test
We define
specific alternative
Trang 8Error types and power of a test
Figure: Illustration of the relationship of the probability of a Type I error (horizontal hatching) and the probability of a Type II error (vertical hatching) for a test conducted at the 5% level.
Trang 9One-sided versus two-sided tests
A statistical test can be either one-sidedor two-sided
A one-sided test is appropriate
if there is a prior reason to expect that violations of H 0 will lead to values of the test statistic on a particular side of the null distribution.
when only values on one tail or the other of the null
distribution are unfavorable to H0, because the way the test statistic has been constructed.
Two-sided tests are appropriate when either very large or verysmall values of the test statistic are unfavourable to the nulldistribution
Trang 10Confidence intervals: inverting hypothesis tests
the computed confidence interval (CI) around the observedstatistic
The 100 × (1 − α)% CI around an observed statistic
will not contain the null hypothesis value of the test if the test
is significant at the α level ,
and will contain the null value if the test is not significant at the α level
Trang 11Example: Exact binomial test
Advertisements for a tourist resort claim that, on average, sixdays out of seven are cloudless during winter (6/7 = 0.857)
Assume that we could arrange to take observations on 25independent occasions
If cloudless skies are observed on 15 of those 25 days, is thisobservation consistent with, or does it justify questioning, theclaim?
This problem fits neatly into the parametric setting of thebinomial distribution
Trang 12Example: Exact binomial test
The test statistic of X = 15 out of n = 25 days has beendictated by the form of the problem
0.857x(1 − 0.857)25−x = 0.0015
Trang 13Example: Exact binomial test
Trang 14Approximate binomial test
Approximation of the binomial distribution:
Trang 15Approximate binomial test
ˆ
π − π 0
q
π0(1−π0) n
and given α, H 0 is rejected if
(a) |z| > z1−α/2
(b) z < −z1−α
(c) z > z 1−α
Trang 16Example: Approximate binomial test
Trang 17Continuity correction
Approximation of the binomial with continuity correction:
Let X ∼ B(n, π) For sufficiently large n:
Trang 18Test for differences of mean for paired samples
t test for the mean
unknown mean µ
If the number of data values making up the sample mean islarge enough for its sampling distribution to be essentiallyGaussian, then the test statistic
T =X − µ¯ 0
S
√
n ,where S =
q
i =1(Xi− ¯X )2/(n − 1), follows a t distribution
The statistic T resembles the standard Gaussian variable
Z = X −µ¯ 0
σ
√
of the sample mean has been substituted in the denominator
Trang 19Test for differences of mean for paired samples
t test for the mean
√ n and given α, H 0 is rejected if
Trang 20Test for differences of mean for paired samplesComparison of two proportions
Let π1 (π2) be the probability of success in group 1 (group 2)
Trang 21Test for differences of mean for paired samplesTest statistic
The test statistic is
Trang 22Test for differences of mean for paired samples
If H0: π1 = π2 is true, we could estimate the commonprobability π by ˆπ = 25/60 = 0.4167
In the upper left corner we would expect to see
0.4167 × 30 = 12.501 successes in group 1, and so
30 − 12.501 = 17.499 failures in the lower left
In the upper right corner we would expect to see
0.4167 × 30 = 12.501 successes in group 2, and so
30 − 12.501 = 17.499 failures in the lower right
Trang 23Test for differences of mean for paired samplesDecision
The value of the observed test statistic χ2 is
2
(20 − 17.499)217.499
Trang 24Test for differences of mean for paired samplesExample: comparison of two fertilizers
Response: crop yield
Two independentsamples (each with sample size n = 6)
Crop yield using
fertilizer X : 22, 21, 18, 16, 22, 17.
fertilizer Y : 20, 22, 17, 13, 17, 18.
µX (µY) denotes the mean crop yield using fertilizer X (Y )
Test problem: H0: µX = µY vs H1: µX 6= µY
Trang 25Test for differences of mean for paired samplesTwo-sample t test
Xk i i d ∼ N (µX, σX2) (k = 1, , n) and Yl i i d ∼ N (µY, σY2)(l = 1, , m)
Reject H0, if |t| > t1−α
2(n + m − 2)
Trang 26Test for differences of mean for paired samplesExample:
The sample means are
The observed difference is ¯x − ¯y = 1.5
The value of the observed test statistic is
1
3 ×8.2167 = 0.9064
Decision: t = 0.9064 < t0.975(10) = 2.2814: H0 is notrejected (p value = 0.386)
Trang 27Test for differences of mean for paired samplesPaired t test
An important from of non-independence occurs when the datavalues are paired
The two-sample t test for paired dataanalyzes the differences
Di = Xi− Yi (i = 1, , n) between corresponding members
i =1Di, µD = µX − µY and
q
1 n−1
i =1(Di − ¯D)2