Chapter GoalsAfter completing this chapter, you should be able to: Recognize situations in which to use analysis of variance Understand different analysis of variance designs Perfo
Trang 2Chapter Goals
After completing this chapter, you should be able to:
Recognize situations in which to use analysis of variance
Understand different analysis of variance designs
Perform a single-factor hypothesis test and interpret results
Conduct and interpret post-analysis of variance pairwise
comparisons procedures
Set up and perform randomized blocks analysis
Analyze two-factor analysis of variance test with replications results
Trang 3Chapter Overview
Analysis of Variance (ANOVA)
F-test
F-test Tukey-
Kramer test Fisher’s Least Significant
Difference test
One-Way
ANOVA
Randomized Complete Block ANOVA
Two-factor ANOVA with replication
Trang 4General ANOVA Setting
Investigator controls one or more independent variables
Called factors (or treatment variables)
Each factor contains two or more levels (or categories/classifications)
Observe effects on dependent variable
Response to levels of independent variable
Experimental design: the plan used to test hypothesis
Trang 5 Populations are normally distributed
Populations have equal variances
Samples are randomly and independently drawn
Trang 6Completely Randomized
Design
Experimental units (subjects) are assigned randomly to treatments
Only one factor or independent variable
With two or more treatment levels
Analyzed by
One-factor analysis of variance (one-way ANOVA)
Called a Balanced Design if all factor levels have equal sample size
Trang 7Hypotheses of One-Way
ANOVA
All population means are equal
i.e., no treatment effect (no variation in means among groups)
At least one population mean is different
i.e., there is a treatment effect
Does not mean that all population means are different (some pairs may be the same)
k 3
2 1
same the
are means
population the
of all Not
:
H A
Trang 82 1
same the
are μ
all Not
:
3 2
Trang 9One-Factor ANOVA
At least one mean is different:
The Null Hypothesis is NOT true (Treatment Effect is present)
k 3
2 1
same the
are μ
all Not
:
3 2
or
(continue d)
Trang 10Partitioning the Variation
Total variation can be split into two parts:
SST = Total Sum of Squares SSB = Sum of Squares Between SSW = Sum of Squares Within SST = SSB + SSW
Trang 11Partitioning the Variation
Total Variation = the aggregate dispersion of the individual
data values across the various factor levels (SST)
Within-Sample Variation = dispersion that exists among the data values within a particular factor level (SSW)
Between-Sample Variation = dispersion among the factor sample means (SSB)
SST = SSB + SSW
(continue d)
Trang 12Partition of Total Variation
Variation Due to Factor (SSB)
Variation Due to Random
Sampling (SSW)
Total Variation (SST)
Commonly referred to as:
Sum of Squares Within
Sum of Squares Error
Sum of Squares Unexplained
Within Groups Variation
Commonly referred to as:
Sum of Squares Between
Sum of Squares Among
Sum of Squares Explained
Among Groups Variation
Trang 13Total Sum of Squares
ij
i
) x x
SST = Total sum of squares
k = number of populations (levels or treatments)
n i = sample size from population i
x ij = j th measurement from population i
x = grand mean (mean of all data values)
SST = SSB + SSW
Trang 1411 x ) ( x x ) ( x x ) x
(
SST
k
kn − +
+
− +
−
=
Trang 15Sum of Squares Between
Where:
SSB = Sum of squares between
k = number of populations
n i = sample size from population i
x i = sample mean from population i
x = grand mean (mean of all data values)
2 1
) x x
( n
Trang 16Between-Group Variation
Variation Due to Differences Among Groups
i
2 1
) x x
( n
Mean Square Between = SSB/degrees of freedom
Trang 172 1
1 ( x x ) n ( x x ) n ( x x ) n
Trang 18Sum of Squares Within
Where:
SSW = Sum of squares within
k = number of populations
n i = sample size from population i
x i = sample mean from population i
2 1
1
) x x
Trang 19Within-Group Variation
Summing the variation within each group and then adding over all groups
i
µ
k N
SSW MSW
1
) x x
Trang 202 1
11 x ) ( x x ) ( x x ) x
(
k − +
+
− +
−
=
Trang 21One-Way ANOVA Table
N - k
F =
Trang 22One-Factor ANOVA
F Test Statistic
Test statistic
MSB is mean squares between variances
MSW is mean squares within variances
Trang 23Interpreting One-Factor
ANOVA
F Statistic
The F statistic is the ratio of the between estimate
of variance and the within estimate of variance
The ratio must always be positive
df 1 = k -1 will typically be small
df 2 = N - k will typically be large
The ratio should be close to 1 if
H 0 : μ 1 = μ 2 = … = μ k is true
H 0 : μ 1 = μ 2 = … = μ k is false
Trang 24One-Factor ANOVA
F Test Example
You want to see if three
different golf clubs yield
different distances You
randomly select five
measurements from trials on
an automated driving
machine for each club At
the 05 significance level, is
there a difference in mean
Trang 25205.8 x
226.0 x
249.2
Trang 26One-Factor ANOVA Example
SSW = (254 – 249.2) 2 + (263 – 249.2) 2 +…+ (204 – 205.8) 2 = 1119.6
MSB = 4716.4 / (3-1) = 2358.2
MSW = 1119.6 / (15-3) = 93.3 25.275
93.3 2358.2
Trang 27There is evidence that
at least one μ i differs from the rest
2358.2 MSW
MSB
Critical Value:
F α = 3.885
Trang 29The Tukey-Kramer Procedure
Tells which population means are significantly different
e.g.: μ1 = μ2 ≠ μ3
Done after rejection of equal means in ANOVA
Allows pair-wise comparisons
Compare absolute mean differences with critical range
x
μ 1 = μ 2 μ 3
Trang 30Tukey-Kramer Critical Range
where:
q α = Value from standardized range table
with k and N - k degrees of freedom for the desired level of α
MSW = Mean Square Within
n i and n j = Sample sizes from populations (levels) i and j
1 2
MSW q
Range Critical
Trang 31The Tukey-Kramer Procedure: Example
1 Compute absolute mean differences:
Club 1 Club 2 Club 3
249.2 x
x
23.2 226.0
249.2 x
x
3 2
3 1
2 1
2 Find the q value from the table in appendix J
with k and N - k degrees of freedom for
the desired level of α
3.77
q α =
Trang 32The Tukey-Kramer Procedure: Example
5 All of the absolute mean differences are greater than critical range
Therefore there is a significant difference between each pair of
means at 5% level of significance
16.285 5
1 5
1 2
93.3 3.77
n
1 n
1 2
MSW q
Range
Critical
j i
=
3 Compute Critical Range:
20.2 x
x
43.4 x
x
23.2 x
x
3 2
3 1
2 1
Trang 33Tukey-Kramer in PHStat
Trang 34Randomized Complete Block
ANOVA
Like One-Way ANOVA, we test for equal population
means (for different factor levels, for example)
but we want to control for possible variation from a
second factor (with two or more levels)
Used when more than one factor may influence the
value of the dependent variable, but only one is of key interest
Levels of the secondary factor are called blocks
Trang 35Partitioning the Variation
Total variation can now be split into three parts:
SST = Total sum of squares SSB = Sum of squares between factor levels SSBL = Sum of squares between blocks
SSW = Sum of squares within levels
SST = SSB + SSBL + SSW
Trang 36Sum of Squares for Blocking
Where:
k = number of levels for this factor
b = number of blocks
x j = sample mean from the j th block
x = grand mean (mean of all data values)
2 1
) x x
( k
Trang 37Partitioning the Variation
Total variation can now be split into three parts:
Trang 38square Mean
square Mean
MSBL
) b
)(
k (
SSW within
square Mean
Trang 39Randomized Block ANOVA
Trang 40Blocking Test
Blocking test: df1 = b - 1
df2 = (k – 1)(b – 1)
MSBL MSW
μ μ
μ :
H 0 b1 = b2 = b3 =
equal are
means block
all Not
:
H A
F =
Reject H 0 if F > F α
Trang 41 Main Factor test: df1 = k - 1
df2 = (k – 1)(b – 1)
MSB MSW
k 3
2 1
equal are
means population
all Not
Trang 42Fisher’s Least Significant Difference Test
To test which population means are significantly
different
e.g.: μ1 = μ2 ≠ μ3
Done after rejection of equal means in randomized block ANOVA design
Allows pair-wise comparisons
Compare absolute mean differences with critical range
x
µ 1 = µ 2 µ 3
Trang 43Fisher’s Least Significant
Difference (LSD) Test
where:
t α /2 = Upper-tailed value from Student’s t-distribution
for α /2 and (k -1)(n - 1) degrees of freedom MSW = Mean square within from ANOVA table
b = number of blocks
k = number of levels of the main factor
b
2 MSW
t LSD = α /2
Trang 44etc
x x
x x
x x
3 2
3 1
2 1
t LSD = α /2
If the absolute mean difference
is greater than LSD then there
is a significant difference
between that pair of means at
the chosen level of significance
Compare:
? LSD x
x
Is i − j >
Trang 45Two-Way ANOVA
Examines the effect of
Two or more factors of interest on the dependent variable
e.g.: Percent carbonation and line speed on soft drink bottling process
Interaction between the different levels of these two factors
e.g.: Does the effect of one particular percentage of carbonation depend on which level the line speed is set?
Trang 46Two-Way ANOVA
Assumptions
Populations are normally distributed
Populations have equal variances
Independent random samples are drawn
(continued)
Trang 47Two-Way ANOVA Sources of Variation
Two Factors of interest: A and B
a = number of levels of factor A
b = number of levels of factor B
N = total number of observations in all cells
Trang 48Two-Way ANOVA Sources of Variation
SST Total Variation
Trang 49Two Factor ANOVA Equations
n k
ijk x ) x
) x x
( n
) x x
( n
Total Sum of Squares:
Sum of Squares Factor A:
Sum of Squares Factor B:
Trang 50Two Factor ANOVA Equations
2
) x x
x x
( n
i
b j
j i
n k
ij ijk x ) x
Trang 51Two Factor ANOVA Equations
where:
Mean
Grand n
ab
x x
a i
b j
n k
level each
of
Mean n
b
x x
b j
n k
level each
of
Mean n
a
x x
a i
n k
of
Mean n
a = number of levels of factor A
b = number of levels of factor B n’ = number of replications in each cell
(continued)
Trang 52Mean Square Calculations
factor square
factor square
Mean
B
) b
)(
a (
SS n
interactio square
SSE error
square Mean
MSE
−
=
=
Trang 53Two-Way ANOVA:
The F Test Statistic
F Test for Factor B Main Effect
F Test for Interaction Effect
H 0 : μ A1 = μ A2 = μ A3 = • • •
H A : Not all μ Ai are equal
H 0 : factors A and B do not interact
to affect the mean response
H A : factors A and B do interact
F Test for Factor A Main Effect
Trang 54Two-Way ANOVA Summary Table
Factor B SSB b – 1 MSB
= SSB /(b – 1)
MSBMSE
AB
(Interaction) SSAB (a – 1)(b – 1) MSAB
= SSAB / [(a – 1)(b – 1)]
MSABMSE
Error SSE N – ab MSE =
SSE/(N – ab)
Total SST N – 1
Trang 55Features of Two-Way ANOVA
F Test
Degrees of freedom always add up
N-1 = (N-ab) + (a-1) + (b-1) + (a-1)(b-1)
Total = error + factor A + factor B + interaction
The denominator of the F Test is always the same but
the numerator is different
The sums of squares always add up
SST = SSE + SSA + SSB + SSAB
Total = error + factor A + factor B + interaction
Trang 57Chapter Summary
Described one-way analysis of variance
The logic of ANOVA
ANOVA assumptions
F test for difference in k means
The Tukey-Kramer procedure for multiple comparisons
Described randomized complete block designs
F test
Fisher’s least significant difference test for multiple comparisons
Described two-way analysis of variance
Examined effects of multiple factors and interaction