Chapter 9: Confidence Intervals 219Introduction to Confidence Intervals for the Mean ...220 Confidence Intervals for the Mean with Large Samples and Sigma Known ...221 Confidence Intervals f
Trang 4Published by the Penguin Group
Penguin Group (USA) Inc., 375 Hudson Street, New York, New York 10014, USA
Penguin Group (Canada), 90 Eglinton Avenue East, Suite 700, Toronto, Ontario M4P 2Y3, Canada (a division of Pearson Penguin Canada Inc.)
Penguin Books Ltd., 80 Strand, London WC2R 0RL, England
Penguin Ireland, 25 St Stephen’s Green, Dublin 2, Ireland (a division of Penguin Books Ltd.)Penguin Group (Australia), 250 Camberwell Road, Camberwell, Victoria 3124, Australia (a division of Pearson Australia Group Pty Ltd.)
Penguin Books India Pvt Ltd., 11 Community Centre, Panchsheel Park, New Delhi—110 017, India
Penguin Group (NZ), 67 Apollo Drive, Rosedale, North Shore, Auckland 1311, New Zealand (a division of Pearson New Zealand Ltd.)
Penguin Books (South Africa) (Pty.) Ltd., 24 Sturdee Avenue, Rosebank, Johannesburg 2196, South Africa
Penguin Books Ltd., Registered Offices: 80 Strand, London WC2R 0RL, England
Copyright © 2009 by W Michael Kelley
All rights reserved No part of this book shall be reproduced, stored in a retrieval system, or transmitted by any means, electronic, mechanical, photocopying, recording, or otherwise, without written permission from the publisher No patent liability is assumed with respect to the use of the information contained herein Although every precaution has been taken in the preparation of this book, the publisher and author assume no responsibility for errors or omissions Neither is any liability assumed for damages resulting from the use of information contained herein For information, address Alpha Books, 800 East 96th Street, Indianapolis,
IN 46240
Library of Congress Catalog Card Number: 2006926601
Note: This publication contains the opinions and ideas of its author It is intended to provide helpful and informative material on the subject matter covered It is sold with the understanding that the author and publisher are not engaged in rendering professional services in the book If the reader requires personal assistance or advice, a competent professional should be consulted
The author and publisher specifically disclaim any responsibility for any liability, loss, or risk, personal or otherwise, which is incurred as a consequence, directly or indirectly, of the use and application of any of the contents of this book
ISBN: 1-101-15010-6
Trang 5Introduction
Frequency Distributions 2
Histograms 5
Bar Charts 8
Pie Charts 14
Line Charts 19
Scatter Charts 21
Chapter 2: Calculating Descriptive Statistics: Measures of Central Tendency 25 Mean 26
Median 30
Midrange 32
Mode 33
Percentile 36
Weighted Mean 42
Mean of a Frequency Distribution 45
Mean of a Grouped Frequency Distribution 47
Chapter 3: Calculating Descriptive Statistics: Measures of Variation 51 Range 52
Interquartile Range 54
Outliers 58
Visualizing Distributions 62
Stem-and-Leaf Plot 66
Variance and Standard Deviation of a Population 71
Variance and Standard Deviation for Grouped Data 81
Chebyshev’s Theorem 85
HjbbVg^o^c\YViV^ciVWaZh!X]Vgih!VcY\gVe]h H]dl^c\ndjgYViV^cViVWaZ
H]dl^c\ndjg[gZfjZcXnY^hig^Wji^dc^cVX]Vgi
H]dl^c\ndjgXViZ\dg^XVaYViV^cVX]Vgi H]dl^c\ndjgXViZ\dg^XVaYViV^cVX^gXaZ
H]dl^c\ndjgYViVdkZgi^bZ^cVX]Vgi H]dl^c\gZaVi^dch]^ehWZilZZcildkVg^VWaZh^cVX]Vgi
;^cY^c\i]ZXZciZgd[i]Z YViV I]ZVkZgV\Z
G^\]ihbVX`^ci]Zb^YYaZ
;^cY^c\i]Zbdhi[gZfjZcikVajZ
DcVhXVaZ[gdb&id&%%
6kZgV\^c\jh^c\Y^[[ZgZcilZ^\]ih 6kZgV\^c\Y^hXgZiZYViV 8VaXjaVi^c\i]ZbZVcd[\gdjeZYYViV 9ZiZgb^c^c\i]ZY^heZgh^dcd[i]ZYViV
=dll^YZ^hndjgYViV4
;^cY^c\i]Zb^YYaZ*%eZgXZcid[i]ZYViV HZeVgVi^c\i]Z\ddYYViV[gdbi]ZWVY 7dm"VcY"l]^h`ZgeadihVcYY^hig^Wji^dcY^V\gVbh
I]ZÓdlZgedlZgd[YViV I]ZbdhiXdbbdclVnhidbZVhjgZY^heZgh^dc Ejii^c\i]ZhiVcYVgYYZk^Vi^dcidldg`
8VaXjaVi^c\Y^heZgh^dch[dg[gZfjZcXnY^hig^Wji^dch
=Va[lVnWZilZZci]ZZcYed^cih
Trang 6Chapter 4: Introduction to Probability 89
Types of Probability 90
Ađition Rules for Probability 98
Conditional Probability 106
The Multiplication Rule for Probability .116
Bayes’ Theorem 120
Chapter 5: Counting Principles and Probability Distributions 123 Fundamental Counting Principle 124
Permutations 127
Combinations 129
Probability Distributions .135
Chapter 6: Discrete Probability Distributions 141 Binomial Probability Distribution 142
Poisson Probability Distribution .149
The Poisson Distribution as an Approximation to the Binomial Distribution .156
Hypergeometric Probability Distribution 159
Chapter 7: Continuous Probability Distributions 165 Normal Pobability Distribution .166
The Empirical Rule .179
Using the Normal Distribution to Approximate the Binomial Distribution .182
Continuous Uniform Distribution .186
Exponential Distribution 189
Chapter 8: Sampling and Sampling Distributions 195 Probability Sampling .196
Sampling Distribution of the Mean 197
Finite Population Correction Factor .205
Sampling Distribution of the Proportion .207
Finite Population Correction Factor for the Sampling Distribution of the Proportion 215
L]ViVgZi]ZX]VcXZh4 HiVgi^c\l^i]i]ZWVh^Xh 8dbW^c^c\egdWVW^âi^Zhjh^c\ÆdgÇ EgdWVW^âi^Zhi]ViYZeZcYdcdi]ZgZkZcih IlddgbdgZZkZcihdXXjgg^c\Vii]ZhVbZi^bZ 6cdi]ZglVnidXVaXjaViZXdcY^i^dcVaegdWVW^âi^Zh
DYYhndjXVcXdjcidc
=dlegdWVWaZ^h^ii]ViildhZeVgViZZkZcihdXXjg4
=dlbVcnlVnhXVcndjVggVc\ZVXdaaZXi^dcd[i]^c\h4 L]Zci]ZdgYZgd[dW_ZXih^hcdi^bedgiVci
EgdWVW^âinjh^c\Y^hXgZiZYViV 7^cdb^Va!Ed^hhdc!VcY]neZg\ZdbZig^X Jh^c\XdZ[ÒX^Zcihi]ViVgZXdbW^cVi^dch 9ZiZgb^c^c\egdWVW^âi^ZhdkZgheZX^ÒX^ciZgkVah
6W^cdb^Vah]dgiXji
9ZiZgb^c^c\egdWVW^âi^Zhl]ZcZkZcihVgZcdi^cYZeZcYZci
7ZaaXjgkZhVcYo"hXdgZh DcZ!ild!VcYi]gZZhiVcYVgYYZk^Vi^dch[gdbi]ZbZVc
<d^c\[gdbVWZaa"h]VeZYidVWdm"h]VeZYY^hig^Wji^dc ẦZi]ZEd^hhdcY^hig^Wji^dc!WjiXdci^cjdjh
GVcYdbkVg^VWaZhi]ViVgZcÉil]daZcjbWZgh 6cdi]ZgW^cdb^VaegdWVW^âinh]dgiXji
Ldg`^c\l^i]VhjWhZid[VedejaVi^dc HdbVcnlVnhid\Vi]ZgVhVbeaZ
EgZY^Xi^c\i]ZWZ]Vk^dgd[hVbeaZbZVch
EgZY^Xi^c\egdedgi^dch[gdbVhbVaaedejaVi^dc
H Vbeâc\Y^hig^Wji^dcd[i]ZbZVcl^i]VhbVaaedejaVi^dc EgZY^Xi^c\i]ZWZ]Vk^dgd[Y^hXgZiZgVcYdbkVg^VWaZh
Trang 7Chapter 9: Confidence Intervals 219
Introduction to Confidence Intervals for the Mean .220
Confidence Intervals for the Mean with Large Samples and Sigma Known 221
Confidence Intervals for the Mean with Small Samples and Sigma Known 225
Confidence Intervals for the Mean with Small Samples and Sigma Unknown .229
Confidence Intervals for the Mean with Large Samples and Sigma Unknown .235
Confidence Intervals for the Proportion .239
Chapter 10: Hypothesis Testing for a Single Population 243 Introduction to Hypothesis Testing for the Mean .244
Hypothesis Testing for the Mean with n v 30 and Sigma Known .247
Hypothesis Testing for the Mean with n < 30 and Sigma Known .255
Hypothesis Testing for the Mean with n < 30 and Sigma Unknown .259
Hypothesis Testing for the Mean with n > 30 and Sigma Unknown .265
Hypothesis Testing for the Proportion .271
Chapter 11: Hypothesis Testing for Two Populations 279 Hypothesis Testing for Two Means with n > 30 and Sigma Known .280
Hypothesis Testing for Two Means with n < 30 and Sigma Known .286
Hypothesis Testing for Two Means with n < 30 and Sigma Unknown .289
Hypothesis Testing for Two Means with n v 30 and Sigma Unknown .299
Hypothesis Testing for Two Means with Dependent Samples .302
Hypothesis Testing for Two Proportions .309
Chapter 12: Chi-Square and Variance Tests 317 Chi-Square Goodness-of-Fit Test 318
Chi-Square Test for Independence .331
Hypothesis Test for a Single Population Variance .338
Hypothesis Test for Two Population Variances .346
Chapter 13: Analysis of Variance 351 One-Way ANOVA: Completely Randomized Design .352
One-Way ANOVA: Randomized Block Design .371
=dljcgZegZhZciVi^kZXdjaYVhVbeaZWZ4 8ZcigVaâb^ii]ZdgZbidi]ZgZhXjZ Â[Zl^i]djii]ZXZcigVaâb^ii]ZdgZb
>cigdYjX^c\i]ZHijYZciÉhi"Y^hig^Wji^dc
:hi^bVi^c\eZgXZciV\Zh[gdbVedejaVi^dc LZaXdbZWVX`!XZcigVaâb^ii]ZdgZb
L]ViVgZcjaaVcYVaiZgcVi^kZ]nedi]ZhZh4
IZhi^c\eZgXZciV\Zh^chiZVYd[bZVch
I^bZidgZ_ZXihdbZcjaa]nedi]ZhZh
8Vaâc\dci]ZXZcigVaâb^ii]ZdgZbdcXZV\V^c I]ZncZZYidWZcdgbVaanY^hig^WjiZY
7g^c\^c\WVX`i]Zi"Y^hig^Wji^dc ẦZi]ZaVhihZXi^dc!Wjil^i]o"hXdgZh
=nedi]Zh^ôc\
8dbeVg^c\ildedejaVi^dcbZVch
Cdh^\bV hbVaahVbeaZh2i"Y^hig^Wji^dc
Oh^chiZVYd[Ih L]Vi]VeeZchl]Zci]ZildhVbeaZhVgZgZaViZY4 8dbeVg^c\edejaVi^dceZgXZciV\Zh L]ZcedejaVi^dchcZZYidWZcdgbVaanY^hig^WjiZY
IZhi^c\XViZ\dg^XVaYViV[dgkVg^Vi^dc
>hi]ZYViVY^hig^WjiZYi]ZlVnndji]dj\]i^ildjaYWZ4
6gZi]ZkVg^VWaZhgZaViZY4 IZhi^c\kVg^Vi^dc^chiZVYd[i]ZbZVc
>cigdYjX^c\i]Z;"Y^hig^Wji^dc 8dbeVg^c\bjai^eaZbZVchl^i]i]Z;"Y^hig^Wji^dc
I]ZbdhiWVh^X6CDK6egdXZYjgZ 6YY^c\VWadX`^c\kVg^VWaZidi]ZiZhi Ejii^c\hVbeaZhidldg`
Trang 8Chapter 14: Correlation and Simple Regression Analysis 389
Correlation 390
Simple Regression Analysis .396
Chapter 15: Nonparametric Tests 413 The Sign Test with a Small Sample Size 414
The Sign Test with a Large Sample Size 418
The Paired-Sample Sign Test (n f 25) 421
The Paired-Sample Sign Test (n # 25) 423
The Wilcoxon Rank Sum Test for Small Samples .425
The Wilcoxon Rank Sum Test for Large Samples .428
The Wilcoxon Signed-Rank Test .431
The Kruskal-Wallis Test 436
The Spearman Rank Correlation Coefficient Test .442
Chapter 16: Forecasting 449 Simple Moving Average .450
Weighted Moving Average .454
Exponential Smoothing 458
Exponential Smoothing with Trend Adjustment 462
Trend Projection and Seasonality .468
Causal Forecasting 477
Chapter 17: Statistical Process Control 483 Introduction to Statistical Process Control 484
Statistical Process Control for Variable Measurement .484
Statistical Process Control for Attribute Measurement Using p-charts 491
Statistical Process Control for Attribute Measurement Using c-charts 495
Process Capability Ratio 498
Process Capability Index .500
;^cY ^c\gZaVi^dch]^ehWZilZZcildkVg^VWaZh 9ZhXg^W^c\i]ZhigZc\i]VcYY^gZXi^dcd[VgZaVi^dch]^e
A^cZd[WZhiÒi
IZhihi]ViYdcdigZfj^gZVhhjbei^dchVWdjii]ZedejaVi^dch
IZhii]ZbZY^Vcd[VhVbeaZ IZhibZY^Vchjh^c\o"hXdgZh 6eeani]Zh^\ciZhiidildYZeZcYZciYViVhZih
JhZo"hXdgZhidbZVhjgZgVc`Y^[[ZgZcXZh 9^[[ZgZcXZ^cbV\c^ijYZWZilZZcYZeZcYZcihVbeaZh 8dbeVg^c\bdgZi]VcildedejaVi^dch
8dggZaVi^c\YViVhZihVXXdgY^c\idgVc`Y^[[ZgZcXZh
8dbW^cZi]Zh^\ciZhiVcYo"hXdgZhidiZhieV^gZYYViV I]ZbV\c^ijYZd[Y^[[ZgZcXZhWZilZZcildhVbeaZh
EgZY^Xi^c\[jijgZkVajZhd[gVcYdbkVg^VWaZh I]ZbdhiWVh^X[dgZXVhi^c\iZX]c^fjZ GZXZciYViV^hlZ^\]iZYbdgZ]ZVk^an 6hZa["XdggZXi^c\[dgZXVhi^c\iZX]c^fjZ
6YYigZcYhidi]ZhZa["XdggZXi^c\bZi]dY 6XXdjci[dgigZcYhVcYhZVhdcVa^cÓjZcXZh I]Z^cYZeZcYZcikVg^VWaZYdZhcÉi]VkZidWZi^bZ
Jh^c\hiVi^hi^XhidbZVhjgZfjVa^in
BZVcVcYgVc\ZXdcigdaX]Vgih 8VaXjaViZi]Zegdedgi^dcd[YZ[ZXi^kZ^iZbh 8djci^c\i]ZcjbWZgd[YZ[ZXi^kZ^iZbh
>hVegdXZhhXVeVWaZd[eZg[dgb^c\VXXdgY^c\idYZh^\c4 BZVhjg^c\XVeVW^a^in[dgVegdXZhhi]Vi]Vhh]^[iZY
:meadg^c\i]ZY^[[ZgZciineZhd[fjVa^inbZVhjgZbZci
Trang 9Chapter 18: Contextualizing Statistical Concepts 503
Trang 10>cigdYjXi^dc 6gZndj^cVhiVi^hi^XhXaVhh4NZh4I]ZcndjcZZYi]^hWđ`#=ZgZÉhl]n/
;VXi&/I]ZWZhilVnidaZVgchiVi^hi^Xh^hWnldg`^c\djihiVi^hi^XhegdWaZbh#I]ZgZÉh cdYZcn^c\^i#>[ndjXdjaYÒ\jgZi]^hXaVhhdji_jhiWngZVY^c\i]ZiZmiWđ`dgiV`^c\
\đYcdiZh^cXaVhh!ZkZgnWdYnldjaYeVhhl^i]Ón^c\Xdadgh#Jc[dgijcViZan!i]Z]Vgh] igji]^hi]Vindj]VkZidWjX`aZYdlcVcYldg`egdWaZbhdjijci^andjgÒc\ZghVgZ cjbW#
;VXi'/BdhiiZmiWđ`hdcaniZaandjl]Vii]ZVchlZghidi]Z^gegVXi^XZegdWaZbhVgZ Wjicdi]dlidYdi]ZbHjgZndjgiZmiWđ`bVn]VkZ&,*egdWaZbh[dgZkZgnidêX! Wjibdhid[i]Zbdcan\^kZndji]ZVchlZgh#I]VibZVch^[ndjYdcÉi\Zii]ZVchlZg g^\]indjÉgZidiVaanhXgZlZY@cdl^c\ndjÉgZlgdc\^hcd]ZaeViVaa^[ndjYdcÉi`cdlL=N ndjÉgZlgdc\#HiVi^hi^XhiZmiWđ`hh^idcV]j\Zi]gdcZ!ầZi]Z<gZViVcYIZgg^WaZ DoVcYhVn!ÆCdeZ!ignV\V^c!ÇVcYndjYd#DkZgVcYdkZg!jhjVaan\Zii^c\i]ZegdWaZb lgdc\#L]ViVYZâ\]i[jalVnidaZVgcAZiÉhcdiZkZc\Zi^cidl]ni]ZndcaniZaandj i]ZVchlZghidi]ZdYYegdWaZbh#9dZhi]VibZVci]ZWđ`ÉhVXijVaVji]dgY^YcÉi ZkZc[ZZaầZldg`^c\djii]ZZkZcdcZh4
;VXi(/:kZcl]ZcbVi]Wđ`hignidh]dlndji]ZhiZeh[dgVegdWaZb!i]ZnYdV adjhn_dW#BVi]eZdeaZadkZidh`^ehiZeh#NdjÉaaWZ[daadl^c\Vadc\ÒcZl^i]VcZmeaVcVi^dc VcYi]ZcVaad[VhjYYZcWVb!ndjÉgZadhi#NdjÉaai]^c`idndjghZa[!Æ=dlY^Yi]ZnYd i]Vi4ÇdgÆL]ZgZi]Z]ZX`Y^Yi]Vi)'XdbZ[gdb4>ilVhcÉii]ZgZ^ci]ZaVhihiZeÇ L]nYdVabdhiVaad[i]ZhZWđ`hVhhjbZi]Vi^cdgYZgidldg`djiVegdWaZbdceV\Z '%%!ndjÉYWZiiZg`cdleV\Zh&i]gdj\]& ầZi]ZWVX`d[ndjg]VcY4NdjYdcÉilVci idheZcYi]ZgZhid[ndjgâ[Zdc]dbZldg`Ndj_jhilVciid`cdll]nndj`ZZe\Zii^c\ VcZ\Vi^kZcjbWZgl]ZcndjÉgZign^c\idXVaXjaViZi]ZegdWVW^âind[YgVl^c\V[jaa ]djhZ[gdbVYZX`d[XVgYh!l]^X]ndjl^aaaZVgc^c8]VeiZg*#
;VXi)/GZVY^c\âhihd[[VXih^h[jc[dgVl]^aZ!Wjii]Zc^i\ZihdaY#AZiÉhXjiid i]ZX]VhZ#?jhiVWdjiZkZgnh^c\aZ`^cYd[hiVi^hi^XhegdWaZbndjXdjaYedhh^Wangjc
^cid^h^c]ZgZÅV[iZgVaa!i]^hWđ`^h]jbdc\djh>[Vi]djhVcYegdWaZbhVgZcÉiZcdj\]! i]ZcndjÉkZ\dihdbZ`^cYd[XgVonhiVih]jc\Zg!bn[g^ZcY!VcY>ÉYhZZ`egd[Zhh^dcVa ]Zae#I]^hegVXi^XZWđ`lVh\đYViÒghi!WjiidbV`Z^i\gZVi!lZlZcii]gdj\]VcY ldg`ZYdjiVaai]ZegdWaZbhVcYiđ`cdiZh^ci]ZbVg\^chl]ZclZi]dj\]ihdbZi]^c\ lVhXdc[jh^c\dgcZZYZYVâiiaZbdgZZmeaVcVi^dc#LZVahdYgZlâiiaZh`jaahcZmiid i]Z]VgYZhiegdWaZbh!hdndjÉY`cdlcdiid[gZV`dji^[i]ZnlZgZiđX]VaaZc\^c\# 6[iZgVaa!^[ndjÉgZldg`^c\dcVegdWaZbVcYndjÉgZidiVaanhijbeZY!^hcÉi^iWZiiZgid
Trang 11LZi]^c`ndjÉaaWZeaZVhVcianhjgeg^hZYWn]dlYZiV^aZYi]ZVchlZgZmeaVcVi^dchVgZ!VcYlZ ]deZndjÉaaÒcYdjga^iiaZcdiZh]Zae[jaVadc\i]ZlVn#8VaajhXgVon!WjilZi]^c`i]VieZdeaZ l]dlVciidaZVgchiVi^hi^XhVcYVgZl^aa^c\idheZcYi]Zi^bZYg^aa^c\i]Z^glVni]gdj\]
egVXi^XZegdWaZbhh]djaYVXijVaanWZVWaZidÒ\jgZi]ZegdWaZbhdjiVcYaZVgcVhi]Zn\d! Wjii]ViÉh_jhidjg'#
DcZÒcValdgYd[lVgc^c\#6adid[hiVi^hi^XhXaVhhZhjhZXVaXjaVidghi]Vildg`Vaad[i]Z
[dgbjaVhdji[dgndj#>ci]^hWdd`!lZÉgZh]dl^c\ndj]dli]Z[dgbjaVhldg`!Wjii]ZgZÉhV
adid[Vg^i]bZi^X^ckdakZY!hdlZgdjcYi]ZYZX^bVahd[[#I]^hbZVchi]ZXVaXjaVidgVchlZg bVnY^[[ZgViZZcnW^i[gdbdjgVchlZgh!Wjii]^hlVnndjVXijVaan\ZiidhZZi]ZhiZehVcY jcYZghiVcYl]ViÉh\d^c\dc#
>cX#XVccdiViiZhiidi]ZVXXjgVXnd[i]^h^c[dgbVi^dc#JhZd[ViZgb^ci]^hWdd`h]djaYcdi WZgZ\VgYZYVhV[[ZXi^c\i]ZkVa^Y^ind[VcnigVYZbVg`dghZgk^XZbVg`#
Trang 129ZY^XVi^dc 7dW/I]^hWdd`ldjaYcdi]VkZWZZcedhh^WaZl^i]djii]Zadk^c\hjeedgid[bnl^[ZVcYWZhi [g^ZcY!9ZWW^Z#NdjgZcXdjgV\ZbZciVcYWZa^Z[^cbZlZgZVXdchiVcihdjgXZd[^che^gVi^dc# I]Vc`ndj[dgndjgjcZcY^c\eVi^ZcXZl^i]bZYjg^c\i]^hegd_ZXi#>adkZndjValVnh# B^`Z/;dgA^hV!C^X`!:g^c!VcYHVgV!i]Z[djggZVhdchVcni]^c\^cbna^[Z^hldgi]Yd^c\#
Trang 13Chapter 1
DISPLAYING DESCRIPTIVE STATISTICS
The main focus of descriptive statistics is to summarize and present data This chapter demonstrates a variety of techniques available to display descriptive statistics Presenting data graphically allows the user to extract information more efficiently.
I]ZgZVgZbVcnY^[[ZgZciiddahVkV^aVWaZ[dgY^heaVn^c\YZhXg^ei^kZ hiVi^hi^Xh#;gZfjZcXnY^hig^Wji^dchVgZVh^beaZlVnidhjbbVg^oZgVl YViV^ciVWaZhVcYbV`Zi]Z^c[dgbVi^dcbdgZjhZ[ja#=^hid\gVbhXdckZgi i]ZhZiVWaZh^cidX]VgihVcYegdk^YZVe^XijgZd[i]ZYViV#7VgVcYe^Z X]Vgihd[[ZgVkVg^Zind[lVnhidY^heaVnXViZ\dg^XVaYViV#;^cVaan!a^cZ VcYhXViiZgX]VgihVaadlndjidk^ZlgZaVi^dch]^ehWZilZZcildkVg^VWaZh V\gVe]^XVa[dgbVi# ^c
Trang 141.1 Develop a frequency distribution summarizing this data.
A frequency distribution is a two-column table In the left column, list each value in the data set from least to greatest Count the number of times each value appears and record those totals in the right column
Note: Problems 1.1–1.3 refer to the data set in Problem 1.1, the daily demand for hammers at
a hardware store over the last 20 days.
1.2 Develop a relative frequency distribution for the data
Divide the frequency of each daily demand by the total number of data values (20)
Daily Demand Frequency Relative Frequency
Trang 15Note: Problems 1.1–1.3 refer to the data set in Problem 1.1, the daily demand for hammers at
a hardware store over the last 20 days.
1.3 Develop a cumulative relative frequency distribution for the data
Note: Problems 1.4–1.6 refer to the data set below, the number of calls per day made from a
cell phone for the past 30 days.
Cell Phone Calls per Day
1.4 Develop a frequency distribution summarizing the data
Because this data has many possible outcomes, you should group the number
of calls per day into groups, which are known as classes One option is the 2 k v n
rule to determine the number of classes, where k equals the number of classes
and n equals the number of data points Given n = 30, the best value for k is 5.
Calculate the width W of each class.
Set the size of each class to 3 and list the classes in the left column of the
frequency distribution Count the number of values contained in each group
and list those values in the right column
I]ZXjbjaVi^kZ gZaVi^kZ[gZfjZcXn [dgVeVgi^XjaVggdl^hi]Z gZaVi^kZ[gZfjZcXn[dgi]Vi gdleajhi]ZXjbjaVi^kZ gZaVi^kZ[gZfjZcXn[dgi]Z egZk^djhgdl#I]ZaVhiXjbj" aVi^kZgZaVi^kZ[gZfjZcXn h]djaYValVnhWZ&#%%#
7ZXVjhZ')
2(' (%#
Trang 16Calls per Day Frequency
1.5 Develop a relative frequency distribution for the data
Divide the frequency of each class by the total number of data values (30)
Calls per Day Frequency Relative Frequency
1.6 Develop a cumulative relative frequency distribution for the data
The cumulative relative frequency for a particular row is the relative frequency (calculated in Problem 1.5) for that row plus the cumulative relative frequency for the previous row
Trang 17Zm^hihWZilZZc XdajbchWZXVjhZ i]ZYViV^h
Y^hXgZiZ#9^hXgZiZ YViV^hYViVi]Vi i]ViXVciV`ZdcV XdjciVWaZcjbWZgd[ edhh^WaZkVajZh!^c i]^hXVhZZ^i]Zg 6!7!8!9!dg;#
Trang 181.8 Develop a histogram for the frequency distribution below, the commuting distance for 50 employees of a particular company.
First, develop a frequency distribution for the data Using the 2k v n rule, set
k = 6 because 26= 64 v 40 Calculate the width W of each class.
Set the size of each class equal to 10 and count the number of values contained
Trang 19Miles per Tank Frequency
The height of each bar in the histogram reflects the frequency for each group
of miles per tank of gas
1.10 Develop a histogram for the data set below, the number of home runs hit by 40
Major League Baseball players during the 2008 season
Develop a frequency distribution for the data Apply the 2k v n rule and set k = 6
because 26= 64 v 40 Calculate the width W of each class.
Set the size of each class equal to 4 and count the number of values contained
Trang 20Home Runs Frequency
Trang 21A column bar chart uses vertical bars to represent categorical data The height
of each bar corresponds to the value of each category
1.12 Construct a column bar chart for the data below, a company’s monthly sales
^hdg\Vc^oZY^c Y^hXgZiZ\gdjeh!hjX] Vhi]Zbdci]h^c i]^hegdWaZb#
Trang 221.13 Construct a horizontal bar chart for the data set below, weekly donations collected at a local church.
Trang 23Note: Problems 1.15–1.16 refer to the data set below, weekly sales data in units for
1.15 Construct a grouped column bar chart for the data, grouping by week
Because there are two data values for each time period (a value for Store 1 and
a value for Store 2), you should use a grouped column bar chart
I]ZhVaZhYViV [dgZVX]lZZ` XdbeVgZhi]ZhVaZh [dgZVX]hidgZ#
Trang 24Note: Problems 1.15–1.16 refer to the data set in Problem 1.15, weekly sales data in units for two stores.
1.16 Construct a stacked column bar chart for the data, grouping by store
Each column represents the total units sold each week between the two stores
Note: Problems 1.17–1.20 refer to the data set below, the investment portfolio for three different investors in thousands of dollars.
Investor 1 Investor 2 Investor 3
1.17 Construct a grouped horizontal bar chart, grouping by investor
Three horizontal bars are arranged side-by-side for each investor, indicating the amount of each investment type
Trang 25Note: Problems 1.17–1.20 refer to the data set in Problem 1.17, showing the investment
portfolio for three different investors in thousands of dollars.
1.18 Construct a grouped horizontal bar chart, grouping by investment type
Arrange three horizontal bars representing the investors, side-by-side, for each
investment type
Note: Problems 1.17–1.20 refer to the data set in Problem 1.17, the investment portfolio for
three different investors in thousands of dollars.
1.19 Construct a stacked horizontal bar chart, grouping by investor
Each investor is represented by three horizontally stacked bars that indicate that
investor’s total investments by type
:VX]WVg
^ci]^hhiVX`ZY X]VgigZegZhZcih i]ZidiVa^ckZhibZcih d[Vh^c\aZ^ckZhidg# I]ZidiVaaZc\i]d[ ZVX]WVggZegZhZcih i]ZidiVaVbdjci
^ckZhiZYWnZVX] eZghdc#
Trang 26Note: Problems 1.17–1.20 refer to the data set in Problem 1.17, the investment portfolio for three different investors in thousands of dollars.
1.20 Construct a stacked horizontal bar chart, grouping by investment type.Represent each investment type using three horizontally stacked bars
Trang 27Multiply each relative frequency distribution by 360 to calculate the
corresponding central angle for each category in the pie chart A central
angle has a vertex at the center of the circle and sides that intersect the circle,
defining the boundaries of each category in a pie chart
Grade Relative Frequency Central Angle
The central angle determines the size of each pie segment
1.22 Construct a pie chart for the data in the table below, the number of total wins
recorded by the Green Bay Packers football team in five recent seasons
I]ZXZcigVa Vc\aZ[dgi]^h XViZ\dgn^h-(ï#
Trang 28Year Number of Wins Relative Frequency
Trang 291.23 Construct a pie chart for the data in the table below, an individual investor’s
The total investment is $78,000 Divide the figure for each category by this
number to determine the percentage of the total investment each category
Multiply each percentage by 360 to calculate the central angle for each category
in the pie chart
Use the central angles calculated above to draw appropriately sized sectors of
the pie chart If you have difficulty visualizing angles, use a protractor
I]Zh^oZd[
i]ZhZ\bZci
XdggZhedcYhidi]Z eZgXZciV\Zd[i]Z
^ckZhibZci^ci]Vi XViZ\dgn#
Trang 301.24 Construct a pie chart for the frequency distribution below, the daily high temperature (in degrees Fahrenheit) in a particular city over the last 40 days.
Determine the relative frequency distribution for each temperature range
Daily High Temperature Frequency Relative Frequency
Calculate the central angle for each category in the pie chart
Trang 31Line Charts
9ViVdkZgi^bZ^cVX]Vgi
1.25 Construct a line chart for the data in the table below, the number of wins
recorded by the Philadelphia Phillies for seven seasons
Place the time variable (year) on the x-axis and place the variable of interest
(wins) on the y-axis.
1.26 Construct a line chart for the data in the table below, the percent change in
annual profit for a company by year
Trang 32Place the time variable (year) on the x-axis and place the variable of interest (percent change) on the y-axis.
1.27 Construct a line chart for the data in the table below, the population of Delaware by decade during the 1800s
Trang 33Scatter Charts
>aajhigViZgZaVi^dch]^ehWZilZZcildkVg^VWaZh
1.28 Construct a scatter chart for the data in the table below, the number of hours
eight students studied for an exam and the scores they earned on the exam
Place the independent variable (study hours) on the x-axis and the dependent
variable (exam score) on the y-axis.
I]ZYZeZcYZci kVg^VWaZ^hi]Z kVg^VWaZi]ViX]Vc\Zh VhVgZhjaid[X]Vc\Zh
^ci]Z^cYZeZcYZci kVg^VWaZ#HijYn^c\adc\Zg h]djaY\^kZndjV]^\]Zg
\gVYZ#I]ZgZkZghZ YdZhcÉibV`ZhZchZÅ
\Zii^c\V]^\]ZgiZhi hXdgZYdZhcÉigZhjai^cndj hijYn^c\adc\Zg[dgi]Vi iZhi#
HdbZhiVi^hi^Xh Wdd`hjhZÆZmeaVcVidgnÇ VcYÆgZhedchZÇ^chiZVY d[Æ^cYZeZcYZciÇVcY ÆYZeZcYZci#Ç
Trang 341.29 Construct a scatter chart for the data below, the mileage and selling price of eight used cars.
Trang 351.30 Construct a scatter chart for the data in the table below, eight graduate
students’ grade point averages (GPA) and entrance exam scores for M.B.A
Place the independent variable (GMAT) on the x-axis and the dependent
variable (GPA) on the y-axis.
<B6IhXdgZ
^hi]Z^cYZeZcYZci kVg^VWaZWZXVjhZV
<B6IhXdgZ^hhjeedhZY idegZY^Xi]dllZaa ndjaaYd^c\gVYjViZ hX]dda#
Trang 37Chapter 2
CALCULATING DESCRIPTIVE STATISTICS: MEASURES OF CENTRAL TENDENCY
One of the most common roles served by descriptive statistics is termining the central tendency of data This chapter investigates the primary means by which the “center” of a data set can be described, including the mean, median, mode, and percentiles.
de-9ViVbVnWZgZedgiZY^cZ^i]ZggVldghjbbVg^oZY[dgb#IdjcYZghiVcY i]ZXZcigVaiZcYZcXnd[gVlYViV!i]ZbZVc!bZY^Vc!bdYZ!VcYl Z^\]iZY bZVcXVcWZkZgn]Zae[ja#I]Zedh^i^dcd[gVlYViVXVcWZYZhXg^WZYjh^ c\ eZgXZci^aZh#HjbbVg^oZYYViV!^ci]Z[dgbd[[gZfjZcXnY^hig^Wji^dch!XVc WZYZhXg^WZYjh^c\VlZ^\]iZYbZVciZX]c^fjZ#
;^cY^c\i]ZXZciZgd[i]ZYViV
Trang 38Mean
I]ZVkZgV\Z
2.1 The table below lists the number of students enrolled in the five different statistics courses offered by a college Calculate the mean number of students per class
Number of Students
The mean µ of a data set is the sum of the data divided by the population size.
Notice that a mean can be heavily influenced by an extreme value in the data The only difference between the enrollment numbers in Problems 2.1 and 2.2
is the size of the last class That class increased from 15 to 75 students and caused the mean class size to grow from 21.2 to 33.2
2.3 The table below lists the time, in minutes, it takes seven random customers to check out at a local grocery store Calculate the mean time it takes a customer
Trang 392.4 A consumer group tested the gas mileage of two car models in five trials
Which model averages more miles per gallon?
Trial 1 Trial 2 Trial 3 Trial 4 Trial 5
Model B averages more miles per gallon than Model A
2.5 The table below reports the number of minutes eight randomly selected airline
flights were either early (negative values) or late (positive values) arriving at
their destinations Calculate the sample mean
Number of Minutes Early or Late
Combine the positive and negative values and divide by the sample size
The average flight is 8.6 minutes late
2.6 The following table lists the daily percent increase (or decrease) of a stock
price over a five-day period Calculate the mean daily change in the stock price
Percent Increase or Decrease
If the sum is less than zero, the average change is a decrease, whereas a
positive sum indicates an average increase
The stock price decreased an average of 1.76 percent per day
6YY^c\&'VcY Ä&%^hi]ZhVbZ VhhjWigVXi^c\
&%[gdb&'/
&' Ä&%2&'Ä&%#
>[i]ZVkZgV\Z lVhÄ-#+!^ildjaY bZVci]Vii]Z VkZgV\ZÓ^\]ilVh -#+b^cjiZhZVgan#
Trang 402.7 The table below lists the points scored by three basketball players over six games Identify the player with the highest average points per game.
Player Points Scored per Game
Calculate each player’s scoring average separately
Paul averaged the most points per game, 21.7
Note: Problems 2.8–2.9 refer to the data set below, the daily demand for tires at a particular store over a seven-day period.
The demand forecast for January 11 is 29.3 tires