1. Trang chủ
  2. » Kinh Doanh - Tiếp Thị

2009 the humongous book of statistics

562 77 0

Đang tải... (xem toàn văn)

Tài liệu hạn chế xem trước, để xem đầy đủ mời bạn chọn Tải xuống

THÔNG TIN TÀI LIỆU

Thông tin cơ bản

Định dạng
Số trang 562
Dung lượng 12,95 MB

Các công cụ chuyển đổi và chỉnh sửa cho tài liệu này

Nội dung

Chapter 9: Confidence Intervals 219Introduction to Confidence Intervals for the Mean ...220 Confidence Intervals for the Mean with Large Samples and Sigma Known ...221 Confidence Intervals f

Trang 4

Published by the Penguin Group

Penguin Group (USA) Inc., 375 Hudson Street, New York, New York 10014, USA

Penguin Group (Canada), 90 Eglinton Avenue East, Suite 700, Toronto, Ontario M4P 2Y3, Canada (a division of Pearson Penguin Canada Inc.)

Penguin Books Ltd., 80 Strand, London WC2R 0RL, England

Penguin Ireland, 25 St Stephen’s Green, Dublin 2, Ireland (a division of Penguin Books Ltd.)Penguin Group (Australia), 250 Camberwell Road, Camberwell, Victoria 3124, Australia (a division of Pearson Australia Group Pty Ltd.)

Penguin Books India Pvt Ltd., 11 Community Centre, Panchsheel Park, New Delhi—110 017, India

Penguin Group (NZ), 67 Apollo Drive, Rosedale, North Shore, Auckland 1311, New Zealand (a division of Pearson New Zealand Ltd.)

Penguin Books (South Africa) (Pty.) Ltd., 24 Sturdee Avenue, Rosebank, Johannesburg 2196, South Africa

Penguin Books Ltd., Registered Offices: 80 Strand, London WC2R 0RL, England

Copyright © 2009 by W Michael Kelley

All rights reserved No part of this book shall be reproduced, stored in a retrieval system, or transmitted by any means, electronic, mechanical, photocopying, recording, or otherwise, without written permission from the publisher No patent liability is assumed with respect to the use of the information contained herein Although every precaution has been taken in the preparation of this book, the publisher and author assume no responsibility for errors or omissions Neither is any liability assumed for damages resulting from the use of information contained herein For information, address Alpha Books, 800 East 96th Street, Indianapolis,

IN 46240

Library of Congress Catalog Card Number: 2006926601

Note: This publication contains the opinions and ideas of its author It is intended to provide helpful and informative material on the subject matter covered It is sold with the understanding that the author and publisher are not engaged in rendering professional services in the book If the reader requires personal assistance or advice, a competent professional should be consulted

The author and publisher specifically disclaim any responsibility for any liability, loss, or risk, personal or otherwise, which is incurred as a consequence, directly or indirectly, of the use and application of any of the contents of this book

ISBN: 1-101-15010-6

Trang 5

Introduction

Frequency Distributions 2

Histograms 5

Bar Charts 8

Pie Charts 14

Line Charts 19

Scatter Charts 21

Chapter 2: Calculating Descriptive Statistics: Measures of Central Tendency 25 Mean 26

Median 30

Midrange 32

Mode 33

Percentile 36

Weighted Mean 42

Mean of a Frequency Distribution 45

Mean of a Grouped Frequency Distribution 47

Chapter 3: Calculating Descriptive Statistics: Measures of Variation 51 Range 52

Interquartile Range 54

Outliers 58

Visualizing Distributions 62

Stem-and-Leaf Plot 66

Variance and Standard Deviation of a Population 71

Variance and Standard Deviation for Grouped Data 81

Chebyshev’s Theorem 85

HjbbVg^o^c\YViV^ciVWaZh!X]Vgih!VcY\gVe]h H]dl^c\ndjgYViV^cViVWaZ

H]dl^c\ndjg[gZfjZcXnY^hig^Wji^dc^cVX]Vgi

H]dl^c\ndjgXViZ\dg^XVaYViV^cVX]Vgi H]dl^c\ndjgXViZ\dg^XVaYViV^cVX^gXaZ

H]dl^c\ndjgYViVdkZgi^bZ^cVX]Vgi H]dl^c\gZaVi^dch]^ehWZilZZcildkVg^VWaZh^cVX]Vgi

;^cY^c\i]ZXZciZgd[i]Z YViV I]ZVkZgV\Z

G^\]ihbVX`^ci]Zb^YYaZ

;^cY^c\i]Zbdhi[gZfjZcikVajZ

DcVhXVaZ[gdb&id&%%

6kZgV\^c\jh^c\Y^[[ZgZcilZ^\]ih 6kZgV\^c\Y^hXgZiZYViV 8VaXjaVi^c\i]ZbZVcd[\gdjeZYYViV 9ZiZgb^c^c\i]ZY^heZgh^dcd[i]ZYViV

=dll^YZ^hndjgYViV4

;^cY^c\i]Zb^YYaZ*%eZgXZcid[i]ZYViV HZeVgVi^c\i]Z\ddYYViV[gdbi]ZWVY 7dm"VcY"l]^h`ZgeadihVcYY^hig^Wji^dcY^V\gVbh

I]ZÓdlZgedlZgd[YViV I]ZbdhiXdbbdclVnhidbZVhjgZY^heZgh^dc Ejii^c\i]ZhiVcYVgYYZk^Vi^dcidldg`

8VaXjaVi^c\Y^heZgh^dch[dg[gZfjZcXnY^hig^Wji^dch

=Va[lVnWZilZZci]ZZcYed^cih

Trang 6

Chapter 4: Introduction to Probability 89

Types of Probability 90

Ađition Rules for Probability 98

Conditional Probability 106

The Multiplication Rule for Probability .116

Bayes’ Theorem 120

Chapter 5: Counting Principles and Probability Distributions 123 Fundamental Counting Principle 124

Permutations 127

Combinations 129

Probability Distributions .135

Chapter 6: Discrete Probability Distributions 141 Binomial Probability Distribution 142

Poisson Probability Distribution .149

The Poisson Distribution as an Approximation to the Binomial Distribution .156

Hypergeometric Probability Distribution 159

Chapter 7: Continuous Probability Distributions 165 Normal Pobability Distribution .166

The Empirical Rule .179

Using the Normal Distribution to Approximate the Binomial Distribution .182

Continuous Uniform Distribution .186

Exponential Distribution 189

Chapter 8: Sampling and Sampling Distributions 195 Probability Sampling .196

Sampling Distribution of the Mean 197

Finite Population Correction Factor .205

Sampling Distribution of the Proportion .207

Finite Population Correction Factor for the Sampling Distribution of the Proportion 215

L]ViVgZi]ZX]VcXZh4 HiVgi^c\l^i]i]ZWVh^Xh 8dbW^c^c\egdWVW^âi^Zhjh^c\ÆdgÇ EgdWVW^âi^Zhi]ViYZeZcYdcdi]ZgZkZcih IlddgbdgZZkZcihdXXjgg^c\Vii]ZhVbZi^bZ 6cdi]ZglVnidXVaXjaViZXdcY^i^dcVaegdWVW^âi^Zh

DYYhndjXVcXdjcidc

=dlegdWVWaZ^h^ii]ViildhZeVgViZZkZcihdXXjg4

=dlbVcnlVnhXVcndjVggVc\ZVXdaaZXi^dcd[i]^c\h4 L]Zci]ZdgYZgd[dW_ZXih^hcdi^bedgiVci

EgdWVW^âinjh^c\Y^hXgZiZYViV 7^cdb^Va!Ed^hhdc!VcY]neZg\ZdbZig^X Jh^c\XdZ[ÒX^Zcihi]ViVgZXdbW^cVi^dch 9ZiZgb^c^c\egdWVW^âi^ZhdkZgheZX^ÒX^ciZgkVah

6W^cdb^Vah]dgiXji

9ZiZgb^c^c\egdWVW^âi^Zhl]ZcZkZcihVgZcdi^cYZeZcYZci

7ZaaXjgkZhVcYo"hXdgZh DcZ!ild!VcYi]gZZhiVcYVgYYZk^Vi^dch[gdbi]ZbZVc

<d^c\[gdbVWZaa"h]VeZYidVWdm"h]VeZYY^hig^Wji^dc ẦZi]ZEd^hhdcY^hig^Wji^dc!WjiXdci^cjdjh

GVcYdbkVg^VWaZhi]ViVgZcÉil]daZcjbWZgh 6cdi]ZgW^cdb^VaegdWVW^âinh]dgiXji

Ldg`^c\l^i]VhjWhZid[VedejaVi^dc HdbVcnlVnhid\Vi]ZgVhVbeaZ

EgZY^Xi^c\i]ZWZ]Vk^dgd[hVbeaZbZVch

EgZY^Xi^c\egdedgi^dch[gdbVhbVaaedejaVi^dc

H Vbeâc\Y^hig^Wji^dcd[i]ZbZVcl^i]VhbVaaedejaVi^dc EgZY^Xi^c\i]ZWZ]Vk^dgd[Y^hXgZiZgVcYdbkVg^VWaZh

Trang 7

Chapter 9: Confidence Intervals 219

Introduction to Confidence Intervals for the Mean .220

Confidence Intervals for the Mean with Large Samples and Sigma Known 221

Confidence Intervals for the Mean with Small Samples and Sigma Known 225

Confidence Intervals for the Mean with Small Samples and Sigma Unknown .229

Confidence Intervals for the Mean with Large Samples and Sigma Unknown .235

Confidence Intervals for the Proportion .239

Chapter 10: Hypothesis Testing for a Single Population 243 Introduction to Hypothesis Testing for the Mean .244

Hypothesis Testing for the Mean with n v 30 and Sigma Known .247

Hypothesis Testing for the Mean with n < 30 and Sigma Known .255

Hypothesis Testing for the Mean with n < 30 and Sigma Unknown .259

Hypothesis Testing for the Mean with n > 30 and Sigma Unknown .265

Hypothesis Testing for the Proportion .271

Chapter 11: Hypothesis Testing for Two Populations 279 Hypothesis Testing for Two Means with n > 30 and Sigma Known .280

Hypothesis Testing for Two Means with n < 30 and Sigma Known .286

Hypothesis Testing for Two Means with n < 30 and Sigma Unknown .289

Hypothesis Testing for Two Means with n v 30 and Sigma Unknown .299

Hypothesis Testing for Two Means with Dependent Samples .302

Hypothesis Testing for Two Proportions .309

Chapter 12: Chi-Square and Variance Tests 317 Chi-Square Goodness-of-Fit Test 318

Chi-Square Test for Independence .331

Hypothesis Test for a Single Population Variance .338

Hypothesis Test for Two Population Variances .346

Chapter 13: Analysis of Variance 351 One-Way ANOVA: Completely Randomized Design .352

One-Way ANOVA: Randomized Block Design .371

=dljcgZegZhZciVi^kZXdjaYVhVbeaZWZ4 8ZcigVaâb^ii]ZdgZbidi]ZgZhXjZ Â[Zl^i]djii]ZXZcigVaâb^ii]ZdgZb

>cigdYjX^c\i]ZHijYZciÉhi"Y^hig^Wji^dc

:hi^bVi^c\eZgXZciV\Zh[gdbVedejaVi^dc LZaXdbZWVX`!XZcigVaâb^ii]ZdgZb

L]ViVgZcjaaVcYVaiZgcVi^kZ]nedi]ZhZh4

IZhi^c\eZgXZciV\Zh^chiZVYd[bZVch

I^bZidgZ_ZXihdbZcjaa]nedi]ZhZh

8Vaâc\dci]ZXZcigVaâb^ii]ZdgZbdcXZV\V^c I]ZncZZYidWZcdgbVaanY^hig^WjiZY

7g^c\^c\WVX`i]Zi"Y^hig^Wji^dc ẦZi]ZaVhihZXi^dc!Wjil^i]o"hXdgZh

=nedi]Zh^ôc\

8dbeVg^c\ildedejaVi^dcbZVch

Cdh^\bV hbVaahVbeaZh2i"Y^hig^Wji^dc

Oh^chiZVYd[Ih L]Vi]VeeZchl]Zci]ZildhVbeaZhVgZgZaViZY4 8dbeVg^c\edejaVi^dceZgXZciV\Zh L]ZcedejaVi^dchcZZYidWZcdgbVaanY^hig^WjiZY

IZhi^c\XViZ\dg^XVaYViV[dgkVg^Vi^dc

>hi]ZYViVY^hig^WjiZYi]ZlVnndji]dj\]i^ildjaYWZ4

6gZi]ZkVg^VWaZhgZaViZY4 IZhi^c\kVg^Vi^dc^chiZVYd[i]ZbZVc

>cigdYjX^c\i]Z;"Y^hig^Wji^dc 8dbeVg^c\bjai^eaZbZVchl^i]i]Z;"Y^hig^Wji^dc

I]ZbdhiWVh^X6CDK6egdXZYjgZ 6YY^c\VWadX`^c\kVg^VWaZidi]ZiZhi Ejii^c\hVbeaZhidldg`

Trang 8

Chapter 14: Correlation and Simple Regression Analysis 389

Correlation 390

Simple Regression Analysis .396

Chapter 15: Nonparametric Tests 413 The Sign Test with a Small Sample Size 414

The Sign Test with a Large Sample Size 418

The Paired-Sample Sign Test (n f 25) 421

The Paired-Sample Sign Test (n # 25) 423

The Wilcoxon Rank Sum Test for Small Samples .425

The Wilcoxon Rank Sum Test for Large Samples .428

The Wilcoxon Signed-Rank Test .431

The Kruskal-Wallis Test 436

The Spearman Rank Correlation Coefficient Test .442

Chapter 16: Forecasting 449 Simple Moving Average .450

Weighted Moving Average .454

Exponential Smoothing 458

Exponential Smoothing with Trend Adjustment 462

Trend Projection and Seasonality .468

Causal Forecasting 477

Chapter 17: Statistical Process Control 483 Introduction to Statistical Process Control 484

Statistical Process Control for Variable Measurement .484

Statistical Process Control for Attribute Measurement Using p-charts 491

Statistical Process Control for Attribute Measurement Using c-charts 495

Process Capability Ratio 498

Process Capability Index .500

;^cY ^c\gZaVi^dch]^ehWZilZZcildkVg^VWaZh 9ZhXg^W^c\i]ZhigZc\i]VcYY^gZXi^dcd[VgZaVi^dch]^e

A^cZd[WZhiÒi

IZhihi]ViYdcdigZfj^gZVhhjbei^dchVWdjii]ZedejaVi^dch

IZhii]ZbZY^Vcd[VhVbeaZ IZhibZY^Vchjh^c\o"hXdgZh 6eeani]Zh^\ciZhiidildYZeZcYZciYViVhZih

JhZo"hXdgZhidbZVhjgZgVc`Y^[[ZgZcXZh 9^[[ZgZcXZ^cbV\c^ijYZWZilZZcYZeZcYZcihVbeaZh 8dbeVg^c\bdgZi]VcildedejaVi^dch

8dggZaVi^c\YViVhZihVXXdgY^c\idgVc`Y^[[ZgZcXZh

8dbW^cZi]Zh^\ciZhiVcYo"hXdgZhidiZhieV^gZYYViV I]ZbV\c^ijYZd[Y^[[ZgZcXZhWZilZZcildhVbeaZh

EgZY^Xi^c\[jijgZkVajZhd[gVcYdbkVg^VWaZh I]ZbdhiWVh^X[dgZXVhi^c\iZX]c^fjZ GZXZciYViV^hlZ^\]iZYbdgZ]ZVk^an 6hZa["XdggZXi^c\[dgZXVhi^c\iZX]c^fjZ

6YYigZcYhidi]ZhZa["XdggZXi^c\bZi]dY 6XXdjci[dgigZcYhVcYhZVhdcVa^cÓjZcXZh I]Z^cYZeZcYZcikVg^VWaZYdZhcÉi]VkZidWZi^bZ

Jh^c\hiVi^hi^XhidbZVhjgZfjVa^in

BZVcVcYgVc\ZXdcigdaX]Vgih 8VaXjaViZi]Zegdedgi^dcd[YZ[ZXi^kZ^iZbh 8djci^c\i]ZcjbWZgd[YZ[ZXi^kZ^iZbh

>hVegdXZhhXVeVWaZd[eZg[dgb^c\VXXdgY^c\idYZh^\c4 BZVhjg^c\XVeVW^a^in[dgVegdXZhhi]Vi]Vhh]^[iZY

:meadg^c\i]ZY^[[ZgZciineZhd[fjVa^inbZVhjgZbZci

Trang 9

Chapter 18: Contextualizing Statistical Concepts 503

Trang 10

>cigdYjXi^dc 6gZndj^cVhiVi^hi^XhXaVhh4NZh4I]ZcndjcZZYi]^hWđ`#=ZgZÉhl]n/

 ™ ;VXi&/I]ZWZhilVnidaZVgchiVi^hi^Xh^hWnldg`^c\djihiVi^hi^XhegdWaZbh#I]ZgZÉh cdYZcn^c\^i#>[ndjXdjaYÒ\jgZi]^hXaVhhdji_jhiWngZVY^c\i]ZiZmiWđ`dgiV`^c\

\đYcdiZh^cXaVhh!ZkZgnWdYnldjaYeVhhl^i]Ón^c\Xdadgh#Jc[dgijcViZan!i]Z]Vgh] igji]^hi]Vindj]VkZidWjX`aZYdlcVcYldg`egdWaZbhdjijci^andjgÒc\ZghVgZ cjbW#

 ™ ;VXi'/BdhiiZmiWđ`hdcaniZaandjl]Vii]ZVchlZghidi]Z^gegVXi^XZegdWaZbhVgZ Wjicdi]dlidYdi]ZbHjgZndjgiZmiWđ`bVn]VkZ&,*egdWaZbh[dgZkZgnidêX! Wjibdhid[i]Zbdcan\^kZndji]ZVchlZgh#I]VibZVch^[ndjYdcÉi\Zii]ZVchlZg g^\]indjÉgZidiVaanhXgZlZY@cdl^c\ndjÉgZlgdc\^hcd]ZaeViVaa^[ndjYdcÉi`cdlL=N ndjÉgZlgdc\#HiVi^hi^XhiZmiWđ`hh^idcV]j\Zi]gdcZ!ầZi]Z<gZViVcYIZgg^WaZ DoVcYhVn!ÆCdeZ!ignV\V^c!ÇVcYndjYd#DkZgVcYdkZg!jhjVaan\Zii^c\i]ZegdWaZb lgdc\#L]ViVYZâ\]i[jalVnidaZVgcAZiÉhcdiZkZc\Zi^cidl]ni]ZndcaniZaandj i]ZVchlZghidi]ZdYYegdWaZbh#9dZhi]VibZVci]ZWđ`ÉhVXijVaVji]dgY^YcÉi ZkZc[ZZaầZldg`^c\djii]ZZkZcdcZh4

 ™ ;VXi(/:kZcl]ZcbVi]Wđ`hignidh]dlndji]ZhiZeh[dgVegdWaZb!i]ZnYdV adjhn_dW#BVi]eZdeaZadkZidh`^ehiZeh#NdjÉaaWZ[daadl^c\Vadc\ÒcZl^i]VcZmeaVcVi^dc VcYi]ZcVaad[VhjYYZcWVb!ndjÉgZadhi#NdjÉaai]^c`idndjghZa[!Æ=dlY^Yi]ZnYd i]Vi4ÇdgÆL]ZgZi]Z]ZX`Y^Yi]Vi)'XdbZ[gdb4>ilVhcÉii]ZgZ^ci]ZaVhihiZeÇ L]nYdVabdhiVaad[i]ZhZWđ`hVhhjbZi]Vi^cdgYZgidldg`djiVegdWaZbdceV\Z '%%!ndjÉYWZiiZg`cdleV\Zh&i]gdj\]& ầZi]ZWVX`d[ndjg]VcY4NdjYdcÉilVci idheZcYi]ZgZhid[ndjgâ[Zdc]dbZldg`Ndj_jhilVciid`cdll]nndj`ZZe\Zii^c\ VcZ\Vi^kZcjbWZgl]ZcndjÉgZign^c\idXVaXjaViZi]ZegdWVW^âind[YgVl^c\V[jaa ]djhZ[gdbVYZX`d[XVgYh!l]^X]ndjl^aaaZVgc^c8]VeiZg*#

 ™ ;VXi)/GZVY^c\âhihd[[VXih^h[jc[dgVl]^aZ!Wjii]Zc^i\ZihdaY#AZiÉhXjiid i]ZX]VhZ#?jhiVWdjiZkZgnh^c\aZ`^cYd[hiVi^hi^XhegdWaZbndjXdjaYedhh^Wangjc

^cid^h^c]ZgZÅV[iZgVaa!i]^hWđ`^h]jbdc\djh>[Vi]djhVcYegdWaZbhVgZcÉiZcdj\]! i]ZcndjÉkZ\dihdbZ`^cYd[XgVonhiVih]jc\Zg!bn[g^ZcY!VcY>ÉYhZZ`egd[Zhh^dcVa ]Zae#I]^hegVXi^XZWđ`lVh\đYViÒghi!WjiidbV`Z^i\gZVi!lZlZcii]gdj\]VcY ldg`ZYdjiVaai]ZegdWaZbhVcYiđ`cdiZh^ci]ZbVg\^chl]ZclZi]dj\]ihdbZi]^c\ lVhXdc[jh^c\dgcZZYZYVâiiaZbdgZZmeaVcVi^dc#LZVahdYgZlâiiaZh`jaahcZmiid i]Z]VgYZhiegdWaZbh!hdndjÉY`cdlcdiid[gZV`dji^[i]ZnlZgZiđX]VaaZc\^c\# 6[iZgVaa!^[ndjÉgZldg`^c\dcVegdWaZbVcYndjÉgZidiVaanhijbeZY!^hcÉi^iWZiiZgid

Trang 11

LZi]^c`ndjÉaaWZeaZVhVcianhjgeg^hZYWn]dlYZiV^aZYi]ZVchlZgZmeaVcVi^dchVgZ!VcYlZ ]deZndjÉaaÒcYdjga^iiaZcdiZh]Zae[jaVadc\i]ZlVn#8VaajhXgVon!WjilZi]^c`i]VieZdeaZ l]dlVciidaZVgchiVi^hi^XhVcYVgZl^aa^c\idheZcYi]Zi^bZYg^aa^c\i]Z^glVni]gdj\]

egVXi^XZegdWaZbhh]djaYVXijVaanWZVWaZidÒ\jgZi]ZegdWaZbhdjiVcYaZVgcVhi]Zn\d! Wjii]ViÉh_jhidjg'–#

DcZÒcValdgYd[lVgc^c\#6adid[hiVi^hi^XhXaVhhZhjhZXVaXjaVidghi]Vildg`Vaad[i]Z

[dgbjaVhdji[dgndj#>ci]^hWdd`!lZÉgZh]dl^c\ndj]dli]Z[dgbjaVhldg`!Wjii]ZgZÉhV

adid[Vg^i]bZi^X^ckdakZY!hdlZgdjcYi]ZYZX^bVahd[[#I]^hbZVchi]ZXVaXjaVidgVchlZg bVnY^[[ZgViZZcnW^i[gdbdjgVchlZgh!Wjii]^hlVnndjVXijVaan\ZiidhZZi]ZhiZehVcY jcYZghiVcYl]ViÉh\d^c\dc#

>cX#XVccdiViiZhiidi]ZVXXjgVXnd[i]^h^c[dgbVi^dc#JhZd[ViZgb^ci]^hWdd`h]djaYcdi WZgZ\VgYZYVhV[[ZXi^c\i]ZkVa^Y^ind[VcnigVYZbVg`dghZgk^XZbVg`#

Trang 12

9ZY^XVi^dc 7dW/I]^hWdd`ldjaYcdi]VkZWZZcedhh^WaZl^i]djii]Zadk^c\hjeedgid[bnl^[ZVcYWZhi [g^ZcY!9ZWW^Z#NdjgZcXdjgV\ZbZciVcYWZa^Z[^cbZlZgZVXdchiVcihdjgXZd[^che^gVi^dc# I]Vc`ndj[dgndjgjcZcY^c\eVi^ZcXZl^i]bZYjg^c\i]^hegd_ZXi#>adkZndjValVnh# B^`Z/;dgA^hV!C^X`!:g^c!VcYHVgV!i]Z[djggZVhdchVcni]^c\^cbna^[Z^hldgi]Yd^c\#

Trang 13

Chapter 1

DISPLAYING DESCRIPTIVE STATISTICS

The main focus of descriptive statistics is to summarize and present data This chapter demonstrates a variety of techniques available to display descriptive statistics Presenting data graphically allows the user to extract information more efficiently.

I]ZgZVgZbVcnY^[[ZgZciiddahVkV^aVWaZ[dgY^heaVn^c\YZhXg^ei^kZ hiVi^hi^Xh#;gZfjZcXnY^hig^Wji^dchVgZVh^beaZlVnidhjbbVg^oZgVl YViV^ciVWaZhVcYbV`Zi]Z^c[dgbVi^dcbdgZjhZ[ja#=^hid\gVbhXdckZgi i]ZhZiVWaZh^cidX]VgihVcYegdk^YZVe^XijgZd[i]ZYViV#7VgVcYe^Z X]Vgihd[[ZgVkVg^Zind[lVnhidY^heaVnXViZ\dg^XVaYViV#;^cVaan!a^cZ VcYhXViiZgX]VgihVaadlndjidk^ZlgZaVi^dch]^ehWZilZZcildkVg^VWaZh V\gVe]^XVa[dgbVi# ^c

Trang 14

1.1 Develop a frequency distribution summarizing this data.

A frequency distribution is a two-column table In the left column, list each value in the data set from least to greatest Count the number of times each value appears and record those totals in the right column

Note: Problems 1.1–1.3 refer to the data set in Problem 1.1, the daily demand for hammers at

a hardware store over the last 20 days.

1.2 Develop a relative frequency distribution for the data

Divide the frequency of each daily demand by the total number of data values (20)

Daily Demand Frequency Relative Frequency

Trang 15

Note: Problems 1.1–1.3 refer to the data set in Problem 1.1, the daily demand for hammers at

a hardware store over the last 20 days.

1.3 Develop a cumulative relative frequency distribution for the data

Note: Problems 1.4–1.6 refer to the data set below, the number of calls per day made from a

cell phone for the past 30 days.

Cell Phone Calls per Day

1.4 Develop a frequency distribution summarizing the data

Because this data has many possible outcomes, you should group the number

of calls per day into groups, which are known as classes One option is the 2 k v n

rule to determine the number of classes, where k equals the number of classes

and n equals the number of data points Given n = 30, the best value for k is 5.

Calculate the width W of each class.

Set the size of each class to 3 and list the classes in the left column of the

frequency distribution Count the number of values contained in each group

and list those values in the right column

 I]ZXjbjaVi^kZ gZaVi^kZ[gZfjZcXn [dgVeVgi^XjaVggdl^hi]Z gZaVi^kZ[gZfjZcXn[dgi]Vi gdleajhi]ZXjbjaVi^kZ gZaVi^kZ[gZfjZcXn[dgi]Z egZk^djhgdl#I]ZaVhiXjbj" aVi^kZgZaVi^kZ[gZfjZcXn h]djaYValVnhWZ&#%%#

 7ZXVjhZ')

2(' (%#

Trang 16

Calls per Day Frequency

1.5 Develop a relative frequency distribution for the data

Divide the frequency of each class by the total number of data values (30)

Calls per Day Frequency Relative Frequency

1.6 Develop a cumulative relative frequency distribution for the data

The cumulative relative frequency for a particular row is the relative frequency (calculated in Problem 1.5) for that row plus the cumulative relative frequency for the previous row

Trang 17

Zm^hihWZilZZc XdajbchWZXVjhZ i]ZYViV^h

Y^hXgZiZ#9^hXgZiZ YViV^hYViVi]Vi i]ViXVciV`ZdcV XdjciVWaZcjbWZgd[ edhh^WaZkVajZh!^c i]^hXVhZZ^i]Zg 6!7!8!9!dg;#

Trang 18

1.8 Develop a histogram for the frequency distribution below, the commuting distance for 50 employees of a particular company.

First, develop a frequency distribution for the data Using the 2k v n rule, set

k = 6 because 26= 64 v 40 Calculate the width W of each class.

Set the size of each class equal to 10 and count the number of values contained

Trang 19

Miles per Tank Frequency

The height of each bar in the histogram reflects the frequency for each group

of miles per tank of gas

1.10 Develop a histogram for the data set below, the number of home runs hit by 40

Major League Baseball players during the 2008 season

Develop a frequency distribution for the data Apply the 2k v n rule and set k = 6

because 26= 64 v 40 Calculate the width W of each class.

Set the size of each class equal to 4 and count the number of values contained

Trang 20

Home Runs Frequency

Trang 21

A column bar chart uses vertical bars to represent categorical data The height

of each bar corresponds to the value of each category

1.12 Construct a column bar chart for the data below, a company’s monthly sales

^hdg\Vc^oZY^c Y^hXgZiZ\gdjeh!hjX] Vhi]Zbdci]h^c i]^hegdWaZb#

Trang 22

1.13 Construct a horizontal bar chart for the data set below, weekly donations collected at a local church.

Trang 23

Note: Problems 1.15–1.16 refer to the data set below, weekly sales data in units for

1.15 Construct a grouped column bar chart for the data, grouping by week

Because there are two data values for each time period (a value for Store 1 and

a value for Store 2), you should use a grouped column bar chart

 I]ZhVaZhYViV [dgZVX]lZZ` XdbeVgZhi]ZhVaZh [dgZVX]hidgZ#

Trang 24

Note: Problems 1.15–1.16 refer to the data set in Problem 1.15, weekly sales data in units for two stores.

1.16 Construct a stacked column bar chart for the data, grouping by store

Each column represents the total units sold each week between the two stores

Note: Problems 1.17–1.20 refer to the data set below, the investment portfolio for three different investors in thousands of dollars.

Investor 1 Investor 2 Investor 3

1.17 Construct a grouped horizontal bar chart, grouping by investor

Three horizontal bars are arranged side-by-side for each investor, indicating the amount of each investment type

Trang 25

Note: Problems 1.17–1.20 refer to the data set in Problem 1.17, showing the investment

portfolio for three different investors in thousands of dollars.

1.18 Construct a grouped horizontal bar chart, grouping by investment type

Arrange three horizontal bars representing the investors, side-by-side, for each

investment type

Note: Problems 1.17–1.20 refer to the data set in Problem 1.17, the investment portfolio for

three different investors in thousands of dollars.

1.19 Construct a stacked horizontal bar chart, grouping by investor

Each investor is represented by three horizontally stacked bars that indicate that

investor’s total investments by type

 :VX]WVg

^ci]^hhiVX`ZY X]VgigZegZhZcih i]ZidiVa^ckZhibZcih d[Vh^c\aZ^ckZhidg# I]ZidiVaaZc\i]d[ ZVX]WVggZegZhZcih i]ZidiVaVbdjci

^ckZhiZYWnZVX] eZghdc#

Trang 26

Note: Problems 1.17–1.20 refer to the data set in Problem 1.17, the investment portfolio for three different investors in thousands of dollars.

1.20 Construct a stacked horizontal bar chart, grouping by investment type.Represent each investment type using three horizontally stacked bars

Trang 27

Multiply each relative frequency distribution by 360 to calculate the

corresponding central angle for each category in the pie chart A central

angle has a vertex at the center of the circle and sides that intersect the circle,

defining the boundaries of each category in a pie chart

Grade Relative Frequency Central Angle

The central angle determines the size of each pie segment

1.22 Construct a pie chart for the data in the table below, the number of total wins

recorded by the Green Bay Packers football team in five recent seasons

 I]ZXZcigVa Vc\aZ[dgi]^h XViZ\dgn^h-(ï#

Trang 28

Year Number of Wins Relative Frequency

Trang 29

1.23 Construct a pie chart for the data in the table below, an individual investor’s

The total investment is $78,000 Divide the figure for each category by this

number to determine the percentage of the total investment each category

Multiply each percentage by 360 to calculate the central angle for each category

in the pie chart

Use the central angles calculated above to draw appropriately sized sectors of

the pie chart If you have difficulty visualizing angles, use a protractor

 I]Zh^oZd[

i]ZhZ\bZci

XdggZhedcYhidi]Z eZgXZciV\Zd[i]Z

^ckZhibZci^ci]Vi XViZ\dgn#

Trang 30

1.24 Construct a pie chart for the frequency distribution below, the daily high temperature (in degrees Fahrenheit) in a particular city over the last 40 days.

Determine the relative frequency distribution for each temperature range

Daily High Temperature Frequency Relative Frequency

Calculate the central angle for each category in the pie chart

Trang 31

Line Charts

9ViVdkZgi^bZ^cVX]Vgi

1.25 Construct a line chart for the data in the table below, the number of wins

recorded by the Philadelphia Phillies for seven seasons

Place the time variable (year) on the x-axis and place the variable of interest

(wins) on the y-axis.

1.26 Construct a line chart for the data in the table below, the percent change in

annual profit for a company by year

Trang 32

Place the time variable (year) on the x-axis and place the variable of interest (percent change) on the y-axis.

1.27 Construct a line chart for the data in the table below, the population of Delaware by decade during the 1800s

Trang 33

Scatter Charts

>aajhigViZgZaVi^dch]^ehWZilZZcildkVg^VWaZh

1.28 Construct a scatter chart for the data in the table below, the number of hours

eight students studied for an exam and the scores they earned on the exam

Place the independent variable (study hours) on the x-axis and the dependent

variable (exam score) on the y-axis.

 I]ZYZeZcYZci kVg^VWaZ^hi]Z kVg^VWaZi]ViX]Vc\Zh VhVgZhjaid[X]Vc\Zh

^ci]Z^cYZeZcYZci kVg^VWaZ#HijYn^c\adc\Zg h]djaY\^kZndjV]^\]Zg

\gVYZ#I]ZgZkZghZ YdZhcÉibV`ZhZchZÅ

\Zii^c\V]^\]ZgiZhi hXdgZYdZhcÉigZhjai^cndj hijYn^c\adc\Zg[dgi]Vi iZhi#

 HdbZhiVi^hi^Xh Wdd`hjhZÆZmeaVcVidgnÇ VcYÆgZhedchZÇ^chiZVY d[Æ^cYZeZcYZciÇVcY ÆYZeZcYZci#Ç

Trang 34

1.29 Construct a scatter chart for the data below, the mileage and selling price of eight used cars.

Trang 35

1.30 Construct a scatter chart for the data in the table below, eight graduate

students’ grade point averages (GPA) and entrance exam scores for M.B.A

Place the independent variable (GMAT) on the x-axis and the dependent

variable (GPA) on the y-axis.



<B6IhXdgZ

^hi]Z^cYZeZcYZci kVg^VWaZWZXVjhZV

<B6IhXdgZ^hhjeedhZY idegZY^Xi]dllZaa ndjaaYd^c\gVYjViZ hX]dda#

Trang 37

Chapter 2

CALCULATING DESCRIPTIVE STATISTICS: MEASURES OF CENTRAL TENDENCY

One of the most common roles served by descriptive statistics is termining the central tendency of data This chapter investigates the primary means by which the “center” of a data set can be described, including the mean, median, mode, and percentiles.

de-9ViVbVnWZgZedgiZY^cZ^i]ZggVldghjbbVg^oZY[dgb#IdjcYZghiVcY i]ZXZcigVaiZcYZcXnd[gVlYViV!i]ZbZVc!bZY^Vc!bdYZ!VcYl Z^\]iZY bZVcXVcWZkZgn]Zae[ja#I]Zedh^i^dcd[gVlYViVXVcWZYZhXg^WZYjh^ c\ eZgXZci^aZh#HjbbVg^oZYYViV!^ci]Z[dgbd[[gZfjZcXnY^hig^Wji^dch!XVc WZYZhXg^WZYjh^c\VlZ^\]iZYbZVciZX]c^fjZ#

;^cY^c\i]ZXZciZgd[i]ZYViV

Trang 38

Mean

I]ZVkZgV\Z

2.1 The table below lists the number of students enrolled in the five different statistics courses offered by a college Calculate the mean number of students per class

Number of Students

The mean µ of a data set is the sum of the data divided by the population size.

Notice that a mean can be heavily influenced by an extreme value in the data The only difference between the enrollment numbers in Problems 2.1 and 2.2

is the size of the last class That class increased from 15 to 75 students and caused the mean class size to grow from 21.2 to 33.2

2.3 The table below lists the time, in minutes, it takes seven random customers to check out at a local grocery store Calculate the mean time it takes a customer

Trang 39

2.4 A consumer group tested the gas mileage of two car models in five trials

Which model averages more miles per gallon?

Trial 1 Trial 2 Trial 3 Trial 4 Trial 5

Model B averages more miles per gallon than Model A

2.5 The table below reports the number of minutes eight randomly selected airline

flights were either early (negative values) or late (positive values) arriving at

their destinations Calculate the sample mean

Number of Minutes Early or Late

Combine the positive and negative values and divide by the sample size

The average flight is 8.6 minutes late

2.6 The following table lists the daily percent increase (or decrease) of a stock

price over a five-day period Calculate the mean daily change in the stock price

Percent Increase or Decrease

If the sum is less than zero, the average change is a decrease, whereas a

positive sum indicates an average increase

The stock price decreased an average of 1.76 percent per day

 6YY^c\&'VcY Ä&%^hi]ZhVbZ VhhjWigVXi^c\

&%[gdb&'/

&' Ä&%2&'Ä&%#



>[i]ZVkZgV\Z lVhÄ-#+!^ildjaY bZVci]Vii]Z VkZgV\ZÓ^\]ilVh -#+b^cjiZhZVgan#

Trang 40

2.7 The table below lists the points scored by three basketball players over six games Identify the player with the highest average points per game.

Player Points Scored per Game

Calculate each player’s scoring average separately

Paul averaged the most points per game, 21.7

Note: Problems 2.8–2.9 refer to the data set below, the daily demand for tires at a particular store over a seven-day period.

The demand forecast for January 11 is 29.3 tires

Ngày đăng: 09/08/2017, 10:31

🧩 Sản phẩm bạn có thể quan tâm