Government works International Standard Book Number 1-58488-059-7 Library of Congress Card Number 99-045786 Printed in the United States of America 2 3 4 5 6 7 8 9 0 Printed on acid-free
Trang 1standard probability
and
Statistics tables
Trang 2CHAPMAN & HALL/CRC
DANIEL ZWILLINGER
Rensselaer Polytechnic Institute
Troy, New York STEPHEN KOKOSKA
Bloomsburg University Bloomsburg, Pennsylvania
Boca Raton London New York Washington, D.C.
Trang 3This book contains information obtained from authentic and highly regarded sources Reprinted material
is quoted with permission, and sources are indicated A wide variety of references are listed Reasonable efforts have been made to publish reliable data and information, but the author and the publisher cannot assume responsibility for the validity of all materials or for the consequences of their use.
Neither this book nor any part may be reproduced or transmitted in any form or by any means, electronic
or mechanical, including photocopying, microfilming, and recording, or by any information storage or retrieval system, without prior permission in writing from the publisher.
The consent of CRC Press LLC does not extend to copying for general distribution, for promotion, for creating new works, or for resale Specific permission must be obtained in writing from CRC Press LLC for such copying.
Direct all inquiries to CRC Press LLC, 2000 N.W Corporate Blvd., Boca Raton, Florida 33431
Trademark Notice: Product or corporate names may be trademarks or registered trademarks, and are used only for identification and explanation, without intent to infringe.
Visit the CRC Press Web site at www.crcpress.com
© 2000 by Chapman & Hall/CRC
No claim to original U.S Government works International Standard Book Number 1-58488-059-7 Library of Congress Card Number 99-045786 Printed in the United States of America 2 3 4 5 6 7 8 9 0
Printed on acid-free paper
Library of Congress Cataloging-in-Publication Data
Zwillinger, Daniel, CRC standard probability and statistics tables and formulae / Daniel Zwillinger, Stephen Kokoska.
1957-p cm.
Includes bibliographical references and index.
ISBN 1-58488-059-7 (alk paper)
1 Probabilities—Tables 2 Mathematical statistics—Tables I Kokoska, Stephen.
II Title.
QA273.3 Z95 1999
Trang 4It has long been the established policy of CRC Press to publish, in handbookform, the most up-to-date, authoritative, logically arranged, and readily us-able reference material available This book fills the need in probability andstatistics
Prior to the preparation of this book the contents of similar books were sidered It is easy to fill a statistics reference book with many hundred pages
con-of tables—indeed, some large books contain statistical tables for only a singletest The authors of this book focused on the basic principles of statistics
We have tried to ensure that each topic had an understandable textual troduction as well as easily understood examples There are more than 80examples; they usually follow the same format: start with a word problem,interpret the words as a statistical problem, find the solution, interpret thesolution in words
in-We have organized this reference in an efficient and useful format in-We believeboth students and researchers will find this reference easy to read and under-stand Material is presented in a multi-sectional format, with each sectioncontaining a valuable collection of fundamental reference material—tabularand expository This Handbook serves as a guide for determining appropriatestatistical procedures and interpretation of results We have assembled themost important concepts in probability and statistics, as experienced throughour own teaching, research, and work in industry
For most topics, concise yet useful tables were created In most cases, thetables were re-generated and verified against existing tables Even very mod-est statistical software can generate many of the tables in the book—often tomore decimal places and for more values of the parameters The values inthis book are designed to illustrate the range of possible values and act as ahandy reference for the most commonly needed values
This book also contains many useful topics from more advanced areas of tics, but these topics have fewer examples Also included are a large collection
statis-of short topics containing many classical results and puzzles Finally, a section
on notation used in the book and a comprehensive index are also included
Trang 5In line with the established policy of CRC Press, this Handbook will be kept
as current and timely as is possible Revisions and anticipated uses of newermaterials and tables will be introduced as the need arises Suggestions for theinclusion of new material in subsequent editions and comments concerningthe accuracy of stated information are welcomed
If any errata are discovered for this book, they will be posted to
http://vesta.bloomu.edu/~skokoska/prast/errata
Many people have helped in the preparation of this manuscript The authorsare especially grateful to our families who have remained lighthearted andcheerful throughout the process A special thanks to Janet and Kent, and toJoan, Mark, and Jen
Daniel Zwillingerzwillinger@alum.mit.edu
Stephen Kokoskaskokoska@planetx.bloomu.edu
ACKNOWLEDGMENTS
Plans 6.1–6.6, 6A.1–6A.6, and 13.1–13.5 (appearing on pages 331–337) originally appeared
on pages 234–237, 276–279, and 522–523 of W G Cochran and G M Cox, Experimental Designs, Second Edition, John Wiley & Sons, Inc, New York, 1957 Reprinted by permission
of John Wiley & Sons, Inc.
The tables of Bartlett’s critical values (in section 10.6.2) are from D D Dyer and J P.
Keating, “On the Determination of Critical Values for Bartlett’s Test”, JASA, Volume 75,
1980, pages 313–319 Reprinted with permission from the Journal of American Statistical Association Copyright 1980 by the American Statistical Association All rights reserved.
The tables of Cochran’s critical values (in section 10.7.1) are from C Eisenhart, M W.
Hastay, and W A Wallis, Techniques of Statistical Analysis, McGraw-Hill Book
Companies.
The tables of Dunnett’s critical values (in section 12.1.4.5) are from C W Dunnett, “A
Multiple Comparison Procedure for Comparing Several Treatments with a Control”, JASA, Volume 50, 1955, pages 1096–1121 Reprinted with permission from the Journal of Amer- ican Statistical Association Copyright 1980 by the American Statistical Association All
rights reserved.
The tables of Duncan’s critical values (in section 12.1.4.3) are from L Hunter, “Critical
Values for Duncan’s New Multiple Range Test”, Biometrics, 1960, Volume 16, pages 671–
Copyright 1960 by the American Statistical Association All rights reserved.
Table 15.1is reproduced, by permission, from ASTM Manual on Quality Control of rials, American Society for Testing and Materials, Philadelphia, PA, 1951.
Mate-The table in section 15.1.2 and much of Chapter 18 originally appeared in D Zwillinger,
Standard Mathematical Tables and Formulae, 30th edition, CRC Press, Boca Raton, FL,
1995 Reprinted courtesy of CRC Press, LLC.
Much of section 17.17 is taken from the URL http://members.aol.com/johnp71/javastat.html Permission courtesy of John C Pezzullo.
Trang 62.1 Tabular and graphical procedures
2.2 Numerical summary measures
4 Functions of Random Variables
4.1 Finding the probability distribution
4.2 Sums of random variables
Trang 75 Discrete Probability Distributions
5.1 Bernoulli distribution
5.2 Beta binomial distribution
5.3 Beta Pascal distribution
5.10 Rectangular (discrete uniform) distribution
6 Continuous Probability Distributions
Trang 86.24 Triangular distribution
6.25 Uniformdistribution
6.26 Weibull distribution
6.27 Relationships among distributions
7 Standard Normal Distribution
7.1 Density function and related functions
7.2 Critical values
7.3 Tolerance factors for normal distributions
7.4 Operating characteristic curves
7.5 Multivariate normal distribution
7.6 Distribution of the correlation coefficient
7.7 Circular normal probabilities
7.8 Circular error probabilities
8 Estimation
8.1 Definitions
8.2 Cram´er–Rao inequality
8.3 Theorems
8.4 The method of moments
8.5 The likelihood function
8.6 The method of maximum likelihood
8.7 Invariance property of MLEs
8.8 Different estimators
8.9 Estimators for small samples
8.10 Estimators for large samples
9 Confidence Intervals
9.1 Definitions
9.2 Common critical values
9.3 Sample size calculations
9.4 Summary of common confidence intervals
9.5 Confidence intervals: one sample
9.6 Confidence intervals: two samples
9.7 Finite population correction factor
10 Hypothesis Testing
Trang 910.1 Introduction
10.2 The Neyman–Pearson lemma
10.3 Likelihood ratio tests
10.4 Goodness of fit test
10.5 Contingency tables
10.6 Bartlett’s test
10.7 Cochran’s test
10.8 Number of observations required
10.9 Critical values for testing outliers
10.10 Significance test in 2× 2 contingency tables
10.11 Determining values in Bernoulli trials
11 Regression Analysis
11.1 Simple linear regression
11.2 Multiple linear regression
13.6 Confounding in 2n factorial experiments
13.7 Tables for design of experiments
13.8 References
14 Nonparametric Statistics
14.1 Friedman test for randomized block design
Trang 1014.2 Kendall’s rank correlation coefficient
14.3 Kolmogorov–Smirnoff tests
14.4 Kruskal–Wallis test
14.5 The runs test
14.6 The sign test
14.7 Spearman’s rank correlation coefficient
14.8 Wilcoxon matched-pairs signed-ranks test
14.9 Wilcoxon rank–sum(Mann–Whitney) test
14.10 Wilcoxon signed-rank test
15 Quality Control and Risk Analysis
15.1 Quality assurance
15.2 Acceptance sampling
15.3 Reliability
15.4 Risk analysis and decision rules
16 General Linear Models
16.1 Notation
16.2 The general linear model
16.3 Summary of rules for matrix operations
16.4 Quadratic forms
16.5 General linear hypothesis of full rank
16.6 General linear model of less than full rank
17.7 Measure theoretical probability
17.8 Monte Carlo integration techniques
17.9 Queuing theory
17.10 Randommatrix eigenvalues
Trang 1118.14 Sums of powers of integers
18.15 Tables of orthogonal polynomials
18.16 References
Notation
Trang 12com-Almost every table is presented along with a textual description and at leastone example using a value from the table Most concepts are illustrated withexamples and step-by-step solutions Several data sets are described in thischapter; they are used in this book in order for users to be able to checkalgorithms.
The emphasis of this book is on what is often called basic statistics Most
real-world statistics users will be able to refer to this book in order to quicklyverify a formula, definition, or theorem In addition, the set of tables hereshould make this a complete statistics reference tool Some more advanceduseful and current topics, such as Brownian motion and decision theory arealso included
We have established a few data sets which we have used in examples out this book With these, a user can check a local statistics programbyverifying that it returns the same values as given in this book For exam-ple, the correlation coefficient between the first 100 elements of the sequence
through-of integers{1, 2, 3 } and the first 100 elements of the sequence of squares {1, 4, 9 } is 0.96885 Using this value is an easy way to check for correct
computation of a computer program These data sets may be obtained fromhttp://vesta.bloomu.edu/~skokoska/prast/data
Trang 13Ticket data: Forty randomspeeding tickets were selected fromthe courthouse
records in Columbia County The speed indicated on each ticket is given inthe table below
Swimming pool data: Water samples from 35 randomly selected pools in
Bev-erly Hills were tested for acidity The following table lists the PH for eachsample
Soda pop data: A new soda machine placed in the Mathematics Building on
campus recorded the following sales data for one week in April
1 W G Cochran and G M Cox, Experimental Designs, Second Edition,
John Wiley & Sons, Inc., New York, 1957
2 C J Colbourn and J H Dinitz, CRC Handbook of Combinatorial
De-signs, CRC Press, Boca Raton, FL, 1996.
3 L Devroye, Non-Uniform Random Variate Generation, Springer–Verlag,
New York, 1986
4 W Feller, An Introduction to Probability Theory and Its Applications,
Volumes 1 and 2, John Wiley & Sons, New York, 1968
5 C W Gardiner, Handbook of Stochastic Methods, Second edition, Springer–
Verlag, New York, 1985
6 D J Sheskin, Handbook of Parametric and Nonparametric Statistical
Procedures, CRC Press LLC, Boca Raton, FL, 1997.
Trang 152.2.25 Measures of kurtosis
2.2.26 Data transformations
2.2.27 Sheppard’s corrections for grouping
Numerical descriptive statistics and graphical techniques may be used to marize information about central tendency and/or variability
A stem-and-leaf plot is a a graphical summary used to describe a set of servations (as symmetric, skewed, etc.) Each observation is displayed on thegraph and should have at least two digits Split each observation (at the samepoint) into a stem (one or more of the leading digit(s)) and a leaf (remainingdigits) Select the split point so that there are 5–20 total stems List thestems in a column to the left, and write each leaf in the corresponding stemrow
ob-Example 2.1 : Construct a stem-and-leaf plot for the Ticket Data (page 2)
(2) Record, or tally, the number of observations in each class, called thefrequency of each class
(3) Compute the proportion of observations in each class, called the relativefrequency
(4) Compute the proportion of observations in each class and all preceding
classes, called the cumulative relative frequency
Trang 16Example 2.2 : Construct a frequency distribution for the Ticket Data (page 2).
Solution:
(S1) Determine the classes It seems reasonable to use 40 to less than 50, 50 to less
than 60, , 90 to less than 100.
Note: For continuous data, one end of each class must be open This ensures
that each observation will fall into only one class The open end of each classmay be either the left or right, but should be consistent
(S2) Record the number of observations in each class
(S3) Compute the relative frequency and cumulative relative frequency for each class.(S4) The resulting frequency distribution is inFigure 2.2
(rela-Example 2.3 : Construct a frequency histogram for the Ticket Data (page 2)
Solution:
(S1) Using the frequency distribution in Figure 2.2, construct rectangles above eachclass, with height equal to class frequency
(S2) The resulting histogram is inFigure 2.3
Note: A probability histogramis constructed so that the area of each rectangle
equals the relative frequency If the class widths are unequal, this histogrampresents a more accurate description of the distribution
A frequency polygon is a line plot of points with x coordinate being class midpoint and y coordinate being class frequency Often the graph extends to
Trang 17Figure 2.3: Frequency histogramfor Ticket Data.
an additional empty class on both ends The relative frequency may be used
(S2) The resulting frequency polygon is inFigure 2.4
Figure 2.4: Frequency polygon for Ticket Data
An ogive, or cumulative frequency polygon, is a plot of cumulative
fre-quency versus the upper class limit Figure 2.5is an ogive for the Ticket Data(page 2)
Another type of frequency polygon is a more-than cumulative frequency
poly-gon For each class this plots the number of observations in that class andevery class above versus the lower class limit
Trang 18Figure 2.5: Ogive for Ticket Data.
A bar chart is often used to graphically summarize discrete or categorical
data A rectangle is drawn over each bin with height proportional to frequency.The chart may be drawn with horizontal rectangles, in three dimensions, andmay be used to compare two or more sets of observations Figure 2.6is a barchart for the Soda Pop Data (page 2)
Figure 2.6: Bar chart for Soda Pop Data
A pie chart is used to illustrate parts of the total A circle is divided into
slices proportional to the bin frequency Figure 2.7is a pie chart for the SodaPop Data (page 2)
Chernoff faces are used to illustrate trends in multidimensional data Theyare effective because people are used to differentiating between facial features.Chernoff faces have been used for cluster, discriminant, and time-series anal-yses Facial features that might be controllable by the data include:
(a) ear: level, radius
(b) eyebrow: height, slope, length
(c) eyes: height, size, separation, eccentricity, pupil position or size
Trang 19Figure 2.7: Pie chart for Soda Pop Data.
(d) face: width, half-face height, lower or upper eccentricity
(e) mouth: position of center, curvature, length, openness
(f) nose: width, length
The Chernoff faces inFigure 2.8 come from data about this book For theeven chapters:
(a) eye size is proportional to the approximate number of pages
(b) mouth size is proportional to the approximate number of words(c) face shape is proportional to the approximate number of occurrences ofthe word “the”
The data are as follows:
Number of words 4514 5426 12234 2392 9948 18418 8179 11739 5186Occurrences of “the” 159 147 159 47 153 118 264 223 82
An interactive programfor creating Chernoff faces is available at http://www.hesketh.com/schampeo/projects/Faces/interactive.shtml See H
Chernoff, “The use of faces to represent points in a K-dimensional space graphically,” Journal of the American Statistical Association, Vol 68, No 342,
1973, pages 361–368
The following conventions will be used in the definitions and formulas in thissection
(C1) Ungrouped data: Let x1, x2, x3, , x n be a set of observations
(C2) Grouped data: Let x1, x2, x3, , x k be a set of class marks from a quency distribution, or a representative set of observations, with corre-
Trang 20fre-Figure 2.8: Chernoff faces for chapter data.
sponding frequencies f1, f2, f3, , f k The total number of observations
is n =
k
i=1
f i Let c denote the (constant) width of each bin and x oone
of the class marks selected to be the computing origin Each class mark,
x i , may be coded by u i = (x i − x o )/c Each u i will be an integer andthe bin mark taken as the computing origin will be coded as a 0
Trang 21Let w i ≥ 0 be the weight associated with observation x i The total weight is
Trang 22For grouped data:
For grouped data, select the class containing the largest frequency, called
the modal class Let L be the lower boundary of the modal class, d L thedifference in frequencies between the modal class and the class immediately
below, and d H the difference in frequencies between the modal class and theclass immediately above The mode may be approximated by
M o ≈ L + c · d L
The median, ˜x, is another measure of central tendency, resistant to outliers.
For ungrouped data, arrange the observations in order fromsmallest to largest
If n is odd, the median is the middle value If n is even, the median is the
mean of the two middle values
For grouped data, select the class containing the median (median class) Let
L be the lower boundary of the median class, f mthe frequency of the medianclass, and CF the sumof frequencies for all classes below the median class (acumulative frequency) The median may be approximated by
Note: If x > ˜ x the distribution is positively skewed If x < ˜ x the distribution
is negatively skewed If x ≈ ˜x the distribution is approximately symmetric.
A trimmed mean is a measure of central tendency and a compromise between
a mean and a median The mean is more sensitive to outliers, and the median
is less sensitive to outliers Order the observations fromsmallest to largest
Trang 23Delete the smallest p% and the largest p% of the observations The p% trimmed mean, xtr (p), is the arithmetic mean of the remaining observations.
Note: If p% of n (observations) is not an integer, several (computer)
algo-rithms exist for interpolating at each end of the distribution and for
deter-mining xtr (p)
Example 2.5 : Using the Swimming Pool data (page 2) find the mean, median, andmode Compute the geometric mean and the harmonic mean, and verify the relationshipbetween these three measures
The first quartile, Q1, is the median of the smallest (n + 1)/2 tions; and the third quartile, Q3, is the median of the largest (n + 1)/2
observa-observations
For grouped data, the quartiles are computed by applying equation (2.13) forthe median Compute the following:
L1= the lower boundary of the class containing Q1
L3= the lower boundary of the class containing Q3
f1= the frequency of the class containing the first quartile
f3= the frequency of the class containing the third quartile
CF1= cumulative frequency for classes below the one containing Q1
CF3= cumulative frequency for classes below the one containing Q3
Trang 24The (approximate) quartiles are given by
Deciles split the data into 10 parts
(1) For ungrouped data, arrange the observations in order fromsmallest to
largest The ith decile, D i (for i = 1, 2, , 9), is the i(n + 1)/10th servation It may be necessary to interpolate between successive values.(2) For grouped data, apply equation (2.13) (as in equation (2.14)) for the
ob-median to find the approximate deciles D i is in the class containing
the i n/10th largest observation
Percentiles split the data into 100 parts
(1) For ungrouped data, arrange the observations in order fromsmallest to
largest The ith percentile, P i (for i = 1, 2, , 99), is the i(n + 1)/100th
observation It may be necessary to interpolate between successive ues
val-(2) For grouped data, apply equation (2.13) (as in equation (2.14)) for the
median to find the approximate percentiles P iis in the class containing
the i n/100th largest observation
Trang 25(1) For ungrouped data:
dis-by σ with a subscript indicating the statistic.
2.2.14.1 Standard error of the mean
The standard error of the mean is used in hypothesis testing and is an
indi-cation of the accuracy of the estimate x.
Trang 262.2.15 Root mean square
(1) For ungrouped data:
RMS =
1
zontal or vertical scale The inner and outer fences are used in constructing
a box plot and are markers used in identifying mild and extreme outliers
Inner Fences: Q1− 1.5 · IQR, Q1 + 1.5 · IQR
Trang 27A general description:
Multiple box plots on the same measurement axis may be used to comparethe center and spread of distributions Figure 2.9 presents box plots for ran-domly selected August residential electricity bills for three different parts ofthe country
Figure 2.9: Example of multiple box plots
Trang 28The coefficient of quartile variation is a measure of variability.
Moments are used to characterize a set of observations
(1) For ungrouped data:
The rth moment about the origin:
(−1) j m
r −j x j . (2.33)
(2) For grouped data:
The rth moment about the origin:
(−1) j m
n
i=1
Trang 30For grouped data, suppose every class interval has width c If both tails of the
distribution are very flat and close to the measurement axis, the grouped dataapproximation to the sample variance may be improved by using Sheppard’scorrection,−c2/12:
corrected variance = grouped data variance− c2
Example 2.7 : Consider the grouped Ticket Data (page 2) as presented in the
fre-quency distribution in Example 2.2 (on page 5) Find the corrected sample variance andcorrected sample moments
Trang 313.2.1 The product rule for ordered pairs
3.2.2 The generalized product rule for k-tuples
3.3.1 Relative frequency concept of probability
3.3.2 Axioms of probability (discrete sample space)
3.3.3 The probability of an event
3.3.4 Probability theorems
3.3.5 Probability and odds
3.3.6 Conditional probability
3.3.7 The multiplication rule
3.3.8 The law of total probability
3.3.9 Bayes’ theorem
3.3.10 Independence
3.4 Random variables
3.4.1 Discrete random variables
3.4.2 Continuous random variables
Trang 323.6.10 Moment generating function
3.6.11 Linear combination of random variables
(1) A set A is a collection of objects called the elements of the set.
a ∈ A means a is an element of the set A.
a
A = {a, b, c} is used to denote the elements of the set A.
(2) The null set, denoted by φ or { }, is the empty set; the set that contains
no elements
(3) Two sets A and B are equal, written A = B, if
1) every element of A is an element of B, and
2) every element of B is an element of A.
(4) The set A is a subset of the set B if every element of A is also in B; written A ⊂ B (or B ⊃ A) For every set A, φ ⊂ A.
(5) If A ⊂ B and B ⊂ A then A = B and A is an improper subset of B.
If A ⊂ B and there is at least one element of B not in A then A is
a proper subset of B The subset symbol ⊂ is often used to denote a proper subset while the symbol ⊆ indicates an improper subset.
(6) Let S be the universal set, the set consisting of all elements of interest For any set A, A ⊂ S.
(7) The complement of the set A, denoted A , is the set consisting of all
elements in S but not in A (Figure 3.1)
(8) For any two sets A and B:
The union of A and B, denoted A ∪B, is the set consisting of all elements
in A, or B, or both (Figure 3.2)
The intersection of A and B, denoted A ∩ B, is the set consisting of all
elements in both A and B (Figure 3.3)
Trang 33(9) A and B are disjoint or mutually exclusive if A ∩ B = φ (Figure 3.4).
Figure 3.1: Shaded region =
exclu-For the following properties, suppose A, B, and C are sets It is necessary
to assume these sets lie in a universal set S only in those properties that explicitly involve S.
(1) Closure
(a) There is a unique set A ∪ B.
(b) There is a unique set A ∩ B.
Trang 34(e) If A ⊂ B, then A ∪ B = B and A ∩ B = A.
(8) Properties of (set complement)
(a) For every set A, there is a unique set A .
Suppose A1, A2, A3, , A n is a collection of sets
(a) The generalized union, A1∪ A2 ∪ · · · ∪ A n, is the set consisting of
all elements in at least one A i
(b) The generalized intersection, A1∩A2 ∩· · ·∩A n, is the set consisting
of all elements in every A i
In an equally likely outcome experiment, computing the probability of anevent involves counting The following techniques are useful for determiningthe number of outcomes in an event and/or the sample space
If the first element of an ordered pair can be selected in n1 ways, and for each
of these n1 ways the second element of the pair can be selected in n2 ways,
then the number of possible pairs is n1n2
Trang 353.2.2 The generalized product rule for k-tuples
Suppose a sample space, or set, consists of ordered collections of k-tuples.
If there are n1 choices for the first element, and for each choice of the first
element there are n2 choices for the second element, , and for each of the first k − 1 elements there are n k choices for the kth element, then there are
The binomial coefficientn
n n
n n
= 0(f) 2n
n
+· · · +n+m
Trang 363!2! = 10 subsets containing exactly 3 elements.
(S2) The subsets are
(a, b, c) (a, b, d) (a, b, e) (a, c, d) (a, c, e) (a, d, e) (b, c, d) (b, c, e) (b, d, e) (c, d, e)
There are 4 ways in which a sample of k elements can be obtained from a set
of n distinguishable objects.
combination with replacement C R (2, 2) = 3 aa, ab, and bb
permutation with replacement P R (2, 2) = 4 aa, ab, ba, and bb
Trang 373.2.7 Balls into cells
There are 8 different ways in which n balls can be placed into k cells.
2
+· · · +n
is the Stirling cycle number (see page 525) and p k (n) is the number
of partitions of the number n into exactly k integer pieces (see page 523) Given n distinguishable balls and k distinguishable cells, the number of ways
in which we can place n1 balls into cell 1, n2 balls into cell 2, , n k balls
into cell k, is given by the multinomial coefficient n
n1,n2, ,n k
of ways of choosing n1 objects, then n2 objects, , then n k objects froma
collection of n distinct objects without regard to order This requires that
k
j=1 n j = n.
Other ways to interpret the multinomial coefficient:
(1) Permutations (all objects not distinct): Given n1 objects of one kind,
n2 objects of a second kind, , n k objects of a kth kind, and n1+
n2+· · · + n k = n The number of permutations of the n objects is
n1,n2, ,n k
(2) Partitions: The number of ways of partitioning a set of n distinct objects into k subsets with n1objects in the first subset, n2objects in the second
subset, , and n k objects in the kth subset is n
n1,n2, ,n k
.The multinomial symbol is numerically evaluated as
Trang 38{ab, c, d} {ab, d, c} {ac, b, d} {ac, d, b}
{ad, b, c} {ad, c, b} {bc, a, d} {bc, d, a}
{bd, a, c} {bd, c, a} {cd, a, b} {cd, b, a}
(a) The number of ways to arrange n distinct objects in a row is n!; this is the number of permutations of n objects.
Example 3.11 : For the three objects{a, b, c} the number of arrangements is
3! = 6 These permutations are{abc, bac, cab, acb, bca, cba}.
(b) The number of ways to arrange n non-distinct objects (assuming that there are k types of objects, and n i copies of each object of type i) is
the multinomial coefficient n
n1,n2, ,n k
Example 3.12 : For the set{a, a, b, c} the parameters are n = 4, k = 3, n1= 2,
n2 = 1, and n3= 1 Hence, there are 4
2,1,1
= 2! 1! 1!4! = 12 arrangements, theyare:
aabc aacb abac abca acab acba baac baca bcaa caab caba cbaa
(c) A derangement is a permutation of objects, in which object i is not in the ith location
Example 3.13 : All the derangements of{1, 2, 3, 4} are:
2143 2341 2413
3142 3412 3421
4123 4312 4321
The number of derangements of n elements, D n, satisfies the recursion
relation: D n = (n − 1) (D n −1 + D n −2 ), with the initial values D1 = 0
The sample space of an experiment, denoted S, is the set of all possible
out-comes Each outcome of the sample space is also called an element of the
sample space or a sample point An event is any collection of outcomes tained in the sample space A simple event consists of exactly one outcome and a compound event consists of more than one outcome.
Trang 39con-3.3.1 Relative frequency concept of probability
Suppose an experiment is conducted n identical and independent times and
n(A) is the number of times the event A occurs The quotient n(A)/n is the
relative frequency of occurrence of the event A As n increases, the relative frequency converges to the limiting relative frequency of the event A The probability of the event A, Prob [A], is this limiting relative frequency.
(1) For any event A, Prob [A] ≥ 0.
(2) Prob [S] = 1.
(3) If A1, A2, A3, , is a finite or infinite collection of pairwise mutually
exclusive events of S, then
Prob [A1∪ A2 ∪ A3 ∪ · · ·] = Prob [A1 ] + Prob [A2] + Prob [A3] +· · ·
(3.5)
The probability of an event A is the sumof Prob [a i ] for all sample points a i
in the event A:
a i ∈A
If all of the outcomes in S are equally likely:
Prob [A] = n(A)
n(S) =
number of outcomes in A
(1) Prob [φ] = 0 for any sample space S.
(2) If A and A are complementary events, Prob [A] + Prob [A ] = 1.
(3) For any events A and B, if A ⊂ B then Prob [A] ≤ Prob [B].
(4) For any events A and B,
Prob [A ∪ B] = Prob [A] + Prob [B] − Prob [A ∩ B] (3.8)
If A and B are mutually exclusive events, Prob [A ∩ B] = 0 and
(5) For any events A and B,
Prob [A] = Prob [A ∩ B] + Prob [A ∩ B ] (3.10)
Trang 40(6) For any events A, B, and C,
Prob [A ∪ B ∪ C] =Prob [A] + Prob [B] + Prob [C]
− Prob [A ∩ B] − Prob [A ∩ C] − Prob [B ∩ C]
≤ n
i=1
Equality holds if the events are pairwise mutually exclusive
If the probability of an event A is Prob [A] then
odds for A = Prob [A]/Prob [A ], Prob [A ]
If the odds for the event A are a:b, then Prob [A] = a/(a + b).
Example 3.14 : The odds of a fair coin coming up heads are 1:1; that it, is has aprobability of 1/2
The odds of a die showing a “1” are 5:1 against; that it, there is a probability of 5/6that a “1” does not appear
The conditional probability of A given the event B has occurred is
Prob [A |B] = Prob [A Prob [B] ∩ B] , Prob [B] > 0 (3.14)
(1) If Prob [A1∩ A2 ∩ · · · ∩ A n −1 ] > 0 then
Prob [A1∩ A2 ∩ · · · ∩ A n ] =Prob [A1]· Prob [A2 |A1]
· Prob [A3 |A1 ∩ A2]
· · · Prob [A n |A1 ∩ A2 ∩ · · · ∩ A n −1 ].
(3.15)
(2) If A ⊂ B, then Prob [A|B] = Prob [A]/Prob [B] and
Prob [B |A] = 1.
(3) Prob [A |B] = 1 − Prob [A|B].
Example 3.15 : A local bank offers loans for three purposes: home (H), automobile(A), and personal (P), and two different types: fixed rate (FR) and adjustable rate(ADJ) The joint probability table given below presents the proportions for the various
... concept of probability3.3.2 Axioms of probability (discrete sample space)
3.3.3 The probability of an event
3.3.4 Probability theorems
3.3.5 Probability and odds
3.3.6... A and B,
Prob [A ∪ B] = Prob [A] + Prob [B] − Prob [A ∩ B] (3.8)
If A and B are mutually exclusive events, Prob [A ∩ B] = and< /i>
(5) For any events A and. .. loans for three purposes: home (H), automobile(A), and personal (P), and two different types: fixed rate (FR) and adjustable rate(ADJ) The joint probability table given below presents the proportions