Table 13.12 Asymptotic Partial Likelihood Inference on the Bladder Cancer Data from the Fitted PWP Gap Time Models with Stratum-Specific or Common Coefficients95% Confidence Interval Regres
Trang 1Table 13.11 Rearranged Data from Table 13.9 for
Fitting PWP Gap Time Model with Common
Using the notations in Table 13.9, let GT denote the gaptime, then
GT: TR—TL Replacing TR and TL in Tables 13.8 and 13.9 by GT, the data
are ready for SAS and other software Table 13.11 is the corresponding tablefor the same six patients in Table 13.9 using gap times Using the notation ofExample 13.6, the second product in(13.4.5) for stratum 2 is
The results from fitting the PWP gaptime model to all the data in Table13.6 with stratum-specific coefficients and common coefficients are given inTable 13.12 Again, the number of initial tumors is the only significant
Trang 2Table 13.12 Asymptotic Partial Likelihood Inference on the Bladder Cancer Data from the Fitted PWP Gap Time Models with Stratum-Specific or Common Coefficients
95% Confidence Interval Regression Standard Chi-Square Hazards
Variable Coefficient Error Statistic p Ratio Lower Upper
Model with Stratum-Specific Coefficients
Suppose that the text file ‘‘C:EX13d4d1.DAT’’ contains the successivecolumns in Table 13.8 for the entire data set in Table 13.6: NR, TL, TR, CS,T1, T2, T3, T4, N1, N2, N3, N4, S1, S2, S3, and S4, and the text file
‘‘C:EX13d4d2.DAT’’ contains the seven successive columns in Table 13.9: NR,
TL, TR, CS, TRT, N, and S The following SAS code can be used to obtainthe PWP models in Table 13.10
data w1;
infile ‘c: ex13d4d1.dat’ missover;
input nr tl tr cs t1 t2 t3 t4 n1 n2 n3 n4 s1 s2 s3 s4;
run;
title ‘‘PWP model with stratified coefficients‘;
proc phreg data : w1;
Trang 3model (tl, tr)*cs(0) : t1 t2 t3 t4 n1 n2 n3 n4 s1 s2 s3 s4 / ties : efron;
title ‘‘PWP model with common coefficients‘;
proc phreg data : w1;
model (tl, tr)*cs(0) : trt n s / ties : efron;
GT, CS, T1, T2, T3, T4, N1, N2, N3, N4, S1, S2, S3, and S4 The text file
‘‘C:KEX13d4d4.DAT’’ contains the successive six columns from Table 13.11: NR,
GT, CS, TRT, N, and S The following SAS, SPSS, and BMDP codes can beused to obtain the PWP gaptime models in Table 13.12
title ‘‘PWP gaptime model with stratified coefficients’’;
proc phreg data : w1;
model gt*cs(0) : t1 t2 t3 t4 n1 n2 n3 n4 s1 s2 s3 s4 / ties : efron;
title ‘‘PWP gaptime model with common coefficients‘;
proc phreg data : w1;
model gt*cs(0) : trt n s / ties : efron;
Trang 4before that time The multiplicative hazard function h(t, xG) for the ith person is
h(t, xG):YG(t)h(t) exp[bxG(t)]
where YG(t), an indicator, equals 1 when the ith person is under observation (at
risk) at time t and 0 otherwise and h(t) is an unspecified underlying hazard
Trang 5Table 13.13 Rearranged Data from Table 13.7 for
of this likelihood function and the estimation of the coefficients can be found
in Fleming and Harrington(1991) and Andersen et al (1993) Similar to thePWP models, software packages are available to carry out the computationprovided that the data are arranged in a certain format The following exampleillustrates the terms in(13.4.6) and the data format required by SAS
To explain the terms in the likelihood function, we use the data of the sixpeople in Table 13.7 In this model, every recurrent event is considered to beindependent Therefore, we can rearrange the data by person and by event time
‘‘within’’ an individual Table 13.13 shows the rearranged data For example,the person with ID: 4 had two recurrences, at 12 and 16, and the follow-uptime ended at 18 The time intervals(TL, TR] are (0, 12], (12, 16], and (16,18],and 12 and 16 are uncensored observations and 18 censored, since there was
no tumor recurrence at 18 For patients with ID: 1 and 2 (i : 1, 2), the
respective second product terms in(13.4.6) are equal to 1 since
2, for all t For patient 3 (i
recurrence time of the patient) Thus, the respective second product has only
Trang 6Table 13.14 Asymptotic Partial Likelihood Inference on the Bladder Cancer Data from the Fitted AG Model
95% Confidence Interval Regression Standard Chi-Square Hazards
Variable Coefficient Error Statistic p Ratio Lower Upper
one term at t: 3 and the denominator of this term sums over all the patients
who are under observation and at risk at time t: 3 From Figure 13.1 it iseasily seen that the sum is over all six patients; that is, the respective secondproduct is
exp(bx)
For patient 4(i: 4), the second product in (13.4.6) contains two terms One
is for t : 12 (the first recurrence time), and at t : 12, patients 2, 3, 4, 5, and 6
are still under observation, and therefore the denominator of the term sums
over patients 2 to 6 The other term is for t: 16 (the second recurrence time)and the denominator sums over patients 2, 4, 5, and 6 Patient 3 is no longer
under observation after t : 14 Thus, the second product term for i : 4 is
exp(bx)
Hexp(bxH);
exp(bx)exp(bx) ;Hexp(bxH) (13.4.8)
Similarly, we can construct each term in (13.4.6) and the partial likelihoodfunction
Using SAS, we obtain the results in Table 13.14 The AG model identifiestreatment and number of initial tumor as significant covariates Comparedwith placebo, thiotepa does slow down tumor recurrence
Readers can construct the SAS codes for the AG model by using Table 13.13and by following the codes given in Example 13.6
Wei et al Model
By using a marginal approach, Wei, Lin, and Weissfeld (1989) proposed amodel, the WLW model, for the analysis of recurrent failures The failures may
be recurrences of the same kind of event or events of different natures,depending on how the stratification is defined If the strata are defined by the
Trang 7times of repeated failures of the same type, similar to the strata defined in thePWP models, it can be used to analyze repeated failures of the same kind Thedifference between the PWP models and the WLW model is that the latterconsiders each event as a separate process and treats each stratum-specific(marginal) partial likelihood separately In the stratum-specific (marginal)
partial likelihood of stratum s, people who have experienced the (s9 1)thfailure contribute either one uncensored or one censored failure time depending
on whether or not they experience a recurrence in stratum s, and the other
subjects contribute only censored times(forced as censored times) Therefore,each stratum contains everyone in the study This is different from the PWPmodels, in which subjects who have not experienced the (s9 1)th failure are
not included in stratum s If the strata are defined by the type of failure, the
WLW model acts like the competing risks model defined in Section 13.3, andthe type-specific(marginal) partial likelihood for the jth type simply treats all failures of types other than j in the data as censored.
For the kth stratum of the ith person, the hazard function is assumed to
have the form
hIG(t) :YIG(t)hI(t) exp(bIxIG), t 0 (13.4.9) where YIG(t) :1, if the ith person in the kth stratum is under observation, 0, otherwise, hI(t) is an unspecified underlying hazard function Let RI(tIG) denote the risk set with people at risk at the ith distinct uncensored time tIG in the kth stratum Then the specific partial likelihood for the kth stratum is
otherwise The coefficients bI are stratum specific In practice, if we are
interested in the overall effect of the covariates, we can assume that thecoefficients from different strata are equal(provided that there are no qualitat-ive differences among the strata), combine the strata and draw conclusionsabove the ‘‘average effect’’ of the covariates We again called the coefficients ofthese covariates common coefficients The event time is from the beginning ofthe study in this model
Similar to the PWP and AG models, the data must be arranged in a certainformat in order to use available software to carry out estimation of thecoefficients and tests of significance of the covariates Using the same data as
in Examples 13.6 and 13.7, the following example illustrates the terms in thestratum-specific likelihood function and the use of software
compo-nents in the stratum-specific likelihood function in(13.4.10) The format thedata have to be in for the available software, such as SAS, SPSS, and BMDP,
Trang 8Table 13.15 Rearranged Data from Table 13.7 for Fitting WLW Model with
of the event time(censored or not, TR) In stratum 2(NR: 2), the three people(with ID: 4, 5, and 6) whose times to the second tumor recurrence areuncensored observations Patients 1 and 2 had censored time at 9 and 59,respectively Patient 3, who had no second recurrence and was observed until
14 months, is considered censored at 14 The other strata are constructed in asimilar manner Using the data arrangement in Table 13.15, we can see that forthe second stratum, the likelihood function in(13.4.10) has three terms, one foreach of persons 5, 6, and 4, whose
the risk set at time t: 16 has two individuals (ID : 4 and 2); for patient 5,
the risk set at time t: 12 contains five individuals (ID : 2, 3, 4, 5, and 6); and
for patient 6, the risk set at time t: 15 has three individuals (ID : 2, 4, and
6) Let xH be the covariate vector of the patient with ID:j in stratum 2; then
Trang 9Table 13.16 Rearranged Data from Table 13.7 for
Fitting WLW Model with Common Coefficients
in the average overall effect of the covariates, we combine T1—T4, N1—N4, and S1—S4 The rearranged data for the six patients are given in Table 13.16.
Trang 10Table 13.17 Asymptotic Partial Likelihood Inference on the Bladder Cancer Data from the Fitted WLW Models with Stratum-Specific or Common Coefficients
95% Confidence Interval Regression Standard Chi-Square Hazards
Variable Coefficient Error Statistic p Ratio Lower Upper
Model with Stratum-Specific Coefficients
recurrences The signs of the coefficients for T1—T4 suggest that thiotepa may
slow down tumor growth, but the evidence is not statistically significant Themodel with common coefficients suggests that thiotepa is significantly moreeffective in prolonging the recurrence time The results suggest that whenlooking at each stratum independently, there is no strong evidence thatthiotepa is more effective than placebo However, the combined estimate of thecommon coefficient provides stronger evidence that thiotepa is more effectiveover the course of the study
In Cox’s proportional hazards model and other regression methods, a keyassumption is that observed survival or event times are independent However,
in many practical situations, failure times are observed from related individuals
Trang 11or from successive recurrent events or failures of the same person For example,
in an epidemiological study of heart disease, some of the participants may befrom the same family and therefore are not independent These families with
multiple participants may be called clusters In this case, the regression
methods we introduced earlier may not be appropriate Several types of modelsintroduced especially for related observations are discussed by Andersen et al.(1993), Liang et al (1995), Klein and Moeschberger (1997), and Ibrahim et al.(2001) Details about these models are beyond the scope of this book In thefollowing, we introduce briefly the frailty models
The frailty models assume that there is an unmeasured random variable
(frailty) in the hazard function This random variable accounts for the variation
or heterogeneity among individuals in a cluster It is also assumed that the
frailty is independent of censoring Let n be the total number of participants in the study, some of them related and forming clusters Let vG be the unknown random variable, frailty, associated with the ith cluster, 1 i n The frailty
model associated with the proportional hazards model can be written in terms
of the log hazard function as
log[hGH(t; xGH vG)] :log[h(t)] ;vG ;bxGH (13.5.1)for 1 j mG and 1 in, where b denotes the p;1 column vector of unknown regression coefficients, xGH is the covariate vector of the jth person in the ith cluster, mG is the number of individuals in the ith cluster, and h(t) is an
unknown underlying hazard function Compared with the Cox proportional
hazards model, the difference here is the random effect vG Because vG remains the same in the ith cluster, the association between failure and covariates
within each cluster in this model is assumed to have a symmetric pattern In afamily study, this model can be used, for example, to model failure timesobserved from siblings by treating each family as a cluster This model wasproposed by Vaupel et al (1979) and developed and discussed by manyresearchers, including Clayton and Cuzick(1985) The main approach to this
model is to assume that vG follows a parametric distribution.The frailty model in(13.5.1) can be extended to handle more complicated
situations For example, the frailty can be a time-dependent variable [replace
vG by vG(t) in (13.5.1)] The frailty model with vG(t) can be used to model
successive or recurrent failure time as an alternative to the models in Section13.4 Another example is that there may be more than one type of frailty in
each cluster, and vG in (13.5.1) can be replaced by vG ;uG or vG;uG ;wG, and
so on
Inferences of these frailty models are also based on either a likelihoodfunction or a partial likelihood function Since the models involve a parametricdistribution, the likelihood or partial likelihood functions are complicated andare beyond the level of this book
The frailty models have not been used widely primarily because of the lack
of commercially available software There are some computer programs
Trang 12available; for example, a SAS macro is available for a gamma frailty model atthe Web site of Klein and Moeschberger (1997), and another program isdescribed by Jenkins(1997).
Bibliographical Remarks
Most of the major references for nonproportional hazards models have been
cited in the text of this chapter Applications of these models include: stratified
models: Vasan et al.(1997), Aaronson et al (1997), and Yakovlev et al (1999);
frailty models: Yashin and Iachine(1997), Kessing et al (1999), Siegmund et al.(1999), Albert (2000), Lee and Yau (2001), Wienke el al (2001), and Xue (2001);
competing risks models: Mackenbach et al.(1995), Fish et al (1998), Albertsen
et al (1998), Blackstone and Lytle (2000), Yan et al (2000), and Tai et al.(2001)
EXERCISES
13.1 Consider the cancer-free times from the participants with IDs 15 to 23
in Table 13.1 Follow Example 13.1 to construct the partial likelihoodfunction based on the observed cancer-free times from these nine partici-pants
13.2 Consider the survival times from 30 resected melanoma patients in Table3.1 Let AGEG denote age group, AGEG: 1 if age 45 and AGEG : 2otherwise Fit the survival times with an AGEG-stratified Cox propor-tional hazards model with the covariates age, gender, initial stage, andtreatment received Discuss the association of the treatment received withthe survival time
13.3 Using the data in Table 12.4, following Example 13.5 and the samplecodes for SAS, SPSS, or BMDP, fit the competing risk model for stroke,CHD, other CVD, or STROKE/CHD separately, and discuss the resultsobtained
13.4 Using the rearranged data in Tables 13.7 to 13.13 and followingExamples 13.6 to 13.8, complete construction of the remaining terms inthe partial likelihood function based on the PWP model(13.4.2), PWPgaptime model(13.4.3), and AG model (13.4.9), and the remaining threemarginal likelihood functions based on the WLW model(13.4.13)
Trang 13C H A P T E R 14
Identification of Risk Factors
Related to Dichotomous and
Polychotomous Outcomes
In biomedical research we are often interested in whether a certain related event will occur and the important factors that influence its occurrence.Such events may involve two or more possible outcomes; examples are thedevelopment of a given condition and response to a given treatment If thegiven condition is diabetes and we are only interested in whether someonedevelops the disease(yes or no), the outcome is binary or dichotomous If weare interested in whether the person develops impaired glucose tolerance,diabetes, or remains having normal glucose tolerance, there are three possible
survival-outcomes, or we say the outcome is trichotomous Similarly, response to a given treatment can have dichotomous (response or no response) or polychotomous
outcomes(complete response, partial response, or no response)
To determine whether one is likely to develop a given disease, we need toknow the important characteristics (or factors) related to its development.High- and low-risk groups can then be defined accordingly Factors closely
related to the development of a given disease are usually called risk factors or
risk variables by epidemiologists We shall use these terms in a broader sense
to mean factors closely related to the occurrence of any event of interest Forexample, to find out whether a woman will develop breast cancer because one
of her relatives did, we need to know whether a family history of breast cancer
is an important risk factor Therefore, we need to know the following:
1 Of age, race, family history of breast cancer, number of pregnancies,experience of breast-feeding, and use of oral contraceptives — which aremost important?
2 Can we predict, on the basis of the important risk factors, whether awoman will develop breast cancer or is more likely to develop breastcancer than another person?
377
Trang 14In this chapter we introduce several methods for answering these tions The general approach is to relate various patient characteristics(or independent variables, or covariates) to the occurrence of an event(dependent or response variable) on the basis of data collected frompatients in each of the outcome groups In the case of dichotomous out-comes, there are two outcome groups For example, to relate variables such
ques-as age, race, and number of pregnancies to the development of breques-ast cancer,
we need to collect information about these variables from a group of breastcancer patients as well as from a group of healthy normal women For an eventwith polychotomous outcomes, we need to collect data from each outcomegroup
Often, a large number of patient characteristics deserve consideration.These characteristics may be demographic variables such as age; geneticvariables such as gene variant or phenotype; behavioral variables such assmoking or drinking behavior and use of estrogen or progesterone medic-ation; environmental variables such as exposure to sun, air pollution, oroccupational dust; or clinical variables such as blood cell counts, weight, andblood pressure The number of possible risk factors can be reduced throughmedical knowledge of the disease and careful examination of the possible riskfactors individually
In Section 14.1 we present two methods for examination of individualvariables One is to compare the distribution of each possible risk variableamong the outcome groups The other method is the chi-square test for acontingency table This test is particularly useful when the risk variables arecategorical: for example, dichotomous or trichotomous In this case, a 2;c or
r ;c contingency table can be set up and a chi-square test performed In
Section 14.2 we discuss logistic, conditional logistic, and other regressionmodels for binary responses and for examining the possible risk variablessimultaneously Models for multiple outcomes are discussed in Section 14.3
14.1.1 Comparing the Distributions of Risk Variables Among Groups
When the outcome is binary, it is often convenient to call an observation a
success or a failure Success may mean that a survival-related event occurred, and failure that it failed to occur Thus, a success may be a responding
patient, a patient who survives more than five years after surgery, or aperson who develops a given disease A failure may be a nonrespond-ing patient, a patient who dies within five years after surgery, or a personwho does not develop a given disease A preliminary examination of thedata can compare the distribution of the risk variables in the success andfailure groups This method is especially appropriate if the risk variable is
Trang 15Table 14.1 Ages of 71 Leukemia Patients (Years)
Responders 20, 25, 26, 26, 27, 28, 28, 31, 33, 33, 36, 40, 40, 45, 45, 50, 50, 53 56,
62, 71, 74, 75, 77, 18, 19, 22, 26, 27, 28, 28, 28, 34, 37, 47, 56, 19 Nonresponders 27, 33, 34, 37, 43, 45, 45, 47, 48, 51, 52, 53, 57, 59, 59, 60, 60, 61, 61,
61, 63, 65, 71, 73, 73, 74, 80, 21, 28, 36, 55, 59, 62, 83
Source: Hart et al.(1977) Data used by permission of the author.
continuous If, for example, the risk factor x is weight and the dependent variable y is having cardiovascular disease, we may compare the weight
distribution of patients who have developed disease to that of disease-freepatients If the disease group has significantly higher weights than those of thedisease-free group, we may consider weight an important risk factor Common-
ly used statistical methods for comparing two distributions are the t-test for
two independent samples if the assumption of normality holds and the
Mann—Whitney U-test if the normality assumption is violated and a
non-parametric test is preferred
Similarly, if there are more than two possible outcomes, we can use analysis
of variance or the Kruskal—Wallis nonparametric test to compare the multiple
distributions of a continuous variable The following example compares the agedistribution of responders with that of nonresponders in a cancer clinical trial
and 34 nonresponders (response is defined as a complete response only) —given in Table 14.1 Figure 14.1 gives us the estimated age distributions of the
two groups By using the Mann—Whitney U-test (or Gehan’s generalizedWilcoxon test), we find that the difference in age between responders andnonresponders is statistically significant(p 0.01) In consequence, a questionmay arise as to what age is critical Can we say that patients under 50 mayhave a better chance of responding than do patients over 50? To answer thisquestion, one can dichotomize the age data and use the chi-square test,discussed next
14.1.2 Chi-Square Test and Odds Ratio
The chi-square test and the odds ratio are most appropriate when theindependent variable is categorical If the independent variable is dichotomous,
a 2;2 table can be used to represent the data Any variables that are notdichotomous can be made so (with a loss of some information) by choos-ing a cutoff point: for example, age less than 50 years For multiple-outcome events, 2;c or r;c tables can be constructed The independent
Trang 16Figure 14.1 Age distribution of responders and nonresponders.
variables are then examined to find which ones (in some sense) provide thebest risk associations with the dependent variable We first consider binaryoutcomes and independent variables that have two categories; that is, we set
up a 2;2 contingency table similar to Table 14.2 for each independent variableand look for a high degree of proportionality
The first step is to calculate the sample proportion of successes in the two
risk groups, a/C and b/C Further analysis of the table is concerned with the
precision of these proportions A standard chi-square test can be used
Trang 17Table 14.2 General Setup of a 2 ;2 Contingency Table
Proportion of successes (success rate) a/C b/C
If the rates of success for the two groups E and E are exactly equal, the
expected number of patients in the ijth cell (ith row and jth column) is
since the overall success rate is R/N and there are C individuals in the E
group Similar expected numbers can be obtained for each of the four cells Let
OGH be the number of patients observed in the ijth cell Then the discrepancies
can be measured by the differences (OGH9EGH) In a rough sense, the greater the
discrepancies, the more evidence we have against the null hypothesis that the
success rates are the same for the two groups The chi-square test is based on
these discrepancies Let
Under the null hypothesis, X follows the chi-square distribution with 1 degree
of freedom(df) The hypothesis of equal success rates for groups E and E is
rejected if X ?, where ? is the 100 percentage point of the chi-square
distribution with 1 degree of freedom An alternative way to compute X is
X :(ad 9 bc)N
Trang 18The odds ratio(Cornfield, 1951) is a commonly used measure of association
in 2;2 tables The odds ratio (OR) is the ratio of two odds: the odds of success
when the risk factor is present and the odds of success when the risk factor isabsent In terms of probabilities,
OR:P(success E)/P(failure E)
P(success E)/P(failure E) (14.1.4)Using the notation in Table 14.2, P(success E) and P(failure E) may
be estimated by a/C and c/C, respectively Similarly, P(success E) and
P(failure E) may be estimated, respectively, by b/C and d/C Therefore,
the numerator and denominator of (14.1.4) may be estimated, respectively,by
a/C
c/C:
a c
and
b/C
d/C:
b d
Consequently, the OR may be estimated by
OR :a/c
b/d:ad
which is also referred to as the cross-product ratio.
Several methods are available for an interval estimate of OR: for example,Cornfield (1956) and Woolf (1955) Cornfield’s method, which requires aniterative procedure, is considered more accurate but more complicated thanWoolf’s method Woolf suggests using the logarithm of OR The standard error
of log OR may be estimated by
The confidence interval for OR can be obtained by taking the antilog of the
confidence limits for log OR If log OR3 and logOR* are the upper and lower
Trang 19confidence limits for log OR, elog OR3 and elog OR* are the upper and lowerconfidence limits for OR.
Notice that in(14.1.5), if b or c is zero, OR is undefined If any one of thefour cell frequencies is zero, the estimated standard error in (14.1.6) is alsoundefined Should this occur, some statisticians(Haldane, 1956; Fleiss, 1979,1981) suggest that 0.5 be added to each cell before using (14.1.5) and (14.1.6)
to solve the computational problem However, if the cell frequencies are assmall as zero, the addition of 0.5 to each cell will substantially affect theresulting estimate of OR and its standard error (Mantel, 1977; Miettinen,1979) The estimates so obtained must be interpreted with caution
An odds ratio of 1 indicates that the odds of success are the same whether
or not the risk factor is present An odds ratio greater than 1 means that theodds in favor of success is higher when the risk factor is present, and thereforethere is a positive association between the risk factor and success Similarly, anodds ratio of less than 1 signifies a negative association between the risk factorand success The interpretation should not be based totally on the pointestimate A confidence interval is always more meaningful, just as in any otherestimation procedure
The chi-square statistic in(14.1.2) may be used to test the null hypothesis
that there is no association between the risk factor and success, or H: OR:1.
The following example illustrates the chi-square test and odds ratio
(Example 14.1), age is considered one of the possible risk variables Thefollowing 2;2 table is constructed
Age 50 Age 50 Total
Trang 20getting a X value of 10.16 if the two response rates are equal in the population
is less than 0.01 Hence the difference between the two response rates issignificant at the 1% level
The estimate odds ratio, according to(14.1.5), is
OR:(27)(22)(10)(12): 4.95
The data show that the odds in favor of response are almost five times higher
in patients under 50 years of age than in patients at least 50 years old Thedifference is significantly different, as indicated by the chi-square test above
To obtain a confidence interval for OR, we first compute log OR : 1.60.The estimated standard error of log OR following (14.1.6) is
A 95% confidence interval for log OR is 1.60< 1.96(0.515), or (0.59, 2.61), and
a 95% confidence interval for OR is (e
interval may be due to the small cell frequencies Note that the standard error
of log OR is inversely related to the cell frequencies
In this example, the cutoff point, 50, was chosen arbitrarily It is often ofinterest to try more than one cutoff point if the number of observations in eachcell is not too small
There are cases where the independent variable has c 2 classes Thechi-square test can be extended to 2;c tables The odds ratio method can also
be extended to handle polychotomous independent variables It is done byselecting one of the classes as the reference class(the E group) and calculatingthe measure of association of each of the other classes relative to the reference
class For multiple-outcome events, the chi-square test can be extended to r ;c
tables The expected frequencies are computed just as in(14.1.1), and
compu-tation of X [chi-square distributed with (r 9 1)(c 9 1) degrees of freedom] is
the same as in(14.1.2) except that the sum is over all r ;c cells For details, see
Snedecor and Cocharan(1967, Sec 9.7) The following example illustrates theprocedures
patients, another possible risk variable is the marrow absolute leukemic
infil-trate, which is defined as the percentage of the total marrow that is either blast
cells or promyelocytes It is believed that patients should be classified into threeclasses:
in parentheses are expected frequencies For example, 18.68: (39)(34)/71
Trang 21Marrow Absolute Infiltrate
The question is whether the difference in marrow absolute leukemic infiltrate is
related to response The value of X is
X :(49 8.34)
8.34 ;(129 7.66)
7.66 ; % ;(39 7.66)
7.66 : 10.17
The number of degrees of freedom is 39 1 : 2 With X : 10.17 and 2 degrees
of freedom, the probability that the three absolute infiltrate groups have thesame response rate is less than 0.01 The data suggest that patients with a highpercentage of marrow absolute infiltrate tend to have a high response rate.Marrow absolute infiltrate may be an important factor in predicting response.The OR
the reference class(or group) For example, for the90% class, the odds ratio
is 13;12/4;3 : 13 The 95% confidence intervals for the ORs are obtainedusing(14.1.6) Although the odds ratio for the 46—90% group is larger than 1,
the 95% confidence intervals covers 1 Therefore, the point estimate, 3.16,cannot be taken too seriously It appears that the major difference is betweenthe
Individual examination of each independent variable can provide only apreliminary idea of how important each variable is by itself The relativeimportance of all the variables has to be examined simultaneously usingmultivariate methods In the following section we discuss the linear logisticregression analysis
MODELS FOR DICHOTOMOUS RESPONSES
14.2.1 Logistic Regression Model for Prospective Studies
In a typical prospective study, a random sample of subjects is taken and thevalues of the independent variables are measured at a given time(usually called
baseline measurements) The subjects are then followed for a given period of
Trang 22time and the outcome (dependent) variable is measured at the end of thefollow-up Therefore, for a prospective study, the independent variables areregarded as fixed quantities during the follow-up, but the outcomes are randomand unknown The purpose of a prospective study is to examine the outcomesand relate them to the baseline measurements Examples of prospective studiesare cohort epidemiologic studies and clinical trials.
Suppose that there are n subjects and to some of whom the event of interest occurred They are called successes; the others are failures Let yG :1 if the ith subject is a success and yG : 0 if the ith subject is a failure Suppose that for each of the n subjects, p independent variables xG, xG, , xGN are measured.
These variables can be either qualitative, such as gender and race, or tative, such as blood pressure and white blood cell count The problem is to
quanti-relate the independent variables, xG, , xGN, to the dichotomous dependent variable yG.
Let PG be the probability of success, PG :P(yG :1 xG, , xGN), for the ith
subject The logistic regression model, proposed by Cox(1970) assumes thatthe dependence of the probability of success on independent variables is
PG: P(yG :1 xG): exp(NHbHxGH)
1; exp(NHbHxGH) (14.2.1)
and
19 PG:P(yG:0xG) :1; exp(NHbHxGH)1 (14.2.2)
where xG :(xG% xGN), xGY1, and bH are unknown coefficients The logarithm
of the ratio of PG and 19PG is a simple linear function of the xGH’s.Let
G:log19 PG PG :HN bHxGH (14.2.3) G:log[PG/(1 9PG)] is called the logistic transform of PG and (14.2.3) is a linear
logistic model Another name for G is log odds Thus, the model relates the independent variables to the logistic transform of PG, or log odds The probability of success PG can then be found from (14.2.3) or (14.2.1) In many
ways(14.2.3) is the most useful analog for dichotomous response data of theordinary regression model for normally distributed data
To estimate the coefficients bH’s, Cox suggests the maximum likelihood method Let y, y, , yL be observations with dichotomous values on n
subjects The likelihood function based on the binomial distribution contains afactor(14.2.1) whenever yG :1 and (14.2.2) whenever yG: 0 Thus, the likeli-
Trang 23exp(NHbHtH) LG[1;exp(NHbHxGH)]
(14.2.4) where tH: LGxGHyG The log-likelihood function is
function in (14.2.5) can be obtained by solving the following p equations
HH Then the estimated inverse of the I matrix, I \, is the
asymptotic covariance matrix of the bH’s If we use the notation in Section 7.1
and let b : (b, b, , bN) denote the MLE of b, the estimated covariance
matrix of the MLE b
denotes the ijth element of V (b) or the ijth element of I\.
The coefficients so obtained indicate the relationships between the variablesand the log odds in favor of success For a continuous variable, the correspond-ing coefficient gives the change in the log odds for an increase of 1 unit in thevariable For a categorical variable, the coefficient is equal to the log odds ratio(see Section 14.1)
An approximate 100(19 )% confidence interval for bH is
where Z? is the 100(19/2) percentile of the standard normal distribution.
Trang 24To test the hypothesis that some of the bH’s are zero, a likelihood ratio test can be used For example, to test H:bH:0, the log-likelihood ratio test
statistic is
X*:92[l(b, b, , bH\, 0, bH>, , bN) 9l(b, b, b, , bN)]
(14.2.9)where the first term is the maximized log-likelihood subject to the constraint
bH:0 If the hypothesis is true, X* is distributed asymptotically as chi-square
with 1 degree of freedom
An alternative test for the significance of the coefficients is the Wald test,
which can be written as
X5: b H
Under the null hypothesis that bH:0, X5 has an asymptotic chi-square
distribution with 1 degree of freedom Although the Wald test is used by many,
it is less powerful than the likelihood ratio test (Hauck and Donner, 1977;Jennings, 1986) In other words, the Wald test often leads the user to concludethat the coefficient(consequently, the respective risk factor) is not significantwhen, in fact, it is significant
Similar to earlier discussion of model selection, forward, backward, andstepwise variable selection methods can be used to select the risk factors thatare significantly associated with a dichotomous response The independent
variables xGH in this model do not have to be the original variables They can
be any meaningful transforms of the original variables: for example, the
logarithm of the original variable, log xGH, and the deviation of the variable from its mean, xGH9 xH.From(14.2.1) and (14.2.2), the logarithm of the odds ratio for ith and kth
subjects is
log PG/(1 9PG) PI/(1 9PI):
square test, the Hosmer—Lemeshow(Hosmer and Lemeshow, 1980) test, a teststatistic suggested by Tsiatis (1980), and the score of Brown (1982) In the
following, we introduce the Hosmer—L emeshow test.
Trang 25Let pG be the estimate of PG obtained from the fitted logistic regression equation for the ith subject, i : 1, , n The pG’s can be arranged in ascending
order from smallest to largest Those probabilities and the corresponding
subjects are then divided into g groups according to some cutoff points of the probability For example, let g: 10 and the cutoff points of the probability be
equal to k/10, k: 1, 2, , 10 Thus, the first group contains all subjects whoseestimated probabilities are less than or equal to 0.1, the second group containsall subjects whose estimated probabilities are less than or equal to 0.2, and so
on Let nI be the number of subjects in the kth group The estimated expected number of successes for the kth group is
EI: L I H pH k : 1, 2, , g The Hosmer—Lemeshow test statistic is defined as
of freedom The test is basically a chi-square test of the discrepancy between
the observed and predicted frequencies of success Thus, a C value larger than
the 100 percentage point of the chi-square distribution (or p value less than
) indicates that the model is inadequate
Similar to other chi-square goodness-of-fit tests, the approximation depends
on the estimated expected frequencies being reasonably large If a large number(say, far more than 20%) of the expected frequencies are less than 5, the
approximation may not be appropriate and the p value must be interpreted
carefully If this is the case, adjacent groups may be combined to increase theestimated expected frequencies However, Hosmer and Lemeshow warn that if
fewer than six groups are used to calculate C, the test would be insensitive and
would almost always indicate that the model is adequate
Most statistical software packages provide programs for logistic regressionanalysis: for example, SAS(procedures LOGISTIC, PHREG, and CATMOD),BMDP(procedures LR and PR), and SPSS (procedures NOMREG, PROBIT,PLUM, and LOGISTIC) Most of them provide estimates of the coefficientsand test statistics, variable selection procedures, and tests of goodness of fit
Trang 26Example 14.4 In a study of 238 non-insulin-dependent diabetic patients,
10 covariates are considered possible risk factors for proteinuria(the outcomevariable) The logistic regression method is used to identify the most importantrisk factors and to predict the probability of proteinuria on the basis of theserisk factors The 10 potential risk factors are age, gender(1, male; 2, female),smoking status(0, no; 1, yes), percentage of ideal body mass index, hyperten-sion(0, no; 1, yes), use of insulin (0, no; 1, yes), glucose control (0, no; 1, yes),duration of diabetes mellitus (DM) in years, total cholesterol, and total
triglyceride Among the 238 patients, 69 have proteinuria ( yG:1).Using the stepwise procedure in BMDP, it is estimated that at step 1, themodel contains only b and b: 90.896 and l(b) in (14.2.5) is 9143.292 At
step 2, duration of diabetes is added to the model because its maximumlog-likelihood value is the largest among all the covariates The MLEs of the
two coefficients are b :91.467 and b:90.055, and l(b,b):9139.429.
Since
X*:92[l(b) 9l(b, b)] : 7.726
which is significant(p: 0.005), the duration of DM is related significantly to
the chance of proteinuria The Hosmer—Lemeshow test statistic for goodness of
fit with only duration of DM in the model, C: 9.814 with 8 degrees of
freedom, gives a p value of 0.278.
At step 3, gender is added to the model because its addition yields the largestmaximum log-likelihood value among all the remaining covariates The maxi-
mum log-likelihood value, l(b ,b,b) :9137.749, b:91.453, b:90.060, and b :90.279 To test if gender is significantly related to proteinuria after
duration of DM, we perform the likelihood ratio test
X* :92[l(b, b) 9l(b, b, b)] :3.360
which is significant at p: 0.067 The stepwise procedure terminates after thethird step because no other covariates are significant enough to enter the
regression model; that is, none of the other covariates have a p value less than
0.15, which is set by the program (BMDP) If any covariate already in theregression becomes insignificant after some other variables are in, the insignifi-
cant variable would be removed The p values for entering and removing a
variable can be determined by the user The default values for entering andremoving a variable are, respectively, 0.10 and 0.15 Thus, the procedureidentifies duration of DM and gender as the two most important risk factorsbased on the data given A question that may be raised at this point is whetherone should include gender in the equation since its significance level is largerthan the commonly used 0.05 The recommendation is to include it since it is
close to 0.05 and since the p value should not be the only basis for determining
whether a covariate should be included in the model In addition, the
Hosmer—Lemeshow goodness of fit test statistic C: 5.036, when gender is
Trang 27Table 14.3 Estimated Coefficients for a Linear Logistic Regression Model Using Data from Diabetic Patients
Estimated Standard Variable Coefficient Error Coefficient/SE exp(coefficient)
included, yields a p value of 0.754 Thus, inclusion of this covariate improves
considerably the adequacy of the model Thus, the final regression equationwith the two significant risk factors is
log PG
19 PG : 91.453 ; 0.060 (duration of DM) 9 0.279 (gender)
Table 14.3 gives the details for the estimated coefficients
The signs of the coefficients indicate that male patients and patients with alonger duration of diabetes have a higher chance of proteinuria Furthermore,for each increase of one year in duration of diabetes, the log odds increase by0.060 Probabilities of proteinuria can be estimated following (14.2.1) Forexample, the probability of developing proteinuria for a male patient who hashad diabetes for 15 years is
P: e
where90.832 is obtained by substituting the values of the two covariates inthe fitted equation; that is,91.453;0.060(15)90.279(1): 90.832 Similarly,for a female patient who has the same duration of diabetes, the probability is0.248
In addition to individual variables, interaction terms can be included in thelogistic regression model If the association between an independent variable
x and the dependent variable y is not the same in different levels of another
variable, x, there is interaction between x and x To check if there is interaction, one can include the product of x and x in the regression model
and test the significance of this new variable The following example illustratesthe procedure
certain types of cancer It is also well known that adriamycin is highly toxic
... always indicate that the model is adequateMost statistical software packages provide programs for logistic regressionanalysis: for example, SAS(procedures LOGISTIC, PHREG, and CATMOD),BMDP(procedures... following (14.2.1) Forexample, the probability of developing proteinuria for a male patient who hashad diabetes for 15 years is
P: e
where90 .83 2 is obtained... class="page_container" data- page="26">
Example 14.4 In a study of 2 38 non-insulin-dependent diabetic patients,
10 covariates are considered possible risk factors for proteinuria(the