Statistical Methods for Survival Data Analysis Third Edition phần 8 pot

Table 13.12 Asymptotic Partial Likelihood Inference on the Bladder Cancer Data from the Fitted PWP Gap Time Models with Stratum-Specific or Common Coefficients95% Confidence Interval Regres

Trang 1

Table 13.11 Rearranged Data from Table 13.9 for

Fitting PWP Gap Time Model with Common

Using the notations in Table 13.9, let GT denote the gaptime, then

GT: TR—TL Replacing TR and TL in Tables 13.8 and 13.9 by GT, the data

are ready for SAS and other software Table 13.11 is the corresponding tablefor the same six patients in Table 13.9 using gap times Using the notation ofExample 13.6, the second product in(13.4.5) for stratum 2 is

The results from fitting the PWP gaptime model to all the data in Table13.6 with stratum-specific coefficients and common coefficients are given inTable 13.12 Again, the number of initial tumors is the only significant

Trang 2

Table 13.12 Asymptotic Partial Likelihood Inference on the Bladder Cancer Data from the Fitted PWP Gap Time Models with Stratum-Speciﬁc or Common Coefﬁcients

95% Conﬁdence Interval Regression Standard Chi-Square Hazards

Variable Coefﬁcient Error Statistic p Ratio Lower Upper

Model with Stratum-Specific Coefficients

Suppose that the text ﬁle ‘‘C:EX13d4d1.DAT’’ contains the successivecolumns in Table 13.8 for the entire data set in Table 13.6: NR, TL, TR, CS,T1, T2, T3, T4, N1, N2, N3, N4, S1, S2, S3, and S4, and the text ﬁle

‘‘C:EX13d4d2.DAT’’ contains the seven successive columns in Table 13.9: NR,

TL, TR, CS, TRT, N, and S The following SAS code can be used to obtainthe PWP models in Table 13.10

data w1;

inﬁle ‘c: ex13d4d1.dat’ missover;

input nr tl tr cs t1 t2 t3 t4 n1 n2 n3 n4 s1 s2 s3 s4;

run;

title ‘‘PWP model with stratiﬁed coefﬁcients‘;

proc phreg data : w1;

Trang 3

model (tl, tr)*cs(0) : t1 t2 t3 t4 n1 n2 n3 n4 s1 s2 s3 s4 / ties : efron;

title ‘‘PWP model with common coefﬁcients‘;

model (tl, tr)*cs(0) : trt n s / ties : efron;

GT, CS, T1, T2, T3, T4, N1, N2, N3, N4, S1, S2, S3, and S4 The text ﬁle

‘‘C:KEX13d4d4.DAT’’ contains the successive six columns from Table 13.11: NR,

GT, CS, TRT, N, and S The following SAS, SPSS, and BMDP codes can beused to obtain the PWP gaptime models in Table 13.12

title ‘‘PWP gaptime model with stratiﬁed coefﬁcients’’;

model gt*cs(0) : t1 t2 t3 t4 n1 n2 n3 n4 s1 s2 s3 s4 / ties : efron;

title ‘‘PWP gaptime model with common coefﬁcients‘;

model gt*cs(0) : trt n s / ties : efron;

Trang 4

before that time The multiplicative hazard function h(t, xG) for the ith person is

h(t, xG):YG(t)h(t) exp[bxG(t)]

where YG(t), an indicator, equals 1 when the ith person is under observation (at

risk) at time t and 0 otherwise and h(t) is an unspeciﬁed underlying hazard

Trang 5

of this likelihood function and the estimation of the coefﬁcients can be found

in Fleming and Harrington(1991) and Andersen et al (1993) Similar to thePWP models, software packages are available to carry out the computationprovided that the data are arranged in a certain format The following exampleillustrates the terms in(13.4.6) and the data format required by SAS

To explain the terms in the likelihood function, we use the data of the sixpeople in Table 13.7 In this model, every recurrent event is considered to beindependent Therefore, we can rearrange the data by person and by event time

‘‘within’’ an individual Table 13.13 shows the rearranged data For example,the person with ID: 4 had two recurrences, at 12 and 16, and the follow-uptime ended at 18 The time intervals(TL, TR] are (0, 12], (12, 16], and (16,18],and 12 and 16 are uncensored observations and 18 censored, since there was

no tumor recurrence at 18 For patients with ID: 1 and 2 (i : 1, 2), the

respective second product terms in(13.4.6) are equal to 1 since

2, for all t For patient 3 (i

recurrence time of the patient) Thus, the respective second product has only

Trang 6

Table 13.14 Asymptotic Partial Likelihood Inference on the Bladder Cancer Data from the Fitted AG Model

one term at t: 3 and the denominator of this term sums over all the patients

who are under observation and at risk at time t: 3 From Figure 13.1 it iseasily seen that the sum is over all six patients; that is, the respective secondproduct is

exp(bx)

For patient 4(i: 4), the second product in (13.4.6) contains two terms One

is for t : 12 (the ﬁrst recurrence time), and at t : 12, patients 2, 3, 4, 5, and 6

are still under observation, and therefore the denominator of the term sums

over patients 2 to 6 The other term is for t: 16 (the second recurrence time)and the denominator sums over patients 2, 4, 5, and 6 Patient 3 is no longer

under observation after t : 14 Thus, the second product term for i : 4 is

exp(bx)

Hexp(bxH);

exp(bx)exp(bx) ;Hexp(bxH) (13.4.8)

Similarly, we can construct each term in (13.4.6) and the partial likelihoodfunction

Using SAS, we obtain the results in Table 13.14 The AG model identiﬁestreatment and number of initial tumor as signiﬁcant covariates Comparedwith placebo, thiotepa does slow down tumor recurrence

Readers can construct the SAS codes for the AG model by using Table 13.13and by following the codes given in Example 13.6

Wei et al Model

By using a marginal approach, Wei, Lin, and Weissfeld (1989) proposed amodel, the WLW model, for the analysis of recurrent failures The failures may

be recurrences of the same kind of event or events of different natures,depending on how the stratification is defined If the strata are defined by the

Trang 7

times of repeated failures of the same type, similar to the strata defined in thePWP models, it can be used to analyze repeated failures of the same kind Thedifference between the PWP models and the WLW model is that the latterconsiders each event as a separate process and treats each stratum-specific(marginal) partial likelihood separately In the stratum-specific (marginal)

partial likelihood of stratum s, people who have experienced the (s9 1)thfailure contribute either one uncensored or one censored failure time depending

on whether or not they experience a recurrence in stratum s, and the other

subjects contribute only censored times(forced as censored times) Therefore,each stratum contains everyone in the study This is different from the PWPmodels, in which subjects who have not experienced the (s9 1)th failure are

not included in stratum s If the strata are deﬁned by the type of failure, the

WLW model acts like the competing risks model deﬁned in Section 13.3, andthe type-speciﬁc(marginal) partial likelihood for the jth type simply treats all failures of types other than j in the data as censored.

For the kth stratum of the ith person, the hazard function is assumed to

have the form

hIG(t) :YIG(t)hI(t) exp(bIxIG), t 0 (13.4.9) where YIG(t) :1, if the ith person in the kth stratum is under observation, 0, otherwise, hI(t) is an unspeciﬁed underlying hazard function Let RI(tIG) denote the risk set with people at risk at the ith distinct uncensored time tIG in the kth stratum Then the speciﬁc partial likelihood for the kth stratum is

otherwise The coefﬁcients bI are stratum speciﬁc In practice, if we are

interested in the overall effect of the covariates, we can assume that thecoefficients from different strata are equal(provided that there are no qualitat-ive differences among the strata), combine the strata and draw conclusionsabove the ‘‘average effect’’ of the covariates We again called the coefficients ofthese covariates common coefficients The event time is from the beginning ofthe study in this model

Similar to the PWP and AG models, the data must be arranged in a certainformat in order to use available software to carry out estimation of thecoefﬁcients and tests of signiﬁcance of the covariates Using the same data as

in Examples 13.6 and 13.7, the following example illustrates the terms in thestratum-speciﬁc likelihood function and the use of software

compo-nents in the stratum-speciﬁc likelihood function in(13.4.10) The format thedata have to be in for the available software, such as SAS, SPSS, and BMDP,

Trang 8

Table 13.15 Rearranged Data from Table 13.7 for Fitting WLW Model with

of the event time(censored or not, TR) In stratum 2(NR: 2), the three people(with ID: 4, 5, and 6) whose times to the second tumor recurrence areuncensored observations Patients 1 and 2 had censored time at 9 and 59,respectively Patient 3, who had no second recurrence and was observed until

14 months, is considered censored at 14 The other strata are constructed in asimilar manner Using the data arrangement in Table 13.15, we can see that forthe second stratum, the likelihood function in(13.4.10) has three terms, one foreach of persons 5, 6, and 4, whose

the risk set at time t: 16 has two individuals (ID : 4 and 2); for patient 5,

the risk set at time t: 12 contains ﬁve individuals (ID : 2, 3, 4, 5, and 6); and

for patient 6, the risk set at time t: 15 has three individuals (ID : 2, 4, and

6) Let xH be the covariate vector of the patient with ID:j in stratum 2; then

Trang 9

Fitting WLW Model with Common Coefﬁcients

in the average overall effect of the covariates, we combine T1—T4, N1—N4, and S1—S4 The rearranged data for the six patients are given in Table 13.16.

Trang 10

Table 13.17 Asymptotic Partial Likelihood Inference on the Bladder Cancer Data from the Fitted WLW Models with Stratum-Speciﬁc or Common Coefﬁcients

Model with Stratum-Specific Coefficients

recurrences The signs of the coefﬁcients for T1—T4 suggest that thiotepa may

slow down tumor growth, but the evidence is not statistically significant Themodel with common coefficients suggests that thiotepa is significantly moreeffective in prolonging the recurrence time The results suggest that whenlooking at each stratum independently, there is no strong evidence thatthiotepa is more effective than placebo However, the combined estimate of thecommon coefficient provides stronger evidence that thiotepa is more effectiveover the course of the study

In Cox’s proportional hazards model and other regression methods, a keyassumption is that observed survival or event times are independent However,

in many practical situations, failure times are observed from related individuals

Trang 11

or from successive recurrent events or failures of the same person For example,

in an epidemiological study of heart disease, some of the participants may befrom the same family and therefore are not independent These families with

multiple participants may be called clusters In this case, the regression

methods we introduced earlier may not be appropriate Several types of modelsintroduced especially for related observations are discussed by Andersen et al.(1993), Liang et al (1995), Klein and Moeschberger (1997), and Ibrahim et al.(2001) Details about these models are beyond the scope of this book In thefollowing, we introduce brieﬂy the frailty models

The frailty models assume that there is an unmeasured random variable

(frailty) in the hazard function This random variable accounts for the variation

or heterogeneity among individuals in a cluster It is also assumed that the

frailty is independent of censoring Let n be the total number of participants in the study, some of them related and forming clusters Let vG be the unknown random variable, frailty, associated with the ith cluster, 1 i n The frailty

model associated with the proportional hazards model can be written in terms

of the log hazard function as

log[hGH(t; xGH vG)] :log[h(t)] ;vG ;bxGH (13.5.1)for 1 j mG and 1 in, where b denotes the p;1 column vector of unknown regression coefﬁcients, xGH is the covariate vector of the jth person in the ith cluster, mG is the number of individuals in the ith cluster, and h(t) is an

unknown underlying hazard function Compared with the Cox proportional

hazards model, the difference here is the random effect vG Because vG remains the same in the ith cluster, the association between failure and covariates

within each cluster in this model is assumed to have a symmetric pattern In afamily study, this model can be used, for example, to model failure timesobserved from siblings by treating each family as a cluster This model wasproposed by Vaupel et al (1979) and developed and discussed by manyresearchers, including Clayton and Cuzick(1985) The main approach to this

model is to assume that vG follows a parametric distribution.The frailty model in(13.5.1) can be extended to handle more complicated

situations For example, the frailty can be a time-dependent variable [replace

vG by vG(t) in (13.5.1)] The frailty model with vG(t) can be used to model

successive or recurrent failure time as an alternative to the models in Section13.4 Another example is that there may be more than one type of frailty in

each cluster, and vG in (13.5.1) can be replaced by vG ;uG or vG;uG ;wG, and

so on

Inferences of these frailty models are also based on either a likelihoodfunction or a partial likelihood function Since the models involve a parametricdistribution, the likelihood or partial likelihood functions are complicated andare beyond the level of this book

The frailty models have not been used widely primarily because of the lack

of commercially available software There are some computer programs

Trang 12

available; for example, a SAS macro is available for a gamma frailty model atthe Web site of Klein and Moeschberger (1997), and another program isdescribed by Jenkins(1997).

Bibliographical Remarks

Most of the major references for nonproportional hazards models have been

cited in the text of this chapter Applications of these models include: stratiﬁed

models: Vasan et al.(1997), Aaronson et al (1997), and Yakovlev et al (1999);

frailty models: Yashin and Iachine(1997), Kessing et al (1999), Siegmund et al.(1999), Albert (2000), Lee and Yau (2001), Wienke el al (2001), and Xue (2001);

competing risks models: Mackenbach et al.(1995), Fish et al (1998), Albertsen

et al (1998), Blackstone and Lytle (2000), Yan et al (2000), and Tai et al.(2001)

EXERCISES

13.1 Consider the cancer-free times from the participants with IDs 15 to 23

in Table 13.1 Follow Example 13.1 to construct the partial likelihoodfunction based on the observed cancer-free times from these nine partici-pants

13.2 Consider the survival times from 30 resected melanoma patients in Table3.1 Let AGEG denote age group, AGEG: 1 if age 45 and AGEG : 2otherwise Fit the survival times with an AGEG-stratiﬁed Cox propor-tional hazards model with the covariates age, gender, initial stage, andtreatment received Discuss the association of the treatment received withthe survival time

13.3 Using the data in Table 12.4, following Example 13.5 and the samplecodes for SAS, SPSS, or BMDP, ﬁt the competing risk model for stroke,CHD, other CVD, or STROKE/CHD separately, and discuss the resultsobtained

13.4 Using the rearranged data in Tables 13.7 to 13.13 and followingExamples 13.6 to 13.8, complete construction of the remaining terms inthe partial likelihood function based on the PWP model(13.4.2), PWPgaptime model(13.4.3), and AG model (13.4.9), and the remaining threemarginal likelihood functions based on the WLW model(13.4.13)

Trang 13

C H A P T E R 14

Identiﬁcation of Risk Factors

Related to Dichotomous and

Polychotomous Outcomes

In biomedical research we are often interested in whether a certain related event will occur and the important factors that inﬂuence its occurrence.Such events may involve two or more possible outcomes; examples are thedevelopment of a given condition and response to a given treatment If thegiven condition is diabetes and we are only interested in whether someonedevelops the disease(yes or no), the outcome is binary or dichotomous If weare interested in whether the person develops impaired glucose tolerance,diabetes, or remains having normal glucose tolerance, there are three possible

survival-outcomes, or we say the outcome is trichotomous Similarly, response to a given treatment can have dichotomous (response or no response) or polychotomous

outcomes(complete response, partial response, or no response)

To determine whether one is likely to develop a given disease, we need toknow the important characteristics (or factors) related to its development.High- and low-risk groups can then be deﬁned accordingly Factors closely

related to the development of a given disease are usually called risk factors or

risk variables by epidemiologists We shall use these terms in a broader sense

to mean factors closely related to the occurrence of any event of interest Forexample, to ﬁnd out whether a woman will develop breast cancer because one

of her relatives did, we need to know whether a family history of breast cancer

is an important risk factor Therefore, we need to know the following:

1 Of age, race, family history of breast cancer, number of pregnancies,experience of breast-feeding, and use of oral contraceptives — which aremost important?

2 Can we predict, on the basis of the important risk factors, whether awoman will develop breast cancer or is more likely to develop breastcancer than another person?

377

Trang 14

In this chapter we introduce several methods for answering these tions The general approach is to relate various patient characteristics(or independent variables, or covariates) to the occurrence of an event(dependent or response variable) on the basis of data collected frompatients in each of the outcome groups In the case of dichotomous out-comes, there are two outcome groups For example, to relate variables such

ques-as age, race, and number of pregnancies to the development of breques-ast cancer,

we need to collect information about these variables from a group of breastcancer patients as well as from a group of healthy normal women For an eventwith polychotomous outcomes, we need to collect data from each outcomegroup

Often, a large number of patient characteristics deserve consideration.These characteristics may be demographic variables such as age; geneticvariables such as gene variant or phenotype; behavioral variables such assmoking or drinking behavior and use of estrogen or progesterone medic-ation; environmental variables such as exposure to sun, air pollution, oroccupational dust; or clinical variables such as blood cell counts, weight, andblood pressure The number of possible risk factors can be reduced throughmedical knowledge of the disease and careful examination of the possible riskfactors individually

In Section 14.1 we present two methods for examination of individualvariables One is to compare the distribution of each possible risk variableamong the outcome groups The other method is the chi-square test for acontingency table This test is particularly useful when the risk variables arecategorical: for example, dichotomous or trichotomous In this case, a 2;c or

r ;c contingency table can be set up and a chi-square test performed In

Section 14.2 we discuss logistic, conditional logistic, and other regressionmodels for binary responses and for examining the possible risk variablessimultaneously Models for multiple outcomes are discussed in Section 14.3

14.1.1 Comparing the Distributions of Risk Variables Among Groups

When the outcome is binary, it is often convenient to call an observation a

success or a failure Success may mean that a survival-related event occurred, and failure that it failed to occur Thus, a success may be a responding

patient, a patient who survives more than ﬁve years after surgery, or aperson who develops a given disease A failure may be a nonrespond-ing patient, a patient who dies within ﬁve years after surgery, or a personwho does not develop a given disease A preliminary examination of thedata can compare the distribution of the risk variables in the success andfailure groups This method is especially appropriate if the risk variable is

Trang 15

Table 14.1 Ages of 71 Leukemia Patients (Years)

Responders 20, 25, 26, 26, 27, 28, 28, 31, 33, 33, 36, 40, 40, 45, 45, 50, 50, 53 56,

62, 71, 74, 75, 77, 18, 19, 22, 26, 27, 28, 28, 28, 34, 37, 47, 56, 19 Nonresponders 27, 33, 34, 37, 43, 45, 45, 47, 48, 51, 52, 53, 57, 59, 59, 60, 60, 61, 61,

61, 63, 65, 71, 73, 73, 74, 80, 21, 28, 36, 55, 59, 62, 83

Source: Hart et al.(1977) Data used by permission of the author.

continuous If, for example, the risk factor x is weight and the dependent variable y is having cardiovascular disease, we may compare the weight

distribution of patients who have developed disease to that of disease-freepatients If the disease group has signiﬁcantly higher weights than those of thedisease-free group, we may consider weight an important risk factor Common-

ly used statistical methods for comparing two distributions are the t-test for

two independent samples if the assumption of normality holds and the

Mann—Whitney U-test if the normality assumption is violated and a

non-parametric test is preferred

Similarly, if there are more than two possible outcomes, we can use analysis

of variance or the Kruskal—Wallis nonparametric test to compare the multiple

distributions of a continuous variable The following example compares the agedistribution of responders with that of nonresponders in a cancer clinical trial

and 34 nonresponders (response is deﬁned as a complete response only) —given in Table 14.1 Figure 14.1 gives us the estimated age distributions of the

two groups By using the Mann—Whitney U-test (or Gehan’s generalizedWilcoxon test), we ﬁnd that the difference in age between responders andnonresponders is statistically signiﬁcant(p 0.01) In consequence, a questionmay arise as to what age is critical Can we say that patients under 50 mayhave a better chance of responding than do patients over 50? To answer thisquestion, one can dichotomize the age data and use the chi-square test,discussed next

14.1.2 Chi-Square Test and Odds Ratio

The chi-square test and the odds ratio are most appropriate when theindependent variable is categorical If the independent variable is dichotomous,

a 2;2 table can be used to represent the data Any variables that are notdichotomous can be made so (with a loss of some information) by choos-ing a cutoff point: for example, age less than 50 years For multiple-outcome events, 2;c or r;c tables can be constructed The independent

Trang 16

Figure 14.1 Age distribution of responders and nonresponders.

variables are then examined to ﬁnd which ones (in some sense) provide thebest risk associations with the dependent variable We ﬁrst consider binaryoutcomes and independent variables that have two categories; that is, we set

up a 2;2 contingency table similar to Table 14.2 for each independent variableand look for a high degree of proportionality

The ﬁrst step is to calculate the sample proportion of successes in the two

risk groups, a/C and b/C Further analysis of the table is concerned with the

precision of these proportions A standard chi-square test can be used

Trang 17

Table 14.2 General Setup of a 2 ;2 Contingency Table

Proportion of successes (success rate) a/C b/C

If the rates of success for the two groups E and E are exactly equal, the

expected number of patients in the ijth cell (ith row and jth column) is

since the overall success rate is R/N and there are C individuals in the E

group Similar expected numbers can be obtained for each of the four cells Let

OGH be the number of patients observed in the ijth cell Then the discrepancies

can be measured by the differences (OGH9EGH) In a rough sense, the greater the

discrepancies, the more evidence we have against the null hypothesis that the

success rates are the same for the two groups The chi-square test is based on

these discrepancies Let

Under the null hypothesis, X follows the chi-square distribution with 1 degree

of freedom(df) The hypothesis of equal success rates for groups E and E is

rejected if X ?, where ? is the 100 percentage point of the chi-square

distribution with 1 degree of freedom An alternative way to compute X is

X :(ad 9 bc)N

Trang 18

The odds ratio(Cornﬁeld, 1951) is a commonly used measure of association

in 2;2 tables The odds ratio (OR) is the ratio of two odds: the odds of success

when the risk factor is present and the odds of success when the risk factor isabsent In terms of probabilities,

OR:P(success E)/P(failure E)

P(success E)/P(failure E) (14.1.4)Using the notation in Table 14.2, P(success E) and P(failure E) may

be estimated by a/C and c/C, respectively Similarly, P(success E) and

P(failure E) may be estimated, respectively, by b/C and d/C Therefore,

the numerator and denominator of (14.1.4) may be estimated, respectively,by

a/C

c/C:

a c

and

b/C

d/C:

b d

Consequently, the OR may be estimated by

OR :a/c

b/d:ad

which is also referred to as the cross-product ratio.

Several methods are available for an interval estimate of OR: for example,Cornﬁeld (1956) and Woolf (1955) Cornﬁeld’s method, which requires aniterative procedure, is considered more accurate but more complicated thanWoolf’s method Woolf suggests using the logarithm of OR The standard error

of log OR may be estimated by

The conﬁdence interval for OR can be obtained by taking the antilog of the

conﬁdence limits for log OR If log OR3 and logOR* are the upper and lower

Trang 19

conﬁdence limits for log OR, elog OR3 and elog OR* are the upper and lowerconﬁdence limits for OR.

Notice that in(14.1.5), if b or c is zero, OR is undeﬁned If any one of thefour cell frequencies is zero, the estimated standard error in (14.1.6) is alsoundeﬁned Should this occur, some statisticians(Haldane, 1956; Fleiss, 1979,1981) suggest that 0.5 be added to each cell before using (14.1.5) and (14.1.6)

to solve the computational problem However, if the cell frequencies are assmall as zero, the addition of 0.5 to each cell will substantially affect theresulting estimate of OR and its standard error (Mantel, 1977; Miettinen,1979) The estimates so obtained must be interpreted with caution

An odds ratio of 1 indicates that the odds of success are the same whether

or not the risk factor is present An odds ratio greater than 1 means that theodds in favor of success is higher when the risk factor is present, and thereforethere is a positive association between the risk factor and success Similarly, anodds ratio of less than 1 signiﬁes a negative association between the risk factorand success The interpretation should not be based totally on the pointestimate A conﬁdence interval is always more meaningful, just as in any otherestimation procedure

The chi-square statistic in(14.1.2) may be used to test the null hypothesis

that there is no association between the risk factor and success, or H: OR:1.

The following example illustrates the chi-square test and odds ratio

(Example 14.1), age is considered one of the possible risk variables Thefollowing 2;2 table is constructed

Age 50 Age 50 Total

Trang 20

getting a X value of 10.16 if the two response rates are equal in the population

is less than 0.01 Hence the difference between the two response rates issigniﬁcant at the 1% level

The estimate odds ratio, according to(14.1.5), is

OR:(27)(22)(10)(12): 4.95

The data show that the odds in favor of response are almost ﬁve times higher

in patients under 50 years of age than in patients at least 50 years old Thedifference is signiﬁcantly different, as indicated by the chi-square test above

To obtain a conﬁdence interval for OR, we ﬁrst compute log OR : 1.60.The estimated standard error of log OR following (14.1.6) is

A 95% conﬁdence interval for log OR is 1.60< 1.96(0.515), or (0.59, 2.61), and

a 95% conﬁdence interval for OR is (e

interval may be due to the small cell frequencies Note that the standard error

of log OR is inversely related to the cell frequencies

In this example, the cutoff point, 50, was chosen arbitrarily It is often ofinterest to try more than one cutoff point if the number of observations in eachcell is not too small

There are cases where the independent variable has c 2 classes Thechi-square test can be extended to 2;c tables The odds ratio method can also

be extended to handle polychotomous independent variables It is done byselecting one of the classes as the reference class(the E group) and calculatingthe measure of association of each of the other classes relative to the reference

class For multiple-outcome events, the chi-square test can be extended to r ;c

tables The expected frequencies are computed just as in(14.1.1), and

compu-tation of X [chi-square distributed with (r 9 1)(c 9 1) degrees of freedom] is

the same as in(14.1.2) except that the sum is over all r ;c cells For details, see

Snedecor and Cocharan(1967, Sec 9.7) The following example illustrates theprocedures

patients, another possible risk variable is the marrow absolute leukemic

inﬁl-trate, which is deﬁned as the percentage of the total marrow that is either blast

cells or promyelocytes It is believed that patients should be classiﬁed into threeclasses:

in parentheses are expected frequencies For example, 18.68: (39)(34)/71

Trang 21

Marrow Absolute Inﬁltrate

The question is whether the difference in marrow absolute leukemic inﬁltrate is

related to response The value of X is

X :(49 8.34)

8.34 ;(129 7.66)

7.66 ; % ;(39 7.66)

7.66 : 10.17

The number of degrees of freedom is 39 1 : 2 With X : 10.17 and 2 degrees

of freedom, the probability that the three absolute infiltrate groups have thesame response rate is less than 0.01 The data suggest that patients with a highpercentage of marrow absolute infiltrate tend to have a high response rate.Marrow absolute infiltrate may be an important factor in predicting response.The OR

the reference class(or group) For example, for the90% class, the odds ratio

is 13;12/4;3 : 13 The 95% conﬁdence intervals for the ORs are obtainedusing(14.1.6) Although the odds ratio for the 46—90% group is larger than 1,

the 95% conﬁdence intervals covers 1 Therefore, the point estimate, 3.16,cannot be taken too seriously It appears that the major difference is betweenthe

Individual examination of each independent variable can provide only apreliminary idea of how important each variable is by itself The relativeimportance of all the variables has to be examined simultaneously usingmultivariate methods In the following section we discuss the linear logisticregression analysis

MODELS FOR DICHOTOMOUS RESPONSES

14.2.1 Logistic Regression Model for Prospective Studies

In a typical prospective study, a random sample of subjects is taken and thevalues of the independent variables are measured at a given time(usually called

baseline measurements) The subjects are then followed for a given period of

Trang 22

time and the outcome (dependent) variable is measured at the end of thefollow-up Therefore, for a prospective study, the independent variables areregarded as ﬁxed quantities during the follow-up, but the outcomes are randomand unknown The purpose of a prospective study is to examine the outcomesand relate them to the baseline measurements Examples of prospective studiesare cohort epidemiologic studies and clinical trials.

Suppose that there are n subjects and to some of whom the event of interest occurred They are called successes; the others are failures Let yG :1 if the ith subject is a success and yG : 0 if the ith subject is a failure Suppose that for each of the n subjects, p independent variables xG, xG, , xGN are measured.

These variables can be either qualitative, such as gender and race, or tative, such as blood pressure and white blood cell count The problem is to

quanti-relate the independent variables, xG, , xGN, to the dichotomous dependent variable yG.

Let PG be the probability of success, PG :P(yG :1 xG, , xGN), for the ith

subject The logistic regression model, proposed by Cox(1970) assumes thatthe dependence of the probability of success on independent variables is

PG: P(yG :1 xG): exp(NHbHxGH)

1; exp(NHbHxGH) (14.2.1)

and

19 PG:P(yG:0xG) :1; exp(NHbHxGH)1 (14.2.2)

where xG :(xG% xGN), xGY1, and bH are unknown coefﬁcients The logarithm

of the ratio of PG and 19PG is a simple linear function of the xGH’s.Let

G:log19 PG PG :HN bHxGH (14.2.3) G:log[PG/(1 9PG)] is called the logistic transform of PG and (14.2.3) is a linear

logistic model Another name for G is log odds Thus, the model relates the independent variables to the logistic transform of PG, or log odds The probability of success PG can then be found from (14.2.3) or (14.2.1) In many

ways(14.2.3) is the most useful analog for dichotomous response data of theordinary regression model for normally distributed data

To estimate the coefﬁcients bH’s, Cox suggests the maximum likelihood method Let y, y, , yL be observations with dichotomous values on n

subjects The likelihood function based on the binomial distribution contains afactor(14.2.1) whenever yG :1 and (14.2.2) whenever yG: 0 Thus, the likeli-

Trang 23

exp(NHbHtH) LG[1;exp(NHbHxGH)]

(14.2.4) where tH: LGxGHyG The log-likelihood function is

function in (14.2.5) can be obtained by solving the following p equations

HH Then the estimated inverse of the I matrix, I \, is the

asymptotic covariance matrix of the bH’s If we use the notation in Section 7.1

and let b : (b, b, , bN) denote the MLE of b, the estimated covariance

matrix of the MLE b

denotes the ijth element of V (b) or the ijth element of I\.

The coefficients so obtained indicate the relationships between the variablesand the log odds in favor of success For a continuous variable, the correspond-ing coefficient gives the change in the log odds for an increase of 1 unit in thevariable For a categorical variable, the coefficient is equal to the log odds ratio(see Section 14.1)

An approximate 100(19 )% conﬁdence interval for bH is

where Z? is the 100(19/2) percentile of the standard normal distribution.

Trang 24

To test the hypothesis that some of the bH’s are zero, a likelihood ratio test can be used For example, to test H:bH:0, the log-likelihood ratio test

statistic is

X*:92[l(b, b, , bH\, 0, bH>, , bN) 9l(b, b, b, , bN)]

(14.2.9)where the ﬁrst term is the maximized log-likelihood subject to the constraint

bH:0 If the hypothesis is true, X* is distributed asymptotically as chi-square

with 1 degree of freedom

An alternative test for the signiﬁcance of the coefﬁcients is the Wald test,

which can be written as

X5: b H

Under the null hypothesis that bH:0, X5 has an asymptotic chi-square

distribution with 1 degree of freedom Although the Wald test is used by many,

it is less powerful than the likelihood ratio test (Hauck and Donner, 1977;Jennings, 1986) In other words, the Wald test often leads the user to concludethat the coefficient(consequently, the respective risk factor) is not significantwhen, in fact, it is significant

Similar to earlier discussion of model selection, forward, backward, andstepwise variable selection methods can be used to select the risk factors thatare signiﬁcantly associated with a dichotomous response The independent

variables xGH in this model do not have to be the original variables They can

be any meaningful transforms of the original variables: for example, the

logarithm of the original variable, log xGH, and the deviation of the variable from its mean, xGH9 xH.From(14.2.1) and (14.2.2), the logarithm of the odds ratio for ith and kth

subjects is

log PG/(1 9PG) PI/(1 9PI):

square test, the Hosmer—Lemeshow(Hosmer and Lemeshow, 1980) test, a teststatistic suggested by Tsiatis (1980), and the score of Brown (1982) In the

following, we introduce the Hosmer—L emeshow test.

Trang 25

Let pG be the estimate of PG obtained from the ﬁtted logistic regression equation for the ith subject, i : 1, , n The pG’s can be arranged in ascending

order from smallest to largest Those probabilities and the corresponding

subjects are then divided into g groups according to some cutoff points of the probability For example, let g: 10 and the cutoff points of the probability be

equal to k/10, k: 1, 2, , 10 Thus, the ﬁrst group contains all subjects whoseestimated probabilities are less than or equal to 0.1, the second group containsall subjects whose estimated probabilities are less than or equal to 0.2, and so

on Let nI be the number of subjects in the kth group The estimated expected number of successes for the kth group is

EI: L I H pH k : 1, 2, , g The Hosmer—Lemeshow test statistic is deﬁned as

of freedom The test is basically a chi-square test of the discrepancy between

the observed and predicted frequencies of success Thus, a C value larger than

the 100 percentage point of the chi-square distribution (or p value less than

) indicates that the model is inadequate

Similar to other chi-square goodness-of-ﬁt tests, the approximation depends

on the estimated expected frequencies being reasonably large If a large number(say, far more than 20%) of the expected frequencies are less than 5, the

approximation may not be appropriate and the p value must be interpreted

carefully If this is the case, adjacent groups may be combined to increase theestimated expected frequencies However, Hosmer and Lemeshow warn that if

fewer than six groups are used to calculate C, the test would be insensitive and

would almost always indicate that the model is adequate

Most statistical software packages provide programs for logistic regressionanalysis: for example, SAS(procedures LOGISTIC, PHREG, and CATMOD),BMDP(procedures LR and PR), and SPSS (procedures NOMREG, PROBIT,PLUM, and LOGISTIC) Most of them provide estimates of the coefﬁcientsand test statistics, variable selection procedures, and tests of goodness of ﬁt

Trang 26

Example 14.4 In a study of 238 non-insulin-dependent diabetic patients,

10 covariates are considered possible risk factors for proteinuria(the outcomevariable) The logistic regression method is used to identify the most importantrisk factors and to predict the probability of proteinuria on the basis of theserisk factors The 10 potential risk factors are age, gender(1, male; 2, female),smoking status(0, no; 1, yes), percentage of ideal body mass index, hyperten-sion(0, no; 1, yes), use of insulin (0, no; 1, yes), glucose control (0, no; 1, yes),duration of diabetes mellitus (DM) in years, total cholesterol, and total

triglyceride Among the 238 patients, 69 have proteinuria ( yG:1).Using the stepwise procedure in BMDP, it is estimated that at step 1, themodel contains only b and b: 90.896 and l(b) in (14.2.5) is 9143.292 At

step 2, duration of diabetes is added to the model because its maximumlog-likelihood value is the largest among all the covariates The MLEs of the

two coefﬁcients are b :91.467 and b:90.055, and l(b,b):9139.429.

Since

X*:92[l(b) 9l(b, b)] : 7.726

which is signiﬁcant(p: 0.005), the duration of DM is related signiﬁcantly to

the chance of proteinuria The Hosmer—Lemeshow test statistic for goodness of

ﬁt with only duration of DM in the model, C: 9.814 with 8 degrees of

freedom, gives a p value of 0.278.

At step 3, gender is added to the model because its addition yields the largestmaximum log-likelihood value among all the remaining covariates The maxi-

mum log-likelihood value, l(b ,b,b) :9137.749, b:91.453, b:90.060, and b :90.279 To test if gender is signiﬁcantly related to proteinuria after

duration of DM, we perform the likelihood ratio test

X* :92[l(b, b) 9l(b, b, b)] :3.360

which is signiﬁcant at p: 0.067 The stepwise procedure terminates after thethird step because no other covariates are signiﬁcant enough to enter the

regression model; that is, none of the other covariates have a p value less than

0.15, which is set by the program (BMDP) If any covariate already in theregression becomes insigniﬁcant after some other variables are in, the insigniﬁ-

cant variable would be removed The p values for entering and removing a

variable can be determined by the user The default values for entering andremoving a variable are, respectively, 0.10 and 0.15 Thus, the procedureidentiﬁes duration of DM and gender as the two most important risk factorsbased on the data given A question that may be raised at this point is whetherone should include gender in the equation since its signiﬁcance level is largerthan the commonly used 0.05 The recommendation is to include it since it is

close to 0.05 and since the p value should not be the only basis for determining

whether a covariate should be included in the model In addition, the

Hosmer—Lemeshow goodness of ﬁt test statistic C: 5.036, when gender is

Trang 27

Table 14.3 Estimated Coefﬁcients for a Linear Logistic Regression Model Using Data from Diabetic Patients

Estimated Standard Variable Coefficient Error Coefficient/SE exp(coefficient)

included, yields a p value of 0.754 Thus, inclusion of this covariate improves

considerably the adequacy of the model Thus, the ﬁnal regression equationwith the two signiﬁcant risk factors is

log PG

19 PG : 91.453 ; 0.060 (duration of DM) 9 0.279 (gender)

Table 14.3 gives the details for the estimated coefﬁcients

The signs of the coefﬁcients indicate that male patients and patients with alonger duration of diabetes have a higher chance of proteinuria Furthermore,for each increase of one year in duration of diabetes, the log odds increase by0.060 Probabilities of proteinuria can be estimated following (14.2.1) Forexample, the probability of developing proteinuria for a male patient who hashad diabetes for 15 years is

P: e

where90.832 is obtained by substituting the values of the two covariates inthe ﬁtted equation; that is,91.453;0.060(15)90.279(1): 90.832 Similarly,for a female patient who has the same duration of diabetes, the probability is0.248

In addition to individual variables, interaction terms can be included in thelogistic regression model If the association between an independent variable

x and the dependent variable y is not the same in different levels of another

variable, x, there is interaction between x and x To check if there is interaction, one can include the product of x and x in the regression model

and test the signiﬁcance of this new variable The following example illustratesthe procedure

certain types of cancer It is also well known that adriamycin is highly toxic

Most statistical software packages provide programs for logistic regressionanalysis: for example, SAS(procedures LOGISTIC, PHREG, and CATMOD),BMDP(procedures... following (14.2.1) Forexample, the probability of developing proteinuria for a male patient who hashad diabetes for 15 years is

P: e

where90 .83 2 is obtained... class="page_container" data- page="26">

Example 14.4 In a study of 2 38 non-insulin-dependent diabetic patients,

10 covariates are considered possible risk factors for proteinuria(the

Định dạng
Số trang	54
Dung lượng	267,25 KB