Ebook methods in human growth research part 2

8 Parametric models for postnatal growth Roland C Hauspie Free University of Brussels and Luciano Molinari Kinderspital, Zürich Why model growth data? Growth can be considered as the process that mak[.]

Trang 1

postnatal growth

Roland C Hauspie

Free University of Brussels

and

Luciano Molinari

Kinderspital, Z¨urich

Why model growth data?

Growth can be considered as the process that makes children change in size and shape over time The dynamics of growth is best understood from the analysis

of longitudinal data, i.e from serial measurements taken at regular intervals on the same subject Table 8.1 gives an example of longitudinal growth data for height of a boy measured at birth and at each birthday thereafter up to the age of

18 years Such data usually form the basis to estimate the underlying process of growth, which is supposed to be continuous Recent analysis of frequent mea-surements of size (at daily or weekly intervals) with high-precision techniques (such as knemometry where measurement error is about 0.1 mm) has shown that the growth process is, at microlevel, not as smooth as we usually assume (Hermanussen, 1998; Lampl, 1999) However, we may readily assume that the growth process is continuous when we are dealing with measurements taken at yearly intervals, or even 3- to 6-monthly intervals, using classical anthropomet-ric techniques Various mathematical models have been proposed to estimate such a smooth growth curve on the basis of a set of discrete measurements of growth of the same subject over time (Marubini and Milani, 1986; Hauspie,

1989, 1998; Simondon et al., 1992; Bogin, 1999).

The main goals of mathematical modelling of longitudinal growth data are:

Methods in Human Growth Research, eds R C Hauspie, N Cameron and L Molinari.

Published by Cambridge University Press C Cambridge University Press 2004.

205

Trang 2

Table 8.1 Attained height (in cm) and yearly increments in

height (in cm/year) of a boy taken at birth and at each

subsequent birth date up to the age of 18 years; these data are used in the examples of Figures 8.1 to 8.7

r To estimate the continuous growth process from a set of discontinuous mea-sures of growth in order to obtain a smooth graphical representation of the growth curve

r To estimate growth between measurement occasions (in that sense, curve ﬁtting is an interpolation technique)

r To summarize the growth data by a limited number of constants or function parameters (therefore curve ﬁtting is also a data reduction technique)

r To estimate particular milestones of the growth process (the so-called bio-logical parameters) such as ﬁnal size or age, size and velocity at take-off and at peak velocity, which characterize the shape of the growth curve and usually form the basis for further analysis

r To estimate a smooth velocity curve representing instantaneous velocity (i.e

by taking the mathematical ﬁrst derivative of the ﬁtted curve)

r To estimate the ‘typical average’ curve in the population, such as the mean-constant curve in the case of structural growth models

Trang 3

Fitting a growth model consists of ﬁnding the set of function parameters that

yield the best-ﬁtting curve The best-ﬁtting curve is usually estimated by the

least-squares method, i.e the method that yields the curve with the smallest value for the sum of squared residuals (deviations of the observations from the ﬁtted curve) Other parameter estimation methods may be envisaged A thorough discussion of this topic can be found in Chapter 9

Non-structural versus structural models

Broadly speaking, we can subdivide growth models into structural (or para-metric) and non-structural models (Bock and Thissen, 1980) Non-structural models do not postulate a particular form of the growth curve They provide smoothing techniques suppressing measurement error and short-term

varia-tion Typical examples are polynomials and cubic splines (Largo et al., 1978).

Non-structural models:

r Do not postulate a particular form of the growth curve

r Usually have a large number of parameters with no biological interpretation

r Do not tend to an asymptotic value

r Are usually unstable in the extremities of the data range

r Are easy to ﬁt

Figure 8.1 shows the example of the fit of a 4th-degree (Figure 8.1a) and 9th-degree polynomial (Figure 8.1b) to the yearly increments in height of the boy shown in Table 8.1 The dashed line shows the subject’s yearly increments in height (i.e the differences between height measurements, one year apart) com-pared to the polynomial fits It is obvious that the 4th-degree polynomial (with five parameters) does not adequately describe the yearly increments in height The 9th-degree polynomial (with ten parameters) performs much better, but still cannot correctly describe the maximum increment in height Polynomials can be considered as inadequate to fit growth data over wide age ranges, but can be used to fit growth over reasonably short-term intervals (a few years) The flexibility of polynomials can be much improved by approaches like smoothing splines, which consist of series of lower-order polynomials (3rd-degree or cubic, for example) that are fitted over only a small range of ages, and which are connected by constraints of continuity, i.e equality of the first and second derivative at the points of transition or ‘knots’ These models give considerable better fits than higher-order polynomials and are better able to model local variations in the growth pattern such as the mid-growth spurt or the decrease in velocity prior to the adolescent growth spurt (Goldstein, 1984) The subjective element in this approach is the determination of the number and

Trang 4

5

10

15

20

25

30

Age, years

(a)

y=a0 +a t1 +a t2 +a t3 +a t4

0 5 10 15 20 25 30

Age, years

(b)

Figure 8.1 Yearly increments in height of the boy whose growth data are shown in

Table 8.1 (a) Fit of 4th degree polynomial, (b) ﬁt of 9th degree polynomial y=

increment, t = age in years, a0to a kthe function parameters (After Hauspie and Chrzastek-Spruch, 1999.)

position of the knots in the ﬁtting procedure Other most useful developments

of non-structural approaches in modelling growth are discussed in Chapter 7 Structural or parametric models:

r Imply a basic functional form of the growth model

r Usually have fewer parameters that allow some functional/biological inter-pretation

r Usually tend to an upper asymptote (ﬁnal size)

Structural or parametric models sometimes impose too rigid a shape on the growth data, which may result in slight, but systematic, bias They also require more sophisticated curve-ﬁtting techniques, but most statistical and several graphical software packages nowadays offer the possibility of non-linear regres-sion analysis of user-deﬁned functions The estimation algorithms may vary from one type of software to another, but they are all based on iterative numeric minimization techniques and require more or less rough guesses of the values

of the function parameters to be estimated, the so-called starting values Good starting values will often allow an iterative technique to converge to a solution more quickly

Reaching convergence means that the numeric minimization procedure has found a minimum in the multidimensional plane of the sum of squared devia-tions However, it may occur that the process has encountered a local minimum but has not yet reached the absolute true minimum, which then leads to a

mis-ﬁt of the data The occurrence of local minima in the estimation procedures

is intrinsic to non-linear regression and does not depend on the minimization algorithm, but rather on the functional form of the growth model The risk of

Trang 5

reaching convergence at a local minimum is greater if the starting values are badly chosen, the scatter in the data is large (usually not a problem in longitu-dinal data except for outliers due to erroneous measurements), and the range

of the data is insufﬁcient for the model at hand A robust model is one that has few or no local minima near the real minimum and hence is not too sensitive towards the choice of starting values To test for the presence of a false min-imum, one can run the curve-ﬁtting procedure with different sets of starting values of the parameters If they all converge to nearly the same solution, then

it is very likely that you have found the true minimum If one set of starting values results in a substantially lower sum of squares, then you should keep them as the new starting values and repeat the procedure since it is likely that you are now nearer to the true minimum (see Chapter 9 for a more extensive discussion of numerical minimization techniques)

Usually, the population means for the function parameters can serve as start-ing values for these numerical minimization techniques although individual adjustments are sometimes required When studying a new population, one can utilize starting values taken from the literature Tables 8.2 and 8.3 provide sets

of starting values for the models discussed in this chapter They will not neces-sarily be suitable to ﬁt all growth curves in a speciﬁc population, but they can

be used in a preliminary analysis of speciﬁc data A more optimal set of starting values can then be obtained from the means of the successful ﬁts

Most structural growth models are monotonously increasing functions and are therefore in the first place designed to describe growth of skeletal dimensions for which, strictly speaking, we have only positive growth For this reason, structural models are not suitable for traits such as body weight, body mass index and skinfolds, for instance The latter traits may show negative as well as positive growth, and the general shape of the growth pattern of those traits usually does not match the functional form of the models Non-structural approaches are more apt to fit those traits Most structural models designed to describe adolescent growth tend to an upper asymptote (final size) towards the end of the growth phase, and also allow for an adolescent spurt Therefore, they are suitable for postcranial skeletal dimensions (length and width measurements

of the body), but perform badly for measurements of the head and face, which have virtually no adolescent growth spurt

Growth in infancy and childhood

A long time ago, Jenss and Bayley (1937) proposed a four-parameter non-linear model which ﬁts satisfactorily growth data from birth to 8 years The formulation of the Jenss curve is as follows:

y = a + bt − e c +dt

Trang 6

50

60

70

80

90

100

110

120

130

Age, months

Jenss-Bayley

a= 79.85

b = 0.5043

c= 3.427

d= -0.09687

40 50 60 70 80 90 100 110 120 130

Age, months

Count

a= 48.88

b = 0.3781

c= 9.395

Figure 8.2 Fit of Jenss–Bayley model and Count model to the data of growth in height from birth to 8 years of age (Table 8.1).

where y is size, t is age, and a, b, c and d are the four function parameters The model has a linear component (a + bt) in which the parameter b determines the

childhood growth velocity and an exponential component (ec +dt), determining

the decreasing growth rate shortly after birth

The Jenss curve has been successfully applied by Deming and Washburn (1963), Manwani and Agarwal (1973), Berkey (1982), and several others The model is suitable for describing growth of body length and of various dimensions

of the head (typical head circumference) during infancy and early childhood

It has often been used to ﬁt weight data as well, despite the problems that may arise when growth in weight is not monotonously increasing or has an irregular pattern, which often occurs shortly after birth

Another model that ﬁts early childhood data fairly well is the three-parameter

model proposed by Count (1942, 1943), slightly modiﬁed by Livshits et al.

(2000) in order to allow the inclusion of birth data:

y = a + bt + c ln(t + 1)

where y is size, t is age, and a, b and c are the three function parameters.

Figure 8.2 shows the height data of Table 8.1 for ages 0 to 8 years with Jenss– Bayley and Count curve ﬁttings For the purpose of ﬁtting those two models,

it is better to express the ages in months The estimates of the parameters for the respective fits are shown in the figures At first glance, both models seem to describe adequately body length during the first 8 years of postnatal life, although visual inspection of the graphs shows a slightly better fit of the Jenss–Bayley curve The residual standard deviation (RSD) is 0.48 cm for the

Trang 7

Table 8.2 Starting values for ﬁtting the Jenss–Bayley model and the Count model to growth of body length, weight and head circumference with numeric minimization algorithms (when age is expressed in months)

Parameter Length (cm) Weight (kg)

Head circumference (cm) Length (cm) Weight (kg)

Head circumference (cm)

Jenss–Bayley curve and 0.93 cm for the Count curve Both models are fairly robust towards the choice of starting values Table 8.2 gives a set of starting values for body length (in cm), body weight (in kg) and head circumference (in cm) when the age is expressed in months The sex differences in growth are much smaller than the normal variations in growth during infancy so that a single set of starting values sufﬁces for both genders

Berkey (1982) compared the reliability, efficiency, precision and goodness-of-fit of the Count and Jenss–Bayley models and concluded that the latter model fitted the growth data better than Count’s model, especially prior to 1 year of age Berkey and Reed (1973) have greatly enhanced the flexibility of the Count function by adding one or more deceleration terms They proposed the following two functions:

Reed 1st order y = a + bt + c ln(t) + d

t

Reed 2nd order y = a + bt + c ln(t) + d1

where y is size, t is age, and a, b, c, d and a, b, c, d1, d2are the parameters The Reed models can accommodate one or more inﬂection points (depend-ing on the number of reciprocal terms), allow(depend-ing the description of one or more periods of growth acceleration and thus ﬁtting a wider variety of both normal and abnormal growth patterns in early childhood However, if birth

is included, then chronological age since birth cannot be used, and an alter-native age scale has to be chosen Berkey and Reed (1973) suggest the age

transformation t = (months since birth + 9)/9, which assigns t = 0 at con-ception and t= 1 at birth They showed that the four-parameter Reed model provided signiﬁcantly better overall ﬁts than the Jenss–Bayley model, which has also four parameters Moreover, by the fact that the Reed models are linear

Trang 8

in their constants, they can be ﬁtted by simpler statistical methods than the

non-linear Jenss–Bayley curve Simondon et al (1992) made an interesting

com-parison of ﬁve growth models to ﬁt weight data between birth and 13 months of age

Growth at adolescence

Logistic and Gompertz functions

The ﬁrst attempts to ﬁt the adolescent growth cycle were made by using the logistic and the Gompertz function These models are special cases of the generalized logistic model (Nelder, 1961) of which the differential equation integrates to:

y = K (1 + ce −bt)1/(1−m) for m > 1

For m > 1, this curve has an S-shape with a lower and upper asymptote, equal to zero and K, and one point of inﬂection Parameter b is a rate constant, determining the spread of the curve along the time axis, while parameter c is

an integration constant For the purpose of ﬁtting the adolescent growth cycle,

the lower asymptote is set different from zero by adding a constant P.

For m= 2, the generalized logistic leads to the autocatalytic or logistic curve, which can, after reparameterization, be written in the form:

1+ ea −bt

where y is size, t is age, and with P, K, a = log(c) and b as stated above In the

logistic model, relative growth rate (growth velocity divided by size) declines linearly with size Hence, the curve is symmetrical around its inﬂection point

yI= P + K/2 at tI= a/b (age at peak velocity) with maximal peak velocity given by bK /4.

For m= 1, the generalized logistic equation breaks down, but it can be shown

that for m→ 1, the model leads to the Gompertz curve, in which relative growth rate declines exponentially with size:

y = P + K e−ea −bt

where y is size and t is age The Gompertz curve is asymmetrical around its point of inﬂection: yI = P + K/e ≈ P + 0.37K, at tI = a/b (age at maxi-mal velocity) with maximaxi-mal velocity given by bK /e In both, the logistic and

Gompertz function, the inﬂection point is functionally related to the amount of adolescent growth (respectively 50% and±37%)

Trang 9

The logistic and Gompertz functions were used to ﬁt the adolescent growth

data of several body dimensions (Deming, 1957; Marubini et al., 1971, 1972; Tanner et al., 1976) In a longitudinal study of 35 Belgian girls (Hauspie et al.,

1980), it was shown that both models fit adolescent data well with pooled residual variances of 0.45 cm2for the logistic and 0.61 cm2for the Gompertz function (total number of degrees of freedom 110) Nevertheless, Wilcoxon’s signed rank test revealed significantly better fits with the logistic than with the

Gompertz function (P <0.05) However, none of the derived biological variables

differed signiﬁcantly between the two models

The major drawback of both models is that the lower age bound of the data to

be ﬁtted (i.e the cut-off point between the prepubertal and adolescent growth cycle) has to be determined arbitrarily for each individual This cut-off point is usually taken as the age at minimal prepubertal growth velocity (age at take-off), obtained through a graphical inspection of a plot of the yearly increments

in function of age This procedure is, in practice, not always so easy and may lead to subjective decisions Nevertheless, errors in assessing the take-off point

of less than 1 year should not greatly affect the estimates of age and velocity

at the peak of the growth spurt (Marubini, 1978; Hauspie, 1981) Due to these drawbacks and inconveniences both the logistic and Gompertz functions have been abandoned as ways to ﬁt adolescent growth, but they have proved to

be still very useful as elements of more complex models describing wider age ranges and for which the graphical determination of the age at take-off is not required

Preece–Baines model 1 (PB1)

Preece and Baines (1978) proposed the following multiplicative exponential-logistic model:

es0(t −θ)+ es1(t −θ)

where y is size, t is age, and h1, h θ , s0, s1andθ are the ﬁve function parameters Adult size is given by parameter h1 Parameterθ locates the adolescent growth

spurt along the time axis This parameter is highly correlated with age at peak

velocity h θ is the size at age θ The parameters s0 and s1 are growth-rate constants, related to prepubertal and pubertal velocity

This ﬁve-parameter model is designed to ﬁt the adolescent growth cycle starting from childhood The success with which the model describes adolescent growth depends, among other things, on how much childhood data is included

in the ﬁtting procedure It was shown that the lower limit of the age range

Trang 10

Figure 8.3 Plot of Preece–Baines model 1 (PB1) for the longitudinal data of the boy whose growth data are given in Table 8.1 RSD, residual standard deviation For deﬁnitions of parameters, see text.

should not be under 2 years of age, and that the ﬁt of the adolescent growth cycle is substantially better if the age range includes data from not more than a

few years before the age at take-off (Hauspie et al., 1980) Therefore, the PB1

model should essentially be considered as a model to ﬁt the adolescent growth cycle from before take-off up to adulthood

Figure 8.3 shows the results of the PB1 function, fitted to the data of Table 8.1 including measurements from age 2 years onwards The values of the function parameters for this particular fit are shown in the figure The upper part of the graph shows the PB1 fit plotted onto the raw measurements for

Tiêu đề	Parametric Models for Postnatal Growth
Tác giả	Roland C. Hauspie, Luciano Molinari
Trường học	Free University of Brussels
Chuyên ngành	Human Growth Research
Thể loại	book chapter
Năm xuất bản	2004
Thành phố	Cambridge

Định dạng
Số trang	20
Dung lượng	357,58 KB