D-Optimal designs D-optimal designs are often used when classical designs do not apply or work D-optimal designs are one form of design provided by a computer algorithm.. You start with
Trang 25 Process Improvement
5.5 Advanced topics
5.5.2 What is a computer-aided design?
5.5.2.1 D-Optimal designs
D-optimal
designs are
often used
when
classical
designs do
not apply or
work
D-optimal designs are one form of design provided by a computer algorithm These types of computer-aided designs are particularly useful when classical designs do not apply.
Unlike standard classical designs such as factorials and fractional factorials, D-optimal design matrices are usually not orthogonal and effect estimates are correlated.
These designs
are always
an option
regardless of
model or
resolution
desired
These types of designs are always an option regardless of the type of model the experimenter wishes to fit (for example, first order, first order plus some interactions, full quadratic, cubic, etc.) or the objective specified for the experiment (for example, screening, response surface, etc.) D-optimal designs are straight optimizations based on a chosen optimality criterion and the model that will be fit The optimality criterion used in generating D-optimal designs is one of maximizing
|X'X|, the determinant of the information matrix X'X.
You start
with a
candidate set
of runs and
the algorithm
chooses a
D-optimal set
of design
runs
This optimality criterion results in minimizing the generalized variance
of the parameter estimates for a pre-specified model As a result, the 'optimality' of a given D-optimal design is model dependent That is, the experimenter must specify a model for the design before a
computer can generate the specific treatment combinations Given the total number of treatment runs for an experiment and a specified model, the computer algorithm chooses the optimal set of design runs
from a candidate set of possible design treatment runs This candidate
set of treatment runs usually consists of all possible combinations of various factor levels that one wishes to use in the experiment.
In other words, the candidate set is a collection of treatment 5.5.2.1 D-Optimal designs
Trang 3the set of treatment runs.
No guarantee Note: There is no guarantee that the design the computer generates is
actually D-optimal.
D-optimal
designs are
particularly
useful when
resources are
limited or
there are
constraints
on factor
settings
The reasons for using D-optimal designs instead of standard classical designs generally fall into two categories:
standard factorial or fractional factorial designs require too many runs for the amount of resources or time allowed for the
experiment
1
the design space is constrained (the process space contains factor settings that are not feasible or are impossible to run).
2
Industrial
example
demostrated
with JMP
software
Industrial examples of these two situations are given below and the process flow of how to generate and analyze these types of designs is also given The software package used to demonstrate this is JMP version 3.2 The flow presented below in generating the design is the flow that is specified in the JMP Help screens under its D-optimal platform.
Example of
D-optimal
design:
problem
setup
Suppose there are 3 design variables (k = 3) and engineering judgment
specifies the following model as appropriate for the process under investigation
The levels being considered by the researcher are (coded)
X1: 5 levels (-1, -0.5, 0, 0.5, 1) X2: 2 levels (-1, 1)
X3: 2 levels (-1, 1)
One design objective, due to resource limitations, is to use n = 12
design points.
5.5.2.1 D-Optimal designs
http://www.itl.nist.gov/div898/handbook/pri/section5/pri521.htm (2 of 5) [5/1/2006 10:31:03 AM]
Trang 4Create the
candidate set
Given the above experimental specifications, the first thing to do toward generating the design is to create the candidate set The candidate set is a data table with a row for each point (run) you want considered for your design This is often a full factorial You can create
a candidate set in JMP by using the Full Factorial design given by the Design Experiment command in the Tables menu The candidate set for this example is shown below Since the candidate set is a full factorial in all factors, the candidate set contains (5)*(2)*(2) = 20 possible design runs.
Table
containing
the candidate
set
TABLE 5.1 Candidate Set for Variables X1, X2, X3
5.5.2.1 D-Optimal designs
Trang 5Specify (and
run) the
model in the
Fit Model
dialog
Once the candidate set has been created, specify the model you want in the Fit Model dialog Do not give a response term for the model! Select D-Optimal as the fitting personality in the pop-up menu at the bottom
of the dialog Click Run Model and use the control panel that appears Enter the number of runs you want in your design (N=12 in this
example) You can also edit other options available in the control panel This control panel and the editable options are shown in the table below These other options refer to the number of points chosen
at random at the start of an excursion or trip (N Random), the number
of worst points at each K-exchange step or iteration (K-value), and the number of times to repeat the search (Trips) Click Go.
For this example, the table below shows how these options were set
and the reported efficiency values are relative to the best design found.
Table
showing JMP
D-optimal
control panel
and efficiency
report
D-Optimal Control Panel Optimal Design Controls
Best Design
The
algorithm
computes
efficiency
numbers to
zero in on a
D-optimal
design
The four line efficiency report given after each search shows the best design over all the excursions (trips) D-efficiency is the objective, which is a volume criterion on the generalized variance of the estimates The efficiency of the standard fractional factorial is 100%, but this is not possible when pure quadratic terms such as (X1)2 are included in the model.
The efficiency values are a function of the number of points in the design, the number of independent variables in the model, and the maximum standard error for prediction over the design points The best design is the one with the highest D-efficiency The A-efficiencies and G-efficiencies help choose an optimal design when multiple excursions produce alternatives with similar D-efficiency.
5.5.2.1 D-Optimal designs
http://www.itl.nist.gov/div898/handbook/pri/section5/pri521.htm (4 of 5) [5/1/2006 10:31:03 AM]
Trang 6Using several
excursions
(or trips)
recommended
The search for a D-optimal design should be made using several excursions or trips In each trip, JMP 3.2 chooses a different set of random seed points, which can possibly lead to different designs The Save button saves the best design found The standard error of
prediction is also saved under the variable OptStdPred in the table.
The selected
design should
be
randomized
The D-optimal design using 12 runs that JMP 3.2 created is listed below in standard order The design runs should be randomized before the treatment combinations are executed.
Table
showing the
D-optimal
design
selected by
the JMP
software
TABLE 5.2 Final D-optimal Design
Parameter
estimates are
usually
correlated
To see the correlations of the parameter estimates for the best design found, you can click on the Correlations button in the D-optimal Search Control Panel In most D-optimal designs, the correlations among the estimates are non-zero However, in this particular example, the correlations are zero.
Other
software may
generate a
different
D-optimal
design
Note: Other software packages (or even other releases of JMP) may
have different procedures for generating D-optimal designs - the above example is a highly software dependent illustration of how to generate
a D-optimal design.
5.5.2.1 D-Optimal designs
Trang 75.5.2.2 Repairing a design
http://www.itl.nist.gov/div898/handbook/pri/section5/pri522.htm (2 of 2) [5/1/2006 10:31:03 AM]
Trang 8(sampling
variability)
affects the
final
answers and
should be
taken into
account
The response models are fit from experimental data that usually contain random variability due to uncontrollable or unknown causes This implies that an experiment, if repeated, will result in
a different fitted response surface model that might lead to different optimal operating conditions Therefore, sampling variability should be considered in experimental optimization.
In contrast, in classical optimization techniques the functions are deterministic and given.
2
Optimization
process
requires
input of the
experimenter
The fitted responses are local approximations, implying that the optimization process requires the input of the experimenter (a person familiar with the process) This is in contrast with classical optimization which is always automated in the form of some computer algorithm.
3
5.5.3 How do you optimize a process?
Trang 95 Process Improvement
5.5 Advanced topics
5.5.3 How do you optimize a process?
5.5.3.1 Single response case
5.5.3.1.1 Single response: Path of steepest ascent
Starting at
the current
operating
conditions, fit
a linear
model
If experimentation is initially performed in a new, poorly understood production process,
chances are that the initial operating conditions X1, X2, ,X k are located far from the region
where the factors achieve a maximum or minimum for the response of interest, Y A first-order
model will serve as a good local approximation in a small region close to the initial operating conditions and far from where the process exhibits curvature Therefore, it makes sense to fit a simple first-order (or linear polynomial) model of the form:
Experimental strategies for fitting this type of model were discussed earlier Usually, a 2k-p
fractional factorial experiment is conducted with repeated runs at the current operating conditions (which serve as the origin of coordinates in orthogonally coded factors)
Determine the
directions of
steepest
ascent and
continue
experimenting
until no
further
improvement
occurs - then
iterate the
process
The idea behind "Phase I" is to keep experimenting along the direction of steepest ascent (or descent, as required) until there is no further improvement in the response At that point, a new fractional factorial experiment with center runs is conducted to determine a new search
direction This process is repeated until at some point significant curvature in is detected
This implies that the operating conditions X1, X2, ,X k are close to where the maximum (or
minimum, as required) of Y occurs When significant curvature, or lack of fit, is detected, the
experimenter should proceed with "Phase II" Figure 5.2 illustrates a sequence of line searches
when seeking a region where curvature exists in a problem with 2 factors (i.e., k=2).
FIGURE 5.2: A Sequence of Line Searches for a 2-Factor Optimization Problem
5.5.3.1.1 Single response: Path of steepest ascent
http://www.itl.nist.gov/div898/handbook/pri/section5/pri5311.htm (1 of 6) [5/1/2006 10:31:05 AM]
Trang 10Two main
decisions:
search
direction and
length of step
There are two main decisions an engineer must make in Phase I:
determine the search direction;
1
determine the length of the step to move from the current operating conditions
2
Figure 5.3 shows a flow diagram of the different iterative tasks required in Phase I This diagram is intended as a guideline and should not be automated in such a way that the experimenter has no input in the optimization process
Flow chart of
iterative
search
process
5.5.3.1.1 Single response: Path of steepest ascent
Trang 11Procedure for Finding the Direction of Maximum Improvement
The direction
of steepest
ascent is
determined by
the gradient
of the fitted
model
Suppose a first-order model (like above) has been fit and provides a useful approximation As long as lack of fit (due to pure quadratic curvature and interactions) is very small compared to the main effects, steepest ascent can be attempted To determine the direction of maximum improvement we use
the estimated direction of steepest ascent, given by the gradient of , if the objective is
to maximize Y;
1
the estimated direction of steepest descent, given by the negative of the gradient of , if
the objective is to minimize Y.
2
The direction
of steepest
ascent
depends on
the scaling
convention
-equal
variance
scaling is
recommended
The direction of the gradient, g, is given by the values of the parameter estimates, that is, g' =
(b1, b2, , b k ) Since the parameter estimates b1, b2, , b k depend on the scaling convention for the factors, the steepest ascent (descent) direction is also scale dependent That is, two
experimenters using different scaling conventions will follow different paths for process improvement This does not diminish the general validity of the method since the region of the search, as given by the signs of the parameter estimates, does not change with scale An equal
variance scaling convention, however, is recommended The coded factors x i, in terms of the
factors in the original units of measurement, X i, are obtained from the relation
This coding convention is recommended since it provides parameter estimates that are scale independent, generally leading to a more reliable search direction The coordinates of the factor settings in the direction of steepest ascent, positioned a distance from the origin, are given by:
Solution is a
simple
equation
This problem can be solved with the aid of an optimization solver (e.g., like the solver option
of a spreadsheet) However, in this case this is not really needed, as the solution is a simple equation that yields the coordinates
Equation can
be computed
for increasing
values of
An engineer can compute this equation for different increasing values of and obtain different factor settings, all on the steepest ascent direction
To see the details that explain this equation, see Technical Appendix 5A
Example: Optimization of a Chemical Process
5.5.3.1.1 Single response: Path of steepest ascent
http://www.itl.nist.gov/div898/handbook/pri/section5/pri5311.htm (3 of 6) [5/1/2006 10:31:05 AM]
Trang 12by search
example
It has been concluded (perhaps after a factor screening experiment) that the yield (Y, in %) of a
chemical process is mainly affected by the temperature (X 1, in C) and by the reaction time
(X 2, in minutes) Due to safety reasons, the region of operation is limited to
Factor levels The process is currently run at a temperature of 200 C and a reaction time of 200 minutes A
process engineer decides to run a 22 full factorial experiment with factor levels at
factor low center high
Orthogonally
coded factors
Five repeated runs at the center levels are conducted to assess lack of fit The orthogonally coded factors are
Experimental
results
The experimental results were:
x 1 x 2 X 1 X 2 Y (= yield)
ANOVA table The corresponding ANOVA table for a first-order polynomial model, obtained using the
DESIGN EASE statistical software, is
SUM OF MEAN F SOURCE SQUARES DF SQUARE VALUE PROB>F
MODEL 503.3035 2 251.6517 4.810 0.0684 CURVATURE 8.1536 1 8.1536 0.1558 0.7093 RESIDUAL 261.5935 5 52.3187
LACK OF FIT 37.6382 1 37.6382 0.6722 0.4583 5.5.3.1.1 Single response: Path of steepest ascent