Tilburg University
Maximizing the simulation output
Kleijnen, J.P.C.; Pala, O.
Published in:
Simulation: Technical journal of the Society for Computer Simulation
Publication date:
1999
Document Version
Publisher's PDF, also known as Version of record Link to publication in Tilburg University Research Portal
Citation for published version (APA):
Kleijnen, J. P. C., & Pala, O. (1999). Maximizing the simulation output: A competition. Simulation: Technical journal of the Society for Computer Simulation, 73(3), 168-173.
http://sim.sagepub.com/
SIMULATION
http://sim.sagepub.com/content/73/3/168
The online version of this article can be found at:
DOI: 10.1177/003754979907300304
1999 73: 168
SIMULATION
Jack P. C. Kleijnen and Özge Pala
Maximizing the Simulation Output: A Competition
Published by:
http://www.sagepublications.com
On behalf of:
Society for Modeling and Simulation International (SCS)
can be found at:
SIMULATION
Additional services and information for
168
TECHNICAL ARTICLE
Maximizing
the
Simulation
Output:
A
Competition
Jack
P.C.
Kleijnen
and
Özge
Pala
Department
of InformationSystems (BIK)/Center
for Economic Research(CentER)
School of
Management
and Economics(FEW),
Tilburg
University (KUB)
Postbox 90153, 5000 LE
Tilburg,
The NetherlandsE-mail:
kleijnen@kub.nl;
O.Pala@kub.nlThe following
competition
wasorganized by
the Business Sectionof
the NetherlandsSociety for
Statistics and
Operations
Research (VVS): maxi-mize theoutput
of a given
simulation modelby
selecting
the best combinationof
sixinputs;
only
32 runs arepermitted.
Twelve teamscompeted;
these teams
came from industry
and academia. This paper is writtenby
thewinning
team,ex-plaining
itsdesign
andanalysis.
Thatdesign
pro-ceeded instages.
First,
aspecial design
was usedto estimate all main
effects
andtwo-factor
inter-actions(namely,
Rechtschaffner’s
saturatedde-sign).
Thenquadratic effects
were estimatedby
changing factors
one at a time.Finally,
there-sulting
estimated second-orderpolynomial
wasused to
estimate
theoptimal input
combination.The paper
presents a combination
of design
of
ex-periment techniques
and
common
sense that mayhave
more
applications in solving
real1. Introduction: The
Competition Explained
The
following problem
was defined in the VVSBulle-tin
(November
1997, pages150-151;
December 1997,pages
162-163).
The translation from theoriginal
Dutch text into
English
is ours.&dquo;Optimize
your ownoutput!
You havedevel-oped
an advancedcomputer
model thatcom-putes
theoutput
of thesynthesis
of zeolite ongauze
pads,
forgiven
values of thefollowing
Rules of the game:
1.
[Given
is thefollowing
table.]
2. We
[the
organizers
of thecompetition]
wille-mail you a similar
list,
including
thecorre-sponding
output.
Note:
Of course, the table above is
only
anexample,
in which
only
the factorsA, B,
and C werevaried. You are
permitted
to vary more factors or fewer factors aslong
as you indicate foreach of the six factors how you wish to set its value. In the
example eight
runs were offered.So 24 runs remain for new
experiments.
You
yourself
determine how you willspread
the 32 runs over theexperiments,
e.g., oneex-periment
with 32 runs, twoexperiments
with16 runs, one
experiment
of 16 runs and two ofeight
runs, etc....You can
register
no later than 5January
1998 ...&dquo;
At the start of our
search,
this was all we knewabout the
problem!
In otherwords,
we had noinforma-tion on the process
itself,
the ranges of itsinputs
orfactors,
say,zj
with j
= 1, ..., 6, etc. We did know oneinput
combination and itsresulting
output;
we callthis latter run
the free
base run.(The
initial estimateswill turn out to be poor, which is a realistic
situation.)
We
organize
thisreport
on our search as follows:Section 2. Solution
Strategy
SelectedSection 3. Rechtschaffner’s Saturated R-5
Design
Section 4.
Quadratic
Effects: One-at-a-timeDe-sign
Section 5.
Re-estimating
theOptimal
Combina-tion
Section 6. Conclusions
Section 7.
Epilogue
Section 8. Final Comments on the
Competition
Appendix.
All 33 Runs withInputs
andOutputs
2. Solution
Strategy
SelectedAny
simulation modelimplies
aninput/output
(I/O)
function or response surface. Since the simulation
model of this
competition
represents
a chemicalsys-tem, we assume that interactions among the six
fac-tors are
important.
Moreover,
it concerns a maximiza-tionproblem,
so we assume thatquadratic
effects areimportant.
Therefore weapproximate
the I/Ofunc-tion
by
asecond-degree polynomial
over the whole area ofexperimentation.
Thispolynomial
has 28param-eters : one overall mean or
intercept,
say,/30’
six main or first-order effects(3j ,
15 two-factor interactions{3j;j’
(j ’ > j; j ’
= 2,...,6),
and sixquadratic
effects/3j; j’
’
Which
experimental design
should we select toesti-mate these
parameters?
We have a
tight
&dquo;computer
budget&dquo; allowing only
32 runs. To estimate all
effects,
we need 27 more runsbesides the free base run. Since we do not wish to
spend
most of ourcomputer
budget
in oneshot,
weproceed
stage-wise:
computer
runs are executed oneby
one. We further focus on interactions, before qua-dratic effects(also
see Section8).
Once we have alsoestimated the
quadratic
effects,
we take the sixpartial
derivatives
8y / 8z, ,
equate
them to zero, and estimatethe
optimum
factor combination.3. Rechtschaffner’s Saturated Resolution-5
Design
Our
strategy
implies
that we first estimate the overallmean, the six main
effects,
and the fifteen two-factorinteractions
(in
total,
22effects).
Because of thetight
computer
budget,
we select a saturateddesign,
that is, adesign
with a number of runs, say, nequal
to thenum-ber of
effects,
q. There are severaltypes
of saturateddesigns,
satisfying
different criteria.By
definition,
resolution-5
(R-5)
designs
give
unbiased estimators of the overall mean, all maineffects,
and all two-factorinteractions. We select a saturated R-5
design
that isreadily
available,
namely
thedesign
derived in Recht-schaffner[1]
andreplicated
inKleijnen
[2,
pp310-311]
]
(see
Table1).
This table
gives
the standardized factor values, say,x: - stands for -1, and + for 1;
further, -
means that the factor has its lowestvalue,
and + means that thefactor has its
highest
value in theexperiment.
We let+
correspond
to a 10% increase of the factor relative tothe base
value;
forexample,
factor A or z, has a basevalue of 150
(see
Section1),
so its +equals
165. Stan-dardizationimplies
that effects can bedirectly
com-pared-without thinking
about their different units(factor
A is in mM, factor C inC°) :
it reveals the mostimportant
factors. In the nextstage, however,
we shall use theoriginal
scales. Also seeKleijnen
[3].
3.1 Main
Effects Only:
FirstEight
RunsTable 1. Rechtschaffner’s saturated R-5
design
[1], in standardized values (- is -1; + is 1)This estimation
requires
at least seven runs. Run #1 isthe free run. Now we execute runs #2
through
#8 inTable 1
(actually,
runs #2through
#7 would havesuf-ficed,
but we were misledby
the fact that a 2k-Pdesign
would have
required eight
runs).
Theresulting
esti-mators may be biased
by
two-factor interactions andquadratic
effects. Hence it isdangerous
to declare afactor
unimportant
when its estimated main effect isnot
significant!
To estimate the effects
(3,
we useordinary
least squares(OLS),
giving,
say,(3.
Theresulting
first-orderpolyno-mial
gives
excellent fit:R-square
is0.99999,
andR-square
adjusted
for the number of effects is 0.99996.We use
SPSS,
which assumesnormally identically
andindependently
distributed(NIID)
fitting
errors withconstant variance
(estimated
to be0.015162).
Further,
SPSS
applies
Student’s t statistic to estimate 95%con-fidence
intervals;
their low and upper limits aredis-played
in the last two columns of Table 2(all
effectshave
roughly
the same standard error,namely
0.007;
see column
3).
All main effects aresignificant
(last
two
columns).
Actually,
we useonly
themagnitudes
of the OLSpoint
estimates(column 2)
to sort the factors. This showsthat factor B is the most
important
factor;
factor D the leastimportant;
factor F theonly &dquo;negative&dquo;
factor.However,
these areonly
tentativeconclusions,
be-cause main effect estimators may be biasedby
higher-order effects and statistical
significance
testing
as-sumes NIID. Our conclusion after the firststage
isthat there is not
enough
information to eliminate afactor or to make any
changes
in the factor levels.3.2 Two-Factor Interactions:
Remaining
RunsNext we execute the
remaining
runs #9through
#22in Table 1. The
outputs
turn out to vary between 90.369 and 99.204(base
output
90.900);
see theAppendix.
Since the
design
issaturated,
R-square
is 1.0. The factor estimateschange:
(a) (30 =
94.3616;(b)
the (3~ ’s
become0.79105,1.79775, 0.5415, 0.41918, 0.61275,
and-0.66778;
(c)
the
~3j; ~~’s
equal
0.00225,
except
for~1;6
=0.0014,
and
P4;6
= 0.002225. These estimatessuggest
that alltwo-factor interactions are
unimportant
(see
Section7).
4.
Quadratic
Effects: One-at-a-TimeDesign
Next we estimate the
quadratic
effects,
by changing
one factor at a time. Each factor should have at leastthree values: we
change
zj
to, say,cj
withcj ~
-1 andCj 7=
1.Moreover,
we execute runs oneby
one(chang-ing
the level ofonly
onefactor).
After each run, were-estimate the main
effect,
interactions, andquadratic
effect of that oneinput.
If theresulting
estimatedopti-mum value of that
input
lies far outside the currentrange, we are
searching
in the wrong area!The first factor we
change
is theseemingly
mostim-portant
factor,
B(see
Section3.2).
Furthermore,
wechange
this factor in the combination thatyielded
thehighest
output
so far(run
#7; see theAppendix).
SinceB’s estimated main effect is
positive,
we increase B’svalue. We do so
by
another 10%, whichgives
z2 = 484(or
x2 = 3.2: a 10%change
in z is not a 10%change
inx).
This increases theoutput
to 102.79.After
adding
this run to theprevious
22 runs, weestimate the second-order
polynomial. Taking
its par-tial derivativesay/azj ,
equating
them to zero, andsolving
gives
the estimatedoptimum input
values.(The
values for the other factors besides B do not make sense: theirquadratic
effects are notyet
estimated.)
The
&dquo;optimal&dquo;
B value turns out to be far away: x2 =15.4843 or 22 = 729.68599.
Next we also increase the factors A and C
through
G
by
20% in theoriginal
scales;
we decrease Fby
that samepercentage:
runs #24through
#28. Were-esti-mate the
polynomial;
the overall mean and maineffects remain close to those in Section 3.2; the interac-tions remain
unchanged;
thequadratic
effects are-0.011488, -0.041071, -0.016225, -0.004175, -0.023099,
and -0.019517.
5.
Re-estimating
theOptimal Input
CombinationAfter run #28, we re-estimate the second-order
poly-nomial,
whichgives
the estimatedoptimum input
values:
530.9438, 955.02,
623.2063,51.96925,
647.079, and 74.437. This combination is theinput
for run #29,which
gives
anoutput
of 145.4481(a
drastic increaseof
41.04%,
compared
with thehighest
output
sofar).
Again re-estimating
thepolynomial
gives
overallmean, main and
quadratic
effects thathardly
change,
and interactions thatchange
quite
a bit. There-esti-mated
optimal input
values are shown in theAppen-dix. These values are the
input
for run#30,
whichyields
a further increase to 159.5943.Next we re-estimate the
optimal
inputs
and find-7.0495 for factor F; such a
negative
value, however,
isimpossible
since F denotes the factor copper.There-fore we
keep
F’s level at zero in the next run(run
#31).
This
yields
anoutput
of157.5518,
a decreasecompared
with the
immediately preceding
run.Next we
again
re-estimate the effects and findthey
hardly change.
We re-estimate theoptimal inputs:
some
inputs
increase, somedecrease,
factor F becomespositive again
(58.2545),
which is moremeaningful.
Run #32
yields
anoutput
of 151.3(a decrease).
Finally,
we re-estimate theoptimal
input
values forrun #33, which
yields
anoutput
of 152.6. This is notthe maximum
output
over all 33 runs; the maximumin our search is that of run #30
(namely,
159.5943).
6. Conclusions
Our
computer
budget
was restricted to a total of 33runs,
including
the free base run(provided
in theprob-lem
definition).
We used the first 22 runs to estimatethe six main effects and the 15 two-factor interactions,
besides the overall mean. To
specify
these runs weused Rechschaffner’s saturated
design
(Table 1),
andwe
changed
the factorsby
10%(see
Appendix).
These runs gaveoutputs
that increasedby
no more than 9%(90.9
in the base run became 99.2 in run#7).
Next we estimated the
quadratic
effects. We usedsix runs,
increasing
each factor one at a time,by
20%(runs
#23through
#28).
This increased theoutput
to amaximum of 103.1, a modest increase
(see
run#24).
For the
remaining
five runs(#29
through
#33)
weused the five combinations estimated to be
optimal,
using
the second-orderpolynomial
re-estimated aftereach run. These runs gave
substantially
improved
outputs.
The overall maximum
output
turns out to be there-sult of run #30; this maximum is 159.5943. This is a 76% increase
compared
with the baseoutput,
90.9000.Ob-viously,
our estimated maximum is notnecessarily
the
global
maximum(we
might
havegotten
stuck at alocal
maximum).
Actually,
the true maximumoutput
turns out to be 160
(see
Section7),
so we have succeededin
approximating
the true maximum veryclosely!
7.
Epilogue
After we finished the search for the maximum
160. The simulation model that was a black box to us,
turned out to be:
y = 160 +
-(zl - 420)2/5000 - (z2 -
870)2/10000
-(Z3 - 480)2/10000 -
(z4 -
40)2/70
-(Z5 - 520)2/10000 -
(z6 -
40)
2/1000
+ +30/ {[(zl - 420)(z6 -
40)/1000]2
+5} -
30/5.
So there are no main effects and no interactionsex-cept
for that between z, and z2. There is no randomnoise. The
optimal input
values are 420,..., 40(com-pare with the values of run
#30).
The last term(30/5)
is
subtracted,
because the value of the interaction termfor the
optimal
input
values is30/(02
+5).
8. Final Comments on the
Competition
We were
disappointed
to learn that the simulation model wasonly
a mathematicalfunction,
not areal-life
problem
that we werehelping
to solve. This factexplains why
theparticipants
did notget
anyinforma-tion on the process itself and the ranges of its
inputs.
Hence,
in our view thecompetition
was unrealistic: inreal life the
analysts
accumulate muchknowledge
while
developing
their simulation model. This1-nnIAll-edge
concerns both the model and theunderlying
realsystem.
In reallife,
analysts
andproblem &dquo;owners&dquo;
should
cooperate!
Notwithstanding
this criticism, notonly
we foundthis an
interesting
andchallenging problem:
12 teamscompeted, employed by
operations
research andsta-tistics
departments
of well-known internationalcom-panies (Philips,
Unilever),
research institutes(TNO,
DLO),
and universities(Amsterdam,
Tilburg).
We wonthe
competition,
but it was a&dquo;photo
finish&dquo;: ourmaxi-mum
output
was 159.6, whereas thesecond-place
out-put
was 159.4.On
hindsight,
interactions were not soimportant
(see
Section7),
so an R-4design might
have beenbet-ter than Rechtschaffner’s R-5
design
(Table 1).
How-ever, a
complication
is that we usedstage-wise
experi-mentation. In this
approach,
classicaldesigns
(such
as2k-p
designs)
were not suited: we were limited to 32runs
altogether,
and we also wanted to estimate thequadratic
effects. This limit alsoimplies
that we couldnot
apply Response
SurfaceMethodology
(RSM),
which combines a series of localdesigns
withsteepest
ascent.
Kleijnen
[3]
gives
details,
including nearly
100references;
we limit our references to thosepublica-that
really
used.a
joint
paperby
the betterteams).
At themeeting
atwhich the
competitors presented
theirsolutions,
itturned out that
typically
ourstrategy
gaverelatively
low results
(compared
with ourcompetitors) during
the first 28 runs; in runs #29
through
#33, however,
ourstrategy
accelerated and overtook thecompeti-tors’
outputs.
In
general,
ourstrategy
seems agood
heuristic forreal-life
applications. Obviously
no heuristic isalways
&dquo;best&dquo;
(it
would not be aheuristic).
Determining
when aparticular
heuristic isapplicable
is rather difficult.One
practical
solutionmight
be:apply
the heuristicthat is most familiar
(&dquo;a
carpenter
can solve anyprob-lem with a
hammer&dquo;).
Ourstrategy
was a combinationof
design
ofexperiment
techniques
and common sense.Moreover, in some other
respects
thiscompetition
was realistic: the number of runs was limited
(to 32),
and there was a deadline
(5
January
1998).
So thetechniques applied
in this paper may have moreap-plications
insolving
realproblems.
9.Postscript
In
May
1999 thiscompetition
wasrepeated
with fiveteams of students at the
University
ofCanterbury
int-11r4S4-c 1, --1- ~T..~.. 1wl- -- &dquo;- -
autl- or
Christchurch,
New Zealand(when
the first ~o authorvisited the
University
as aVisiting
ErskineFellow).
Each team consisted of two members. These teams
used
strategies
that differed from thestrategy
that the authorsapplied. Actually,
the students did nottry
toestimate
quadratic
effects and interactions. Insteadthey
fitted first-orderpolynomials
to localinput/out-put
data, each time followedby
severalsteepest
as-cent trials. In
hindsight,
interactions may indeed beignored
in thiscompetition!
Thewinning
teamsuc-ceeded in
obtaining
anoutput
of159.4-very
close tothe authors’
output
of 159.6 and the true maximum of 160(the
&dquo;worst&dquo; team realized anoutput
of139.2).
Sodifferent
strategies
mayyield
(roughly)
the samere-sult :
&dquo;many
roads lead to Rome!&dquo;10. References
[1] Rechtschaffner, R.L. "Saturated Fractions of 2n and 3n
Facto-rial Designs." Technometrics, Vol. 9, pp 569-575, 1967.
[2] Kleijnen, J.P.C. Statistical Tools for Simulation Practitioners,
Marcel Dekker, NY, 1987.
[3] Kleijnen, J.P.C. "Experimental Design for Sensitivity Analysis, Optimization, and Validation of Simulation Models."
Hand-book of Simulation, Jerry Banks (ed.), Wiley, NY, 1998.
Acknowledgment
Jack P.C.
Kleijnen
is a Professor of Simulation andInforma-tion
Systems.
His research concerns simulation,mathemati-cal statistics, information systems, and
logistics,
which have led to six books andnearly
160 articles. He has been aconsultant for several
organizations
in the U.S. andEurope,
and has served on many international editorial boards and scientific committees. He spent several years in the U.S., atboth universities and
companies,
and received a number of internationalfellowships
and awards. More information isprovided
athttp:llcwis.kub.nll-few5lcenterlstafflkleijnenl.
6zge
Pala is a PhD student inOperations
Research. She earned her BSc in IndustrialEngineering
fromBogazici
University
in Istanbul,Turkey,
and an MSc inManagement
Science fromTilburg
University
in The Netherlands. Her research interests concern systemdynamics methodology,
simulation and soft OR