Chris Siefert
Department of Mathematics
College of William and Mary
cmsief@cs.wm.edu
A planned set of evaluations of a deterministic function is called a computer experiment. The problem of optimization of such experiments is a highly non-trivial one. Typically, evaluating function the f, requires running a computationally expensive simulation of some kind of physical system e.g. the helicopter rotor blade studied by Booker (1996). Due to high-frequency, low-amplitude variations in the function f, often induced by numerical error in the computer simulation, derivative information if it can be found, is not helpful. Hence, derivative-based optimization methods, such as quasi-Newton methods would be inappropriate for this problem. Likewise, due to the high computational expense of evaluating the objective function, direct search methods would also be inappropriate for computer experiments, as they require a large number of function evaluations. Since these two mainstays of traditional optimization would fail to be effective for computer experiments, different methods are required.
The idea behind the Surrogate Management Framework (SMF), outlined by Booker et al.(1999), is to use a global approximation to our objective function f, to accelerate the search for a minimizer. The SMF is uses this approximation to guide the pattern search, which is a type of direct search. The specific implementation of the Surrogate Management Framework that was completed by Trosset and Torczon (1997) is called the Model-Assisted Grid Search (MAGS). The algorithm has since been modified, and has been renamed the Model-Assisted Pattern Search (MAPS) by Trosset and Torczon (1999).
The purpose of this report is to study the Model-Assisted Pattern Search more fully and to subject it to more comprehensive testing than has been done to this date. This report explains and discusses the MAPS algorithm in Section 2, discusses previous computational experiments in Section 3 and proposes new computational experiments in Section 4. Section 5 offers preliminary results of the proposed experiments in Section 4, and Section 6 outlines future plans for completing this research. Finally, the paper concludes in Section 7.
The essential logic of the Model-Assisted Pattern Search is shown below in the figure reproduced from Trosset and Torczon (1999).
As a special case of SMF, MAPS inherits its guarantee of global convergence. This guarantee is derived from the fact that SMF is based on the pattern search. See Booker et al.(1999) and Lewis et al.(1998) for information about the convergence of pattern search. Like pattern search, MAPS searches on a sequence of grids.
Grid are refined according to pattern search rules, i.e. only when f(xt) >
f(xc) for every xt in the ``core pattern'' of points at xc,
denoted as
.
is the set of sites
at which f has been evaluated, so when
the pattern search grid refinement criterion has been
satisfied, and grid refinement is allowed.
A key feature of MAPS is its use of global approximations to help
the pattern search find promising trial points. The current global
approximation,
, is constructed by a technique known as
kriging, explained by Sacks et al.(1989) and
Koehler and Owen (1996). Before the pattern search segment of
MAPS commences, an initial approximation is constructed by selecting
an initial set of design sites, evaluating f there, and then kriging
the results. Previous studies of MAGS/MAPS selected the initial
design sites with Latin hypercube sampling.
To search for trial points, MAPS optimizes a search criterion, Sc.
The original MAGS algorithm used
as the search
criterion; however Torczon and Trosset (1998) and
Trosset (1998) argued that this choice is problematic. They
suggested that MAGS is too single-minded in its pursuit of a minimizer
of
and that it gives insufficient attention to improving
the quality of
. Trosset (1998) proposed setting Sc equal to
, where Dc(x) measures a point's potential
to improve the approximation and wc is the weight placed on Dc(x).
Influenced by Cox and John (1997), Trosset and Torczon (1997)
adopted
.
Previous experiments with MAGS/MAPS have focused on the importance of sequential experimentation. Trosset and Torczon (1997, 1999) compared MAGS/MAPS to an the DACE approach proposed by Welch and Sacks (1991). In DACE, a single experiment is performed, a single approximation is constructed and minimized, and a single validation evaluation is performed at the putative minimizer. Trosset and Torczon (1997) found that MAGS outperformed DACE for several choice of V, the permitted number of function evaluations. However, these experiments considered just one objective function. Trosset and Torczon (1999) considered 20 objective functions, constructed by generating 10 realizations of a stochastic process and adding a constant or a quadratic trend. One of these functions was used in this report as well. Again, in Trosset and Torczon's (1999) report MAPS performed DACE. However, only one implementation of MAPS was tested.
Other than this limited testing, however, very little testing has been
performed on the MAPS algorithm. There has been no experimentation
with varying the number of evaluations, N, devoted to the initial
design, and there has also been no experimentation on the effects of a
small value of N. Likewise, little testing has been done with
varying the mesh size
, and the weighting schedule which
determines wc for each iteration of the loop in the MAPS algorithm.
Many computational experiments remain to be performed. This study investigates four specific questions, outlined below.
The experiments described in this section were performed a on randomly
generated objective functions of p=2 variables. This function was
generated using methods described by Trosset (1999). It is one
of the objective functions studied by Trosset and Torczon (1997),
the sum of their realizations of a Gaussian process and a constant
trend. Unless otherwise noted, all experiments were
performed with a budget of function evaluations V=2, N=5 initial
design sites, a weighting scheme of
, and a
initial mesh fineness of 0.1. A plot of the function over the
region of interest is attached in Appendix A.
Several tests of the MAPS algorithm were performed n order to study the effects of the proportion N/V, of function evaluations allocated to the initial design. These tests had 20 runs of MAPS each, and were run for N=3,5,7,11,13 for V=20 and V=25 function evaluations. The figures below illustrate the results of these tests.
Figure 1 and 2 support the theory outlined above -- MAPS tended to perform poorly if small quantities of function evaluations were allocated to the initial design. Likewise, MAPS performance also suffered on average, when a relatively high proportion of function evaluation was allocated to the initial design. Although the ideal balance cannot be determined by these preliminary results, the goal of finding such a balance should certainly be a future research goal.
Varying the mesh size produced mixed results. For the function tested in Figure 3, the results seem to suggest that smaller mesh sizes perform the best. This supports the hypothesis that larger mesh sizes tended to waste function evaluations evaluating Core patterns prior to grid refinement.
Another question dealt with in this report is that of the weighting
schedules. Specifically, how do weighting schedules interact with
mesh size. It is reasonable to suppose that Trosset and Torczon's
weighting schedules,
, will do better for
most functions when given a smaller mesh. But how are other
weighting schedules affected by mesh size? Specifically, how are the
schedules based on Cox and John's SDO design (1997) with
and the original MAGS weight-free design
affected by the mesh size.
The results here are fairly interesting. Since the function evaluated in Figure 4 is the same function tested in the testing of mesh sizes, the results are fairly clear -- smaller mesh sizes tend to yield a significant performance gain in the MAPS algorithm. Likewise, the Cox and John inspired weighting schedule shown in Figure 5 shows a similar gain.
However, the ``No Weight'' weighting schedule has a significantly smaller performance gain when it uses a smaller mesh. In fact, this apparent gain is in all likelihood illusory, a product of the small sample size used for these preliminary results. So it is reasonable to infer from these results that there is some sort of interaction between mesh size and weighting scheme.
Finally, testing on Latin hypercube initial designs has also yielded some interesting results. The figure below shows four initial Latin hypercube designs. In examining these figures, one notes that there is no distinguishing features which sets the designs that yield good results apart from the design that yield poor results. More specifically, it seems to be difficult to discern, a priori if a given Latin hypercube initial design is ``good''. Thus alternatives to Latin hypercubes initial designs should be investigated.

Currently, the code used for the MAPS algorithm is written in S-PLUS, a high-level mathematical programming language. Although S-PLUS supports many advanced features, such non-linear optimization, it is also quite inefficient when it comes to execution time. The present version takes hours to run a set of tests on a mid-level pentium. Over the summer, the author plans to implement the MAPS algorithm in C++. This language is far more portable than S-PLUS, as nearly every system has a C++ compiler available. Also, execution time should be much faster for the C++ implementation, as the C++ can take advantage of optimizing compilers, while S-PLUS is stuck using its interpreter.
Across the course of the next academic year, the author plans to study more closely the affect of these facets of MAPS, as well as others, on the algorithm performance. The author also intends to test MAPS on a wide variety of randomly-generated functions, and in doing so produce much more definitive results than those in the present report.
In conclusion, the MAPS algorithm provides a potentially useful method for optimization on a limited budget. However, there has been little study into the effect of algorithm's various parameters, and this report has taken a preliminary look into this area. The tests performed here suggest that MAPS performance is inversely related to mesh size, for certain weighting schedules, but that the choice of weighting schedule can mitigate or possibly reverse that effect. This report also suggests MAPS tends to perform best on allocations of evaluations to the initial design that are medium-sized. However, finding the optimal level is an area for future research. Finally, as for Latin hypercube initial designs, this report can say little, other than to suggest that more research should go into the use of other experimental designs.

This document was generated using the LaTeX2HTML translator Version 97.1 (release) (July 13th, 1997)
Copyright © 1993, 1994, 1995, 1996, 1997, Nikos Drakos, Computer Based Learning Unit, University of Leeds.
The command line arguments were:
latex2html -no_navigation -split 0 maps_math490.tex.
The translation was initiated by Chris Siefert on 5/17/1999