*********************************
There is now a CONTENT FREEZE for Mercury while we switch to a new platform. It began on Friday, March 10 at 6pm and will end on Wednesday, March 15 at noon. No new content can be created during this time, but all material in the system as of the beginning of the freeze will be migrated to the new platform, including users and groups. Functionally the new site is identical to the old one. webteam@gatech.edu
*********************************
TITLE: The Knowledge Gradient Policy for Optimal Learning
SPEAKER: Dr. Warren Powell
ABSTRACT:
There are many applications which require collecting information, where the time or cost required to make a measurement may be high. A measurement may require running an expensive simulation, testing a molecular compound in a lab, estimating the presence of a disease in the population, or field testing a market price or business policy in the market place. There are elegant, optimal techniques for very specialized problems such as multi-armed bandit problems, and a host of heuristics and techniques developed for specialized problem classes. The knowledge gradient is a technique that guides measurement decisions using what might be described as classical steepest ascent which requires finding the expected value of a single measurement. This technique is myopically optimal, and is also asymptotically optimal, with strong supporting evidence for problems with finite budgets. The appeal of the method is its generality, allowing it to address problems that have been previously viewed as belonging to completely separate communities. However, it introduces a specific computational challenge which has to be overcome before it can be used for a particular application. The idea will be illustrated on discrete choice problems with correlated beliefs, scalar problems (e.g. optimizing prices), continuous multidimensional problems (e.g. finding the best set of parameters to optimize a simulation), and drug discovery.