Approximate Dynamic Programming for High-Dimensional Resource Allocation Problems

*********************************
There is now a CONTENT FREEZE for Mercury while we switch to a new platform. It began on Friday, March 10 at 6pm and will end on Wednesday, March 15 at noon. No new content can be created during this time, but all material in the system as of the beginning of the freeze will be migrated to the new platform, including users and groups. Functionally the new site is identical to the old one. webteam@gatech.edu
*********************************

Event Details
  • Date/Time:
    • Thursday November 4, 2004
      10:00 am - 10:59 pm
  • Location: Executive Classroom, ISyE Main Building
  • Phone:
  • URL:
  • Email:
  • Fee(s):
    N/A
  • Extras:
Contact
Barbara Christopher
Industrial and Systems Engineering
Contact Barbara Christopher
404.385.3102
Summaries

Summary Sentence: Approximate Dynamic Programming for High-Dimensional Resource Allocation Problems

Full Summary: Approximate Dynamic Programming for High-Dimensional Resource Allocation Problems

Dynamic programming has long been relegated to small problems due to the well-known "curse of dimensionality." We show that there are actually three curses of dimensionality, but that these can be overcome by using a different form of the optimality equations along with carefully chosen functional approximations. In particular, the use of separable, piecewise linear approximations of the value function is particularly well suited for discrete resource allocation problems, allowing us to solve dynamic programs where the dimensionality of the state variable is in the tens of thousands. We illustrate these methods in the context of several large-scale freight transportation applications, and discuss research issues that have arisen in these applications. One challenge has been the management of resources with complex attributes, which produces dynamic programs where the dimensionality of the state vector can measure in the millions. We show how hierarchical learning strategies can be used to produce effective approximations, helping to solve the "explore vs. exploit" dilemma that is well-known in approximate dynamic programming. As a byproduct, the value functions provide accurate gradient information, avoiding the need to perform statistically unreliable "what if" simulations to determine the effect of changes in fleet size and mix.

Additional Information

In Campus Calendar
No
Groups

School of Industrial and Systems Engineering (ISYE)

Invited Audience
No audiences were selected.
Categories
Seminar/Lecture/Colloquium
Keywords
No keywords were submitted.
Status
  • Created By: Barbara Christopher
  • Workflow Status: Published
  • Created On: Oct 8, 2010 - 7:39am
  • Last Updated: Oct 7, 2016 - 9:52pm