*********************************
There is now a CONTENT FREEZE for Mercury while we switch to a new platform. It began on Friday, March 10 at 6pm and will end on Wednesday, March 15 at noon. No new content can be created during this time, but all material in the system as of the beginning of the freeze will be migrated to the new platform, including users and groups. Functionally the new site is identical to the old one. webteam@gatech.edu
*********************************
Title: Scalable and Safe Deep Learning Architectures for Stochastic Optimal Control Using Forward-Backward Stochastic Differential Equations
Date: August 24, 2022
Time: 10:00 am to noon E.T.
Location: Conference room MK317 in the Montgomery Knight Aerospace Engineering Building
Meeting Link: Teams meeting link
Marcus Aloysius Pereira
Robotics Ph.D. Candidate
School of Aerospace Engineering
Georgia Institute of Technology
Committee:
Dr. Evangelos Theodorou (advisor), School of Aerospace Engineering, Georgia Institute of Technology
Dr. Enlu Zhou, School of Industrial and Systems Engineering, Georgia Institute of Technology
Dr. Samuel Coogan, School of Electrical and Computer Engineering, Georgia Institute of Technology
Dr. Kyriakos G. Vamvoudakis, School of Aerospace Engineering, Georgia Institute of Technology
Dr. Yongxin Chen, School of Aerospace Engineering, Georgia Institute of Technology
Dr. Ioannis Exarchos, Microsoft
Abstract:
Stochastic Optimal Control (SOC) in continuous-time requires solving the Hamilton-Jacobi-Bellman (HJB) equation which suffers from the well-known curse-of-dimensionality. Instead of directly attempting to solve the HJB, one can obtain probabilistic representations of the solution via the Nonlinear Feynman-Kac lemma which relates the unique solution of the HJB to a system of Forward-Backward Stochastic Differential Equations (FBSDEs). This thesis develops novel algorithms that leverage the function approximation capabilities of deep recurrent neural networks to solve systems of FBSDEs and the resulting deep FBSDE framework is memory efficient, provides temporally smoother controls, is immune to compounding approximation errors and can be employed for long time-horizons owing to the underlying Long-Short Term Memory network architecture. Starting from a Vanilla SOC problem, the framework is extended to problem formulations such as dynamics with control-multiplicative noise, dynamics with non-affine controls and non-quadratic control cost functions, safety-critical tasks which employs Stochastic Control Barrier Functions and for L1-SOC in minimum-fuel aerospace applications. Each problem formulation is accompanied with necessary structural modifications to the deep learning architecture to enable end-to-end learning. In order to improve the scalability of the framework, especially for safety critical multi-robot problems, this thesis then proposes a novel Decentralized Approach to Safe SOC using the aforementioned Deep FBSDE framework and the well-known Alternating Direction Method of Multipliers (ADMM) algorithm. Using simulations, the efficacy of the decentralized approach is demonstrated on challenging tasks involving many robots and safety constraints. Finally, using this as the backbone, a novel sim2real approach is developed which empowers the deep FBSDE framework to be directly deployed on hardware after training in simulation and the approach is tested on the Robotarium platform. This marks the first work to deploy FBSDE-based controllers on real hardware.