PhD Defense by Marcus Aloysius Pereira

*********************************
There is now a CONTENT FREEZE for Mercury while we switch to a new platform. It began on Friday, March 10 at 6pm and will end on Wednesday, March 15 at noon. No new content can be created during this time, but all material in the system as of the beginning of the freeze will be migrated to the new platform, including users and groups. Functionally the new site is identical to the old one. webteam@gatech.edu
*********************************

Event Details

Date/Time:
- Wednesday August 24, 2022
  10:00 am - 12:00 pm
Location: MK317 in the Montgomery Knight Aerospace Engineering Building
Phone:
URL: TEAMS
Email:
Fee(s):
N/A
Extras:

Contact

No contact information submitted.

Summaries

Summary Sentence: Scalable and Safe Deep Learning Architectures for Stochastic Optimal Control Using Forward-Backward Stochastic Differential Equations

Full Summary: No summary paragraph submitted.

Title: Scalable and Safe Deep Learning Architectures for Stochastic Optimal Control Using Forward-Backward Stochastic Differential Equations

Date: August 24, 2022

Time: 10:00 am to noon E.T.

Location: Conference room MK317 in the Montgomery Knight Aerospace Engineering Building

Meeting Link: Teams meeting link

Marcus Aloysius Pereira

Robotics Ph.D. Candidate

School of Aerospace Engineering

Georgia Institute of Technology

Committee:

Dr. Evangelos Theodorou (advisor), School of Aerospace Engineering, Georgia Institute of Technology

Dr. Enlu Zhou, School of Industrial and Systems Engineering, Georgia Institute of Technology

Dr. Samuel Coogan, School of Electrical and Computer Engineering, Georgia Institute of Technology

Dr. Kyriakos G. Vamvoudakis, School of Aerospace Engineering, Georgia Institute of Technology

Dr. Yongxin Chen, School of Aerospace Engineering, Georgia Institute of Technology

Dr. Ioannis Exarchos, Microsoft

Abstract:

Stochastic Optimal Control (SOC) in continuous-time requires solving the Hamilton-Jacobi-Bellman (HJB) equation which suffers from the well-known curse-of-dimensionality. Instead of directly attempting to solve the HJB, one can obtain probabilistic representations of the solution via the Nonlinear Feynman-Kac lemma which relates the unique solution of the HJB to a system of Forward-Backward Stochastic Differential Equations (FBSDEs). This thesis develops novel algorithms that leverage the function approximation capabilities of deep recurrent neural networks to solve systems of FBSDEs and the resulting deep FBSDE framework is memory efficient, provides temporally smoother controls, is immune to compounding approximation errors and can be employed for long time-horizons owing to the underlying Long-Short Term Memory network architecture. Starting from a Vanilla SOC problem, the framework is extended to problem formulations such as dynamics with control-multiplicative noise, dynamics with non-affine controls and non-quadratic control cost functions, safety-critical tasks which employs Stochastic Control Barrier Functions and for L1-SOC in minimum-fuel aerospace applications. Each problem formulation is accompanied with necessary structural modifications to the deep learning architecture to enable end-to-end learning. In order to improve the scalability of the framework, especially for safety critical multi-robot problems, this thesis then proposes a novel Decentralized Approach to Safe SOC using the aforementioned Deep FBSDE framework and the well-known Alternating Direction Method of Multipliers (ADMM) algorithm. Using simulations, the efficacy of the decentralized approach is demonstrated on challenging tasks involving many robots and safety constraints. Finally, using this as the backbone, a novel sim2real approach is developed which empowers the deep FBSDE framework to be directly deployed on hardware after training in simulation and the approach is tested on the Robotarium platform. This marks the first work to deploy FBSDE-based controllers on real hardware.

Additional Information

In Campus Calendar

Groups

Graduate Studies

Invited Audience

Faculty/Staff, Public, Undergraduate students

Categories

Other/Miscellaneous

Keywords

Phd Defense

Status

Created By: Tatianna Richardson
Workflow Status: Published
Created On: Aug 11, 2022 - 11:48am
Last Updated: Aug 11, 2022 - 11:48am

Georgia Tech

PhD Defense by Marcus Aloysius Pereira

Additional Information