PhD Defense by Marcus Aloysius Pereira

*********************************
There is now a CONTENT FREEZE for Mercury while we switch to a new platform. It began on Friday, March 10 at 6pm and will end on Wednesday, March 15 at noon. No new content can be created during this time, but all material in the system as of the beginning of the freeze will be migrated to the new platform, including users and groups. Functionally the new site is identical to the old one. webteam@gatech.edu
*********************************

Event Details
  • Date/Time:
    • Wednesday August 24, 2022
      10:00 am - 12:00 pm
  • Location: MK317 in the Montgomery Knight Aerospace Engineering Building
  • Phone:
  • URL: TEAMS
  • Email:
  • Fee(s):
    N/A
  • Extras:
Contact
No contact information submitted.
Summaries

Summary Sentence: Scalable and Safe Deep Learning Architectures for Stochastic Optimal Control Using Forward-Backward Stochastic Differential Equations

Full Summary: No summary paragraph submitted.

Title: Scalable and Safe Deep Learning Architectures for Stochastic Optimal Control Using Forward-Backward Stochastic Differential Equations

 

Date: August 24, 2022

Time: 10:00 am to noon E.T.

Location: Conference room MK317 in the Montgomery Knight Aerospace Engineering Building

Meeting Link: Teams meeting link

 

Marcus Aloysius Pereira 

Robotics Ph.D. Candidate 

School of Aerospace Engineering 

Georgia Institute of Technology

 

Committee:

Dr. Evangelos Theodorou (advisor), School of Aerospace Engineering, Georgia Institute of Technology

Dr. Enlu Zhou, School of Industrial and Systems Engineering, Georgia Institute of Technology

Dr. Samuel Coogan, School of Electrical and Computer Engineering, Georgia Institute of Technology

Dr. Kyriakos G. Vamvoudakis, School of Aerospace Engineering, Georgia Institute of Technology

Dr. Yongxin Chen, School of Aerospace Engineering, Georgia Institute of Technology

Dr. Ioannis Exarchos, Microsoft

 

Abstract:

Stochastic Optimal Control (SOC) in continuous-time requires solving the Hamilton-Jacobi-Bellman (HJB) equation which suffers from the well-known curse-of-dimensionality. Instead of directly attempting to solve the HJB, one can obtain probabilistic representations of the solution via the Nonlinear Feynman-Kac lemma which relates the unique solution of the HJB to a system of Forward-Backward Stochastic Differential Equations (FBSDEs). This thesis develops novel algorithms that leverage the function approximation capabilities of deep recurrent neural networks to solve systems of FBSDEs and the resulting deep FBSDE framework is memory efficient, provides temporally smoother controls, is immune to compounding approximation errors and can be employed for long time-horizons owing to the underlying Long-Short Term Memory network architecture. Starting from a Vanilla SOC problem, the framework is extended to problem formulations such as dynamics with control-multiplicative noise, dynamics with non-affine controls and non-quadratic control cost functions, safety-critical tasks which employs Stochastic Control Barrier Functions and for L1-SOC in minimum-fuel aerospace applications. Each problem formulation is accompanied with necessary structural modifications to the deep learning architecture to enable end-to-end learning. In order to improve the scalability of the framework, especially for safety critical multi-robot problems, this thesis then proposes a novel Decentralized Approach to Safe SOC using the aforementioned Deep FBSDE framework and the well-known Alternating Direction Method of Multipliers (ADMM) algorithm. Using simulations, the efficacy of the decentralized approach is demonstrated on challenging tasks involving many robots and safety constraints. Finally, using this as the backbone, a novel sim2real approach is developed which empowers the deep FBSDE framework to be directly deployed on hardware after training in simulation and the approach is tested on the Robotarium platform. This marks the first work to deploy FBSDE-based controllers on real hardware.

Additional Information

In Campus Calendar
No
Groups

Graduate Studies

Invited Audience
Faculty/Staff, Public, Undergraduate students
Categories
Other/Miscellaneous
Keywords
Phd Defense
Status
  • Created By: Tatianna Richardson
  • Workflow Status: Published
  • Created On: Aug 11, 2022 - 11:48am
  • Last Updated: Aug 11, 2022 - 11:48am