SCS Recruiting Seminar: Yuanzhi Li

*********************************
There is now a CONTENT FREEZE for Mercury while we switch to a new platform. It began on Friday, March 10 at 6pm and will end on Wednesday, March 15 at noon. No new content can be created during this time, but all material in the system as of the beginning of the freeze will be migrated to the new platform, including users and groups. Functionally the new site is identical to the old one. webteam@gatech.edu
*********************************

Event Details
  • Date/Time:
    • Tuesday January 15, 2019 - Wednesday January 16, 2019
      11:00 am - 11:59 am
  • Location: KACB 1116W
  • Phone:
  • URL:
  • Email:
  • Fee(s):
    N/A
  • Extras:
Contact

Tess Malone, Communications Officer

tess.malone@cc.gatech.edu

Summaries

Summary Sentence: Towards Deeper Understandings of Deep Learning

Full Summary: No summary paragraph submitted.

Media
  • Yuanzhi Li Yuanzhi Li
    (image/jpeg)

TITLE: Towards Deeper Understandings of Deep Learning

ABSTRACT:

Recent breakthroughs in machine learning often involve learning highly non-convex models, especially deep neural networks. Though many empirical works have demonstrated the success of these methods, the formal study of the principles behind them is less established.

This talk will show a few of the recent results towards developing such principles. In particular, we focus on the over-parameterized neural networks for multi-class classifications. We will show that stochastic gradient descent (SGD) on over-parameterized deep neural networks provably finds the global minimum for the training objective. Moreover, we also prove that such perfect fitting can also be extended to test data set when the labels are generated by certain teaching networks.

This talk will also cover how to use the above results as a step to establish the theory behind the “magic’’ of learning rate decay in training neural networks, as well as how the identity mapping in ResNet helps in the learning process.

BIO:

Yuanzhi Li is a postdoctoral researcher at the computer science department of Stanford University. Previously, he obtained his Ph.D. at Princeton under the advice of Sanjeev Arora. His research interests include topics in deep learning, non-convex optimization, and online learning.

 

Additional Information

In Campus Calendar
No
Groups

College of Computing, School of Computer Science

Invited Audience
Faculty/Staff, Postdoc, Public, Graduate students, Undergraduate students
Categories
Seminar/Lecture/Colloquium
Keywords
No keywords were submitted.
Status
  • Created By: Tess Malone
  • Workflow Status: Published
  • Created On: Jan 10, 2019 - 2:19pm
  • Last Updated: Jan 10, 2019 - 2:22pm