ISyE Statistic Seminar - David Dunson

*********************************
There is now a CONTENT FREEZE for Mercury while we switch to a new platform. It began on Friday, March 10 at 6pm and will end on Wednesday, March 15 at noon. No new content can be created during this time, but all material in the system as of the beginning of the freeze will be migrated to the new platform, including users and groups. Functionally the new site is identical to the old one. webteam@gatech.edu
*********************************

Event Details
  • Date/Time:
    • Friday March 6, 2020 - Saturday March 7, 2020
      12:00 pm - 12:59 pm
  • Location: ISyE Main 228
  • Phone:
  • URL: ISyE Building
  • Email:
  • Fee(s):
    N/A
  • Extras:
Contact
No contact information submitted.
Summaries

Summary Sentence: Learning & Exploiting Low-Dimensional Structure in High-Dimentional Data

Full Summary:

Abstract:

This talk will focus on the problem of learning low-dimensional geometric structure in high-dimensional data. We allow the lower-dimensional subspace to be non-linear. There are a variety of algorithms available for “manifold learning” and non-linear dimensionality reduction, mostly relying on locally linear approximations and not providing a likelihood-based approach for inferences. We propose a new class of simple geometric dictionaries for characterizing the subspace, along with a simple optimization algorithm and a model-based approach to inference. We provide strong theory support, in terms of tight bounds on covering numbers, showing advantages of our approach relative to local linear dictionaries. These advantages are shown to carry over to practical performance in a variety of settings including manifold learning, manifold de-noising, data visualization, classification (providing a competitor to deep neural networks that requires fewer training examples), and geodesic distance estimation. We additionally provide a Bayesian nonparametric methodology for inference, using a new class of kernels, which is shown to outperform current methods, such as mixtures of multivariate Gaussians.

Title:

Learning & Exploiting Low-Dimensional Structure in High-Dimentional Data

Abstract:

This talk will focus on the problem of learning low-dimensional geometric structure in high-dimensional data. We allow the lower-dimensional subspace to be non-linear. There are a variety of algorithms available for “manifold learning” and non-linear dimensionality reduction, mostly relying on locally linear approximations and not providing a likelihood-based approach for inferences. We propose a new class of simple geometric dictionaries for characterizing the subspace, along with a simple optimization algorithm and a model-based approach to inference. We provide strong theory support, in terms of tight bounds on covering numbers, showing advantages of our approach relative to local linear dictionaries. These advantages are shown to carry over to practical performance in a variety of settings including manifold learning, manifold de-noising, data visualization, classification (providing a competitor to deep neural networks that requires fewer training examples), and geodesic distance estimation. We additionally provide a Bayesian nonparametric methodology for inference, using a new class of kernels, which is shown to outperform current methods, such as mixtures of multivariate Gaussians.

Short Bio:

David Dunson is Arts & Sciences Distinguished Professor of Statistical Science and Mathematics at Duke University.  His research focuses on developing methodology for analysis and interpretation of complex and high-dimensional data, with a particular emphasis on Bayesian and probability modeling approaches.  He is particularly interested in work at the intersection of statistics, differential geometry, and computer science.  Methods development and theory are directly motivated by applications in neuroscience, genomics, environmental health, and ecology among others.  In these settings, it is common for data to have a structured form, consisting of replicated networks/graphs, trees, functions, tensors, etc.  A focus is on developing fundamentally new frameworks for statistical inferences in challenging settings, including improving robustness to modeling assumptions and scalability to large datasets.  He has won numerous awards, including the 2010 COPSS Presidents’ Award, which is widely viewed as the most prestigious award in statistics and represents statistics version of the Field’s Medal, being given to one outstanding researcher under the age of 41 per year internationally.  His work has had substantial impact, with ~48,000 citations on google scholar and an H-index of 75.

Additional Information

In Campus Calendar
Yes
Groups

School of Industrial and Systems Engineering (ISYE)

Invited Audience
Faculty/Staff, Postdoc, Public, Graduate students, Undergraduate students
Categories
Seminar/Lecture/Colloquium
Keywords
No keywords were submitted.
Status
  • Created By: Julie Smith
  • Workflow Status: Published
  • Created On: Mar 2, 2020 - 10:33am
  • Last Updated: Mar 2, 2020 - 10:33am