SCS Seminar Talk: Sepideh Mahabadi

*********************************
There is now a CONTENT FREEZE for Mercury while we switch to a new platform. It began on Friday, March 10 at 6pm and will end on Wednesday, March 15 at noon. No new content can be created during this time, but all material in the system as of the beginning of the freeze will be migrated to the new platform, including users and groups. Functionally the new site is identical to the old one. webteam@gatech.edu
*********************************

Event Details
  • Date/Time:
    • Thursday February 4, 2021 - Friday February 5, 2021
      11:00 am - 11:59 am
  • Location: BlueJeans
  • Phone:
  • URL:
  • Email:
  • Fee(s):
    N/A
  • Extras:
Contact

Tess Malone, Communications Officer

tess.malone@cc.gatech.edu

Summaries

Summary Sentence: Diversity and Fairness in Data Summarization Algorithms

Full Summary: No summary paragraph submitted.

Media
  • Sepideh Mahabadi Sepideh Mahabadi
    (image/jpeg)

TITLE:  Diversity and Fairness in Data Summarization Algorithms

ABSTRACT:

Searching and summarization are two of the most fundamental tasks in massive data analysis. In this talk, I will focus on these two tasks from the perspective of diversity and fairness. Search is often formalized as the (approximate) nearest neighbor problem. Despite an extensive research on this topic, its basic formulation is insufficient for many applications. In this talk, I will describe such applications and our approaches to address them. For example, we show how to incorporate diversity or fairness in the results of a search query.

A prominent approach to summarize the data is to compute a small “core-set”: a subset of the data that is sufficient for approximating the solution of a given task. We introduce the notion of “composable core-sets” as core-sets with the composability property: the union of multiple core-sets should form a good summary for the union of the original data sets. This composability property enables efficient solutions to a wide variety of massive data processing applications, including distributed computation (e.g. Map-Reduce model), streaming algorithms, and similarity search. We show how to produce such efficient summaries of the data while preserving the diversity in the data set. I will describe several metrics for capturing the notion of diversity, and present efficient algorithms for construction of composable core-sets with respect to those metrics.

BIO:

Sepideh Mahabadi is a research assistant professor at the Toyota Technological Institute at Chicago (TTIC).  She received her Ph.D. from MIT, where she was advised by Piotr Indyk. For a year, she was a postdoctoral research scientist at Simons Collaboration on Algorithms and Geometry based at Columbia University. Her research focuses on theoretical foundations of massive data including high dimensional computational geometry, streaming algorithms, and data summarization; as well as social aspects of algorithms for massive data including diversity maximization and algorithmic fairness.

 

WATCH HERE:  https://bluejeans.com/457461626

Additional Information

In Campus Calendar
No
Groups

College of Computing, School of Computer Science

Invited Audience
Faculty/Staff, Postdoc, Public, Graduate students, Undergraduate students
Categories
Seminar/Lecture/Colloquium
Keywords
No keywords were submitted.
Status
  • Created By: Tess Malone
  • Workflow Status: Published
  • Created On: Jan 27, 2021 - 7:00pm
  • Last Updated: Jan 27, 2021 - 7:01pm