ML@GT Virtual Seminar: Towards High Precision Text Generation with Ankur Parikh, Google

*********************************
There is now a CONTENT FREEZE for Mercury while we switch to a new platform. It began on Friday, March 10 at 6pm and will end on Wednesday, March 15 at noon. No new content can be created during this time, but all material in the system as of the beginning of the freeze will be migrated to the new platform, including users and groups. Functionally the new site is identical to the old one. webteam@gatech.edu
*********************************

Event Details
  • Date/Time:
    • Wednesday November 11, 2020
      12:15 pm - 1:15 pm
  • Location: Virtual - Bluejeans - https://primetime.bluejeans.com/a2m/live-event/vppyhtjh
  • Phone:
  • URL:
  • Email:
  • Fee(s):
    N/A
  • Extras:
Contact

Allie McFadden | Communications Officer

allie.mcfadden@cc.gatech.edu

Summaries

Summary Sentence: A seminar on Towards High Precision Text Generation with Ankur Parikh from Google

Full Summary: No summary paragraph submitted.

Ankur Parikh is a senior research scientist at Google NYC and adjunct assistant professor at NYU will give a talk on November 11, 2020 at 12:15 pm ET. This is a virtual event and is open to all Georgia Tech students, faculty, staff, and interested members of the public.

REGISTER HERE

Title: Towards High Precision Text Generation

Abstract:

Despite large advances in neural text generation in terms of fluency, existing generation techniques are prone to hallucination and often produce output that is unfaithful or irrelevant to the source text. In this talk, we take a multi-faceted approach to this problem from 3 aspects: data, evaluation, and modeling. 

From the data standpoint, we propose ToTTo, a tables-to-text-dataset with high quality annotator revised references that we hope can serve as a benchmark for high precision text generation task.  While the dataset is challenging, existing n-gram based evaluation metrics are often insufficient to detect hallucinations. To this end, we propose BLEURT, a fully learnt end-to-end metric based on transfer learning that can quickly adapt to measure specific evaluation criteria. Finally, we propose a model based on confidence decoding to mitigate hallucinations. 

Collaborators: This is joint work with Thibault Sellam, Ran Tian, Xuezhi Wang, Sebastian Gehrmann, Manaal Faruqui, Bhuwan Dhingra, Diyi Yang, and Dipanjan Das. 

About the author:

Ankur Parikh is a senior research scientist at Google NYC and adjunct assistant professor at NYU. His research interests are in natural language processing and machine learning with a recent focus on high precision text generation. Ankur received his PhD from Carnegie Mellon in 2015 and has received a best paper runner up award at EMNLP 2014 and a best paper in translational bioinformatics at ISMB 2011.

Additional Information

In Campus Calendar
Yes
Groups

College of Computing, Computational Science and Engineering, GVU Center, Machine Learning, ML@GT, OMS, School of Computational Science and Engineering, School of Computer Science, School of Interactive Computing

Invited Audience
Faculty/Staff, Postdoc, Public, Graduate students, Undergraduate students
Categories
Seminar/Lecture/Colloquium
Keywords
No keywords were submitted.
Status
  • Created By: ablinder6
  • Workflow Status: Published
  • Created On: Oct 20, 2020 - 11:51am
  • Last Updated: Oct 20, 2020 - 11:58am