DeepDive: A Data Management System for Machine Learning Workloads

*********************************
There is now a CONTENT FREEZE for Mercury while we switch to a new platform. It began on Friday, March 10 at 6pm and will end on Wednesday, March 15 at noon. No new content can be created during this time, but all material in the system as of the beginning of the freeze will be migrated to the new platform, including users and groups. Functionally the new site is identical to the old one. webteam@gatech.edu
*********************************

Event Details
  • Date/Time:
    • Tuesday February 23, 2016
      10:00 am - 11:30 am
  • Location: Klaus 1116E and W
  • Phone:
  • URL:
  • Email:
  • Fee(s):
    N/A
  • Extras:
Contact
No contact information submitted.
Summaries

Summary Sentence: DeepDive creator and Stanford University postdoctoral researcher, Ce Zhang, will explain DeepDive's framework and underlying techniques to boost the range of machine learning workloads.

Full Summary: Ce Zhang, Stanford University postdoctoral researcher and PaleoDeepDive creator, a program widely used by paleontologists and featured in "Nature" magazine. Zhang will explain PaleoDeepDive's purpose, the framework behind it and the need to build a high-quality KBC system that can deal with information that is both diverse in type and size while boosting the range of machine learning workloads.

Media
  • Ce_Zhang Ce_Zhang
    (image/png)

Event Overview:

Many pressing questions in science are larger than the resource researchers have to solve them. These questions require the ability to consult scattered information, much of which are not organized in a structured form. Enters Ce Zhang, PaleoDeepDive creator, and Stanford University postdoctoral researcher. Zhang developed the program PaleoDeepDive, a machine-reading system for paleontologists. PaleoDeepDrive was featured in Nature magazine and is widely used in fields ranging from paleobiology, genomics, and event in anti-human trafficking.

Zhang research focuses on building a data management system for machine learning workloads with the goal to simplify the complex process of building KBC systems. He will explain PaleoDeepDive's framework, and its ultimate goal to allow scientists to build a KBC system without worrying about any algorithmic, performance, or scalability issues. Zhang will discuss the key challenge in building a high-quality KBC system, dealing with information that is both diverse in type and size, and underlying techniques to boost the range of machine learning workloads. 

About Ce Zhang:

Ce is a postdoctoral researcher in Computer Science at Stanford University. He is working with Christopher Ré on data management and database systems. With the indispensable help of many collaborators, his doctoral work produced the system PaleoDeepDive, a trained data system for automatic knowledge-base construction. As part of his doctoral thesis, he led the research efforts that won the 2014 SIGMOD Best Paper Award and was invited to the “Best of VLDB 2015” special issue; PaleoDeepDive, a machine-reading system for paleontologists, was featured in Nature magazine, and he also led the Stanford team that produced the top-performing machine-reading system for TAC-KBP 2014 slot-filling evaluations using DeepDive. Ce obtained his doctorate. from the University of Wisconsin-Madison, advised by Christopher Ré, and his Bachelor of Science degree from Peking University, advised by Bin Cui.

Additional Information

In Campus Calendar
No
Groups

School of Computer Science

Invited Audience
Undergraduate students, Faculty/Staff, Public, Graduate students
Categories
Seminar/Lecture/Colloquium
Keywords
Ce Zhang, College of Computing, Computer Science, distinguished lecture series, School of Computer Science
Status
  • Created By: Devin Young
  • Workflow Status: Published
  • Created On: Feb 16, 2016 - 1:10pm
  • Last Updated: Apr 13, 2017 - 5:16pm