Ph.D. Dissertation Defense - I-Fan Chen

*********************************
There is now a CONTENT FREEZE for Mercury while we switch to a new platform. It began on Friday, March 10 at 6pm and will end on Wednesday, March 15 at noon. No new content can be created during this time, but all material in the system as of the beginning of the freeze will be migrated to the new platform, including users and groups. Functionally the new site is identical to the old one. webteam@gatech.edu
*********************************

Event Details

Date/Time:
- Tuesday November 17, 2015 - Wednesday November 18, 2015
  2:00 pm - 1:59 pm
Location: Room 5244, Centergy
Phone:
URL:
Email:
Fee(s):
N/A
Extras:

Contact

No contact information submitted.

Summaries

Summary Sentence: ECE PhD Dissertation Defense

Full Summary: No summary paragraph submitted.

Title: Resource-dependent Acoustic and Language Modeling for Spoken Keyword Search

Committee:

Dr. C.-H. Lee , Advisor

Dr. Biing-Hwang Juang, ECE

Dr. Mark Clements, ECE

Dr. Gee-Kung Chang, ECE

Dr. Yao Xie, ISyE

Abstract:

In this dissertation, three research directions were explored to alleviate two major issues, i.e., the use of incorrect models and training/test condition mismatches, in the modeling frameworks of modern spoken keyword search (KWS) systems. Each of the three research directions, which include (i) data-efficient training processes, (ii) system optimization objectives, and (iii) data augmentation, utilizes different types and amounts of training resources in different ways to ameliorate the two issues of acoustic and language modeling in modern KWS systems. To be more specific, resource-dependent keyword modeling, keyword-boosted sMBR (state-level minimum Bayes risk) training, and multilingual acoustic modeling are proposed and investigated for acoustic modeling in this research. For language modeling, keyword-aware language modeling, discriminative keyword-aware language modeling, and web text augmented language modeling are presented and discussed.

The dissertation provides a comprehensive collection of solutions and strategies to the acoustic and language modeling problems in KWS. It also offers insights into the realization of good-performance KWS systems. Experimental results show that the data-efficient training process and data augmentation are the two directions providing the most prominent performance improvement for KWS systems. While modifying system optimization objectives provides smaller yet consistent performance enhancement in KWS systems with different configurations. The effects of the proposed acoustic and language modeling approaches in the three directions are also shown to be additive and can be combined to further improve the overall KWS system performance.

Additional Information

In Campus Calendar

Groups

ECE Ph.D. Dissertation Defenses

Invited Audience

Public

Categories

Other/Miscellaneous

Keywords

graduate students, Phd Defense

Status

Created By: Daniela Staiculescu
Workflow Status: Published
Created On: Nov 15, 2015 - 8:03am
Last Updated: Oct 7, 2016 - 10:14pm

Georgia Tech

Ph.D. Dissertation Defense - I-Fan Chen

Additional Information