Ph.D. Proposal Oral Exam - Desmond Caulley

*********************************
There is now a CONTENT FREEZE for Mercury while we switch to a new platform. It began on Friday, March 10 at 6pm and will end on Wednesday, March 15 at noon. No new content can be created during this time, but all material in the system as of the beginning of the freeze will be migrated to the new platform, including users and groups. Functionally the new site is identical to the old one. webteam@gatech.edu
*********************************

Event Details
  • Date/Time:
    • Wednesday June 3, 2020
      10:30 am - 12:30 pm
  • Location: https://bluejeans.com/394588141
  • Phone:
  • URL:
  • Email:
  • Fee(s):
    N/A
  • Extras:
Contact
No contact information submitted.
Summaries

Summary Sentence: Improved Automatic Analysis Methods for Lena-Obtained Audio Recordings of Children with Autism Spectrum Disorder

Full Summary: No summary paragraph submitted.

Title:  Improved Automatic Analysis Methods for Lena-Obtained Audio Recordings of Children with Autism Spectrum Disorder

Committee: 

Dr. Anderson, Advisor   

Dr. Lee, Chair

Dr. Inan

Dr. Clements

Abstract:

The objective of the proposed research is to develop novel automatic analysis methods for audio recordings of children with autism spectrum disorder (ASD). The first area of exploration is speaker diarization, which allows us to identify ”who spoke when” and thus differentiate between the audio segments of a child vocalizing and those of a parent. Once diarization is completed, we can compute language behavior statistics that clinicians can use for diagnosis purposes and for tracking treatment effectiveness. Audio recordings of children with ASD are often recorded in a clinic setting or at home, and diarization performance is directly impacted by the environment where the audio is obtained. This work will thus center around the development of an environment-aware speaker diarization model. Traditional i-vector-based speaker modeling can be supplemented with environmental factors (e-vectors) for increased accuracy. Additional work on audio diarization will involve using time delay neural network (TDNN) techniques, namely x-vectors. Unlike i-vectors, which are trained generatively, x-vectors utilize speaker labels for direct supervision during training. X-vector modeling addresses the issue of limited training data by utilizing transfer learning techniques. Finally, an additional goal of this research is it to develop a long short-term (LSTM)-based interrogative utterance detector—a tool that can enhance language measure statistics by enabling clinicians to track a child’s response to questions from parents.

Additional Information

In Campus Calendar
No
Groups

ECE Ph.D. Proposal Oral Exams

Invited Audience
Public
Categories
Other/Miscellaneous
Keywords
Phd proposal, graduate students
Status
  • Created By: Daniela Staiculescu
  • Workflow Status: Published
  • Created On: May 26, 2020 - 3:01pm
  • Last Updated: May 26, 2020 - 3:01pm