*********************************
There is now a CONTENT FREEZE for Mercury while we switch to a new platform. It began on Friday, March 10 at 6pm and will end on Wednesday, March 15 at noon. No new content can be created during this time, but all material in the system as of the beginning of the freeze will be migrated to the new platform, including users and groups. Functionally the new site is identical to the old one. webteam@gatech.edu
*********************************
Title: Some New Understanding of Phase Information and its Application to Speech Processing
Committee:
Dr. Chin-Hui Lee, ECE, Chair , Advisor
Dr. Biing Hwang Juang, ECE
Dr. Mark Clements, ECE
Dr. Geoffrey Li, ECE
Dr. Yao Xie, ISyE
Abstract:
With the fast growing of deep neural network models, more and more tasks have been boosted when move on to deep models. Speech processing applications, e.g., speech enhancement, speech bandwidth expansion, dereverberataion, and etc., are also benefited. Most deep models focus more on improving the estimation of the spectral magnitude. However, there are evidences showing that the phase spectra are as well informative. Therefore, this dissertation investigates practical approaches to recover the spectral phase by resolving two inconsistency issues, i.e., frame-length inconsistency and frame-overlap inconsistency, leveraging the success of convex programming and alternating projection, respectively. Furthermore, frameworks to integrate both of the methods are explored. The proposed approaches and frameworks, taking advantage of some speech signal characteristics, have very limited number of assumptions, and therefore can be applied to various speech processing tasks.