*********************************
There is now a CONTENT FREEZE for Mercury while we switch to a new platform. It began on Friday, March 10 at 6pm and will end on Wednesday, March 15 at noon. No new content can be created during this time, but all material in the system as of the beginning of the freeze will be migrated to the new platform, including users and groups. Functionally the new site is identical to the old one. webteam@gatech.edu
*********************************
Title: Towards Development of No-reference Objective Measures for Perceptual Evaluation of Singing Voice Separation
Committee:
Dr. Elliot Moore, ECE, Chair , Advisor
Dr. Alexander Lerch, Music, Co-Advisor
Dr. Mark Clements, ECE
Dr. Pamela Bhatti, ECE
Dr. Justin Romberg, ECE
Dr. Timothy Hsu, Music
Abstract:
Singing Voice Separation (SVS) uses audio source separation methods to isolate the vocal component from the background accompaniment in a song mix. A key challenge currently associated with evaluation of SVS is a lack of objective measures which correlate consistently with subjective evaluation. Additionally, the current state-of-the-art evaluation measures require the use of unmixed vocal and instrumental tracks which are often not available. The research presented in this thesis is an attempt to address these challenges by introducing two new objective measures for perceptually relevant evaluation of SVS.
The Vocal Isolation Score (VIS) is designed to assess the quality of isolation produced by various SVS algorithms when separating the vocals from the accompaniment. Similarly, the Vocal Intelligibility Preservation Score (VIPS) evaluates the preservation of intelligibility in the separated vocals. Other than an improvement upon the state-of-the-art, both VIS and VIPS have the additional advantage that they do not require references in the form of unmixed vocal or instrumental tracks to perform objective evaluation, unlike the currently popular objective measures used for evaluating audio source separation.