AN SVM BASED CLASSIFICATION APPROACH TO SPEECH SEPARATION
Speech Enhancement
Presented by: Kun Han, Author(s): Kun Han, DeLiang Wang, The Ohio State University, United States
Monaural speech separation is a very challenging task. CASA-based systems utilize acoustic features to produce a time-frequency (T-F) mask. In this study, we propose a classification approach to monaural separation problem. Our feature set consists of pitch-based features and amplitude modulation spectrum features, which can discriminate both voiced and unvoiced speech from nonspeech interference. We employ support vector machines (SVMs) followed by a re-thresholding method to classify each T-F unit as either target-dominated or interference-dominated. An auditory segmentation stage is then utilized to improve SVM-generated results. Systematic evaluations show that our approach produces high quality binary masks and outperforms a previous system in terms of classification accuracy.
Lecture Information
Recorded: | 2011-05-27 16:15 - 16:35, Panorama |
---|---|
Added: | 7. 6. 2011 19:19 |
Number of views: | 48 |
Video resolution: | 1024x576 px, 512x288 px |
Video length: | 0:20:25 |
Audio track: | MP3 [6.98 MB], 0:20:25 |
Comments