AN SVM BASED CLASSIFICATION APPROACH TO SPEECH SEPARATION
Speech Enhancement
Přednášející: Kun Han, Autoři: Kun Han, DeLiang Wang, The Ohio State University, United States
Monaural speech separation is a very challenging task. CASA-based systems utilize acoustic features to produce a time-frequency (T-F) mask. In this study, we propose a classification approach to monaural separation problem. Our feature set consists of pitch-based features and amplitude modulation spectrum features, which can discriminate both voiced and unvoiced speech from nonspeech interference. We employ support vector machines (SVMs) followed by a re-thresholding method to classify each T-F unit as either target-dominated or interference-dominated. An auditory segmentation stage is then utilized to improve SVM-generated results. Systematic evaluations show that our approach produces high quality binary masks and outperforms a previous system in terms of classification accuracy.
Informace o přednášce
Nahráno: | 2011-05-27 16:15 - 16:35, Panorama |
---|---|
Přidáno: | 7. 6. 2011 19:19 |
Počet zhlédnutí: | 48 |
Rozlišení videa: | 1024x576 px, 512x288 px |
Délka videa: | 0:20:25 |
Audio stopa: | MP3 [6.98 MB], 0:20:25 |
Komentáře