PHASE-BASED INFORMATION FOR VOICE PATHOLOGY DETECTION
Modeling and Analysis of Speech Production
Presented by: Drugman Thomas, Author(s): Thomas Drugman, Thomas Dubuisson, Thierry Dutoit, University of Mons, Belgium
In most current approaches of speech processing, information is extracted from the magnitude spectrum. However recent perceptual studies have underlined the importance of the phase component. The goal of this paper is to investigate the potential of using phase-based features for automatically detecting voice disorders. It is shown that group delay functions are appropriate for characterizing irregularities in the phonation. Besides the respect of the mixed-phase model of speech is discussed. The proposed phase-based features are evaluated and compared to other parameters derived from the magnitude spectrum. Both streams are shown to be interestingly complementary. Furthermore phase-based features turn out to convey a great amount of relevant information, leading to high discrimination performance.
Lecture Information
Recorded: | 2011-05-27 09:50 - 10:10, Panorama |
---|---|
Added: | 9. 6. 2011 10:42 |
Number of views: | 27 |
Video resolution: | 1024x576 px, 512x288 px |
Video length: | 0:19:54 |
Audio track: | MP3 [6.80 MB], 0:19:54 |
Comments