SuperLectures.com

GAMMATONE SUB-BAND MAGNITUDE-DOMAIN DEREVERBERATION FOR ASR

Full Paper at IEEE Xplore

Robust ASR

Přednášející: Bhiksha Raj, Autoři: Kshitiz Kumar, Rita Singh, Carnegie Mellon University, United States; Bhiksha Raj, Disney Research, United States; Richard Stern, Carnegie Mellon University, United States

We present an algorithm for dereverberation of speech signals for automatic speech recognition (ASR) applications. Often ASR systems are presented with speech that has been recorded in environments that include noise and reverberation. The performance of ASR systems degrades with increasing levels of noise and reverberation. While many algorithms have been proposed for robust ASR in noisy environments, reverberation is still a challenging problem. In this paper, we present an approach for dereverberation that models reverberation as a convolution operation in the speech spectral domain. Using a least-squares error criterion we decompose reverberated spectra into clean spectra convolved with a filter. We incorporate non-negativity and sparsity of the speech spectra as constraints within a non-negative matrix factorization (NMF) framework to achieve the decomposition. In ASR experiments where the system is trained with unreverberated and reverberated speech, we show that the proposed approach can provide upto 40% and 19% relative reduction respectively in performance.


  Přepis řeči

|

  Slajdy

Zvětšit slajd | Zobrazit všechny slajdy

0:00:16

  1. slajd

0:00:37

  2. slajd

0:01:00

  3. slajd

0:01:56

  4. slajd

0:02:31

  5. slajd

0:03:29

  6. slajd

0:05:22

  7. slajd

0:06:00

     7. slajd

0:06:27

  8. slajd

0:07:00

  9. slajd

0:07:29

 10. slajd

0:08:05

 11. slajd

0:08:47

 12. slajd

0:09:14

 13. slajd

0:10:28

 14. slajd

0:11:13

 15. slajd

0:13:19

 16. slajd

0:13:39

 17. slajd

0:14:29

 18. slajd

0:14:48

 19. slajd

0:15:47

 20. slajd

0:16:02

 21. slajd

0:16:18

 22. slajd

0:17:12

 23. slajd

0:17:42

 24. slajd

0:17:53

 25. slajd

0:18:26

 26. slajd

  Komentáře

Please sign in to post your comment!

  Informace o přednášce

Nahráno: 2011-05-26 17:55 - 18:15, Panorama
Přidáno: 15. 6. 2011 19:50
Počet zhlédnutí: 38
Rozlišení videa: 1024x576 px, 512x288 px
Délka videa: 0:24:02
Audio stopa: MP3 [8.15 MB], 0:24:02