SuperLectures.com

SPEAKER AND NOISE FACTORISATION ON THE AURORA4 TASK

Robust ASR

Full Paper at IEEE Xplore

Přednášející: Yongqiang Wang, Autoři: Yongqiang Wang, Mark Gales, University of Cambridge, United Kingdom

For many realistic scenarios, there are multiple factors that affect the clean speech signal. In this work approaches to handling two such factors, speaker and background noise differences, simultaneously are described. A new adaptation scheme is proposed. Here the acoustic models are first adapted to the target speaker via an MLLR transform. This is followed by adaptation to the target noise environment via model-based vector Taylor series (VTS) compensation. These speaker and noise transforms are jointly estimated, using maximum likelihood. Experiments on the AURORA4 task demonstrate that this adaptation scheme provides improved performance over VTS-based noise adaptation. In addition, this framework enables the speech and noise to be factorised, allowing the speaker transform estimated in one noise condition to be successfully used in a different noise condition.


  Přepis řeči

|

  Slajdy

Zvětšit slajd | Zobrazit všechny slajdy

0:00:16

  1. slajd

0:00:51

  2. slajd

0:01:41

  3. slajd

0:02:23

  4. slajd

0:03:08

  5. slajd

0:04:03

  6. slajd

0:05:15

  7. slajd

0:06:28

  8. slajd

0:08:12

  9. slajd

0:10:20

 10. slajd

0:10:55

 11. slajd

0:12:04

 12. slajd

0:14:14

 13. slajd

0:16:22

     8. slajd

0:16:39

     5. slajd

0:16:51

     6. slajd

  Komentáře

Please sign in to post your comment!

  Informace o přednášce

Nahráno: 2011-05-26 16:15 - 16:35, Panorama
Přidáno: 15. 6. 2011 18:50
Počet zhlédnutí: 22
Rozlišení videa: 1024x576 px, 512x288 px
Délka videa: 0:17:59
Audio stopa: MP3 [6.07 MB], 0:17:59