SuperLectures.com

UNSUPERVISED ACOUSTIC SUB-WORD UNIT DETECTION FOR QUERY-BY-EXAMPLE SPOKEN TERM DETECTION

Speaker Diarization

Full Paper at IEEE Xplore

Přednášející: Marijn Huijbregts, Autoři: Marijn Huijbregts, Mitchell McLaren, David van Leeuwen, Radboud University Nijmegen, Netherlands

In this paper we present a method for automatically generating acoustic sub-word units that can substitute conventional phone models in a query-by-example spoken term detection system. We generate the sub-word units with a modified version of our speaker diarization system. Given a speech recording, the original diarization system generates a set of speaker models in an unsupervised manner without the need for training or development data. Modifying the diarization system to process the speech of a single speaker and decreasing the minimum segment duration constraint allows us to detect speaker-dependent sub-word units. For the task of query-by-example spoken term detection, we show that the proposed system performs well on both broadcast and non-broadcast recordings, unlike a conventional phone-based system trained solely on broadcast data. A mean average precision of 0.28 and 0.38 was obtained for experiments on broadcast news and on a set of war veteran interviews, respectively.


  Přepis řeči

|

  Slajdy

Zvětšit slajd | Zobrazit všechny slajdy

0:00:22

  1. slajd

0:00:40

  2. slajd

0:01:55

  3. slajd

0:02:46

  4. slajd

0:03:35

  5. slajd

0:05:06

  6. slajd

0:06:29

  7. slajd

0:07:08

  8. slajd

0:08:24

  9. slajd

0:09:16

 10. slajd

  Komentáře

Please sign in to post your comment!

  Informace o přednášce

Nahráno: 2011-05-24 15:25 - 15:45, Panorama
Přidáno: 15. 6. 2011 13:40
Počet zhlédnutí: 17
Rozlišení videa: 1024x576 px, 512x288 px
Délka videa: 0:11:36
Audio stopa: MP3 [3.87 MB], 0:11:36