SuperLectures.com

SPEAKER DIARIZATION OF HETEROGENEOUS WEB VIDEO FILES: A PRELIMINARY STUDY

Speaker Diarization

Full Paper at IEEE Xplore

Presented by: Pierre Clement, Author(s): Pierre Clement, Université d'Avignon, France; Thierry Bazillon, Aix Marseille Université, France; Corinne Fredouille, Université d'Avignon, France

In the last ten years, internet as well as its applications changed significantly, mainly thanks to the raising of available personal resources. Concerning multimedia, the most impressive evolution is the continuous growing success of the video sharing websites. But with this success come the difficulties to efficiently search, index and access relevant information about these documents. Speaker diarization is an important task in the overall information retrieval process. This paper describes an audio/video database, especially built for the speaker diarization task, based on different video genres. Through some preliminary experiments, it highlights the difficulties encountered in this context, mainly linked to the database heterogeneity.


  Speech Transcript

|

  Slides

Enlarge the slide | Show all slides in a pop-up window

0:00:41

  1. slide

0:00:56

  2. slide

0:01:11

  3. slide

0:01:33

  4. slide

0:02:12

  5. slide

0:02:43

  6. slide

0:02:49

  7. slide

0:04:20

  8. slide

0:04:56

  9. slide

0:05:40

 10. slide

0:06:34

 11. slide

0:10:46

 12. slide

0:11:18

 13. slide

0:11:40

     2. slide

  Comments

Please sign in to post your comment!

  Lecture Information

Recorded: 2011-05-24 15:05 - 15:25, Panorama
Added: 15. 6. 2011 14:22
Number of views: 30
Video resolution: 1024x576 px, 512x288 px
Video length: 0:15:02
Audio track: MP3 [5.06 MB], 0:15:02