SuperLectures.com

BAYESIAN INTEGRATION OF AUDIO AND VISUAL INFORMATION FOR MULTI-TARGET TRACKING USING A CB-MEMBER FILTER

Joint Audio Visual Processing

Full Paper at IEEE Xplore

Presented by: Reza Hoseinnezhad, Author(s): Reza Hoseinnezhad, RMIT University, Australia; Ba-Ngu Vo, Ba-Tuong Vo, The University of Western Australia, Australia; David Suter, The University of Adelaide, Australia

A new method is presented for integration of audio and visual information in multiple target tracking applications. The proposed approach uses a Bayesian filtering formulation and exploits multi-Bernoulli random finite set approximations. The work presented in this paper is the first principled Bayesian estimation approach to solve the sensor fusion problems that involve intermittent sensory data (e.g. audio data for a person who occasionally speaks.) We have examined our method with case studies from the SPEVI database. The results show nearly perfect tracking of people not only when they are silent but also when they are not visible to the camera (but speaking).


  Speech Transcript

|

  Slides

Enlarge the slide | Show all slides in a pop-up window

0:00:16

  1. slide

0:00:39

  2. slide

0:01:40

  3. slide

0:02:30

     3. slide

0:03:08

     3. slide

0:03:39

  4. slide

0:03:49

  5. slide

0:04:59

  6. slide

0:07:30

  7. slide

0:09:17

  8. slide

0:10:07

  9. slide

0:11:08

 10. slide

0:12:49

 11. slide

0:13:48

 12. slide

0:14:35

 13. slide

0:16:38

 14. slide

0:19:07

 15. slide

0:21:33

 16. slide

  Comments

Please sign in to post your comment!

  Lecture Information

Recorded: 2011-05-24 17:55 - 18:15, Club H
Added: 9. 6. 2011 00:58
Number of views: 35
Video resolution: 1024x576 px, 512x288 px
Video length: 0:22:59
Audio track: MP3 [7.86 MB], 0:22:59