SuperLectures.com

CLASSIFYING SOUNDTRACKS WITH AUDIO TEXTURE FEATURES

Innovative Representations of Audio

Full Paper at IEEE Xplore

Přednášející: Josh McDermott, Autoři: Daniel P.W. Ellis, Xiaohong Zeng, Columbia University, United States; Josh McDermott, New York University, United States

Sound textures may be defined as sounds whose character depends on statistical properties as much as the specific details of each individually-perceived event. Recent work has devised a set of statistics that, when synthetically imposed, cause listeners to identify a wide range of environmental sound textures. In this work, we investigate using these statistics for automatic classification of a set of environmental sound classes defined over a set of web videos depicting ``multimedia events''. We show that the texture statistics perform as well as our best conventional statistics (based on MFCC covariance). We further examine the relative contributions of the different statistics, showing the importance of modulation spectra and cross-band envelope correlations.


  Přepis řeči

|

  Komentáře

Please sign in to post your comment!

  Informace o přednášce

Nahráno: 2011-05-26 10:10 - 10:30, Club D
Přidáno: 15. 6. 2011 18:06
Počet zhlédnutí: 39
Rozlišení videa: 1024x576 px, 512x288 px
Délka videa: 0:23:58
Audio stopa: MP3 [8.12 MB], 0:23:58