COST-SENSITIVE STACKING FOR AUDIO TAG ANNOTATION AND RETRIEVAL
Multimedia Indexing and Retrieval
Presented by: Hung-Yi Lo, Author(s): Hung-Yi Lo, Ju-Chiang Wang, Hsin-Min Wang, Institute of Information Science / Academia Sinica, Taiwan; Shou-De Lin, National Taiwan University, Taiwan
Audio tags correspond to keywords that people use to describe different aspects of a music clip, such as the genre, mood, and instrumentation. Since social tags are usually assigned by people with different levels of musical knowledge, they inevitably contain noisy information. By treating the tag counts as costs, we can model the audio tagging problem as a cost-sensitive classification problem. In addition, tag correlation is another useful information for automatic audio tagging since some tags often co-occur. By considering the co-occurrences of tags, we can model the audio tagging problem as a multi-label classification problem. To exploit the tag count and correlation information jointly, we formulate the audio tagging task as a novel cost-sensitive multi-label (CSML) learning problem. The results of audio tag annotation and retrieval experiments demonstrate that the new approach outperforms our MIREX 2009 winning method.
Lecture Information
Recorded: | 2011-05-25 14:05 - 14:25, Club H |
---|---|
Added: | 9. 6. 2011 00:15 |
Number of views: | 33 |
Video resolution: | 1024x576 px, 512x288 px |
Video length: | 0:14:18 |
Audio track: | MP3 [4.88 MB], 0:14:18 |
Comments