COST-SENSITIVE STACKING FOR AUDIO TAG ANNOTATION AND RETRIEVAL
Multimedia Indexing and Retrieval
Přednášející: Hung-Yi Lo, Autoři: Hung-Yi Lo, Ju-Chiang Wang, Hsin-Min Wang, Institute of Information Science / Academia Sinica, Taiwan; Shou-De Lin, National Taiwan University, Taiwan
Audio tags correspond to keywords that people use to describe different aspects of a music clip, such as the genre, mood, and instrumentation. Since social tags are usually assigned by people with different levels of musical knowledge, they inevitably contain noisy information. By treating the tag counts as costs, we can model the audio tagging problem as a cost-sensitive classification problem. In addition, tag correlation is another useful information for automatic audio tagging since some tags often co-occur. By considering the co-occurrences of tags, we can model the audio tagging problem as a multi-label classification problem. To exploit the tag count and correlation information jointly, we formulate the audio tagging task as a novel cost-sensitive multi-label (CSML) learning problem. The results of audio tag annotation and retrieval experiments demonstrate that the new approach outperforms our MIREX 2009 winning method.
Informace o přednášce
Nahráno: | 2011-05-25 14:05 - 14:25, Club H |
---|---|
Přidáno: | 9. 6. 2011 00:15 |
Počet zhlédnutí: | 33 |
Rozlišení videa: | 1024x576 px, 512x288 px |
Délka videa: | 0:14:18 |
Audio stopa: | MP3 [4.88 MB], 0:14:18 |
Komentáře