SuperLectures.com

COMBINING HMM-BASED MELODY EXTRACTION AND NMF-BASED SOFT MASKING FOR SEPARATING VOICE AND ACCOMPANIMENT FROM MONAURAL AUDIO

Acoustic Source Separation

Full Paper at IEEE Xplore

Přednášející: Yun Wang, Autoři: Yun Wang, Zhijian Ou, Tsinghua University, China

Modern monaural voice and accompaniment separation systems usually consist of two main modules: melody extraction and time-frequency masking. A main distinction between different separation systems lies in what approaches are used for the two modules. Popular techniques for melody extraction include hidden Markov models (HMMs) and non-negative matrix factorization (NMF), and masking includes hard and soft masking. This paper investigates the flaw of NMF-based melody extraction, and proposes the combination of HMM-based melody extraction (equipped with a newly-defined feature) and NMF-based soft masking. Evaluations on two publicly available databases show that the proposed system reaches state-of-the-art performance and outperforms several other combinations.


  Přepis řeči

|

  Slajdy

Zvětšit slajd | Zobrazit všechny slajdy

0:01:08

  1. slajd

0:01:23

  2. slajd

0:02:31

  3. slajd

0:03:17

  4. slajd

0:03:44

  5. slajd

0:05:28

  6. slajd

0:06:36

  7. slajd

0:09:33

  8. slajd

0:10:58

  9. slajd

0:11:48

 10. slajd

0:13:01

 11. slajd

0:13:35

 12. slajd

0:15:32

 13. slajd

0:16:27

 14. slajd

0:18:16

 15. slajd

0:19:35

    14. slajd

  Komentáře

Please sign in to post your comment!

  Informace o přednášce

Nahráno: 2011-05-24 10:15 - 10:35, Club H
Přidáno: 7. 6. 2011 19:19
Počet zhlédnutí: 133
Rozlišení videa: 1024x576 px, 512x288 px
Délka videa: 0:20:41
Audio stopa: MP3 [7.07 MB], 0:20:41