InterSpeech 2021

Single-channel speech enhancement

Personalized Speech Enhancement through Self-Supervised Data Augmentation and Purification
(3 minutes introduction)

Aswin Sivaraman (Indiana University, USA), Sunwoo Kim (Indiana University, USA), Minje Kim (Indiana University, USA)

Speech Denoising with Auditory Models
(3 minutes introduction)

Mark R. Saddler (MIT, USA), Andrew Francl (MIT, USA), Jenelle Feather (MIT, USA), Kaizhi Qian (MIT-IBM Watson AI Lab, USA), Yang Zhang (MIT-IBM Watson AI Lab, USA), Josh H. McDermott (MIT, USA)

Human Listening and Live Captioning: Multi-Task Training for Speech Enhancement
(3 minutes introduction)

Sefik Emre Eskimez (Microsoft, USA), Xiaofei Wang (Microsoft, USA), Min Tang (Microsoft, USA), Hemin Yang (Microsoft, USA), Zirun Zhu (Microsoft, USA), Zhuo Chen (Microsoft, USA), Huaming Wang (Microsoft, USA), Takuya Yoshioka (Microsoft, USA)

A Maximum Likelihood Approach to SNR-Progressive Learning Using Generalized Gaussian Distribution for LSTM-Based Speech Enhancement
(3 minutes introduction)

Xiao-Qi Zhang (USTC, China), Jun Du (USTC, China), Li Chai (USTC, China), Chin-Hui Lee (Georgia Tech, USA)

WHISPER SPEECH ENHANCEMENT USING JOINT VARIATIONAL AUTOENCODER FOR IMPROVED SPEECH RECOGNITION
(3 minutes introduction)

Vikas Agrawal (Samsung, India), Shashi Kumar (Samsung, India), Shakti P. Rath (Reverie Language Technologies, India)

Speech Denoising without Clean Training Data: a Noise2Noise Approach
(3 minutes introduction)

Madhav Mahesh Kashyap (PES University, India), Anuj Tambwekar (PES University, India), Krishnamoorthy Manohara (PES University, India), S. Natarajan (PES University, India)

Speech Enhancement with Topology-enhanced Generative Adversarial Networks (GANs)
(3 minutes introduction)

Xudong Zhang (CUNY Graduate Center, USA), Liang Zhao (CUNY Lehman College, USA), Feng Gu (CUNY CSI, USA)