InterSpeech 2021

InterSpeech 2021

INTERSPEECH is the world’s largest and most comprehensive conference on the science and technology of spoken language processing. INTERSPEECH conferences emphasize interdisciplinary approaches addressing all aspects of speech science and technology, ranging from basic theories to advanced applications.

The theme of INTERSPEECH 2021 held in Brno, Czechia, is Speech everywhere. Speech is also becoming an indispensable part of all AI systems and no longer considered an isolated block. We are seeing the emergence of larger systems that treat speech, vision, language, interfaces, external knowledge in an integrated way, and learn multi-modal embeddings, or otherwise jointly optimize performance. Speech everywhere also requires speech engineering to become more aware of the principles of human speech communication processes, and we therefore specifically encourage contributions in human speech processing.

In addition to regular oral and poster sessions, INTERSPEECH 2021 featured plenary talks by internationally renowned experts, tutorials, special sessions and challenges, show & tell sessions, and exhibits. A number of satellite events took place around INTERSPEECH 2021.

Website: www.interspeech2021.org, YouTube

Keynotes

Počet záznamů: 4

Survey talks

Počet záznamů: 4

Acoustic event detection and acoustic scene classification

Počet záznamů: 5

Applications in transcription, education and learning

Počet záznamů: 8

ASR Technologies and systems

Počet záznamů: 1

Assessment of pathological speech and language I

Počet záznamů: 4

Assessment of pathological speech and language II

Počet záznamů: 13

Automatic Speech Recognition in Air Traffic Management

Počet záznamů: 4

Communication and interaction, multimodality

Počet záznamů: 8

ConferencingSpeech 2021 challenge: Far-field Multi-Channel Speech Enhancement for Video Conferencing

Počet záznamů: 5

Cross/multi-lingual and code-switched ASR

Počet záznamů: 7

Disordered speech

Počet záznamů: 3

Diverse modes of speech acquisition and processing

Počet záznamů: 10

Embedding and Network Architecture for Speaker Recognition

Počet záznamů: 3

Emotion and Sentiment Analysis I

Počet záznamů: 2

Emotion and Sentiment Analysis II

Počet záznamů: 9

Emotion and Sentiment Analysis III

Počet záznamů: 4

Feature, Embedding and Neural Architecture for Speaker Recognition

Počet záznamů: 8

Graph and End-to-End Learning for Speaker Recognition

Počet záznamů: 1

Health and Affect I

Počet záznamů: 3

Health and Affect II

Počet záznamů: 9

INTERSPEECH 2021 Acoustic Echo Cancellation Challenge

Počet záznamů: 3

INTERSPEECH 2021 Deep Noise Suppression Challenge

Počet záznamů: 2

Keyword search and spoken language processing

Počet záznamů: 3

Language and Accent Recognition

Počet záznamů: 3

Language and Lexical Modeling for ASR

Počet záznamů: 8

Language Modeling and Text-based Innovations for ASR

Počet záznamů: 3

Linguistic Components in end-to-end ASR

Počet záznamů: 5

Low-resource speech recognition

Počet záznamů: 7

Miscellanous topics in ASR

Počet záznamů: 3

Multi- and cross-lingual ASR, other topics in ASR

Počet záznamů: 8

Multi-channel speech enhancement and hearing aids

Počet záznamů: 9

Multimodal systems

Počet záznamů: 10

Neural Network Training Methods and Architectures for ASR

Počet záznamů: 4

Neural network training methods for ASR

Počet záznamů: 9

Non-Autoregressive Sequential Modeling for Speech Processing

Počet záznamů: 7

Non-native speech

Počet záznamů: 5

Novel neural network architectures for ASR

Počet záznamů: 8

OpenASR20 and Low Resource ASR Development

Počet záznamů: 3

Oriental Language Recognition

Počet záznamů: 3

Phonation and voicing

Počet záznamů: 4

Phonetics I

Počet záznamů: 1

Phonetics II

Počet záznamů: 11

Privacy-preserving Machine Learning for Audio & Speech Processing

Počet záznamů: 9

Prosodic features and structure

Počet záznamů: 8

Resource-constrained ASR

Počet záznamů: 8

Robust and Far-field ASR

Počet záznamů: 3

Robust Speaker Recognition

Počet záznamů: 8

SdSV Challenge 2021: Analysis and Exploration of New Ideas on Short-Duration Speaker Verification

Počet záznamů: 2

Search/decoding techniques and confidence measures for ASR

Počet záznamů: 6

Self-supervision and semi-supervision for neural ASR training

Počet záznamů: 5

Show and Tell 1

Počet záznamů: 5

Show and Tell 2

Počet záznamů: 5

Show and Tell 3

Počet záznamů: 7

Show and Tell 4

Počet záznamů: 7

Single-channel speech enhancement

Počet záznamů: 7

Source Separation I

Počet záznamů: 2

Source Separation II

Počet záznamů: 10

Source Separation III

Počet záznamů: 3

Source separation, dereverberation and echo cancellation

Počet záznamů: 3

Speaker Diarization I

Počet záznamů: 3

Speaker Diarization II

Počet záznamů: 9

Speaker Recognition: Applications

Počet záznamů: 9

Speaker, Language, and Privacy

Počet záznamů: 3

Speech and audio analysis

Počet záznamů: 4

Speech coding and privacy

Počet záznamů: 9

Speech enhancement and coding

Počet záznamů: 2

Speech enhancement and intelligibility

Počet záznamů: 12

Speech Localization, Enhancement, and Quality Assessment

Počet záznamů: 4

Speech perception I

Počet záznamů: 2

Speech perception II

Počet záznamů: 9

Speech production I

Počet záznamů: 4

Speech production II

Počet záznamů: 6

Speech Recognition of Atypical Speech

Počet záznamů: 11

Speech signal analysis and representation I

Počet záznamů: 12

Speech signal analysis and representation II

Počet záznamů: 4

Speech Synthesis: Linguistic processing, paradigms and other topics

Počet záznamů: 8

Speech Synthesis: Neural Waveform Generation

Počet záznamů: 6

Speech Synthesis: Other topics I

Počet záznamů: 4

Speech Synthesis: Prosody Modeling I

Počet záznamů: 6

Speech Synthesis: Prosody Modeling II

Počet záznamů: 3

Speech Synthesis: Singing, Multimodal, Crosslingual Synthesis

Počet záznamů: 8

Speech Synthesis: Speaking Style and Emotion

Počet záznamů: 7

Speech Synthesis: tools, data, evaluation

Počet záznamů: 8

Speech Synthesis: Toward End-to-End Synthesis I

Počet záznamů: 7

Speech Synthesis: Toward End-to-End Synthesis II

Počet záznamů: 8

Speech type classification and diagnosis

Počet záznamů: 8

Spoken Dialogue Systems I

Počet záznamů: 2

Spoken Dialogue Systems II

Počet záznamů: 5

Spoken Language Processing I

Počet záznamů: 7

Spoken Language Processing II

Počet záznamů: 2

Spoken Language Understanding I

Počet záznamů: 8

Spoken Language Understanding II

Počet záznamů: 3

Spoken machine translation

Počet záznamů: 12

Spoken Term Detection & Voice Search

Počet záznamů: 9

Streaming for ASR/RNN Transducers

Počet záznamů: 7

Target speaker detection, localization and separation

Počet záznamů: 5

The ADReSSo Challenge: Detecting cognitive decline using speech only

Počet záznamů: 7

The First DiCOVA Challenge: Diagnosis of COVid-19 using Acoustics

Počet záznamů: 6

The INTERSPEECH 2021 Computational Paralinguistics Challenge (ComParE) - COVID-19 Cough, COVID-19 Speech, Escalation & Primates

Počet záznamů: 8

Tools, corpora and resources

Počet záznamů: 11

Topics in ASR: Adaptation, transfer learning, children's speech, and low-resource settings

Počet záznamů: 9

Topics in ASR: Robustness, feature extraction, and far-field ASR

Počet záznamů: 8

Tutorials

Počet záznamů: 8

Voice activity detection

Počet záznamů: 5

Voice activity detection and keyword spotting

Počet záznamů: 10

Voice and voicing

Počet záznamů: 6

Voice Anti-Spoofing and Countermeasure

Počet záznamů: 11

Voice Conversion and Adaptation I

Počet záznamů: 7

Voice Conversion and Adaptation II

Počet záznamů: 4

Voice quality characterization for clinical voice assessment: Voice production, acoustics, and auditory perception

Počet záznamů: 4

Opening

Počet záznamů: 1

Closing

Počet záznamů: 3

Kategorie přednášek