You are here


The usage of deep neural networks in combination with hidden Markov models (DNN-HMMs) has achieved great success in a number of works
dealing with many applications: mobile voice search, broadcast news transcription, automatic video subtitling, conversational speech recognition, even in noisy environments.

Assistive speech technology for  language learning

Recently, the number of devices equipped with both audio and video sensors (wearable cameras, smart-phones) has been growing continuously, leading to an increasing interest in taking advantage of a joint audio-video processing to address single-modal challenges and to open new application context.