Automatic Speech Recognition
- Acoustic modelling
- Feature compensation
In automatic speech recognition, the received signals are often distorted by various interferences, which lead to performance degradation. In order to alleviate the performance deterioration in adverse environments, the distortion in the speech feature can be reduced via feature compensation techniques.
Speech Synthesis
- High quality speech synthesis
- Speaker adpative speech synthesis
Conventional speech synthesis generates machine-like voice with distinctive timbre and accent. High quality speech synthesis aims to synthesize speech with naturalness and intelligibility similar to humans.
The following methods can be employed for high quality speech synthesis:
- High quality vocoder
- Improving acoustic models
- Post processing
- Speaker adaptation to target speaker
- Emotional TTS
- singing voice TTS
Speech Enhancement
- Noise reduction
- Echo cancellation / Dereverberation
- Speech Intelligibility
- Single channel enhancement
- Multi channel enhancement
- Multichannel echo cancellation / Dereverberation
- Residual echo suppression
- Filtering technique
- Band width extension
Speech Coding
Speech coding is an application of data compression of digital audio signals containing speech. The purpose of speech compression is to reduce the number of bits required to represent speech signals in order to minimize the requirement for transmission bandwidth or to reduce the storage costs.
- Source coding(data compression)
-High efficiency audio coding
- Channel coding(error correction)
-Error robust audio coding