ASR course at Chulalongkorn University for Spring 2018.
The course will give an overview of the current speech technologies, and discuss about various components of human-machine conversational system. The first part of the course will cover mostly signal processing related topics, such as acoustic theory of human speech production, spectrogram reading, and speech signal representation. The second part of the course will go into Dynamic Time Warping (DTW) and the HMM-based speech recognizer. This includes state-of-the-art techniques such as Finite State Transducers, and deep learning. Other speech processing related topics will also be touched upon, such as speaker/language/emotion recognition, and dialogue systems.
- HW1 - Testing ASR capabilities and evaluation
- HW2 - Signal Analysis
- HW3 - Corpus Collection
- HW4 - Spectrogram Analysis
- HW5 - Feature extraction and DTW
- HW6 - Language modeling
Tutorials - Kaldi, G2P, and realtime decoding
To be added