This shows you the differences between two versions of the page.
| Both sides previous revision Previous revision Next revision | Previous revision | ||
|
kaldi_asr_toolkit [2018/09/17 12:25] mganzeboom |
kaldi_asr_toolkit [2022/09/23 22:06] (current) mvangompel [Recommendation: user your own LaMachine on Ponyland] |
||
|---|---|---|---|
| Line 2: | Line 2: | ||
| Under this topic you can find information about the Kaldi ASR Toolkit, like URLs and paths where to find it. Kaldi is a more recent ASR toolkit compared to SPRAAK. Like SPRAAK, it contains functionality to train different types of GMM-HMM acoustic models, but also various types of Deep Neural Networks (DNNs), the current standard in ASR. This page provides links to Kaldi' | Under this topic you can find information about the Kaldi ASR Toolkit, like URLs and paths where to find it. Kaldi is a more recent ASR toolkit compared to SPRAAK. Like SPRAAK, it contains functionality to train different types of GMM-HMM acoustic models, but also various types of Deep Neural Networks (DNNs), the current standard in ASR. This page provides links to Kaldi' | ||
| Feel free to add experiences which you feel are useful to others (i.e. to not ' | Feel free to add experiences which you feel are useful to others (i.e. to not ' | ||
| + | |||
| + | |||
| + | ===== Recommendation (DEPRECATED!): | ||
| + | |||
| + | It is recommended to use your own LaMachine to use Kaldi on Ponyland instead of the shared one (however, LaMachine is now deprecated!). The instructions to install/ | ||
| + | |||
| + | Please note that you should change ' | ||
| + | |||
| + | Once you have your own LaMachine-CLAM-Kaldi, | ||
| + | |||
| + | $ ssh thunderlane | ||
| + | $ lamachine-lacristianmachine-activate | ||
| + | | ||
| + | |||
| + | Also, // | ||
| + | |||
| ===== Details ===== | ===== Details ===== | ||
| Line 18: | Line 34: | ||
| AlexASR is an incremental online decoder based on Kaldi. It can be used if you'd like to use ASR in a time sensitive context. It immediately decodes the speech as it comes in and only requires some finalization after the last audio packet was received. For example, we used this decoder in the game developed in the CHASING project. To reduce player waiting time on ASR results, AlexASR was used to decode speech as it was being recorded from the player.\\ | AlexASR is an incremental online decoder based on Kaldi. It can be used if you'd like to use ASR in a time sensitive context. It immediately decodes the speech as it comes in and only requires some finalization after the last audio packet was received. For example, we used this decoder in the game developed in the CHASING project. To reduce player waiting time on ASR results, AlexASR was used to decode speech as it was being recorded from the player.\\ | ||
| **Location of sources and info:** [[https:// | **Location of sources and info:** [[https:// | ||
| + | See also [[alex_asr|Alex ASR]]. | ||
| ===== Useful tutorial links ===== | ===== Useful tutorial links ===== | ||
| Line 27: | Line 44: | ||
| * Eleanor Chodroff' | * Eleanor Chodroff' | ||
| * Josh Meyer' | * Josh Meyer' | ||
| - | * John Hopkins' | + | * John Hopkins' |
| + | |||
| + | * The offical Kaldi lectures: http:// | ||
| + | |||
| + | **More practical tutorials**: | ||
| + | * https:// | ||
| + | * https:// | ||
| + | * You might find this repo useful for keighrim tutorial: https:// | ||
| + | * https:// | ||
| + | * http:// | ||
| + | * https:// | ||
| + | * https:// | ||
| + | * https:// | ||
| + | * http:// | ||
| + | * http:// | ||
| **Forced alignment using existing acoustic models** | **Forced alignment using existing acoustic models** | ||
| - | * Eleanor Chodroff' | + | * Eleanor Chodroff' |