Create SpeechRecognizer...

Creates the whisper.cpp speech recognizer.

Settings

Whisper model
determines which Whisper model to use for recognition. The list is populated with the .bin files found in the whispercpp subfolder of the models folder in the Praat preferences folder. Models that contain .en in their name are English-only; all other models are multilingual.
Language
determines the language to be recognized. Choose Autodetect language to let the model detect the language automatically. If you know the language of the audio, selecting it explicitly may improve recognition accuracy. Note that English-only models (those with .en in the name) can only be used with Autodetect language or English.

Installing Whisper models

Before you can use the SpeechRecognizer, you need to install one or more Whisper model files (in GGML format, with extension .bin) into the subfolder whispercpp of the folder models in the Praat preferences folder.

Whisper models come in several sizes, each offering a different trade-off between speed and accuracy. Model names that contain .en are English-only models. All other models are multilingual. Available model sizes are: tiny, base, small, medium, large-v1, large-v2, large-v3, and large-v3-turbo (also known as turbo). Larger models are more accurate but require more memory and processing time.

Model files can be obtained from the Hugging Face repository at https://huggingface.co/ggerganov/whisper.cpp/tree/main.

Links to this page


© Anastasia Shchupak 2026-03-15