Skip to content
Georgy Treshchev edited this page Jun 10, 2023 · 21 revisions

Runtime Speech Recognizer Documentation

Runtime Speech Recognizer is an open-source plugin that enables real-time, offline speech recognition. Based on Whisper OpenAI technology, particularly whisper.cpp library, and supports multiple language models pre-selected in the plugin's settings.

How to install

There're two ways to install the plugin:

  1. Through the marketplace.
  2. Manual installation. Select and download the release for the required engine version, extract the archive into your plugins project folder to get the following path: "[ProjectName] / Plugins / RuntimeSpeechRecognizer".

On first run, install language models (a dialog box will appear asking you to do this automatically).

Basic description

This plugin provides real-time speech recognition using advanced algorithms based on whisper.cpp library. It matches incoming audio data, provided as a stream or non-stream input, against pre-trained language models.

Clone this wiki locally