Speech Recognition Settings

Publication date: 09.05.2024

The accuracy and speed with which your Voice Bot understands the client are key factors for dialogue success. Speech Recognition Settings (STT) allow you to finely control this process using the advanced Google system. You can configure the primary and alternative languages to handle bilingual calls, and control how quickly the bot should react to the subscriber’s speech. This ensures your bot reacts instantly and always selects the most accurate result.

Subscriber speech settings are created on the Recognition settings page. Currently, only Google’s recognition system is available.

General settings

Name – the name of the profile
Setting the primary language
Setting up an alternative language
The alternative language doesn’t have to be the same as the main language, it’s not necessary to specify it. Recognition of the alternative language is charged separately.

Google recognition features

Google has languages that support advanced recognition (tailored to phone calls) and those that don’t.
The list of languages available for selection in the Settings:
– English (US) – supports improved recognition
– English (GB) – supports improved recognition
– Ukrainian
– russian – supports improved recognition
– Polish
If you need a language that is not listed, contact tech support.
During recognition, Google sends intermediate results while the caller is speaking, and the final result after a while.
For languages that support improved recognition (russian, English), the final result comes about 2 seconds after the subscriber has finished speaking.
For languages that do not support enhanced recognition (Ukrainian, Polish), the situation is unstable: the final result can come in 2 seconds or even a minute after.

Settings:

Use an advanced recognition model. Available only for languages with advanced recognition. Speeds up the final result by about 10%, but intermediate results are less precise.
Don’t wait for the final result. If we don’t receive another intermediate or final result within 2 seconds after receiving an intermediate result, the result will be accepted by the voice robot without waiting for the final result. This setting was added specifically for languages that don’t support advanced recognition. We do not recommend enabling it for languages that do support advanced recognition.

A correctly configured speech recognition profile is a guarantee of high-quality dialogue and a time-saver for your customers. By utilizing the alternative language feature, you effectively meet the needs of a bilingual audience. Furthermore, precise management of the “Use Enhanced Model” and “Do not wait for final result” parameters allows you to perfectly balance the robot’s reaction speed with recognition accuracy, ensuring flawless communication even in the most complex scenarios.

UniTalk – A single solution for managing customer communication

Request a call back or give us a call

+38 (093) 170 08 00

Get a consultation