Wake Word is a passive listening system that has only one objective, to detect the keyword assigned to it. The main interest of this tool is to save the resources used by activating the actual listening of the user only when the wake word has been recognized.
Speech-to-Text is a voice technology, based on deep learning language models, that is used to transform audio signals into transcribed text.
Automatic Speech Recognition (ASR)
Automatic Speech Recognition is a technology used to create reliable voice commands. Closely related to speech-to-text, it takes the solution further with specific intent comprehension thanks to a grammar-based design.
Voice biometry is a technology designed to authenticate and identify users. It can be used to grant access to a user, using its voice as a password, or identify someone between a certain population thanks to a unique voiceprint.
Audio Front End (AFE)
Audio Front End is the combination of multiple technologies created to evaluate and improve the signal quality of an audio source. Dividing sound into different criterias such as signal-to-noise ratio, echo/reverb level or signal fiability makes it possible to apply specific filters to the signal so that its quality is perfect for voice-based use cases.
Text-to-Speech, also known as voice synthesis or Text-to-Voice, is a technology used to create real-time voice in order to dictate configurated text. These synthetic voices can be selected according to language, genre and quality.