Speech Recognition by Google : Why is the Wake-Up Word so good?
“Ok Google, why does your wake-up word work so well?” is a question we could ask directly to the voice assistant of this digital giant. However, the answer would be very difficult to obtain from an artificial intelligence because of the complexity of this technology. This is where we come into action!
As a reminder, the Wake-Up Word, also called Hot Word, is a brick, a module, of speech recognition in the global sense of the term. It is actually a word or a series of words more generally that will be used to trigger the voice recording of the user of a speech recognition system. Paradoxically you will tell us, in any case if we do a little retrospective of the Google news we can realize that their assistants like to listen a little more than what is planned. This “keyword” such as the famous “Ok Google” or “Hello Zac” (less known but in the process of being such) are used to activate the system for two reasons:
- Respect the privacy of users so that only the sequence that will follow the Wake-Up Word, i. e. the one that corresponds to the intention, is recorded.
- Optimize system consumption and performance, as constant recording and analysis of audio files is a very heavy task if performed continuously.
Now that you have understood, or at least remembered, what a Wake-Up Word is, we can get to the heart of the matter, which is to know what makes a good Hot Word. Indeed, under its rather simplistic appearance and more of a brand image issue at first glance, it is a real puzzle to find THE right word.
In this sense, Google speech recognition displays a rather exemplary trigger word for several reasons.
First good lesson from Google and this applies mainly to marketing! Indeed, their wake-up word takes the name of their brand which is added to a very common and simple word to say: “ok”. As a result, thousands of users, whether it’s the voice assistant or the connected speaker, keep repeating Google when they interact with the brand’s tools.
In terms of brand awareness, what could be better? Already that Google’s name is certainly one of the most reknown nowadays, we let you imagine the rest.
To this we can add a more pragmatic aspect very well managed by Google’s voice recognition. One of the strengths of its wake-up word is its uniformity: whatever the languages, “ok” and “Google” are said in the same way. This is very practical for several reasons: only one wake-up word is applied to all markets covered by Google! It seems harmless like that, yet it is very complicated to translate a Hot Word into different languages while maintaining consistency and efficiency. Speaking of efficiency, let’s continue on the third point!
To finish with these praises, Google has found a combination of words that is technically very effective in terms of voice recognition. Indeed, when imagining and developing a Wake-Up Word it is very important to be aware of false positives, i.e. unexpected errors that send positive answers when in reality they are negative. Indeed, the phonemes, the smallest elements composing an audio frequency (translating sound), are very close in the latter case, following a mispronunciation, the system can be activated and record the conversation without the users’ knowledge.
Thus, “Ok” and “Google” are composed of very different and distant phonemes, which results in a very low false-positive rate for Google’s speech recognition.
In this article, we have seen, through the example of Google, what makes a good Wake-Up Word and the important aspects to take into account when choosing it. To learn more about the rest of voice technologies, other articles in our blog cover different topics related to this technology.