
Voice for Smart Glasses and XR Wearables
On-device voice technologies to empower XR devices, from Smart Glasses or VR Headsets to any Head-Mounted Displays (HMD) with hands-free control, enhanced safety and service reliability.






Made for wearables
Hands-free interactions thanks to embedded voice AI
The major asset that voice is bringing to XR devices is the ability to interact with them hands-free. In most industrial use cases that we are covering, this is mandatory. Indeed, users have to be able to use equipment at all times, with the minimum safety level that is required.
From the easiest voice navigation features, to the complete voice dictation system, we got you covered.
Available Features
How to turn XR Wearables into voice-enabled devices?

Voice Commands
Ability to completely control the device hands-free.
The technology also allows Dynamic Content Consumer to add flexibility in the amount of recognized voice commands and intents.

Voice Dictation
Text transcription and content spelling (letters/digits…).
Produce long messages, intervention reports or comments as well as spelling a serial key or a licence plate for instance.

Voice Instructions
Life-like voices to carry out instructions to the device’s user.
Hands-free interactions supported by speech synthesis on the device to provide with directions, instructions, reminders…

Microphone Enhancement
Fine-tuning your microphone’s voice recording quality.
Audio Front-End allows to evaluate the recording quality in specific environments (factories, airports…) and apply correcting filters.
For developers, by developers
Start developing your voice AI solution with the VDK
1
Sign up to request a free trial
Share your information to access the VDK's evaluation.
2
Develop and test your use cases
Design, create and try all of your features.
3
Export and integrate
Available for Windows, Android or Linux devices.

Resources
Learn more about Voice and Smart Glasses

Bringing edge voice capabilities to connected AR Smart Glasses
We gathered AR Smart Glasses experts to discuss with our team about the close relationship between voice and vision technologies. We mashed the whole conversation inside a 30 minute long webinar that offers condensed insights, information and data provided by industry specialists.
Smart glasses miss a voice to rise even more
Smart glasses, on the verge of being adopted by the mass, are still lacking something to make it. As AR and wearables enthusiasts, we deliver our insights.
How voice enhances vision-based wearables?
Head-mounted wearables can’t be used like traditional computers and devices. Voice and Vision need to work together in this objective. Here’s how.
Features and benefits from the VDK
If you are not already sure that the VDK is made for you

Internet Independent
By embedding our solution, your device doesn't need an Internet connection to run. VDK's technologies are designed to operate without requiring connectivity.

Privacy by Design
Our privacy safeguards at the product, algorithm, and business-model level ensure compliance with the strictest data regulations.

Predictable Costs
Master your budget with a clear pricing supporting your innovation and that adapts to the scalability of your project.

Cross-platform
We adapt to all your use cases with versatility, while being compatible with the largest microprocessor and device manufacturers.

Cloud-like accuracy
Performance and accuracy that meet DeepTech standards, all on the device.

Multilingual Conversations
Technologies inside the VDK understand and respond to you naturally in over 60 languages.
Good morning, Guten Tag, Bonjour, Bom Dia, Buongiorno…
As we already said, language support is not a problem

English

Swedish

Indonesian

Thai

French

Russian

Dutch

Norwegian

Italian

Portuguese

Korean

Polish

German

Arabic

Danish

Ukrainian

Spanish

Hindi

Greek

Turkish

Japanese

Mandarin

Hebrew

Malay
Technical requirements
Key specifications to use the VDK
Language count | 41 |
Model + Resource Size | 6 to 15MB |
SDK Code Size | 10 to 65MB |
Platform supported |
Windows – x86_64 Linux – x86_64 | armv7hf | armv8 Android 6.0 (API 23) |
Hardware supported | MPU |
Language count | 41 |
Model + Resource Size | 6 to 15MB |
SDK Code Size | 10 to 65MB |
Platform supported |
Windows – x86_64 Linux – x86_64 | armv7hf | armv8 Android 6.0 (API 23) |
Hardware supported | MPU |
Language count | 5 |
Model + Resource Size | 30 to 50MB |
Platform supported |
Windows – x86_64 Linux – x86_64 | armv7hf | armv8 Android 6.0 (API 23) |
Hardware supported | MPU |
Language count | 65 |
Model + Resource Size | <1MB up to 500MB per voice depending on the quality |
SDK Code Size | 5 to 65MB |
Platform supported |
Windows – x86_64 Linux – x86_64 | armv7hf | armv8 Android 6.0 (API 23) |
Hardware supported | MPU |
Language count | Agnostic |
Model + Resource Size | <1MB to 20MB |
SDK Code Size |
<1MB = 50KB / User registered 20MB = 5KB / User registered |
Platform supported |
Windows – x86_64 Linux – x86_64 | armv7hf | armv8 Android 6.0 (API 23) |
Hardware supported | MPU |
Language count | Agnostic |
Model + Resource Size | None |
SDK Code Size | 3MB |
Platform supported |
Windows – x86_64 Linux – x86_64 | armv7hf | armv8 |
Hardware supported | MPU |
Get started
Complete the form and start your free evauation of the VDK.

It's always the right time to learn more about voice technologies and their applications
Large Language Models and ChatGPT
Since its launch in November 2022, ChatGPT has become a hot topic and has taken up more and more space in the media sphere. More domains are integrating Large Language Models (LLM) as part of their...

NLU model best practices to improve accuracy

The future of Warehousing: Voice Directed Warehouse Operations

5 business applications to leverage embedded NLU in your products & services

Natural Language Processing – An Overview on what makes an AI “conversational”
