Skyrocketing User Experience thanks to offline ASR
The VDK will allow you to simply enable your products and services with voice commands thanks to a super-accurate, scalable and completely embedded Automatic Speech Recognition (ASR) solution.
A highly accurate Automatic Speech Recognition for you to scale
Automatic Speech Recognition, often referred to as Speech-to-Text or simply Speech Recognition, is the main technology that makes voice a way to interact.
At the pinnacle of voice AI, it merges together complex solutions such as acoustic models, natural language understanding (NLU) or audio signal processing.
This technical stack allows ASR to turn human speech into normalized data (text, intent, values…) that can be processed by complex systems.
Use cases and existing applications
How to leverage ASR in the field?
Wearables
- Voice-enabled AR glasses
- VR voice typing
- Voice control in headphones
Industry 4.0
- Voice picking
- Field report dictation
- Voice-enabled quality control
Appliances
- Smart fridge, oven etc…
- Hands-free Industrial appliance
- Custom/branded voice assistant
Field services
- Maintenance assistant
- Voice reporting
- Voice assisted workforce
Robotics
- Voice-enabled robotics
- Hands-free medical tools
- Human-Machine Interface
Defense
- Hands-free military equipment
- Voice-based training solutions
- Specific MRO workflows
Combining complexity and simplicity
What is the Voice Development Kit made of exactly?
The Studio, an easy-to-use graphical interface
A user-friendly interface that enables to take advantage of the SDK’s capabilities without the coding knowledge it implies. Each technology has its widget and a set of tools to both develop and finetune your bespoke solution.
The SDK, a powerful development tool to create advanced voice-enabled AIs
One SDK that gathers all the technology inside a unique integration platform. Combining R&D and ground-data, we managed to simplify complexity. This is where customization goes far beyond what we can imagine.
Technical requirements
Key specifications to use the VDK
Language count | 41 |
Model + Resource Size | 6 to 15MB |
SDK Code Size | 10 to 65MB |
Platform supported |
Windows – x86_64 Linux – x86_64 | armv7hf | armv8 Android 6.0 (API 23) |
Hardware supported | MPU |
Language count | 5 |
Model + Resource Size | 30 to 50MB |
Platform supported |
Windows – x86_64 Linux – x86_64 | armv7hf | armv8 Android 6.0 (API 23) |
Hardware supported | MPU |
Language count | 41 |
Model + Resource Size | 6 to 15MB |
SDK Code Size | 10 to 65MB |
Platform supported |
Windows – x86_64 Linux – x86_64 | armv7hf | armv8 Android 6.0 (API 23) |
Hardware supported | MPU |
Language count | 65 |
Model + Resource Size | <1MB up to 500MB per voice depending on the quality |
SDK Code Size | 5 to 65MB |
Platform supported |
Windows – x86_64 Linux – x86_64 | armv7hf | armv8 Android 6.0 (API 23) |
Hardware supported | MPU |
Language count | Agnostic |
Model + Resource Size | <1MB to 20MB |
SDK Code Size |
<1MB = 50KB / User registered 20MB = 5KB / User registered |
Platform supported |
Windows – x86_64 Linux – x86_64 | armv7hf | armv8 Android 6.0 (API 23) |
Hardware supported | MPU |
Language count | Agnostic |
Model + Resource Size | None |
SDK Code Size | 3MB |
Platform supported |
Windows – x86_64 Linux – x86_64 | armv7hf | armv8 |
Hardware supported | MPU |
Get started now
Fill out the form and start leveraging voice in your field with a free evaluation.
Benefits of our
Automatic Speech Recognition
Internet Independent
Superior Accuracy
Data Privacy
Real-time Processing
Easy-to-Integrate
Clear Pricing
Companies that have chosen our solution
Our customers' feedbacks regarding our solutions
“Vivoka’s technologies have been integrated into our solutions dedicated to Human Factors in industry, guaranteeing performance and data confidentiality.”
Guillaume Jégou
Head of the Human Factors Technology Laboratory
"Vivoka’s solutions can run offline, on-device, to operate anywhere, anytime and with a broad language support capability which is for our products a distinct competitive advantage."
Paul Travers
Chief Executive Officer (CEO)
"We partnered with Vivoka on several innovative projects on embedded Linux system, and delivered, in a very short time, multi-lingual natural voice interactions fully running “at the edge” to our customers."
Cédric Vincent
VP of Technology
Voice Synthesis
It is the ideal complementary solution as it allows to clear the noisy environment in no time when giving a voice command.