Skyrocketing User Experience thanks to offline ASR

The VDK will allow you to simply enable your products and services with voice commands thanks to a super-accurate, scalable and completely embedded Automatic Speech Recognition (ASR) solution.

GET STARTED

A highly accurate Automatic Speech Recognition for you to scale

Automatic Speech Recognition, often referred to as Speech-to-Text or simply Speech Recognition, is the main technology that makes voice a way to interact.

At the pinnacle of voice AI, it merges together complex solutions such as acoustic models, natural language understanding (NLU) or audio signal processing.

This technical stack allows ASR to turn human speech into normalized data (text, intent, values…) that can be processed by complex systems.

vivoka-automatic-speech-recognition-schema-en

Use cases and existing applications

How to leverage ASR in the field?

Wearables

Voice-enabled AR glasses
VR voice typing
Voice control in headphones

Industry 4.0

Voice picking
Field report dictation
Voice-enabled quality control

Appliances

Smart fridge, oven etc…
Hands-free Industrial appliance
Custom/branded voice assistant

Field services

Maintenance assistant
Voice reporting
Voice assisted workforce

Robotics

Voice-enabled robotics
Hands-free medical tools
Human-Machine Interface

Defense

Hands-free military equipment
Voice-based training solutions
Specific MRO workflows

Combining complexity and simplicity

What is the Voice Development Kit made of exactly?

The Studio, an easy-to-use graphical interface

A user-friendly interface that enables to take advantage of the SDK’s capabilities without the coding knowledge it implies. Each technology has its widget and a set of tools to both develop and finetune your bespoke solution.

The SDK, a powerful development tool to create advanced voice-enabled AIs

One SDK that gathers all the technology inside a unique integration platform. Combining R&D and ground-data, we managed to simplify complexity. This is where customization goes far beyond what we can imagine.

Technical requirements

Key specifications to use the VDK

Language count	41
Model + Resource Size	6 to 15MB
SDK Code Size	10 to 65MB
Platform supported	Windows – x86_64 Linux – x86_64 \| armv7hf \| armv8 Android 6.0 (API 23)
Hardware supported	MPU

Language count	5
Model + Resource Size	30 to 50MB
Platform supported	Windows – x86_64 Linux – x86_64 \| armv7hf \| armv8 Android 6.0 (API 23)
Hardware supported	MPU

Language count	41
Model + Resource Size	6 to 15MB
SDK Code Size	10 to 65MB
Platform supported	Windows – x86_64 Linux – x86_64 \| armv7hf \| armv8 Android 6.0 (API 23)
Hardware supported	MPU

Language count	65
Model + Resource Size	<1MB up to 500MB per voice depending on the quality
SDK Code Size	5 to 65MB
Platform supported	Windows – x86_64 Linux – x86_64 \| armv7hf \| armv8 Android 6.0 (API 23)
Hardware supported	MPU

Language count	Agnostic
Model + Resource Size	<1MB to 20MB
SDK Code Size	<1MB = 50KB / User registered 20MB = 5KB / User registered
Platform supported	Windows – x86_64 Linux – x86_64 \| armv7hf \| armv8 Android 6.0 (API 23)
Hardware supported	MPU

Language count	Agnostic
Model + Resource Size	None
SDK Code Size	3MB
Platform supported	Windows – x86_64 Linux – x86_64 \| armv7hf \| armv8
Hardware supported	MPU

Get started now

Fill out the form and start leveraging voice in your field with a free evaluation.

Benefits of our
Automatic Speech Recognition

Internet Independent

No wifi or network are required to perform advanced speech recognition.

Superior Accuracy

Across different environments, user population or vocabulary complexity.

Data Privacy

User data stay in the device, unexposed to security issues and transfers.

Real-time Processing

Speech processing directly on the device reduces latency to 0.

Easy-to-Integrate

Simple to develop and embed across all platforms (windows, linux, android…).

Clear Pricing

Business models that adapt to yours, without any hidden-fees.

Companies that have chosen our solution

Our customers' feedbacks regarding our solutions

Be part of our happy customers

“Vivoka’s technologies have been integrated into our solutions dedicated to Human Factors in industry, guaranteeing performance and data confidentiality.”

Guillaume Jégou

Head of the Human Factors Technology Laboratory

"Vivoka’s solutions can run offline, on-device, to operate anywhere, anytime and with a broad language support capability which is for our products a distinct competitive advantage."

Paul Travers

Chief Executive Officer (CEO)

"We partnered with Vivoka on several innovative projects on embedded Linux system, and delivered, in a very short time, multi-lingual natural voice interactions fully running “at the edge” to our customers."

Cédric Vincent

VP of Technology

Skyrocketing User Experience thanks to offline ASR

A highly accurate Automatic Speech Recognition for you to scale