Skyrocketing User Experience thanks to offline ASR

The VDK will allow you to simply enable your products and services with voice commands thanks to a super-accurate, scalable and completely embedded Automatic Speech Recognition (ASR) solution.

A highly accurate Automatic Speech Recognition for you to scale

Automatic Speech Recognition, often referred to as Speech-to-Text or simply Speech Recognition, is the main technology that makes voice a way to interact.

At the pinnacle of voice AI, it merges together complex solutions such as acoustic models, natural language understanding (NLU) or audio signal processing.

This technical stack allows ASR to turn human speech into normalized data (text, intent, values…) that can be processed by complex systems.

 

Use cases and existing applications

How to leverage ASR in the field?

Wearables

  • Voice-enabled AR glasses
  • VR voice typing
  • Voice control in headphones

Industry 4.0

  • Voice picking
  • Field report dictation
  • Voice-enabled quality control

Appliances

  • Smart fridge, oven etc…
  • Hands-free Industrial appliance
  • Custom/branded voice assistant

Field services

  • Maintenance assistant
  • Voice reporting
  • Voice assisted workforce

Robotics

  • Voice-enabled robotics
  • Hands-free medical tools
  • Human-Machine Interface

Defense

  • Hands-free military equipment
  • Voice-based training solutions
  • Specific MRO workflows

Combining complexity and simplicity

What is the Voice Development Kit made of exactly?

The Studio, an easy-to-use graphical interface

A user-friendly interface that enables to take advantage of the SDK’s capabilities without the coding knowledge it implies. Each technology has its widget and a set of tools to both develop and finetune your bespoke solution.

The SDK, a powerful development tool to create advanced voice-enabled AIs

One SDK that gathers all the technology inside a unique integration platform. Combining R&D and ground-data, we managed to simplify complexity. This is where customization goes far beyond what we can imagine.

Technical requirements

Key specifications to use the VDK

Language count 41
Model + Resource Size 6 to 15MB
SDK Code Size 10 to 65MB
Platform supported

Windows – x86_64

Linux – x86_64 | armv7hf | armv8

Android 6.0 (API 23)

Hardware supported MPU
Language count 5
Model + Resource Size 30 to 50MB
Platform supported

Windows – x86_64

Linux – x86_64 | armv7hf | armv8

Android 6.0 (API 23)

Hardware supported MPU
Language count 41
Model + Resource Size 6 to 15MB
SDK Code Size 10 to 65MB
Platform supported

Windows – x86_64

Linux – x86_64 | armv7hf | armv8

Android 6.0 (API 23)

Hardware supported MPU
Language count 65
Model + Resource Size <1MB up to 500MB per voice depending on the quality
SDK Code Size 5 to 65MB
Platform supported

Windows – x86_64

Linux – x86_64 | armv7hf | armv8

Android 6.0 (API 23)

Hardware supported MPU
Language count Agnostic
Model + Resource Size <1MB to 20MB
SDK Code Size

<1MB = 50KB / User registered

20MB = 5KB / User registered

Platform supported

Windows – x86_64

Linux – x86_64 | armv7hf | armv8

Android 6.0 (API 23)

Hardware supported MPU
Language count Agnostic
Model + Resource Size None
SDK Code Size 3MB
Platform supported

Windows – x86_64

Linux – x86_64 | armv7hf | armv8

Hardware supported MPU

Get started now

Fill out the form and start leveraging voice in your field with a free evaluation.

Please enable JavaScript in your browser to complete this form.
Name
We are committed to your privacy and will only use the information you provide us to contact you about relevant content about our products and technologies. You can unsubscribe from these at any time. Check our Pivacy Policy.

Internet Independent

No wifi or network are required to perform advanced speech recognition.

Superior Accuracy

Across different environments, user population or vocabulary complexity.

Data Privacy

User data stay in the device, unexposed to security issues and transfers.

Real-time Processing

Speech processing directly on the device reduces latency to 0.

Easy-to-Integrate

Simple to develop and embed across all platforms (windows, linux, android…).

Clear Pricing

Business models that adapt to yours, without any hidden-fees.

Companies that have chosen our solution

Our customers' feedbacks regarding our solutions

“Vivoka’s technologies have been integrated into our solutions dedicated to Human Factors in industry, guaranteeing performance and data confidentiality.”

Guillaume Jégou

Head of the Human Factors Technology Laboratory

"Vivoka’s solutions can run offline, on-device, to operate anywhere, anytime and with a broad language support capability which is for our products a distinct competitive advantage."

Paul Travers

Chief Executive Officer (CEO)

zorabots logo

"We partnered with Vivoka on several innovative projects on embedded Linux system, and delivered, in a very short time, multi-lingual natural voice interactions fully running “at the edge” to our customers."

Cédric Vincent

VP of Technology

Voice Synthesis

It is the ideal complementary solution as it allows to clear the noisy environment in no time when giving a voice command.

Create embedded voice interfaces
powered by top-notch voice AI technology
that respects privacy