Automatic Speech Recognition (ASR) that works offline

Our professional Software Development Kit (SDK) to simply voice-enable products and services with bespoke understanding of any complex vocabulary.

Try the technologyContact us

Getting familiar with the technology

Automatic Speech Recognition (ASR), often referred to as Speech-to-Text or simply Speech Recognition, is the main technology that makes voice a way to interact.

At the pinnacle of voice AI, it merges together complex solutions such as acoustic models, natural language understanding (NLU) or audio signal processing.

This technical stack allows ASR to turn human speech into normalized data (text, intent, values…) that can be processed by complex systems.

With text transformed from speech, such as voice typing, the aim is to give commands to a machine, a way of having voice control over it. Speech is the best natural way to interact between humans. This technology allows to also use speech to interact with technology and then with machines (Human-Machine Interaction / HMI).

 

Use cases and existing applications

How to leverage ASR in the field?

Wearables

  • Voice-enabled AR glasses
  • VR voice typing
  • Voice control in headphones

Industry 4.0

  • Voice picking
  • Field report dictation
  • Voice-enabled quality control

Appliances

  • Smart fridge, oven etc…
  • Hands-free Industrial appliance
  • Custom/branded voice assistant

Field services

  • Maintenance assistant
  • Voice reporting
  • Voice assisted workforce

Robotics

  • Voice-enabled robotics
  • Hands-free medical tools
  • Human-Machine Interface

Defense

  • Hands-free military equipment
  • Voice-based training solutions
  • Specific MRO workflows

Features

How to use offline Automatic Speech Recognition (ASR)?

Grammar-based Recognition

Grammar-based recognition is a technology used to create the most accurate and reliable voice commands. Everything is set, then the users know exactly what answer they will get for what commands. Any spoken word by users that isn’t set up in the software configuration won’t be understood by the machine and then the action will not be fulfilled. 

Continuous ASR (FreeSpeech)

Continuous ASR with rich-text formatting capabilities that converts spoken words and sentences into text, plain text and not punctuated one. Combined with NLU (Natural Language Understanding), the machine understands the intent behind a voice command. It doesn’t have to be strictly what was set for the commands recognition. It’s a first step towards Artificial Intelligence.

Benefits

Why should you choose our
Automatic Speech Recognition ASR?

Internet Independent

No wifi or network are required to perform advanced speech recognition. Our technology is fully embedded.

Superior Accuracy

Goodbye keyboard, your voice is the most accurate way to interact with machine. No matter the vocabulary complexity or the environments.

Data Privacy

User data stay in the device, unexposed to security issues and transfers.

Real-time Processing

Speech processing directly on the device reduces latency to 0.

Clear Pricing

Business models that adapt to yours, without any hidden-fees.

Easy-to-Integrate

Simple to develop and embed across all platforms (windows, linux, android…).

Adopting voice solutions in your business starts here

Get in touch with our team to shift your company in the Voice First world! Or try it now!

Companies that have chosen our solution

Our customers' feedbacks regarding our solutions

“Vivoka’s technologies have been integrated into our solutions dedicated to Human Factors in industry, guaranteeing performance and data confidentiality.”

Guillaume Jégou

Head of the Human Factors Technology Laboratory

"Vivoka’s solutions can run offline, on-device, to operate anywhere, anytime and with a broad language support capability which is for our products a distinct competitive advantage."

Paul Travers

Chief Executive Officer (CEO)

zorabots logo

"We partnered with Vivoka on several innovative projects on embedded Linux system, and delivered, in a very short time, multi-lingual natural voice interactions fully running “at the edge” to our customers."

Cédric Vincent

VP of Technology

For developers, by developers

Try our voice solutions now

1

Sign up first on the Console

Before integrating with VDK, test our online playground: Vivoka Console.

2

Develop and test your use cases

Design, create and try all of your features.

3

Submit your project

Share your project and talk about it with our expert for real integration.

Requirements & Quick-Start

How to develop with our ASR engine?

– On the Device

Fully-embedded voice technology for brands seeking the convenience of a voice user interface without the privacy or connectivity concerns of the internet. Full access to custom commands and the ability to instantly update command codes during development make voice-enabling your product fast and easy.

– On Premise

Get the power of cloud connectivity combined with the reliability of embedded voice technology. On premise (or hybrid) solutions ensure that your device is always-on and responsive to commands. Seamlessly push product updates and deliver a broader voice experience with the level of cloud-connectivity that best matches your product and users.

The Automatic Speech Recognition technology (ASR) we provide are grammar-based recognition engines. It means the recognition’s capabilities are based on a file that includes the different elements to be recognized (numbers, sentences, words, names…).

Our offline Automatic Speech Recognition (ASR) supports both phonetic edition (changing the way words are understood by the engine) and dynamic data (having flexibility with the grammar content).

The engine currently supports 42 languages.

– Model & Resource Size: 6 to 15MB

– SDK Code Size: 10 to 65MB

– Supported Hardware: Microprocessor Units

– Supported Platforms:

  • Windows – x86_64
  • Linux – x86_64 | armv7hf | armv8
  • Android 6.0 (API 23)

Our speech recognition technology leverages cutting-edge artificial intelligence to provide unmatched voice recognition capabilities.

Choose our recognition software for the best speech recognition experience. Whether for spoken word transcription, voice commands, or voice typing, our Automatic Speech Recognition (ASR) stands out in the market.

Explore our solutions today and see how our speech recognition technology can transform your workflows and enhance productivity.

Know more about our ASR Technology

Broad Applications and Superior Performance of Our ASR Technology

To further enhance your understanding of our offerings, it’s important to recognize the broad scope of applications our Automatic Speech Recognition (ASR) technology supports. Leveraging advanced models and machine learning algorithms, our Automatic Speech Recognition (ASR) systems provide seamless and accurate speech-to-text transcription and voice command functionalities. With capabilities to process multiple languages and dialects, our solutions are designed to meet diverse customer needs across various industries. Whether you’re looking for precise dictation, real-time audio processing, or integrating voice technology into video applications, our ASR ensures superior performance and data privacy. Our grammar-based and continuous ASR technologies cater to both structured commands and free-form speech, making it ideal for a wide range of scenarios from industrial automation to hands-free consumer applications. Explore our innovative solutions to experience the future of voice-enabled technology today.

Enhancing Operations with Advanced Offline ASR Technology 

To unlock the full potential of voice technology in your operations, consider the unparalleled advantages our offline ASR solutions bring. By integrating our Automatic Speech Recognition technology (ASR), you can enable real-time speech processing and voice commands across numerous applications, from hands-free industrial tools to voice-enabled AR and VR devices. The advanced acoustic models and natural language understanding capabilities ensure high accuracy, even in complex and noisy environments. With support for 42 languages and robust data privacy features, our ASR technology guarantees that user data remains secure and processed locally. Embrace the future of human-machine interaction and enhance productivity and efficiency with our state-of-the-art speech recognition solutions.

In addition to its versatile integration capabilities, our Automatic Speech Recognition (ASR) technology excels in providing high accuracy for speech recognition and voice command applications across various systems. By leveraging advanced audio processing models, our ASR ensures that spoken words are accurately transcribed into text accurately and effortlessly.

Superior ASR Performance and Integration for Enhanced Productivity and Efficiency

The robust language support and dynamic learning capabilities allow our ASR to handle complex vocabularies and multiple languages, ensuring superior performance in any environment. Our automatic speech recognition systems are designed to operate offline, guaranteeing data privacy and real-time processing without the need for an internet connection. Whether you’re implementing voice technology in video applications, customer service, or industrial automation, our ASR technology offers reliable, high-quality speech recognition that enhances user experience and operational efficiency. Explore how our innovative ASR solutions can revolutionize your workflows and bring the benefits of advanced voice technology to your business.

our Automatic Speech Recognition (ASR) solutions empower businesses to enhance productivity, streamline workflows, and create intuitive human-machine interactions. Embrace the future of voice technology with our state-of-the-art ASR solutions.

If you need any information about our technologies, contact us. Our team is always happy to assist you. You can also check out our free console, to test our voice technologies (voice control and voice dictation technology) directly. Start the voice experience now!

Complementary Technologies to our Automatic Speech Recognition Technology

Discover other technologies in our stack

Wake Word Technology

Trigger speech recognition process by detecting a unique word or sentence.

Voice Biometrics Technology

Seamlessly identify or authenticate users by recognizing their voice pattern.

Text-to-Speech Technology

Automatically generate life-like voices from text data in multiple languages and aspects.

Audio Enhancement Technology

Enhance the audio signal quality from voice to boost speech recognition’s accuracy.

It's always the right time to learn more about voice technologies and their applications