comment on this article

Voice-controlled multimodal AI solution

Renesas Electronics and Syntiant, a deep learning chip technology company, have announced the joint development of a voice-controlled multimodal AI solution.

The AI solution enables low-power contactless operation for image processing in vision AI-based IoT and edge systems, such as self-checkout machines, security cameras, video conference systems and smart appliances such as robotic cleaning devices.

The device combines the Renesas RZ/V Series vision AI microprocessor unit (MPU) and the low-power multimodal, multi-feature Syntiant NDP120 Neural Decision Processor to deliver advanced voice and image processing capabilities.

This joint solution features always-on functionality with quick voice-triggered activation from standby mode to perform object recognition, facial recognition, and other vision-based tasks that are critical functions in security cameras and other systems. For example, while user-defined voice cues drive activation and system operation, vision AI recognition tracks operator behaviour and controls operation or issues a warning when suspicious actions are detected.

The multimodal architecture makes it easier to create contactless user experiences for vision AI-based systems. Using a dedicated, power-efficient chip for voice recognition reduces standby power consumption while speeding up system development because it is possible to develop software independently of the vision AI functionality.

“We anticipate that demand for multimodal systems that use multiple streams of input information will increase moving forward as a way to improve both ease of use and safety,” said Hiroto Nitta, Senior Vice President and Head of SoC Business in the IoT and Infrastructure Business Unit at Renesas.

“Voice-based user interfaces will make it possible for customers to deliver new user experiences that bring the next generation of innovative ideas from concept to reality, added Syntiant CEO Kurt Busch. “We’ve already shipped more than 15 million of our deep learning NDPs globally to enable always-on voice in a wide variety of consumer and industrial IoT applications."

The Renesas RZ/V Series MPU for vision AI incorporates Renesas’ DRP-AI (Dynamically Reconfigurable Processor-AI) accelerator and combines high-precision AI inference with power efficiency which eliminates the need for heat dispersion measures such as heat sinks or cooling fans, which reduces the bill of materials (BOM) cost and makes it possible to integrate vision AI into a wide range of embedded applications.

The Syntiant NDP120 chip incorporates AI capabilities that can be used to implement many high-precision, hands-free voice functions, including speaker recognition, keyword detection, multiple wake words, and local command recognition. Packaged with the Syntiant Core 2 neural network inference engine, the NDP120 can also run multiple applications simultaneously while minimizing power consumption to 1mW battery power.

Author
Neil Tyler

Comment on this article


This material is protected by MA Business copyright See Terms and Conditions. One-off usage is permitted but bulk copying is not. For multiple copies contact the sales team.

What you think about this article:


Add your comments

Name
 
Email
 
Comments
 

Your comments/feedback may be edited prior to publishing. Not all entries will be published.
Please view our Terms and Conditions before leaving a comment.

Related Articles

Production challenges

The challenges associated with meeting the needs of customers are now extending ...

Get to market faster

A quick look at using Vicor's PFM and AIM in VIA packaging for your AC to Point ...

World IoT Day

ByteSnap Design, a specialist in embedded systems design and development, has ...

Digital consciousness

​Would you consider uploading your brain to the cloud if it meant you could ...

End game

How are IoT technologies keeping vaccines safe, in storage and transit, and in ...