SuperFlash technology addresses speech processing challenges

1 min read

Computing-in-memory technology is poised to eliminate the massive data communications bottlenecks that are associated with performing artificial intelligence (AI) speech processing at the network’s edge.

However, this requires an embedded memory solution that can simultaneously perform neural network computation and stores weights.

Microchip Technology, via its Silicon Storage Technology (SST) subsidiary, said that its SuperFlash memBrain neuromorphic memory solution has solved this problem for the WITINMEM neural processing SoC, the first in volume production that enables sub-mA systems to reduce speech noise and recognise hundreds of command words, in real time and immediately after power-up.

Working with WITINMEM Microchip’s memBrain analogue in-memory computing solution, based on SuperFlash technology, has been incorporated into WITINMEM’s ultra-low-power SoC. The SoC features computing-in-memory technology for neural networks processing including speech recognition, voice-print recognition, deep speech noise reduction, scene detection, and health status monitoring.

WITINMEM, in turn, is working with multiple customers to bring products to market during 2022 based on this SoC.

“WITINMEM is breaking new ground with Microchip’s memBrain solution for addressing the compute-intensive requirements of real-time AI speech at the network edge based on advanced neural network models,” said Shaodi Wang, CEO of WITINMEM. “We were the first to develop a computing-in-memory chip for audio in 2019, and now we have achieved another milestone with volume production of this technology in our ultra-low-power neural processing SoC that streamlines and improves speech processing performance in intelligent voice and health products.”

According to Mark Reiten, vice president of the license division at SST, “The WITINMEM SoC showcases the value of using memBrain technology to create a single-chip solution based on a computing-in-memory neural processor that eliminates the problems of traditional processors that use digital DSP and SRAM/DRAM-based approaches for storing and executing machine learning models.”

Microchip’s memBrain neuromorphic memory product has been optimised to perform vector matrix multiplication (VMM) for neural networks. It enables processors used in battery-powered and deeply-embedded edge devices to deliver high AI inference performance per watt. This is accomplished by both storing the neural model weights as values in the memory array and using the memory array as the neural compute element. The result is 10 to 20 times lower power consumption than alternative approaches along with lower overall processor Bill of Materials (BOM) costs because external DRAM and NOR are not required.

Permanently storing neural models inside the memBrain solution’s processing element also supports instant-on functionality for real-time neural network processing. WITINMEM has leveraged SuperFlash technology’s floating gate cells’ nonvolatility to power down its computing-in-memory macros during the idle state to further reduce leakage power in demanding IoT use cases.