Shunya Labs' Empathetic AI Delivers Unmatched Accuracy for Indian Voice
Born from mental health AI, Shunya Labs delivers unprecedented voice recognition for 32 diverse Indian languages.
July 17, 2025

The AI-driven mental health startup, United We Care, has announced the launch of Shunya Labs, a deep-tech spinout poised to redefine the landscape of voice AI technology with a specialized focus on Indian languages. This new venture emerges from the technology originally built to power Stella, United We Care's empathetic AI wellness coach. Shunya Labs is introducing a sophisticated Automatic Speech Recognition (ASR) infrastructure that promises unprecedented accuracy and accessibility for 32 Indic languages and dialects, a development with profound implications for a multitude of sectors in the Indian market. The company claims its technology has already set numerous global benchmarks in speech and language understanding, signaling a significant step forward in making voice-activated technology a ubiquitous reality in a linguistically diverse India.
At the heart of Shunya Labs' innovation is a powerful ASR engine that was born from an unexpected source: the pursuit of emotionally intelligent AI for mental healthcare.[1][2] The development of Stella required a system that could understand not just words, but also the subtle nuances of human emotion and intent conveyed through speech.[1] This necessity-driven innovation led to the creation of a robust AI infrastructure that has now been spun out to serve a broader enterprise market.[1][3] According to Ritu Mehrotra, founder of United We Care, the goal was not merely to compete with existing benchmarks but to create something entirely new. "We didn't set out to beat the benchmarks — we set out to invent what didn't exist,” Mehrotra stated. “And when we built it, we realized we'd created something the world didn't know it needed: AI voice infrastructure that listens like a human, runs like a machine, and respects the sanctity of privacy."[1] This origin story highlights a unique advantage, suggesting that an AI trained to decipher the complexities of mental wellness conversations may possess a superior ability to handle the subtleties of everyday language in various commercial applications.
Shunya Labs enters a market that has long been a challenge for developers. Creating effective voice AI for India's 22 official languages and hundreds of dialects is a formidable task due to vast linguistic diversity, the prevalence of code-switching (mixing languages like Hindi and English in a single sentence), and a lack of large, standardized datasets for many "low-resource" languages.[4][5][6] While research groups like AI4Bharat have made significant strides in creating benchmarks and models for Indic languages, and toolkits like Vakyansh aim to support low-resource language development, a performance gap has persisted compared to English-language ASR.[7][8][9] Shunya Labs claims to directly address this gap, asserting that its ASR platform outperforms major Big Tech players in multilingual fluency, accuracy, and even cost-efficiency.[1] The platform is purpose-built for the Indian market, supporting languages from Hindi and Marathi to Assamese and Maithili.[1] It boasts a remarkably low average Word Error Rate (WER) of 3.37%, a metric that, if consistently validated against independent benchmarks like AI4Bharat's Vistaar which shows average WERs for other models ranging from 17% to over 25% on some tests, would represent a substantial leap in performance.[1][10]
The technological backbone of Shunya Labs includes innovations such as a Clinical Knowledge Graph with over 230 million nodes and a Spatio-Temporal Graph Attention Network (STGAT), a sophisticated deep learning method for processing data across space and time.[1][3] This technology, originally honed to provide real-time emotional intelligence and clinical reasoning for the Stella AI, gives the ASR its distinctive edge.[1] For enterprise applications, this translates to more effective and empathetic customer interactions. In a call center, for example, an ASR that can detect emotional nuances can help route frustrated customers to specialized agents or provide real-time feedback to improve service quality.[5][11][12] Furthermore, Shunya Labs has focused on computational efficiency, optimizing its engine to run on CPUs instead of expensive GPUs, which it claims can cut enterprise cloud costs by a factor of 20 while ensuring privacy through on-premise deployment capabilities.[1] This combination of high accuracy, emotional nuance, and cost-effectiveness positions Shunya Labs to power a wide range of mission-critical applications in sectors like healthcare, banking, and government services.[1][3]
The launch of Shunya Labs by United We Care is a significant event for the AI industry in India and beyond. It represents a homegrown solution to a uniquely Indian challenge, with the potential to unlock a new wave of voice-first digital transformation. The company has already made bold claims, stating its underlying technology for medical transcription, United-MedASR, achieves an industry-leading 0.5% error rate, a 98% reduction compared to some industry giants.[13][14] While the claim of setting "nine global records" in speech understanding will need independent validation, the stated performance metrics and the unique, human-centric development approach are compelling.[1][2] As Sourav Banerjee, Founder & CTO, puts it, the name "Shunya" is a nod to the Indian discovery of zero, signifying a start from first principles.[1] By building a foundational layer for AI that truly listens and understands, Shunya Labs aims to do for voice what the number zero did for mathematics—provide a fundamental building block for future innovation. If it delivers on its ambitious promises, it could significantly accelerate digital inclusion and create more natural, accessible human-machine interfaces for a billion-plus people.
Sources
[3]
[6]
[7]
[8]
[9]
[10]
[11]
[12]
[13]