Latest Updates
-
Happy Birthday Rashmika Mandanna: Steal Her White Looks For Easter 2026 Festive Parties And Celebrations -
Paneer Paratha Recipe: Crispy Outside, Soft Inside Perfection -
Horoscope for Today April 05, 2026 - Small Choices Guide Calm Momentum -
Happy Easter 2026 Wishes: Top 50+ Messages, Status, Captions And Posts To Share With Family And Friends -
Comfort Style Creamy Blend Tomato Soup Recipe -
Rashmika Mandanna’s “Now It’s Us Three” Post Sparks Speculation Ahead of Anime Awards 2026 Return -
The Softest Ever Homemade Gulab Jamun Recipe -
Where To Eat This Easter 2026: From Chef-Led Experiences To Traditional Feasts Across India -
International Carrot Day 2026: The Hydrating, Skin-Loving Vegetable To Eat More This Summer -
Fluffy Jeera Rice Every Time: The Simple Trick You Need To Know
Sarvam AI Beats Gemini, ChatGPT in Indian Language OCR and Speech Tasks
In a proud moment for India's AI space, Bengaluru-based startup Sarvam AI has taken on global giants, and won where it matters most for Indian users. Its new tools, Sarvam Vision (for reading documents) and Bulbul V3 (for text-to-speech), have outperformed Google Gemini and ChatGPT in handling Indian languages and tricky real-world documents.
What Is Sarvam AI?
Sarvam AI is an Indian artificial intelligence startup focused on building generative AI models that understand India's linguistic diversity and contextual nuances. The company was founded in August 2023 by Dr Vivek Raghavan and Dr Pratyush Kumar, both veteran AI researchers with experience in building systems for Indian language processing.
Right from the beginning, Sarvam has had a single intention: to build tools that work for the multilingual context of India, from OCRs reading complex forms and scripts to voice models that speak naturally across regional languages.
In his post, Union Minister for Electronics and IT, Ashwini Vaishnaw, noted that even critical reviewers are now praising Sarvam's technologically advanced models, adding that India's young engineers are working on innovations that will be noticed by the world as pathbreaking models.
Sarvam Vision: Redefining OCR for Indian Languages
At the heart of Sarvam's recent success is Sarvam Vision, a vision-language model engineered to handle difficult real-world document tasks, things like poorly scanned pages, handwritten notes, complex tables and mixed scripts that many generic AI systems struggle with.
In benchmark tests:
- Sarvam Vision obtained an accuracy of 84.3 % in olmOCR-Bench, which outperformed Gemini 3 Pro and other well-known OCR systems.
- It also scored 93.28 % in OmniDocBench v1.5, being able to read and understand real-world documents with aplomb.
- These results are particularly noteworthy, as global models often focus on broad multilingual capability but falter with messy or varied script layouts-a common scenario in Indian paperwork.
Bulbul V3: A Natural Voice for Indian Languages
Alongside Sarvam Vision, the startup's Bulbul V3 Text-to-Speech model is making waves. Introduced in early February 2026, the model produces high-quality speech, complete with tone and regional variations, in multiple Indian languages. It currently offers over 35 high-quality voices, with further intents to support all 22 Indian scheduled languages.
Why This Win Matters
While global giants, such as Google Gemini and ChatGPT continue to be the leaders in general-purpose AI systems, Sarvam's achievement brings to mind an important fact: while developing an AI system requires a deep knowledge of local languages to make it better than the biggest global players for local issues.
This has implications beyond the issue of prestige. In addition, improved OCR and voice technologies can greatly facilitate access to digital technologies for governance, banking, education, etc., particularly in a linguistically diverse India.



Click it and Unblock the Notifications












