The collaboration aims to combine IBM’s AI expertise in data, governance and model training technology with BharatGen’s national mandate and knowledge to create inclusive, India-centric multimodal and large language models rooted in indigenous context and values.
The initiative focuses on developing and scaling multimodal and language-specific AI models, and expanding their applications across sectors such as education, agriculture, banking, healthcare and citizen services.
“At BharatGen, we have been building sovereign AI models and an ecosystem that reflects the linguistic richness, cultural nuances and diverse needs of our people. This collaboration with IBM allows us to bring cutting-edge global research, scalable architectures and inclusive systems for India,” said Prof Ganesh Ramakrishnan of BharatGen.
The partnership will also build a scalable data pipeline using IBM’s selected open-source tools, enhanced with Indic-specific capabilities to streamline data preparation workflows. It will implement a governance framework from IBM’s enterprise-scale model development methodology to strengthen responsible model development.
Sandip Patel, Managing Director, IBM India and South Asia, said: “Through our collaboration with BharatGen, we aim to advance sovereign AI capabilities that reflect India’s diversity and deliver meaningful impact across sectors.”
BharatGen’s LLM and foundation model roadmap is designed to address both national and commercial needs across agriculture, education, healthcare, national security and finance. A key priority is the inclusion of underserved Indian languages and dialects beyond the top 12–22, ensuring broader digital participation and equity.