Sarvam-30B

Sarvam-30B is an open-source 30B-parameter Mixture-of-Experts reasoning model from Sarvam AI trained from scratch and optimized for Indian languages, coding, and conversational workloads. It uses 128 sparse experts with 2.4B active parameters per token, Grouped Query Attention, and was pre-trained on 16 trillion tokens spanning code, mathematics, multilingual, and web data.

Context

Benchmarks