On June 29, 2026, researchers Avisha Das, Mihir Parmar, Mohana Ramnath, and Pulkit Verma introduced the Indi-RomCoM benchmark aimed at evaluating Large Language Models (LLMs) on Romanized Indic-English instructions. This benchmark is essential for understanding how well LLMs can handle code-mixed communication prevalent in multilingual communities.
Understanding Romanized Code Mixing
Romanized Code Mixing (RCM) is a form of communication where bilingual speakers fluidly blend local languages with English using Roman script. This method has become increasingly common in diverse linguistic environments. Despite the growing use of RCM, the performance of LLMs in this context remains largely unexplored.
The Indi-RomCoM benchmark addresses this gap by providing a systematic evaluation framework that includes seven instruction-following tasks across four widely spoken Indic languages. The benchmark also incorporates three levels of controlled code-mixing intensity, allowing for a comprehensive assessment of LLM capabilities.
Performance Evaluation of LLMs
The researchers conducted extensive evaluations of various LLMs, including proprietary, open-weight, and Indic-focused models. These evaluations were carried out under zero- and few-shot settings. The findings revealed that LLMs consistently underperform when handling RCM instructions, with performance deterioration correlating with increased code-mixing density.



