Mihir Parmar
Mihir Parmar is a Research Scientist at Google, where his work sits at the forefront of artificial intelligence, with a primary emphasis on developing robust Multi-Agent Systems. By leveraging state-of-the-art foundation models, his core research explores how complex AI agents can collaborate, reason, and predict outcomes effectively to solve intricate problems. Beyond his primary focus on Gemini-driven multi-agent environments, Mihir maintains a prolific broader research portfolio dedicated to enhancing model capabilities. His additional work actively explores post-training with Reinforcement Learning (RL) and test-time scaling methods to improve reasoning efficiency and overall model performance. He is also recognized for his foundational work in instruction-tuning, highlighted by his highly cited research on Super-NaturalInstructions and his pioneering development of the first instruction-tuned model for the biomedical domain.