
Abhinav Gupta
I am a Research Scientist at Google DeepMind working with the Gemini post-training team based in London UK. I'm interested in the intersection of reinforcement learning and language focusing on fine-tuning LLMs with machine/execution feedback and building robust evaluation metrics. I received my PhD from MILA where I worked on improving self-play in emergent communication and also hold a Masters degree from NYU. My personal website can be found at guabhinav.com.
Research Areas
Authored Publications
Sort By
Google
Dynamic population-based meta-learning for multi-agent communication with natural language
Marc Lanctot
Angeliki Lazaridou
Advances in Neural Information Processing Systems, Curran Associates, Inc. (2021), pp. 16899-16912