Abhinav Gupta

Abhinav Gupta

I am a Research Scientist at Google DeepMind working with the Gemini post-training team based in London UK. I'm interested in the intersection of reinforcement learning and language focusing on fine-tuning LLMs with machine/execution feedback and building robust evaluation metrics. I received my PhD from MILA where I worked on improving self-play in emergent communication and also hold a Masters degree from NYU. My personal website can be found at guabhinav.com.
Authored Publications
Sort By
  • Title
  • Title, descending
  • Year
  • Year, descending
    Google
Dynamic population-based meta-learning for multi-agent communication with natural language
Marc Lanctot
Angeliki Lazaridou
Advances in Neural Information Processing Systems, Curran Associates, Inc. (2021), pp. 16899-16912