
Arjun Reddy Akula
I am a Research Scientist at Google DeepMind in Mountain View. My research interests are in computer vision, natural language processing (NLP), statistical modeling and inference, and deep learning.
Prior to this, I got my PhD from UCLA in Jan 2022, advised by Prof. Song-Chun Zhu. During my PhD, I interned at Amazon Alexa AI (Sunnyvale, CA), Google Research (Los Angeles, CA), Amazon AI (Palo Alto, CA) and Mila (Montreal). Prior to my PhD, I worked as a research software engineer at IBM Research AI (India) for 2.5 years. I did my Bachelors and Masters in Computer Science and Engineering from IIIT Hyderabad, India. I am an active member of the academic community serving as a reviewer/program committee member of ACL, CVPR, ARR, EMNLP, ICCV, AAAI, ECCV, NeurIPS and NAACL. Outside of work, I enjoy hiking, traveling, and playing Table Tennis. Here is a link to my personal website: www.arjunakula.com
Research Areas
Authored Publications
Sort By
Google
PRISM: A New Lens for Improved Color Understanding
Garima Pruthi
Inderjit Dhillon
Varun Jampani
EMNLP (2024)
MetaCLUE: Towards Comprehensive Visual Metaphors Research
Brendan Driscoll
Zhiwei Jia
Garima Pruthi
Leonidas Guibas
Varun Jampani
CVPR (2023)
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models
Weixi Feng
Wangrong Zhu
Tsu-Jui Fu
Varun Jampani
Xuehai He
Xin Eric Wang
William Wang
NeurIPS (2023)
KAFA: Rethinking Image Ad Understanding with Knowledge-Augmented Feature Adaptation of Vision-Language Models
Zhiwei Jia
Garima Pruthi
Hao Su
Varun Jampani
ACL 2023 (Industry Track) (2023)
Discriminative Diffusion Models as Few-shot Vision and Language Learners
Xuehai He
Weixi Feng
Tsu-Jui Fu
Varun Jampani
William Yang Wang
Xin Eric Wang
ArXiv (2023)
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
Weixi Feng
Xuehai He
Tsu-Jui Fu
Varun Jampani
Xin Eric Wang
William Yang Wang
ICLR (2023)
CPL: Counterfactual Prompt Learning for Vision and Language Models
Xuehai He
Diji Yang
Weixi Feng
Tsu-Jui Fu
Varun Jampani
William Yang Wang
Xin Eric Wang
Conference on Empirical Methods in Natural Language Processing (EMNLP) (2022)
ALFRED-L: Investigating the Role of Language for Action Learning in Interactive Visual Environments
Spandana Gella
Aishwarya Padmakumar
Mahdi Namazifar
Mohit Bansal
Jesse Thomason
Dilek Hakkani-Tur
Conference on Empirical Methods in Natural Language Processing (EMNLP) (2022)