
Soravit (Beer) Changpinyo
Research Areas
Authored Publications
Sort By
Google
PaLI-X: On Scaling up a Multilingual Vision and Language Model
Josip Djolonga
Piotr Padlewski
Basil Mustafa
Carlos Riquelme
Sebastian Goodman
Yi Tay
Siamak Shakeri
Daniel Salz
Michael Tschannen
Hexiang (Frank) Hu
Mandar Joshi
Matthias Minderer
Filip Pavetić
Gang Li
Lucas Beyer
Anurag Arnab
Yuanzhong Xu
Keran Rong
Alexander Kolesnikov
Xiaohua Zhai
Neil Houlsby
Computer Vision and Pattern Recognition Conference (CVPR) (2024)
PaLI: A Jointly-Scaled Multilingual Language-Image Model
Piotr Padlewski
Daniel Salz
Sebastian Alexander Goodman
Basil Mustafa
Lucas Beyer
Alexander Kolesnikov
Keran Rong
Hassan Akbari
Linting Xue
James Bradbury
Chao Jia
Carlos Riquelme
Xiaohua Zhai
Neil Houlsby
International Conference on Learning Representations (ICLR) (2023)
Connecting Vision and Language with Video Localized Narratives
Vittorio Ferrari
IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR) 2023 (to appear)
MetaCLUE: Towards Comprehensive Visual Metaphors Research
Brendan Driscoll
Zhiwei Jia
Garima Pruthi
Leonidas Guibas
Varun Jampani
CVPR (2023)
PreSTU: Pre-Training for Scene-Text Understanding
Jihyung Kil
Hexiang (Frank) Hu
Sebastian Goodman
Wei-Lun Chao
ICCV (2023)
What You See is What You Read? Improving Text-Image Alignment Evaluation
Michal Yarom
Eran Ofek
arXiv (2023)
MaXM: Towards Multilingual Visual Question Answering
Linting Xue
Michal Yarom
Findings of ACL: EMNLP (2023)