
Jordi Pont-Tuset
I am a research scientist at Google Research, Zurich, working in Vittorio Ferrari's team. I am also at the advisory board of Vilynx. Previously, I worked at ETHZ and Disney Research, and I collaborated with Prof. J. Malik’s vision group and with the startup Fezoo. I am a mathematician, engineer, and PhD in computer vision by UPC Barcelonatech.
Authored Publications
Sort By
Google
Rich Human Feedback for Text to Image Generation
Katherine Collins
Nicholas Carolan
Youwei Liang
Peizhao Li
Dj Dvijotham
Gang Li
Sarah Young
Jiao Sun
Kai Kohlhoff
Arseniy Klimovskiy
2024
DOCCI: Descriptions of Connected and Contrasting Images
Garrett Tanzer
Jaemin Cho
Su Wang
Sunayana Rane
Zack Berger
Zarana Parekh
(2024)
Imagen Editor and EditBench: Advancing and Evaluating Text-Guided Image Inpainting
Su Wang
Chitwan Saharia
Shai Noy
Stefano Pellegrini
Sarah Laszlo
Mohammad Norouzi
Peter Anderson
William Chan
CVPR (2023)
Connecting Vision and Language with Video Localized Narratives
Vittorio Ferrari
IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR) 2023 (to appear)
Adversarially Robust Panoptic Segmentation (ARPaS) Benchmark
Laura Alexandra Daza Barragan
Pablo Arbelaez
Adversarial Robustness in the Real World (ECCV 2022 Workshop) (to appear)
Panoptic Narrative Grounding
Cristina González
Nicolas Ayobi Mendoza
Isabela Hernandez
José Hernández
Pablo Arbelaez
ICCV (2021)
PanGEA: The Panoramic Graph Environment Annotation Toolkit
Peter Anderson
2nd Workshop on Advances in Language and Vision Research (ALVR) (2021)
The Open Images Dataset V4: Unified image classification, object detection, and visual relationship detection at scale
Mohamad Hassan Mohamad Rom
Neil Alldrin
Ivan Krasin
Matteo Malloci
Alexander Kolesnikov
Vittorio Ferrari
IJCV (2020) (to appear)