Thomas Mensink

Thomas Mensink

I am a research scientist working on Computer Vision and Deep Learning.

Other research interests include: (learning) image representations, dense prediction tasks, zero-shot learning, metric learning and structured predictions all applied on image classification and retrieval tasks. My work has been awarded -among others- by the ECCV Koenderink Prize (2020), a NWO VENI Grant (2015), the ACM Multimedia Best Paper Award (2014), and the ACM ICMR Best Paper Award (2016).

For a full list of (pre-Google) publications see Google Scholar or personal website
Authored Publications
Sort By
  • Title
  • Title, descending
  • Year
  • Year, descending
    Google
Scaling Vision Transformers to 22 Billion Parameters
Josip Djolonga
Basil Mustafa
Piotr Padlewski
Justin Gilmer
Mathilde Caron
Rodolphe Jenatton
Lucas Beyer
Michael Tschannen
Anurag Arnab
Carlos Riquelme
Matthias Minderer
Gamaleldin Elsayed
Fisher Yu
Avital Oliver
Fantine Huot
Mark Collier
Vighnesh Birodkar
Yi Tay
Alexander Kolesnikov
Filip Pavetić
Thomas Kipf
Xiaohua Zhai
Neil Houlsby
Arxiv (2023)
How (not) to ensemble LVLMs for VQA
Lisa Alazraki
Lluis Castrejon
Fantine Huot
"I Can't Believe It's Not Better: Failure Modes in the Age of Foundation Models" at NeurIPS 2023 Workshops
Multi-Loss Weighting with Coefficient of Variations
Rick Groenendijk
Sezer Karaoglu
Theo Gevers
Winter Conference on Applications of Computer Vision (WACV) (2021)
EDEN: Multimodal Synthetic Dataset of Enclosed Garden Scenes
Hoang-An Le
Partha Das
Sezer Karaoglu
Theo Gevers
Winter Conference on Applications of Computer Vision (WACV) (2021)