
Yonatan Bitton
Yonatan Bitton is a Research Scientist at Google Tel Aviv, working on vision-and-language generalization and multimodal consistency.
Research Areas
Authored Publications
Sort By
Google
ImageInWords: Unlocking Hyper-Detailed Image Descriptions
Andrew Bunner
Ranjay Krishna
(2024)
A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains
Alon Jacovi
Or Honovich
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (2024), pp. 4615–4634
DOCCI: Descriptions of Connected and Contrasting Images
Garrett Tanzer
Jaemin Cho
Su Wang
Sunayana Rane
Zack Berger
Zarana Parekh
(2024)
VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use
Anas Awadalla
Hritik Bansal
Jack Hessel
Josh GARDNER
Ludwig Schmidt
Rohan Taori
Rulin Shao
Wanrong Zhu
NeurIPS 2023, Datasets and Benchmarks (2023)
q2d: Automatic Dialog Generation to Improve Models' Query Generation
Enav Weinreb
Ido Hakimi
Shlomi Cohen-Ganor
Yoad Lewenberg
EMNLP 2023 (2023)
What You See is What You Read? Improving Text-Image Alignment Evaluation
Michal Yarom
Eran Ofek
arXiv (2023)
Mismatch Quest: Visual and Textual Feedback for Image-Text Misalignment
Brian Gordon
Dani Lischinski
Daniel Cohen-Or
arXiv (2023)