Google at ECCV 2024
Google at ECCV 2024
Google Research is proud to be a Diamond Sponsor of the European Conference on Computer Vision (ECCV 2024), a biennial premier research conference in Computer Vision and Machine Learning. ECCV 2024 is being held Sunday, September 29th through Friday, October 4th in Milan, Italy. Google has a strong presence at this year’s conference with over 70 accepted papers and active involvement in over 29 workshops and tutorials. We look forward to sharing some of our extensive research and expanding our partnership with the broader computer vision research community.
Attending ECCV 2024? Be sure to visit the Google booth to chat with researchers who are actively pursuing the latest innovations in computer vision, and check out some of the scheduled booth activities (e.g., demos and Q&A sessions listed below). Visit the @GoogleAI X and Google Research LinkedIn accounts to find out more about the Google booth activities at ECCV 2024.
Take a look below to learn more about Google's technical participation at ECCV 2024 (Google affiliations in bold).
All session times are provided in CEST.
Board & Organizing Committee
-
Remi Denton
- Ethics Review Committee
-
Jordi Pont-Tuset
- Workshops and Tutorial Chair & Area Chair
-
Ahmet Iscen
- Area Chair
-
Aishwarya Agrawal
- Area Chair
-
Alireza Fathi
- Area Chair
-
Andre Araujo
- Area Chair
-
Andrew Zisserman
- Area Chair
-
Angela Yao
- Area Chair
-
Arsha Nagrani
- Area Chair
-
Ayan Chakrabarti
- Area Chair
-
Bernt Schiele
- Area Chair
-
Cordelia Schmid
- Area Chair
-
Dan Xu
- Area Chair
-
Daniel Zoran
- Area Chair
-
Deqing Sun
- Area Chair
-
Dima Damen
- Area Chair
-
Du Tran
- Area Chair
-
Evan Shelhamer
- Area Chair
-
Federico Tombari
- Area Chair
-
Golnaz Ghiasi
- Area Chair
-
Joao Carreira
- Area Chair
-
Junhwa Hur
- Area Chair
-
Kenneth Marino
- Area Chair
-
Kevis-Kokitsi Maninis
- Area Chair
-
Krishna Kumar Singh
- Area Chair
-
Liang-Chieh Chen
- Area Chair
-
Long Chen
- Area Chair
-
Mei Chen
- Area Chair
-
Michael Niemeyer
- Area Chair
-
Michael Rubinstein
- Area Chair
-
Ming-Hsuan Yang
- Area Chair
-
Negar Rostamzadeh
- Area Chair
-
Olivia Wiles
- Area Chair
-
Richard Zhang
- Area Chair
-
Rodrigo Benenson
- Area Chair
-
Ryan Farrell
- Area Chair
-
Saining Xie
- Area Chair
-
Sayna Ebrahimi
- Area Chair
-
Tali Dekel
- Area Chair
-
Tatsuya Harada
- Area Chair
-
Thomas Mensink
- Area Chair
-
Timo Bolkart
- Area Chair
-
Vignesh Ramanathan
- Area Chair
-
Xiaoyu Wang
- Area Chair
-
Ying Wu
- Area Chair
Orals
-
Tues, October 1 | 9:00AM — 10:20AM
OmniNOCS: A Unified NOCS Dataset and Model for 3D Lifting of 2D ObjectsAkshay Krishnan, Abhijit Kundu, Kevis-Kokitsi Maninis, James Hays, Matthew Brown
-
Tues, October 1 | 1:30PM — 3:20PM
MobileNetV4: Universal Models for the Mobile EcosystemDanfeng Qin, Chas H Leichner, Manolis Delakis, Marco Fornoni, Shixin Luo, Fan Yang, Weijun Wang, Colby Banbury, Chengxi Ye, Berkin Akin, Vaibhav Aggarwal, Tenghui Zhu, Daniele Moro, Andrew Howard
-
Tues, October 1 | 1:30PM — 3:20PM
ConDense: Consistent 2D/3D Pre-Training for Dense and Sparse Features from Multi-View ImagesXiaoshuai Zhang, Zhicheng Wang, Howard Zhou, Soham Ghosh, Danushen Gnanapragasam, Varun Jampani, Hao Su, Leonidas Guibas
-
Wed, October 2 | 9:00AM — 10:20AM
UniIR: Training and Benchmarking Universal Multimodal Information RetrieversCong Wei, Yang Chen, Haonan Chen, Hexiang Hu, Ge Zhang, Jie Fu, Alan Ritter, Wenhu Chen
-
Thu, October 3 | 9:00AM — 10:20AM
Denoising Vision TransformersJiawei Yang*, Katie Z Luo*, Jiefeng Li, Congyue Deng, Leonidas Guibas, Dilip Krishnan, Kilian Weinberger, Yonglong Tian, Yue Wang
-
Thu, October 3 | 1:30PM — 3:20PM
BRAVE: Broadening the Visual Encoding of Vision-Language ModelsOğuzhan Fatih Kar, Alessio Tonioni, Petra Poklukar, Achin Kulshrestha, Amir Zamir, Federico Tombari
-
Fri, October 4 | 8:30AM — 10:10AM
Flash Cache: Reducing Bias in Radiance Cache Based Inverse RenderingBenjamin Attal, Dor Verbin, Ben Mildenhall, Peter Hedman, Jonathan T Barron, Matthew O'Toole, Pratul Srinivasan
-
Fri, October 4 | 8:30AM — 10:10AM
Parrot: Pareto-Optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image GenerationSeung Hyun Lee*, Yinxiao Li, Junjie Ke, Innfarn Yoo, Han Zhang, Jiahui Yu*, Qifei Wang, Fei Deng*, Glenn Entis, Junfeng He, Gang Li, Sangpil Kim, Irfan Essa, Feng Yang
Accepted papers
Diffusion Bridges for 3D Point Cloud Denoising
Mathias Vogel Hüni, Keisuke Tateno, Marc Pollefeys, Federico Tombari, Marie-Julie Rakotosaona, Francis Engelmann
D-SCo: Dual-Stream Conditional Diffusion for Monocular Hand-Held Object Reconstruction
Bowen Fu, Gu Wang, Chenyangguang Zhang, Yan Di, Ziqin Huang, Zhiying Leng, Fabian Manhardt, Xiangyang Ji, Federico Tombari
HaloQuest: A Visual Hallucination Dataset for Advancing Multimodal Reasoning
Zhecan Wang*, Garrett Bingham, Adams Wei Yu, Quoc V. Le, Thang Luong, Golnaz Ghiasi
MagicMirror: Fast and High-Quality Avatar Generation with Constrained Search Space
Armand Comas, Di Qiu, Menglei Chai, Marcel C. Bühler, Amit Raj, Ruiqi Gao, Qiangeng Xu, Mark J Matthews, Paulo Gotardo, Octavia Camps, Sergio Orts-Escolano, Thabo Beeler
Mismatch Quest: Visual and Textual Feedback for Image-Text Misalignment
Brian Gordon, Yonatan Bitton, Yonatan Shafir, Roopal Garg, Xi Chen, Dani Lischinski, Daniel Cohen-Or, Idan Szpektor
Nuvo: Neural UV Mapping for Unruly 3D Representations
Pratul Srinivasan, Stephan Garbin, Dor Verbin, Jonathan Barron, Ben Mildenhall
ReNoise: Real Image Inversion Through Iterative Noising
Daniel Garibi, Or Patashnik, Andrey Voynov, Hadar Averbuch-Elor, Danny Cohen-Or
Spatial-Temporal Multi-level Association for Video Object Segmentation
Deshui Miao, Xin Li, Zhenyu He, Huchuan Lu, Ming-Hsuan Yang
Text-Conditioned Resampler for Long Form Video Understanding
Bruno Korbar, Yongqin Xian, Alessio Tonioni, Andrew Zisserman, Federico Tombari
ViC-MAE: Self-Supervised Representation Learning from Images and Video with Contrastive Masked Autoencoders
Jefferson Hernandez, Vicente Ordonez, Ruben Villegas
WordRobe: Text-Guided Generation of Textured 3D Garments
Astitva Srivastava, Pranav Manu, Amit Raj, Varun Jampani, Avinash Sharma
Bad Students Make Great Teachers: Active Learning Accelerates Large-Scale Visual Understanding
Talfan Evans, Shreya Pathak, Hamza Merzic, Jonathan Richard Schwarz*, Ryutaro Tanno, Olivier Henaff
Geometry Fidelity for Spherical Images
Anders Christensen*, Nooshin Mojab, Khushman Patel, Karan Ahuja, Zeynep Akata, Ole Winther, Mar Gonzalez Franco, Andrea Colaco
LookupViT: Compressing Visual Information to a Limited Number of Tokens
Rajat Koner, Gagan Jain, Prateek Jain, Volker Tresp, Sujoy Paul
MobileDiffusion: Instant Text-to-Image Generation on Mobile Devices (see blog post)
Yang Zhao, Zhisheng Xiao, Yanwu Xu, Haolin Jia, Tingbo Hou
MVDD: Multi-View Depth Diffusion Models
Zhen Wang*, Qiangeng Xu, Feitong Tan, Menglei Chai, Shichen Liu, Rohit Pandey, Sean Fanello, Achuta Kadambi, Yinda Zhang
Optimizing Illuminant Estimation in Dual-Exposure HDR Imaging
Mahmoud Afifi, Zhenhua Hu, Liang Liang
PhysAvatar: Learning the Physics of Dressed 3D Avatars from Visual Observations
Yang Zheng, Qingqing Zhao, Guandao Yang, Wang Yifan, Donglai Xiang, Florian Dubost, Dmitry Lagun, Thabo Beeler, Federico Tombari, Leonidas Guibas, Gordon Wetzstein
PointNeRF++: A Multi-Scale, Point-Based Neural Radiance Field
Weiwei Sun, Eduard Trulls, Yang-Che Tseng, Sneha Sambandam, Gopal Sharma, Andrea Tagliasacchi, Kwang Moo Yi
Region-Centric Image-Language Pretraining for Open-Vocabulary Detection
Dahun Kim, Anelia Angelova, Weicheng Kuo
ArtVLM: Attribute Recognition Through Vision-Based Prefix Language Modeling
William Yicheng Zhu, Keren Ye, Junjie Ke, Jiahui Yu*, Leonidas Guibas, Peyman Milanfar, Feng Yang
3D Congealing: 3D-Aware Image Alignment in the Wild
Yunzhi Zhang, Zizhang Li, Amit Raj, Andreas Engelhardt, Yuanzhen Li, Tingbo Hou, Jiajun Wu, Varun Jampani
Curved Diffusion: A Generative Model With Optical Geometry Control
Andrey Voynov, Amir Hertz, Moab Arar*, Shlomi Fruchter, Daniel Cohen-Or*
GIVT: Generative Infinite-Vocabulary Transformers
Michael Tschannen, Cian Eastwood*, Fabian Mentzer
IG Captioner: Information Gain Captioners are Strong Zero-shot Classifiers
Chenglin Yang, Siyuan Qiao, Yuan Cao, Yu Zhang, Tao Zhu, Alan Yuille, Jiahui Yu
Improving 2D Feature Representations by 3D-Aware Fine-Tuning
Yuanwen Yue, Anurag Das, Francis Engelmann, Siyu Tang, Jan Eric Lenssen
SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance
Lukas Hoyer*, David Joseph Tan, Muhammad Ferjad Naeem, Luc Van Gool, Federico Tombari
SMooDi: Stylized Motion Diffusion Model
Lei Zhong, Yiming Xie, Varun Jampani, Deqing Sun, Huaizu Jiang
Volumetric Rendering with Baked Quadrature Fields
Gopal Sharma, Daniel Rebain, Kwang Moo Yi, Andrea Tagliasacchi
Optimizing Factorized Encoder Models: Time and Memory Reduction for Scalable and Efficient Action Recognition
Shreyank N Gowda*, Anurag Arnab, Jonathan Huang
SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs
Yang Miao, Francis Engelmann, Olga Vysotska, Federico Tombari, Marc Pollefeys, Daniel Barath
Scene-Graph ViT: End-to-End Open-Vocabulary Visual Relationship Detection
Tim Salzmann, Markus Ryll, Alex Bewley, Matthias Minderer
SPIRE: Semantic Prompt-Driven Image Restoration
Chenyang QI*, Zhengzhong Tu, Keren Ye, Mauricio Delbracio, Peyman Milanfar, Qifeng Chen, Hossein Talebi
When and How do Negative Prompts Take Effect?
Yuanhao Ban, Ruochen Wang, Tianyi Zhou, Minhao Cheng, Boqing Gong, Cho-Jui Hsieh
AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-Level Retrieval
Pavel Suma, Giorgos Kordopatis-Zilos, Ahmet Iscen, Giorgos Tolias
EchoScene: Indoor Scene Generation via Information Echo Over Scene Graph Diffusion
Guangyao Zhai, Evin Pınar Örnek, Dave Zhenyu Chen, Ruotong Liao, Yan Di, Nassir Navab, Federico Tombari, Benjamin Busam
Finding NeMo: Negative-Mined Mosaic Augmentation for Referring Image Segmentation
Seongsu Ha, Chaeyun Kim, Donghwa Kim, Junho Lee, Sangho Lee, Joonseok Lee
Learned Neural Physics Simulation for Articulated 3D Human Pose Reconstruction
Mykhaylo Andriluka, Baruch Tabanpour, C. Daniel Freeman, Cristian Sminchisescu
NICP: Neural ICP for 3D Human Registration at Scale
Riccardo Marin, Enric Corona, Gerard Pons-Moll
Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation Without Manual Labels
Rui Huang, Songyou Peng, Ayca Takmaz, Federico Tombari, Marc Pollefeys, Shiji Son, Gao Huang, Francis Engelmann
Weakly Supervised 3D Object Detection via Multi-level Visual Guidance
Kuan-Chih Huang, Yi-Hsuan Tsai, Ming-Hsuan Yang
ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs
Viraj Shah*, Nataniel Ruiz, Forrester Cole, Erika Lu, Svetlana Lazebnik, Yuanzhen Li, Varun Jampani
AdaDiff: Accelerating Diffusion Models Through Step-Wise Adaptive Computation
Shengkun Tang, Yaqing Wang, Caiwen Ding, Yi Liang, Yao Li, Dongkuan Xu
Affective Visual Dialog: A Large-Scale Benchmark for Emotional Reasoning Based on Visually Grounded Conversations
Kilichbek Haydarov, Xiaoqian Shen, Avinash Madasu, Mahmoud Salem, Li-Jia Li, Gamaleldin F Elsayed, Mohamed Elhoseiny
Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts
Shuangkang Fang, Yufeng Wang, Yi-Hsuan Tsai, Yi Yang, Wenrui Ding, Shuchang Zhou, Ming-Hsuan Yang
DOCCI: Descriptions of Connected and Contrasting Images
Yasumasa Onoe, Sunayana Rane*, Zachary E Berger, Yonatan Bitton, Jaemin Cho*, Roopal Garg, Alexander Ku, Zarana Parekh, Jordi Pont-Tuset, Garrett Tanzer, Su Wang, Jason M Baldridge
GeoGaussian: Geometry-Aware Gaussian Splatting for Scene Rendering
Yanyan Li, Chenyu Lyu, Yan Di, Guangyao Zhai, Gim Hee Lee, Federico Tombari
Improving Point-Based Crowd Counting and Localization Based on Auxiliary Point Guidance
I-HSIANG CHEN, Wei-Ting Chen, Yu-Wei Liu, Ming-Hsuan Yang, Sy-Yen Kuo
Instant 3D Human Avatar Generation Using Image Diffusion Models
Nikos Kolotouros, Thiemo Alldieck, Enric Corona, Eduard Gabriel Bazavan, Cristian Sminchisescu
Lagrangian Hashing for Compressed Neural Field Representations
Shrisudhan Govindarajan, Zeno Sambugaro, Ahan Shabhanov, Towaki Takikawa, Weiwei Sun, Daniel Rebain, Nicola Conci, Kwang Moo Yi, Andrea Tagliasacchi
Photorealistic Video Generation with Diffusion Models
Agrim Gupta*, Lijun Yu, Kihyuk Sohn, Xiuye Gu, Meera Hahn, Li Fei-Fei, Irfan Essa, Lu Jiang, Jose Lezama
3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation
Zihao Xiao, Longlong Jing, Shangxuan Wu, Alex Zihao Zhu, Jingwei Ji, Chiyu Max Jiang, Wei-Chih Hung, Thomas Funkhouser, Weicheng Kuo, Anelia Angelova, Yin Zhou, Shiwei Sheng
SILC: Improving Vision Language Pre-training with Self-Distillation
Muhammad Ferjad Naeem*, Yongqin Xian, Xiaohua Zhai, Lukas Hoyer*, Luc Van Gool, Federico Tombari
TC4D: Trajectory-Conditioned Text-to-4D Generation
Sherwin Bahmani, Xian Liu, Yifan Wang, Ivan Skorokhodov, Victor Rong, Ziwei Liu, Xihui Liu, Jeong Joon Park, Sergey Tulyakov, Gordon Wetzstein, Andrea Tagliasacchi, David B. Lindell
WeConvene: Learned Image Compression with Wavelet-Domain Convolution and Entropy Model
Haisheng Fu, Jie Liang, Zhenman Fang, Jingning Han, Feng Liang, Guohe Zhang
Loc3Diff: Local Diffusion for 3D Human Head Synthesis and Editing
Yushi Lan*, Feitong Tan, Qiangeng Xu, Di Qiu, Kyle Genova, Zeng Huang, Rohit Pandey, Sean Fanello, Thomas Funkhouser, Chen Change Loy, Yinda Zhang
ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion
Daniel Winter, Matan Cohen, Shlomi Fruchter, Yael Pritch, Alex Rav-Acha, Yedid Hoshen
PALM: Predicting Actions Through Language Models
Sanghwan Kim, Daoji Huang, Yongqin Xian, Otmar Hilliges, Luc Van Gool, Xi Wang
Self-Supervised Shape Completion via Involution and Implicit Correspondences
Mengya Liu, Ajad Chhatkuli, Janis Postels, Luc Van Gool, Federico Tombari
Self-Training Room Layout via Geometry-Aware Ray-Casting
Bolivar Solarte, Chin-Hsuan Wu, Jin-Cheng Jhang, Jonathan Lee, Yi-Hsuan Tsai, Min Sun
Score Distillation Sampling with Learned Manifold Corrective
Thiemo Alldieck, Nikos Kolotouros, Cristian Sminchisescu
Taming CLIP for Fine-Grained and Structured Visual Understanding of Museum Exhibits
Ada-Astrid Balauca, Danda Pani Paudel, Kristina Toutanova, Luc Van Gool
Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors
Jae Joong Lee, Bosheng Li, Sara M Beery, Jonathan Huang, Songlin Fei, Raymond A. Yeh, Bedrich Benes
Workshops
-
Sun, September 29 | 8:45AM - 12:30PM, Suite 2
Workshop on Artificial Social IntelligenceOrganizer: Shiry Ginosar
-
Sun, September 29 | 9:00AM - 1:00PM, Amber 3
Recovering 6D Object PoseOrganizer: Martin Sundermeyer Speaker: Martin Sundermeyer
-
Sun, September 29 | 9:00AM - 1:00PM, Space 2
Self-Supervised Learning - What is next?Speaker: Olivier Henaff
-
Sun, September 29 | 9:00AM - 1:00PM, Suite 7
The Second Perception Test ChallengeOrganizers: Joe Heyward, Joao Carreira, Dima Damen, Viorica Pătrăucean
-
Sun, September 29 | 9:00AM - 1:00PM, Brown 2
3D Vision and Modeling Challenges in eCommerceSpeaker: Ira Kemelmacher-Shlizerman
-
Sun, September 29 | 1:00PM - 6:00PM, Suite 5
The Third ROAD Workshop & Challenge: Event Detection for Situation Awareness in Autonomous DrivingOrganizer: Yi-Hsuan Tsai
-
Sun, September 29 | 2:00PM - 6:00PM, Brown 1
Explainable AI for Computer Vision: Where Are We and Where Are We Going?Organizer: Bernt Schiele
-
Sun, September 29 | 2:00PM - 6:00PM, Amber 6
2nd OmniLabel Workshop: Enabling Complex Perception Through Vision and Language Foundational ModelsOrganizer: Long Zhao Speakers: Saining Xie, Anelia Angelova
-
Sun, September 29 | 2:00PM - 5:30PM, Amber 4
OpenSUN3D: 3rd Workshop on Open-Vocabulary 3D Scene UnderstandingOrganizers: Francis Engelmann, Songyou Peng, Johanna Wald, Federico Tombari Speaker: Alex Bewley
-
Sun, September 29 | 2:00PM - 6:00PM, Amber 7 & 8
T-CAP - Towards a Complete Analysis of People: Fine-grained Understanding for Real-World ApplicationsSpeaker: Cristian Sminchisescu
-
Sun, September 29 | 2:00PM - 6:00PM, Amber 2
Traditional Computer Vision in the Age of Deep Learning (TradiCV)Speaker: Richard Szeliski
-
Sun, September 29 | 2:00PM - 6:00PM, Suite 6
Transparent & Reflective objects In the wild Challenges (TRICKY)Speaker: Michael Niemeyer
-
Mon, September 30 | 9:00AM - 1:00PM, Amber 5
Instance-Level Recognition WorkshopOrganizers: Andre Araujo, Bingyi Cao, Kaifeng Chen Speaker: Cordelia Schmid
-
Mon, September 30 | 9:00AM - 1:00PM, Suite 6
Map-Free Visual RelocalizationSpeaker: Simon Lynen
-
Mon, September 30 | 9:00AM - 1:00PM, Brown 2
Uncertainty Quantification for Computer VisionSpeaker: Tal Schuster
-
Mon, September 30 | 9:00AM - 12:30PM, Brown 3
Wild3D: 3D Modeling, Reconstruction, and Generation in the WildSpeaker: Noah Snavely
-
Mon, September 30 | 9:00AM - 1:00PM, Suite 2
2nd Workshop on More Exploration, Less Exploitation (MELEX)Speakers: Dima Damen, Bill Freeman
-
Mon, September 30 | 9:00AM - 1:00PM, Panorama Lounge
1st Workshop on Neural Fields Beyond Conventional CamerasKeynote Speaker: Jon Barron
-
Mon, September 30 | 2:00PM - 6:00PM, Brown 2
AI3DCC: The Second Workshop of AI for 3D Content CreationOrganizer: Leonidas Guibas Spotlight Speaker: Georgios Kopanas
-
Mon, September 30 | 2:00PM - 6:00PM, Amber 5
Emergent Visual Abilities and Limits of Foundation Models (EVAL-FoMo)Organizers: Jindong Gu, Aida Nematzadeh Speaker: Saining Xie
-
Mon, September 30 | 2:00PM - 6:00PM, Brown 3
Geometry in the Large Model EraOrganizers: Yonglong Tian, Leonidas Guibas Keynote Speaker: Thomas Funkhouser
-
Mon, September 30 | 2:00PM - 6:00PM, Amber 2
Sometimes Less is More: The First Dataset Distillation ChallengeOrganizer: Zhiwei Deng
-
Mon, Sep 30 | 2:00PM - 6:00PM, Panorama Lounge
Women in Computer VisionSpeakers: Dima Damen
-
Mon, September 30 | 2:00PM - 6:00PM, Suite 8
7th Workshop and Competition on Affective Behavior Analysis In-the-WildOrganizer: Stefanos Zafeiriou
-
Mon, September 30 | 2:00PM - 6:00PM, Suite 3
Workshop on Visual ConceptsSpeaker: Been Kim
Tutorials
-
Sun, September 29 | 9:00AM - 1:00PM, Amber 3
Efficient Text-to-Image and Text-to-3D ModelingOrganizers: Sadeep Jayasumana, Dilip Krishnan, Srikumar Ramalingam
-
Sun, September 29 | 9:00AM - 1:00PM, Brown 3
Large Multimodal Foundation ModelsSpeaker: Saining Xie
-
Sun, September 29 | 2:00PM - 6:00PM, Amber 3
Third Hands-on Egocentric Research Tutorial with Project Aria, from MetaSpeaker: Dima Damen
-
Mon, September 30 | 9:00AM - 1:00PM, Suite 7
A Bayesian Odyssey in Uncertainty: From Theoretical Foundations to Real-World ApplicationsOrganizer: Alex Immer
-
Mon, September 30 | 9:00AM - 1:00PM, Amber 7 & 8
Time is precious: Self-Supervised Learning Beyond ImagesSpeaker: João Carreira
Demos and Q&A at the Google Booth
*Dates and times may be subject to change. Stop by the Google booth (#41) for more details.
-
Tuesday, October 1 | 10:30AM - 11:00AM
Google DeepMind Media GenerationMiaosen Wang, Hang Qi, Chris Wolff, Siavash Khodadadeh, Abhishek Sharma, Norman Casagrande
-
Tuesday, October 1 | 4:30PM - 5:00PM
Q&A: Building a Career @ GoogleJason Zeidan, Daniel Trifunovich
-
Wednesday, October 2 | 10:30AM - 11:00AM
LookUpVit: Efficient/Flexible ViTRajat Koner, Sujoy Paul, Gagan Jain
-
Wednesday, October 2 | 12:30PM - 1:30PM
Meet the GDM TA teamLaura Giapino, Mike Carne
-
Wednesday, October 2 | 4:30PM - 5:00PM
Open Vocabulary 3D Scene UnderstandingFrancis Engelmann, Federico Tombari
-
Thursday, October 3 | 10:30AM - 11:00AM
PaliGemma: A Versatile 3B VLMAndreas Steiner
-
Thursday, October 3 | 12:30PM - 1:30PM
Q&A: Building a Career @ GoogleJason Zeidan, Daniel Trifunovich
-
Thursday, October 3 | 4:30PM - 5:0PAM
Mismatch Quest: Visual and Textual Feedback for Image-Text MisalignmentYonatan Bitton
* Work done while at Google