Publications

Our teams aspire to make discoveries that impact everyone, and core to our approach is sharing our research and tools to fuel progress in the field.

A Recipe for Improving Remote Sensing Zero Shot Generalization

Aviad Barzilai

Yotam Gigi

Vered Silverman

Yehonathan Refael

Bolous Jaber

Amr Helmy

Tomer Shekel

George Leifman

Genady Beryozkin

3rd ML4RS Workshop at ICLR 2025

Binamix -- A Python Library for Generating Binaural Audio Datasets

Dan Barry

Davoud Shariat Panah

Alessandro Ragano

Jan Skoglund

Andrew Hines

AES 158th Audio Engineering Society Convention (2025)

On the Design of the Binaural Rendering Library for Eclipsa Audio Immersive Audio Container

Tomasz Rudzki

Gavin Kearney

Jan Skoglund

AES 158th Convention of the Audio Engineering Society (2025)

Perceptual Evaluation of a Mix Presentation for Immersive Audio with IAMF

Carlos Tejeda-Ocampo

Toni Hirvonen

Ema Souza-Blanes

Mahmoud Namazi

Jan Skoglund

AES 158th Convention of the Audio Engineering Society (2025)

Bridging Sign and Spoken Languages: Pseudo GlossGeneration for Sign Language Translation

Peike Li

Trevor Cohn

Jianyuan Guo

Advances in Neural Information Processing Systems (NeurIPS) (2025)

Global-to-Local or Local-to-Global? Enhancing Image Retrieval with Efficient Local Search and Effective Global Re-ranking

Dror Aiger

Bingyi Cao

Andre Araujo

Kaifeng Chen

2025

Generating Dialogues from Egocentric Instructional Videos for Task Assistance: Dataset, Method and Benchmark

Lavisha Aggarwal

Vikas Bahirwani

Lin Li

Andrea Colaco

2025

PC-SRIF: Preconditioned Cholesky-based Square Root Information Filter for Vision-aided Inertial Navigation

Weikun Zhen

Yun Zhang

Parth Agrawal

Ryan DuToit

Chao Guo

Tong Ke

Toby Sharp

2025

VIDEOPHY-2: A Challenging Action-Centric Physical Commonsense Evaluation in Video Generation

Kai-Wei Chang

Hritik Bansal

Aditya Grover

Roman Goldenberg

Yonatan Bitton

Clark Peng

(2025)

Text to 3D Object Generation for Scalable Room Assembly

Sonia Laguna

Alberto García García

Marie-Julie Rakotosaona

Stylianos Moschoglou

Leonhard Helminger

Sergio Orts Escolano

2025

LightLab: Controlling Light Sources in Images with Diffusion Models

Nadav Magar

Amir Hertz

Eric Tabellion

Yael Pritch Knaan

Alex Rav Acha

Yedid Hoshen

Arik Shamir

SIGGRAPH Conference Papers '25 (2025)

RefVNLI: Towards Scalable Evaluation of Subject-driven Text-to-image Generation

Aviv Slobodkin

Hagai Taitelbaum

Yonatan Bitton

Brian Gordon

Michal Sokolik

Almog Gueta

Royi Rassin

Dani Lischinski

Idan Szpektor

2025

The JPEG XL Image Codec: History, Features, Coding Tools, Design Rationale, and Future

Jon Sneyers

Jyrki Alakuijala

Luca Versari

Zoltan Szabadka

Sami Boukortt

Amnon Cohen-Tidhar

Moritz Firsching

Evgenii Kliuchnikov

Tal Lev-Ami

Eric Portis

Thomas Richter

WATANABE Osamu

arxiv (2025) (to appear)

PhoMoH: Implicit Photo-realistic 3D Models of Human Heads

Mihai Zanfir

Thiemo Alldieck

Cristian Sminchisescu

International Conference on 3D Vision (2024)

Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis

Shangbang Long

Siyang Qin

Yasuhisa Fujii

Alessandro Bissacco

Michalis Raptis

Winter Conference on Applications of Computer Vision 2024 (2024) (to appear)

Defining the technology of today and tomorrow.

Philosophy

People

Research areas

Foundational ML & Algorithms

Computing Systems & Quantum AI

Science, AI & Society

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Publications

Learn more about how we conduct our research

Defining the technology of today and tomorrow.

Philosophy

People

Research areas

Foundational ML & Algorithms

Computing Systems & Quantum AI

Science, AI & Society

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Publications

Filter by:

Year

Team

Research Area

Learn more about how we conduct our research