Unsupervised Monocular Depth Learning in Dynamic Scenes

Hanhan Li; Ariel Gordon; Hang Zhao; Vincent Casser; Anelia Angelova

Unsupervised Monocular Depth Learning in Dynamic Scenes

Hanhan Li

Ariel Gordon

Hang Zhao

Vincent Casser

Anelia Angelova

Conference on Robot Learning (CoRL) (2020)

Download Google Scholar

Abstract

We present a method for jointly training the estimation of depth, ego-motion, and a dense 3D translation field of objects relative to the scene, with monocular photometric consistency being the sole source of supervision. We show that this apparently heavily-underdetermined problem can be regularized by imposing the following prior knowledge about 3D translation fields: they are sparse, since most of the scene is static, and they tend to be constant for rigid moving objects. We show that this regularization alone is sufficient to train monocular depth prediction models that exceed the accuracy achieved in prior work for dynamic scenes, including semantically-aware methods. The code is available at https://github.com/google-research/google-research/tree/master/depth_and_motion_learning.

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

Unsupervised Monocular Depth Learning in Dynamic Scenes

Abstract

Research Areas

Meet the teams driving innovation

Google AI

Google Cloud

Google DeepMind

Google Labs