Variational Image Compression with a Scale Hyperprior

Johannes Ballé; David Minnen; Saurabh Singh; Sung Jin Hwang; Nick Johnston

Variational Image Compression with a Scale Hyperprior

Johannes Ballé

David Minnen

Saurabh Singh

Sung Jin Hwang

Nick Johnston

6th Int. Conf. on Learning Representations (ICLR) (2018)

Download Google Scholar

Abstract

We describe an end-to-end trainable model for image compression based on variational autoencoders. The model incorporates a hyperprior to effectively capture spatial dependencies in the latent representation. This hyperprior relates to side information, a concept universal to virtually all modern image codecs, but largely unexplored in image compression using artificial neural networks (ANNs). Unlike existing autoencoder compression methods, our model trains a complex prior jointly with the underlying autoencoder. We demonstrate that this model leads to state-of-the-art image compression when measuring visual quality using the popular MS-SSIM index, and yields rate–distortion performance surpassing published ANN-based methods when evaluated using a more traditional metric based on squared error (PSNR). Furthermore, we provide a qualitative comparison of models trained for different distortion metrics.

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

Variational Image Compression with a Scale Hyperprior

Abstract

Research Areas

Meet the teams driving innovation

Google AI

Google Cloud

Google DeepMind

Google Labs