Accelerating Molecular Graph Neural Networks via Knowledge Distillation

Filip Ekström Kelvinius; Dimitar Georgiev; Artur Petrov Toshev; Johannes Gasteiger

Accelerating Molecular Graph Neural Networks via Knowledge Distillation

Filip Ekström Kelvinius

Dimitar Georgiev

Artur Petrov Toshev

Johannes Gasteiger

Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS) (2023)

Download Google Scholar

Abstract

Recent advances in graph neural networks (GNNs) have allowed molecular simulations with accuracy on par with conventional gold-standard methods at a fraction of the computational cost. Nonetheless, as the field has been progressing to bigger and more complex architectures, state-of-the-art GNNs have become largely prohibitive for many large-scale applications. In this paper, we, for the first time, explore the utility of knowledge distillation (KD) for accelerating molecular GNNs. To this end, we devise KD strategies that facilitate the distillation of hidden representations in directional and equivariant GNNs and evaluate their performance on the regression task of energy and force prediction. We validate our protocols across different teacher-student configurations and demonstrate that they can boost the predictive accuracy of student models without altering their architecture. We also conduct comprehensive optimization of various components of our framework, and investigate the potential of data augmentation to further enhance performance. All in all, we manage to close as much as 59% of the gap in predictive accuracy between models like GemNet-OC and PaiNN with zero additional cost at inference.

Research Areas

Machine intelligence

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

Accelerating Molecular Graph Neural Networks via Knowledge Distillation

Abstract

Research Areas

Meet the teams driving innovation

Google AI

Google Cloud

Google DeepMind

Google Labs