Confidential Neural Computing - hosting generative AI model workloads in a Trusted Execution Environment

Joe Woodworth; Zhimin Yao

Confidential Neural Computing - hosting generative AI model workloads in a Trusted Execution Environment

Joe Woodworth

Zhimin Yao

(2024)

Google Scholar

Abstract

As generative AI models grow more & more capable, products increasingly want to leverage these models to provide personalized generative experiences for their users. This personalization relies on fine-tuning and running these models with sensitive user data. The sensitivity of this user data motivates the need to train & run these models in a privacy-safe way that provides strong safety guarantees to the user, and earns user trust.

The Confidential Neural Computing project builds an ML framework focused on enabled generative AI training and inference in secure enclaves. In this talk, we give an overview of some of the core components of the Confidential Neural Computing framework, explain how the framework leverages current CPU & GPU confidential computing technologies, and share updates from our current & on-going areas of investigation.

Explore our many areas of focus

Building a collaborative ecosystem

Shaping the future together

Translating discovery into real-world impact

Confidential Neural Computing - hosting generative AI model workloads in a Trusted Execution Environment

Abstract

Meet the teams driving innovation

Google AI

Google Cloud

Google DeepMind

Google Labs