Accelerate innovation with Google Cloud and NVIDIA

NVIDIA and Google Cloud deliver accelerator-optimized solutions that address your most demanding workloads, including machine learning, high performance computing, data analytics, graphics, and gaming workloads.

Engineering what's next

Google distributed cloud brings models on-prem

See how Google Cloud and NVIDIA are collaborating to bring Gemini and advanced AI to the edge and regulated environments with Google Distributed Cloud.

NVIDIA-accelerated computing on Google Cloud

Google Cloud at NVIDIA GTC 2026-Elite sponsor

Join us at GTC 2026 to see how Google Cloud and NVIDIA are architecting the future of AI scale, security, and agentic workflows. Connect with our industry and product leaders during our exclusive sessions and happy hour to explore the breakthroughs shaping the next frontier of innovation.

Marquee and fireside chats

Infrastructure and scalability

AI agents and applied research

Security and confidential computing

Hands-on labs and tutorials

Networking and ecosystem

  • Cheers to the Future of AI with Google Cloud Happy Hour. Connect with Google Cloud’s AI startup, Industry, and product leaders to discuss generative AI breakthroughs, infrastructure, and the future of the ecosystem. March 17, 2026 5:30 PM (RSVP Required)

“Attending GTC with my team was invaluable, the ability to meet with so many customers alongside our product and engineering partners—it's set us up for success in 2025”

Lauren Kapnick, Head of Sales Hedge Funds FSI, Google Cloud

Google Cloud at NVIDIA GTC 2026-Elite sponsor

Join us at GTC 2026 to see how Google Cloud and NVIDIA are architecting the future of AI scale, security, and agentic workflows. Connect with our industry and product leaders during our exclusive sessions and happy hour to explore the breakthroughs shaping the next frontier of innovation.

Marquee and fireside chats

Infrastructure and scalability

AI agents and applied research

Security and confidential computing

Hands-on labs and tutorials

Networking and ecosystem

  • Cheers to the Future of AI with Google Cloud Happy Hour. Connect with Google Cloud’s AI startup, Industry, and product leaders to discuss generative AI breakthroughs, infrastructure, and the future of the ecosystem. March 17, 2026 5:30 PM (RSVP Required)

“Attending GTC with my team was invaluable, the ability to meet with so many customers alongside our product and engineering partners—it's set us up for success in 2025”

Lauren Kapnick, Head of Sales Hedge Funds FSI, Google Cloud

High Performing GPUs on Google Cloud

Accelerate machine learning, scientific computing, and generative AI with high-performance GPUs on Google Cloud.

Key Benefits:

  • Expedite workloads (generative AI, 3D visualization, HPC) with advanced AI hardware/software
  • Access diverse GPUs for varied performance and pricing
  • Optimize workloads with flexible pricing and machine customizations

Key Features

  • Diverse GPU Offerings: Compute Engine offers NVIDIA GPUs: RTX PRO 6000, GB300, GB200, B200, H200, H100, L4, P100, P4, T4, V100, A100. Options cover various cost/performance needs.
  • Adaptable Performance: Achieve optimal balance of processor, memory, high-performance disk, and up to 8 GPUs per instance. Benefit from per-second billing.
  • Leverage Google Cloud Advantages: Run GPU workloads on Google Cloud and access industry-leading storage, networking, and data analytics.

NVIDIA technologies on Google Cloud

Google Kubernetes Engine (GKE)

Leverage GKE's scalability, NVIDIA Multi-Instance GPU (MIG) support, and GPU time-sharing for efficient generative AI training, inference, and other compute-intensive workloads. Optimize resource utilization and minimize operational costs.

Vertex AI

Combine NVIDIA accelerated computing with Vertex AI, a unified MLOps platform. Utilize NVIDIA GPUs and AI software (such as, Triton™ Inference Server) within Vertex AI Training, Prediction, Pipelines, and Notebooks to accelerate generative AI development and deployment without infrastructure complexities.

Cloud Run

Deploy generative AI faster with NVIDIA NIM on Cloud Run, a fully managed serverless platform. Cloud Run's GPU support allows NIM to optimize performance and accelerate gen AI model deployment in a serverless environment.

Dynamic Workload Scheduler

Access NVIDIA GPU capacity on Google Cloud for short-duration AI workloads (training, fine-tuning, experimentation). Flexible scheduling and atomic provisioning enhance resource utilization and optimize costs across services like GKE, Vertex AI, and Batch.

Google Distributed Cloud

The NVIDIA Blackwell platform on Google Distributed Cloud enables secure, on-premises deployment of advanced agentic AI (including Google Gemini models). This offers breakthrough AI performance and scalability for sensitive, regulated workloads, ensuring data privacy, sovereignty, and compliance.


Technical resources for deploying NVIDIA technologies on Google Cloud

Google Cloud basics

  • GPUs on Compute Engine: Compute Engine provides GPUs that you can add to your virtual machine instances. Learn more
  • Using GPUs for training models in the cloud: Accelerate the training process for many deep learning models, like image classification, video analysis, and natural language processing. Learn more
  • Attaching GPUs to Dataproc clusters: Attach GPUs to the master and worker Compute Engine nodes in a Dataproc cluster to accelerate specific workloads, such as machine learning and data processing. Learn more
  • Using GPUs with Dataflow: Using GPUs in Dataflow jobs lets you accelerate some of your machine learning and other compute intensive data processing tasks. Learn more


Tutorials

  • Learn how to add or remove GPUs from a Compute Engine VM. Learn more
  • Installing GPUs Drivers: This guide shows ways to install NVIDIA proprietary drivers after you’ve created an instance with one or more GPUs. Learn more
  • GPUs on Google Kubernetes Engine: Learn how to use GPU hardware accelerators in your Google Kubernetes Engine clusters’ nodes. Learn more

View all product documentation


Jeff Dean head-shot

Join Google's Deepmind Chief Scientist Jeff Dean for a fireside chat at GTC: Advancing to AI's Next Frontier

Wednesday, March 18 | 4:00 p.m. - 5:00 p.m.

Google Cloud and NVIDIA collaborations

Customer stories

Learn how SandboxAQ accelerates scientific discovery with AI
SANDBOXAQ logo
PUMA logo
Learn how PUMA built an AI jersey designer with Google Cloud and NVIDIA

Augment Code

Augment Code accelerates AI coding with Google Cloud and NVIDIA

Galileo

Galileo: De-risk LLMs and build trustworthy AI apps at scale with Gemini, NVIDIA, and Google Cloud

Galileo + NVIDIA + Google Cloud
LiveX AI + NVIDIA + Google Cloud

Baseten

How Baseten achieves 225% better cost-performance for AI inference with NVIDIA and Google Cloud

LiveX AI

LiveX AI slashes support costs by 85% using GKE and NVIDIA AI-powered agents.

LiveX AI + NVIDIA + Google Cloud
Compared with another inference platform, running on GKE with NVIDIA NIM and GPUs delivered 6.1x acceleration in average answer/response generation speed for the Amazfit AI agent

Jia Li Co-Founder, Chief AI Officer, LiveX AI

Read more

Take the next step

Tell us what you’re solving for. A Google Cloud NVIDIA expert is ready to help you find the best solution.




Google Cloud