Google Cloud Set to Launch NVIDIA-Powered A3 GPU Virtual Machines

Google Cloud Set to Launch NVIDIA-Powered A3 GPU Virtual Machines

The News: Google extended its partnership with NVIDIA during Google Cloud Next ’23, announcing general availability (GA) of its H100-powered A3 GPU virtual machines (VMs) and outlining plans for future collaboration. See the announcement on the Google Cloud blog.

Google Cloud Set to Launch NVIDIA-Powered A3 GPU Virtual Machines

Analyst Take: The big Google- NVIDIA news out of the conference was that the A3 supercomputer VMs will be generally available next month. These VMs use NVIDIA’s H100 Tensor Core GPUs, which are built to train and serve demanding AI workloads and large language models (LLMs). Google claims the A3 instances, combined with Google Cloud infrastructure, can provide 3x faster training and 10x greater networking bandwidth over previous products. A3 VMs can also scale models to tens of thousands of NVIDIA H100 GPUs.

The A3 VM includes dual 4th Gen Intel Xeon scalable processors, 8 NVIDIA H100 GPUs per VM, and 2 TB of host memory. The A3 VM delivers 3.6 TB/s bisectional bandwidth between the eight GPUs via fourth-generation NVIDIA NVLink technology. The bandwidth comes from Google’s Titanium network adapter and NVIDIA Collective Communications Library (NCCL) optimizations.

During a Cloud Next keynote, Google Cloud CEO Thomas Kurian and NVIDIA CEO Jensen Huang spoke of other joint generative AI projects the companies are working on. These projects include:

  • Integrating Google’s serverless Spark with NVIDIA acceleration libraries and GPUs for data science workloads with Google’s Dataproc Hadoop and Spark managed service
  • Plans to put the NVIDIA DGX Cloud on Google Cloud Platform (GCP), so GCP customers can take advantage of NVIDIA’s AI cloud supercomputer
  • Co-engineering chips for data processing, model serving, networking, and software to integrate NVIDIA acceleration into the GCP Vertex AI development environment
  • Working on NVIDIA large-memory AI with DGX GH200 Grace Hopper Superchips and NVIDIA NLink Switch System
  • Enabling NVIDIA GPU acceleration for PaxML framework used by Google to build internal LLMs

Even while launching the next-generation of its own TPU custom chips for accelerating ML, Google made it clear that partnering with NVIDIA acceleration is essential for serious AI companies. That is an easy call, following NVIDIA’s strong earnings report in August when it doubled revenue year-over-year to $13.5 billion on the strength of its AI products and services.

The winners in the generative AI wars will be the companies that can best leverage their NVIDIA acceleration partnerships, and Google is fully engaged. Google Next ’23 featured presentations from General Motors, IHOP, Fox Sports, Six Flags, Wendy’s, Estée Lauder, GE Appliances, and healthcare firms Bayer Pharma, HCA Healthcare, and Meditech detailing their use of AI in the Google Cloud.

Disclosure: The Futurum Group is a research and advisory firm that engages or has engaged in research, analysis, and advisory services with many technology companies, including those mentioned in this article. The author does not hold any equity positions with any company mentioned in this article.

Analysis and opinions expressed herein are specific to the analyst individually and data and other information that might have been provided for validation, not those of The Futurum Group as a whole.

Other insights from The Futurum Group:

Google Cloud’s TPU v53 Accelerates AI Compute War

NVIDIA Generative AI Accelerates Automotive Innovation

Duet AI for Google Workspaces Enhances Google Meet and Google Chat

Author Information

Dave focuses on the rapidly evolving integrated infrastructure and cloud storage markets.

Related Insights
Is AI Ready for Real Work, or Are Enterprises Still Stuck in Experimentation?
July 4, 2026

Is AI Ready for Real Work, or Are Enterprises Still Stuck in Experimentation?

Most enterprises claim advanced AI maturity, but lack governance and deployment strategies. Leading organizations are moving from experimentation to measurable AI impact....
Compliance as Code Is No Longer Optional: Why Manual Reviews Can’t Keep Up
July 4, 2026

Compliance as Code Is No Longer Optional: Why Manual Reviews Can’t Keep Up

Qodo's 'Compliance as Code' framework automates enterprise AI compliance through PR checks, solving the data privacy and security gaps that plague manual reviews at scale....
Databricks AI’s GPU Reliability Push Exposes Hidden Risks for Large-Scale Training
July 3, 2026

Databricks AI’s GPU Reliability Push Exposes Hidden Risks for Large-Scale Training

Databricks AI reveals critical GPU reliability challenges in distributed training environments. Silent slowdowns and numerical corruption pose greater risks than visible failures, threatening model quality and compute efficiency at enterprise...
AI Code Review Hits a Wall: Why Speed Without Trust Risks Engineering Chaos
July 3, 2026

AI Code Review Hits a Wall: Why Speed Without Trust Risks Engineering Chaos

A survey shows 94% of engineering leaders use agentic AI coding tools, but 55% struggle with reliability and hallucinations—revealing a critical gap between development speed and production quality....
Brave's Browser Containers Raise the Bar for Privacy and Workflow Flexibility
July 3, 2026

Brave’s Browser Containers Raise the Bar for Privacy and Workflow Flexibility

As AI platform adoption accelerates to $181.3B projected market size, Brave's v1.92 release introduces native browser containers addressing data privacy concerns for 52.6% of enterprise decision makers managing multi-cloud AI...
Is Self-Healing ITOps Ready to Replace Manual Incident Response?
July 3, 2026

Is Self-Healing ITOps Ready to Replace Manual Incident Response?

LogicMonitor's AI-driven ITOps framework combines root-cause analysis with governed automation to reduce alert fatigue and accelerate issue resolution, as agentic AI reshapes enterprise infrastructure management....

Book a Demo

Welcome

The vision behind everything in Futurum’s Custom Research practice is this: research should show you what is happening, what comes next, and what to do about it. It should be personal to each audience, easy for people to grasp, and structured so LLMs can reason over it accurately. And it should be fast and turnkey; you want answers now, not another project to carry for quarters.

Whether you are defining business, channel, or go-to-market strategy; evaluating vendors or justifying ROI; or commissioning research to fill an emerging market need, we have your back, with a program that answers your questions with the objectivity and credibility to drive real decisions.

To do it, we bring unmatched data to bear: Futurum research, surveys, and market projections; validated market feeds; ETR’s 15 years of insight from 10,000 technology decision-makers; G2’s buyer and user data; and what our analysts hear every day. Add leading primary collection, from AI-moderated voice interviews to surveys and analyst-led interviews, all turnkey, and every project comes out credible, nuanced, and actionable.

And we don’t just drop the results in your lap. For internal work, we provide analyst-led sessions, interactive dashboards, and a range of formats. For market-facing work, Futurum delivers turnkey activation and amplification that actually gets seen, by people and by LLMs, through our media and share of voice. This is research that moves decisions and markets.

We will meet you wherever you are, from a fast-turn brief to a multi-year program, and shape the work to your goals, timeline, and budget. The right program for your moment.

If any of this is useful, I would love to talk.

Benjamin Brown, VP Custom Research, Futurum Research

Benjamin Brown

VP, Custom Research · The Futurum Group

Newsletter Sign-up Form

Get important insights straight to your inbox, receive first looks at eBooks, exclusive event invitations, custom content, and more. We promise not to spam you or sell your name to anyone. You can always unsubscribe at any time.

All fields are required






Thank you, we received your request, a member of our team will be in contact with you.