Unlocking Enterprise AI Ownership: Oracle’s OCI Supercluster

Unlocking Enterprise AI Ownership: Oracle's OCI Supercluster

The News: In the Oracle CloudWorld keynote, Larry Ellison highlighted the Oracle Cloud Infrastructure (OCI) Supercluster as key to Oracle’s AI strategy for customers. Designed for machine learning (ML) training, this architecture scales from 512 to 16,000 NVIDIA H100 graphics processing units (GPUs), all connected by a 200 Gbps remote direct memory access (RDMA) network, boasting low latency. See the complete Press Release on the Oracle website.

Unlocking Enterprise AI Ownership: Oracle’s OCI Supercluster

Analyst Take: Available as a bare-metal service later this year out of the London and Chicago regions of OCI, OCI Supercluster represents a real and practical path forward for enterprise customers looking to “own their own AI.” Do-it-yourself options for AI infrastructure are almost completely cost prohibitive for all but the largest institutions, particularly as materials and supply problems persist in the chip market. But ownership of AI lifecycles is a critical element in enterprise adoption given AI’s particular need for relevant data to ensure accuracy and trust. In the enterprise, the most relevant data is proprietary data, and proprietary data means a requirement for privacy, provenance, and auditing protection.

How do you protect yourself and your partners while using proprietary data to train and use highly effective AI? There are two approaches. Most implementations use retrieval augmented generation (RAG), which adds tokenization of the user prompt and immediate search results to an AI model already trained on publicly available data sets. If the search is an enterprise search, its results can be proprietary and protected while still honing AI inference.

RAG, and RAG-enabling capabilities such as vector database search, have been incorporated throughout the Oracle portfolio. OCI Supercluster is one of the key enablers of this approach, but as a component of other Oracle products and services rather than directly accessed by customers who use this first approach. One such Oracle offering is OCI’s Generative AI service, supporting large language models (LLMs) trained by Cohere.

The second approach to using proprietary data for AI, however, is the only approach if the AI application cannot be based on a readily available ML model. The organization must then train its own models, and to do so, it will need well-architected, private, secure infrastructure. Well-architected infrastructure in this case is a system built for ML training that is highly scalable, has very high bandwidth to the memory and from node to node, and of course, offers low latency to massive quantities of training data available at high throughput. These features are the OCI Supercluster offer.

A Bellwether of the AI Path Forward

AI can only advance by, sooner or later, enabling leading-edge adopters of AI to push outside the boundaries of generative AI and the LLMs that have so captured the market and popular imagination. LLMs are only one kind of AI model, after all. Classification, time-series forecasting, decision trees, recommendation/diagnosis, and predictive regression are examples of modeling opportunities surely less comprehensible to the general public but with great potential, especially in certain industries and lines of scientific research. These kinds of AI will progress and even find their moment sometime, somewhere, and it is infrastructure such as OCI Supercluster, made available as private cloud services, that will allow enterprising organizations to pursue them.

This thinking is in the frame of the market as a whole and the maturing technologies that will enable its growth. Switching frames to the practical matter of an individual organization’s research and development strategy for AI, it is obvious that a supercharged, souped up, super big Supercluster is not needed for every ML model at every stage of its lifecycle. Oracle’s press release introduced two new infrastructure services for AI, only one of which is the bare-metal service mentioned earlier, running on the OCI Supercluster with its NVIDIA H100 Tensor Core GPUs numbering in the tens of thousands.

The other new AI service, also a bare-metal service but in this case suitable for more modest ML training as well as for AI inferencing (in other words, for running a trained AI model), does not run on the OCI Supercluster but rather on more conventional—if still advanced—infrastructure accelerated by NVIDIA L40S GPUs. Both the H100 and the L40S are next-generation NVIDIA GPUs, using the Hopper and Ada Lovelace microarchitectures, respectively. The L40S-based service is planned for launch within the next year, but Oracle did not announce in which regions it will first be available.

It is worth noting that both of these services, as bare-metal services, require significant technical, architectural, and operational expertise by customers to implement and use. What is not required is space, power, cooling, procurement, racking-cabling, system setting and validation, hardware maintenance, or any other resource-sapping activities for IT. OCI Supercluster and these two services are a significant step forward, even if they do not warrant a new acronym like MLtaaS (ML training as a service) or goodness knows what.

Looking Ahead

Oracle’s introduction of the OCI Supercluster significantly elevates its competitive stance, particularly in the arena of AI/ML enterprise solutions. The Supercluster aims to address two pivotal challenges in AI adoption—cost and data privacy—by providing a robust, scalable infrastructure for enterprises to own their own AI. This approach positions Oracle uniquely in the market as it enables a higher degree of customization and control for businesses over their AI initiatives. It also offers a practical solution to the current supply chain issues plaguing the semiconductor industry.

Oracle’s multi-pronged approach—consisting of RAG and bespoke, in-house ML model training—offers flexibility for diverse enterprise needs. The integration of RAG across Oracle’s product line signifies a thoughtful alignment of its AI strategy, further amplified by the Generative AI service supported by Cohere. These capabilities make Oracle more than just an infrastructure provider; it becomes a full-stack AI solution provider.

Moreover, the company’s new offerings are not monolithic but span different scales and requirements. Although the OCI Supercluster focuses on high-end, compute-intensive training tasks with its next-gen NVIDIA H100 GPUs, Oracle also plans another, more modest service based on NVIDIA L40S GPUs. Both are designed as bare-metal services, indicating a shift away from resource-intensive on-premises solutions, which often impede AI implementation due to operational overhead.

This move by Oracle resonates on multiple levels. It not only represents an advance in technological capabilities but also marks a change in the market’s narrative. Previously, the hyperscale cloud provider space has been largely dominated by the likes of AWS, Azure, and Google Cloud in terms of AI/ML capabilities. Oracle’s Supercluster, and its alignment with advanced GPU technology, positions it as a versatile, powerful alternative for enterprises seeking specialized, yet comprehensive, AI/ML solutions.

In summary, Oracle’s strategy and new offerings reflect a keen understanding of the complex challenges that enterprises face in AI adoption. The OCI Supercluster serves as both a technological and strategic milestone, potentially disrupting the current equilibrium among leading cloud service providers and setting a new precedent for customer-centric, versatile AI solutions.

Disclosure: The Futurum Group is a research and advisory firm that engages or has engaged in research, analysis, and advisory services with many technology companies, including those mentioned in this article. The author does not hold any equity positions with any company mentioned in this article.

Analysis and opinions expressed herein are specific to the analyst individually and data and other information that might have been provided for validation, not those of The Futurum Group as a whole.

Other Insights from The Futurum Group:

Oracle Database Analyst Summit: Powering the Multi-Cloud Era and Liberating Developers

Oracle Fiscal Q4 and FY 2023 Results: Oracle Showcases Cloud and AI Mettle in Delivering Record Full-Year Revenue

Oracle Frees Database 23c to Power Universal Modern Apps and Analytics Innovation

Author Information

Guy is the CTO at Visible Impact, responsible for positioning, GTM, and sales guidance across technologies and markets. He has decades of field experience describing technologies, their business and community value, and how they are evaluated and acquired. Guy’s specialty areas include cloud, DevOps/cloud-native/12-factor, enterprise applications, Big Data, governance-risk-compliance, containerization, virtualization, HPC, CPUs-GPUs, and systems lifecycle management.

Guy started his technology career as a research director for technology media company Ziff Davis, with stints at PC Magazine, eWeek, and CIO Insight. Prior to joining Visible Impact, he worked at Dell, including postings in marketing, product, and technical marketing groups for a wide range of products, including engineered systems, cloud infrastructure, enterprise software, and mission-critical cloud services. He lives and works in Austin, TX

Steven engages with the world’s largest technology brands to explore new operating models and how they drive innovation and competitive edge.

Related Insights
Is AI Ready for Real Work, or Are Enterprises Still Stuck in Experimentation?
July 4, 2026

Is AI Ready for Real Work, or Are Enterprises Still Stuck in Experimentation?

Most enterprises claim advanced AI maturity, but lack governance and deployment strategies. Leading organizations are moving from experimentation to measurable AI impact....
Compliance as Code Is No Longer Optional: Why Manual Reviews Can’t Keep Up
July 4, 2026

Compliance as Code Is No Longer Optional: Why Manual Reviews Can’t Keep Up

Qodo's 'Compliance as Code' framework automates enterprise AI compliance through PR checks, solving the data privacy and security gaps that plague manual reviews at scale....
Databricks AI’s GPU Reliability Push Exposes Hidden Risks for Large-Scale Training
July 3, 2026

Databricks AI’s GPU Reliability Push Exposes Hidden Risks for Large-Scale Training

Databricks AI reveals critical GPU reliability challenges in distributed training environments. Silent slowdowns and numerical corruption pose greater risks than visible failures, threatening model quality and compute efficiency at enterprise...
AI Code Review Hits a Wall: Why Speed Without Trust Risks Engineering Chaos
July 3, 2026

AI Code Review Hits a Wall: Why Speed Without Trust Risks Engineering Chaos

A survey shows 94% of engineering leaders use agentic AI coding tools, but 55% struggle with reliability and hallucinations—revealing a critical gap between development speed and production quality....
Brave's Browser Containers Raise the Bar for Privacy and Workflow Flexibility
July 3, 2026

Brave’s Browser Containers Raise the Bar for Privacy and Workflow Flexibility

As AI platform adoption accelerates to $181.3B projected market size, Brave's v1.92 release introduces native browser containers addressing data privacy concerns for 52.6% of enterprise decision makers managing multi-cloud AI...
Is Self-Healing ITOps Ready to Replace Manual Incident Response?
July 3, 2026

Is Self-Healing ITOps Ready to Replace Manual Incident Response?

LogicMonitor's AI-driven ITOps framework combines root-cause analysis with governed automation to reduce alert fatigue and accelerate issue resolution, as agentic AI reshapes enterprise infrastructure management....

Book a Demo

Welcome

The vision behind everything in Futurum’s Custom Research practice is this: research should show you what is happening, what comes next, and what to do about it. It should be personal to each audience, easy for people to grasp, and structured so LLMs can reason over it accurately. And it should be fast and turnkey; you want answers now, not another project to carry for quarters.

Whether you are defining business, channel, or go-to-market strategy; evaluating vendors or justifying ROI; or commissioning research to fill an emerging market need, we have your back, with a program that answers your questions with the objectivity and credibility to drive real decisions.

To do it, we bring unmatched data to bear: Futurum research, surveys, and market projections; validated market feeds; ETR’s 15 years of insight from 10,000 technology decision-makers; G2’s buyer and user data; and what our analysts hear every day. Add leading primary collection, from AI-moderated voice interviews to surveys and analyst-led interviews, all turnkey, and every project comes out credible, nuanced, and actionable.

And we don’t just drop the results in your lap. For internal work, we provide analyst-led sessions, interactive dashboards, and a range of formats. For market-facing work, Futurum delivers turnkey activation and amplification that actually gets seen, by people and by LLMs, through our media and share of voice. This is research that moves decisions and markets.

We will meet you wherever you are, from a fast-turn brief to a multi-year program, and shape the work to your goals, timeline, and budget. The right program for your moment.

If any of this is useful, I would love to talk.

Benjamin Brown, VP Custom Research, Futurum Research

Benjamin Brown

VP, Custom Research · The Futurum Group

Newsletter Sign-up Form

Get important insights straight to your inbox, receive first looks at eBooks, exclusive event invitations, custom content, and more. We promise not to spam you or sell your name to anyone. You can always unsubscribe at any time.

All fields are required






Thank you, we received your request, a member of our team will be in contact with you.