Qualcomm AI 100 Ultra Card

Qualcomm AI 100 Ultra Card

The Six Five team discusses Qualcomm AI 100 Ultra Card.

If you are interested in watching the full episode you can check it out here.

Disclaimer: The Six Five Webcast is for information and entertainment purposes only. Over the course of this webcast, we may talk about companies that are publicly traded and we may even reference that fact and their equity share price, but please do not take anything that we say as a recommendation about what you should do with your investment dollars. We are not investment advisors and we ask that you do not treat us as such.

Transcript:

Daniel Newman: Let’s talk a little bit about some new Qualcomm updates on the AI 100. Did I say AI or IA? It’s AI 100 Ultra Card, Pat, what’s going on?

Patrick Moorhead: So most of you know Qualcomm as a technology provider for smartphones, and then what we’ve learned if you’ve been paying attention is they’ve been increasing their portfolio into areas like automotive where they have a 30 billion backlog and it’s the first time last quarter that it actually made a highlight reel that the business was big enough that said that it helped the business. Also into PCs and also expanding into IoT. But what a lot of people don’t know is that Qualcomm is leveraging their very scalable AI blocks that they use in a lot of different implementations into the data center or the data center edge.

First we saw the Qualcomm AI 100 and that’s currently inside of AWS and not just for automotive customers like BMW who are going with the Qualcomm solutions for their cars, for self-driving and safety, but also are open for anybody who would want to use them. And as you would expect, the solution is very efficient in terms of what it can do, for lack of a better term, TOPS per watt. I think a lot of people are wondering like, Hey, are they going to keep this going? Well, here we go. A couple of days ago they dropped the A 100 Ultra that cranks out even more performance, trillion parameter models here, which is just shocking and excuse me, a hundred billion parameter model on 150 watt card.

The crazy part about this was, it was more than kind of dog with a note here if any of you speak French out there, but this has actually showed up with two customers. The first one is HPE, and the second one is Cerberus. So Cerberus, everybody knows HPE, very successful on the edge, very successful in high performance computing, and they’re offering an AI training as a service, which I’m still waiting details on pricing in GA. They had acquired a company called Cray who is a leader in highest performant supercomputer. We’re going to talk a little bit about that afterwards.

And then Cerberus is this wafer scale, literally the size of the chip is nearly the size of a wafer, and they don’t just sell the chip, they sell the entire system. That company has seen a lot of activity and interest from US departments of X, Y, Z, and you see the success that HPE has in those same just cloud maybe the US military. Then you combine the trust that a company like Qualcomm has on the inference side, it totally makes sense. HPE has already determined what they want to use for training. They didn’t talk about what they’re using for inference. I’m waiting more details for instance, is this the data center edge? We’re going to see this at Tesla or something like that. It’s a lot clearer cut for Cerberus who doesn’t have an inference capability. They’re more of a training. And now Cerberus can come in with an end-to-end solution leveraging Qualcomm, and I’m super interested to see about what the future holds of this business unit.

Daniel Newman: Yeah, Pat, this is really interesting, for lack of a better word, this is not the part of the market that people think about Qualcomm, but it’s a really useful piece of hardware based on my first assessment. Now, again, I’m reading over what Justin Hotard is saying at HPE, I’m looking at the Cerberus commentary. You’re talking about a very low power consumption, but powerful accelerator for these AI workloads. And it really looks like they’re kind of wedging their way into the cloud. They’re wedging their way onto the prem data centers. They’re wedging their way into the hyperscale cloud potentially here to be offering … to putting their, the way ARM has squeaked its way into the PC and how lower power has found its way.

Could this lower power trend in what Qualcomm has … it has quite a pedigree in this particular space, be the beginning of a new business unit for the company. And it sounds like that’s the direction it’s going. Now, it’s early days here, but from generation to generation, you’re seeing some really good improvements. You’re talking about it looks like something, it’s a pretty significant order of magnitude over that original Edge 100 that they had put out. It’s starting to look quite compelling. They’re finding OEM partners now. You’re hearing from cloud companies that are building as well as accelerator companies. What you’re seeing here from Cerberus that are basically saying, we can partner with Qualcomm here and get the types of gains we need.

It is early for me, I got to get a briefing to be candid, to learn a little bit more about this. But Pat, this could be the next IoT business for Qualcomm. This could be the next, where’s the next big growth come from? And you got to say, it would make Qualcomm more attractive. I know there are kind of heads down all in on AI PC, but let’s face it, data center dollars and margins are just better. So if they can really find their way in here and show that there’s a demand for lower power consumption and obviously high output accelerators, this could be an interesting place. And it’s exactly where Qualcomm is known to be able to play well. So early days. But let’s keep an eye on this Pat, because you know what? From FLOPS to TOPS, it’s an AI world.

Author Information

Daniel is the CEO of The Futurum Group. Living his life at the intersection of people and technology, Daniel works with the world’s largest technology brands exploring Digital Transformation and how it is influencing the enterprise.

From the leading edge of AI to global technology policy, Daniel makes the connections between business, people and tech that are required for companies to benefit most from their technology investments. Daniel is a top 5 globally ranked industry analyst and his ideas are regularly cited or shared in television appearances by CNBC, Bloomberg, Wall Street Journal and hundreds of other sites around the world.

A 7x Best-Selling Author including his most recent book “Human/Machine.” Daniel is also a Forbes and MarketWatch (Dow Jones) contributor.

An MBA and Former Graduate Adjunct Faculty, Daniel is an Austin Texas transplant after 40 years in Chicago. His speaking takes him around the world each year as he shares his vision of the role technology will play in our future.

Related Insights
Can Parallel Retrieval Redefine Enterprise AI Search Speed and Quality?
June 6, 2026

Can Parallel Retrieval Redefine Enterprise AI Search Speed and Quality?

Databricks' upgraded Agent Bricks Knowledge Assistant achieves 2x faster answer generation and 3x faster search latency through parallel test-time scaling, redefining enterprise AI search performance....
Will Glean's NVIDIA Nemotron 3 Ultra Integration Shift the Enterprise AI Stack?
June 6, 2026

Will Glean’s NVIDIA Nemotron 3 Ultra Integration Shift the Enterprise AI Stack?

Glean's integration of NVIDIA Nemotron 3 Ultra marks a pivotal moment in enterprise AI, where model flexibility and infrastructure alignment become strategic competitive advantages for buyers seeking cost-effective, high-context solutions....
Zendesk Bets on Embedded AI Support, Can Deep Microsoft 365 Integration Shift Enterprise Workflows?
June 5, 2026

Zendesk Bets on Embedded AI Support, Can Deep Microsoft 365 Integration Shift Enterprise Workflows?

Keith Kirkpatrick, Vice President & Research Director, Enterprise Software & Di at Futurum, Zendesk's new Support Assistant for Microsoft 365 embeds AI-powered support into Teams, Outlook, and Word to streamline...
Marvell’s Teralynx T100 Puts Power Efficiency at the Center of AI Networking
June 5, 2026

Marvell’s Teralynx T100 Puts Power Efficiency at the Center of AI Networking

Tom Hollingsworth, Networking Technology Advisor and Event Lead at Futurum, examines how the Marvell Teralynx T100 addresses AI networking power and latency constraints as hyperscalers build larger AI clusters....
Can Cisco Cloud Control Make AgenticOps Practical for Enterprises
June 5, 2026

Can Cisco Cloud Control Make AgenticOps Practical for Enterprises?

Tom Hollingsworth, Networking Technology Advisor and Event Lead at Futurum, examines how Cisco Cloud Control combines AI agents, operations, security, and resilience into a unified control plane for critical infrastructure....
Can NVIDIA Cosmos 3 Make Open Physical AI a Reality, Or Will Fragmentation Stall Progress?
June 5, 2026

Can NVIDIA Cosmos 3 Make Open Physical AI a Reality, Or Will Fragmentation Stall Progress?

NVIDIA Cosmos 3 launches as the first open omni-model for physical AI, targeting robotics and embodied AI with an open-source approach that challenges proprietary models from OpenAI, Google, and Amazon,...

Book a Demo

Newsletter Sign-up Form

Get important insights straight to your inbox, receive first looks at eBooks, exclusive event invitations, custom content, and more. We promise not to spam you or sell your name to anyone. You can always unsubscribe at any time.

All fields are required






Thank you, we received your request, a member of our team will be in contact with you.