IBM Lakehouse as Part of watsonx.data for AI and Analytics

The News: IBM announced a new Lakehouse as part of watsonx.data at their Think conference. Read more here.

IBM Lakehouse as Part of watsonx.data for AI and Analytics

Analyst Take: IBM has announced the IBM Lakehouse as an evolution beyond the first generation Lakehouses in use today.

For those not familiar, a lakehouse is a mashup of data lake and data warehouse and is used as an analytics repository. More than a database and more than a data lake of unstructured file data, a lakehouse brings a schema to data for fast selective access to potentially massive amounts of data. Lakehouses are primarily used by query engines such as Presto and Apache Spark. Obviously, a lakehouse is serving a huge and growing market for analytics, used for Artificial Intelligence (AI) and Machine Learning (ML). Adoption of AI and ML are driving the need for more and more data with faster access.

The big news regarding specifics for the IBM Lakehouse include:

  • Can be deployed in less than 10 minutes
  • Will work in public cloud or on-premises
  • Offered also with an on-premises integrated appliance – the IBM Storage Fusion HCI
  • OpenShift on bare metal
  • Standard x86 servers
  • Nvidia GPUs
  • IBM Storage

The IBM Lakehouse supports the Apache Iceberg format for query engines. This enables:

  • SQL access for large data sets
  • Multiple simultaneous query engine access
  • ACID transactions support (Atomicity, Consistency, Isolation, Durable)

For performance improvement, a global persistent cache is implemented – every query engine can access the same cache across hybrid clouds or on-premises.
Storage includes object storage with S3 protocol, IBM Cloud, and Google Cloud Storage.

Lakehouses have a big future and IBM has pushed forward with their announcement with key capabilities for customers. They have recognized that on-premises is important as well as public cloud with the customer concerns on data privacy and storage costs. They also understand how complex a container environment can be and the packaged IBM Storage Fusion HCI will be a rapid deployment solution. IBM is a major player in data for analytics – AI and ML and IBM Lakehouse will enhance the completeness of their offerings.

Disclosure: The Futurum Group is a research and advisory firm that engages or has engaged in research, analysis, and advisory services with many technology companies, including those mentioned in this article. The author does not hold any equity positions with any company mentioned in this article.

Analysis and opinions expressed herein are specific to the analyst individually and data and other information that might have been provided for validation, not those of The Futurum Group as a whole.

Other insights from The Futurum Group:

IBM and Stony Brook University Selected to Anchor the New York Climate Exchange by the City of New York

AI-powered Bing Now in Open Preview as Platform Continues to Grow and Evolve

AI is on Fire—New Broadcom AI Fabric Aims to Stoke the Flames

Author Information

Randy Kerns

Randy has written numerous industry articles and papers as an educator and presenter, and he is the author of two books: Planning a Storage Strategy and Information Archiving – Economics and Compliance. The latter is the first book of its kind to explore information archiving in depth. Randy regularly teaches classes on Information Management technologies in the U.S. and Europe.

Related Insights
Autonomous Enterprise
April 24, 2026

Will ServiceNow and Google Cloud’s AI Agent Alliance Disrupt the Autonomous Enterprise Race?

ServiceNow and Google Cloud partnered to deliver AI agent solutions for autonomous enterprise operations, targeting 5G, retail, and IT sectors while raising concerns about vendor lock-in and scalability....
Google's $750M Partner Bet Resets the Agentic Channel Playbook
April 24, 2026

Google’s $750M Partner Bet Resets the Agentic Channel Playbook

Tiffani Bova at Futurum examines Google's $750M agentic AI partner commitment and new alliance formations with Accenture, Deloitte, Salesforce, and Vista Equity that reset channel program expectations....
Pegasystems Q1 FY 2026: Cloud ACV Nears $1 Billion Mark
April 24, 2026

Pegasystems Q1 FY 2026: Cloud ACV Nears $1 Billion Mark

Keith Kirkpatrick, Research Director with Futurum Research analyzes Pegasystems' Q1 FY 2026 earnings, focusing on Pega Cloud ACV growth nearing $1 billion, Blueprint AI's pipeline impact, and the enterprise AI...
Going Beyond the Data Graveyard With Google’s Agentic Data Cloud as the New Semantic Core for Agentic AI
April 24, 2026

Going Beyond the Data Graveyard With Google’s Agentic Data Cloud as the New Semantic Core for Agentic AI

Brad Shimmin, Analyst at Futurum, shares his insights on Google's new Agentic Data Cloud. See how this shift from passive storage to active intelligence helps organizations ditch manual data plumbing...
ServiceNow Q1 FY 2026 Results Raise Full-Year Subscription Outlook
April 24, 2026

ServiceNow Q1 FY 2026 Results Raise Full-Year Subscription Outlook

Futurum Research at The Futurum Group reviews ServiceNow Q1 FY 2026 earnings, focusing on AI product adoption, security expansion through acquisitions, and what embedded AI packaging means for enterprise workflow...
Can Large Language Models Be Trusted in Real Clinical Conversations?
April 24, 2026

Can Large Language Models Be Trusted in Real Clinical Conversations?

A new analysis benchmarks large language models on real clinician conversations, revealing critical safety insights as healthcare organizations rapidly adopt generative AI—findings that will shape enterprise strategies and regulatory approaches....

Book a Demo

Newsletter Sign-up Form

Get important insights straight to your inbox, receive first looks at eBooks, exclusive event invitations, custom content, and more. We promise not to spam you or sell your name to anyone. You can always unsubscribe at any time.

All fields are required






Thank you, we received your request, a member of our team will be in contact with you.