Databricks Acquires Okera

The News: Databricks announced the acquisition of Okera on May 3, 2023, for an undisclosed sum. For more information, see the blog published on the Databricks website.

Databricks Acquires Okera

Analyst Take: Okera provides secure data access with discovery of private information in data lakes, data warehouses, and lakehouses. Okera also includes a no-code policy engine and self-service portal for data administrators to manage data access. Okera, founded in 2016, had raised $29.6 million in venture funding. Databricks offers a lakehouse platform for unifying analytics data for Artificial Intelligence and Machine Learning. To date, Databricks has raised $3.5 billion in venture funding.

The acquisition gives Databricks a data privacy and governance solution for their unified data platform. As Artificial Intelligence and Machine Learning continue to develop and become mainstream in usage, the need for privacy and security becomes increasingly visible.

Okera brings to Databricks a solution that includes:

  • Discovery and classification of personally identifiable and private information
  • Metadata tagging of information with what Okera terms AI/ML technology
  • No-code policy creation and self-service portal for data administrators
  • Policy enforcement for data access based on metadata and policies
  • Reporting on data usage and access patterns.
  • Ability to scale security without impeding access to data

Databricks will integrate the Okera solution into their Unity Catalog, expanding governance and security for data. The addition of the privacy and security capabilities, especially in the area allowing data administrators to create no-code policy controls, will be highly valuable for AI/ML enterprise deployments and will set Databricks apart from other lakehouse solutions.

It should be noted that Nong Li, the Founder and CEO of Okera is known for creating Apache Parquet, the open-source data storage format which Databricks is based on. Nong also worked briefly at Databricks. This should be an important enabler for successful integration of Okera into Databricks.

Disclosure: The Futurum Group is a research and advisory firm that engages or has engaged in research, analysis, and advisory services with many technology companies, including those mentioned in this article. The author does not hold any equity positions with any company mentioned in this article.

Analysis and opinions expressed herein are specific to the analyst individually and data and other information that might have been provided for validation, not those of The Futurum Group as a whole.

Other insights from The Futurum Group:

NIST Launches the Trustworthy & Responsible Artificial Intelligence Resource Center

GroqDay: Groq Sets its Sights on Accelerating the Speed of Artificial Intelligence

New Software Intelligence Apps Unveiled by Dynatrace

Author Information

Randy Kerns

Randy has written numerous industry articles and papers as an educator and presenter, and he is the author of two books: Planning a Storage Strategy and Information Archiving – Economics and Compliance. The latter is the first book of its kind to explore information archiving in depth. Randy regularly teaches classes on Information Management technologies in the U.S. and Europe.

Related Insights
Can Claude Opus 4.7 and Ensemble AI Models Finally Make Code Review Reliable?
April 18, 2026

Can Claude Opus 4.7 and Ensemble AI Models Finally Make Code Review Reliable?

CodeRabbit's ensemble AI code review system using Claude Opus 4.7 catches subtle bugs and race conditions that single-model systems miss, signaling a major shift in software quality assurance....
Will GPT-Rosalind Redefine AI’s Role in Life Sciences R&D?
April 18, 2026

Will GPT-Rosalind Redefine AI’s Role in Life Sciences R&D?

OpenAI's GPT-Rosalind marks a pivotal shift in enterprise AI, delivering domain-specific reasoning for life sciences while intensifying competition between horizontal and vertical AI specialists....
Can Real-Time Code Quality Tools Like Qodo and Cursor Break the Pull Request Bottleneck?
April 18, 2026

Can Real-Time Code Quality Tools Like Qodo and Cursor Break the Pull Request Bottleneck?

Qodo's integration with Cursor demonstrates how real-time code quality tools are eliminating pull request bottlenecks by surfacing issues as developers write code, not after submission....
Can CodeRabbit's Multi-Repo Analysis End the Microservices Blind Spot in Code Review?
April 18, 2026

Can CodeRabbit’s Multi-Repo Analysis End the Microservices Blind Spot in Code Review?

CodeRabbit's new Multi-Repo Analysis feature surfaces cross-repository breaking changes that traditional code review tools miss, addressing a critical pain point for microservices architectures and distributed teams....
Is PyTorch Europe's Rise a Turning Point for Open Source AI Leadership?
April 17, 2026

Is PyTorch Europe’s Rise a Turning Point for Open Source AI Leadership?

PyTorch Conference Europe 2026 drew 600+ AI leaders to Paris, showing open source AI's growing enterprise influence as organizations shift from proprietary solutions toward agentic AI and hybrid deployments....
Agentic AI or Pipeline AI for Code Reviews? Why the Architecture Decision Now Shapes Dev Velocity
April 17, 2026

Agentic AI or Pipeline AI for Code Reviews? Why the Architecture Decision Now Shapes Dev Velocity

Enterprise leaders face a critical decision: agentic AI versus pipeline AI for code reviews. Futurum Group's latest analysis reveals how this architectural choice directly impacts developer velocity, risk management, and...

Book a Demo

Newsletter Sign-up Form

Get important insights straight to your inbox, receive first looks at eBooks, exclusive event invitations, custom content, and more. We promise not to spam you or sell your name to anyone. You can always unsubscribe at any time.

All fields are required






Thank you, we received your request, a member of our team will be in contact with you.