SAP Deepens Data and AI Strategy with Dremio Acquisition

SAP has announced the acquisition of Dremio, a specialized provider of data integration and analytics solutions. This strategic move aims to extend the capabilities of SAP's analytics and AI agent-building tools, enabling them to access and process external data sources with greater efficiency. The undisclosed sum spent on the acquisition focuses on integrating Dremio's Apache Iceberg-based lakehouse technology.

SAP's primary goal is to help its customers overcome data fragmentation and improve integration across heterogeneous systems. According to the company, the acquisition will complement existing platforms such as SAP Business Data Cloud and SAP HANA Cloud, strengthening their offering in the enterprise analytics and artificial intelligence landscape.

The Central Role of Apache Iceberg and Lakehouse Architecture

With Dremio's integration, SAP has stated that its Business Data Cloud will evolve into an "Apache Iceberg-native enterprise lakehouse," designed to unify data from both SAP and non-SAP systems. This architecture is crucial for powering agentic AI at enterprise scale. Apache Iceberg is an Open Source table format, originally developed by Netflix, which has established itself as an industry standard.

Iceberg competes with Databricks' Delta Lake format, also Open Source under the Linux Foundation. However, Databricks has recently taken steps to improve interoperability between the two standards, particularly after acquiring Tabular, a company founded by Iceberg's original authors. Both formats promise to enable analytics directly on the data, eliminating the need and associated costs of moving and converting itโ€”a crucial aspect for underpinning enterprise analytics, machine learning, and AI agent development. SAP emphasizes that the Business Data Cloud will natively support Apache Iceberg as its foundation, removing the need for data movement or format conversion.

Strategic Context and TCO Implications

This acquisition marks an evolution in SAP's strategy. Approximately three years ago, then-CTO Juergen Mueller pledged to facilitate the integration of SAP data with non-SAP data through a partnership with Databricks, which utilized the Delta Lake format. Last year, ties with Databricks deepened to support bidirectional data sharing between SAP Business Data Cloud and third-party platforms, with Delta Lake as the "initial delivery." Although Databricks later announced support for Iceberg via Delta Sharing, SAP's decision to acquire an Iceberg-focused company suggests a clear strategic direction.

SAP's repeated emphasis on the Iceberg format in the acquisition announcement for Dremio, valued at $2 billion in a 2022 funding round, might raise questions about the nature of the pre-existing partnership with Databricks. The integration of Dremio's lakehouse platform promises to "vastly improve the economics of enterprise analytics," offering a serverless and elastic approach that eliminates the need for fixed capacity provisioning or performance ceilings. This aspect is particularly relevant for organizations evaluating the Total Cost of Ownership (TCO) of their data and AI infrastructures, balancing initial capital expenditures with ongoing operational costs.

Towards an Open Catalog and Unified Context

With the acquisition of Dremio, SAP will provide customers with an Open Source catalog built on Apache Polaris and the Apache Iceberg REST Catalog API. This will create a discovery and semantic layer for the SAP Business Data Cloud, offering a single point of access to a unified business context. Such context will include meaning, relationships, access rights, and data lineage across all enterprise data, even those outside the SAP ecosystem.

This move strengthens companies' ability to maintain data sovereignty, managing it consistently and securely, regardless of its origin or storage location. For organizations considering on-premise or hybrid Deployments, the ability to unify and analyze data from diverse sources without complex movements represents a significant advantage, improving efficiency and compliance.