ASX-Dividend-Report-Banner

Databricks Agrees to Acquire Tabular, the Company Founded by the Original Creators of Apache Iceberg

June 05, 2024 02:56 AM AEST | By Cision
 Databricks Agrees to Acquire Tabular, the Company Founded by the Original Creators of Apache Iceberg
Image source: Kalkine Media

Databricks and Tabular will work together towards a joint vision of the open lakehouse

SAN FRANCISCO, June 5, 2024 /PRNewswire/ -- Databricks, the Data and AI company, today announced it has agreed to acquire Tabular, a data management company founded by Ryan Blue, Daniel Weeks, and Jason Reid. By bringing together the original creators of Apache Iceberg™ and Linux Foundation Delta Lake, the two leading open source lakehouse formats, Databricks will lead the way with data compatibility so that organizations are no longer limited by which of these formats their data is in. Databricks intends to work closely with the Delta Lake and Iceberg communities to bring format compatibility to the lakehouse; in the short term, inside Delta Lake UniForm and in the long term, by evolving toward a single, open, and common standard of interoperability. Databricks and Tabular will work together towards a joint vision of the open lakehouse.

The Rise of Lakehouse Architecture and Format Incompatibility

Databricks pioneered the lakehouse architecture in 2020 to enable the integration of traditional data warehousing workloads with AI workloads on a single, governed copy of data. For this to work, all data has to be in an open format so different workloads, applications, and engines could access the same data. Lakehouse architecture maximizes enterprise productivity by democratizing access to data. This is in contrast to proprietary data warehouses where only a proprietary SQL engine can read, write or share the data, and data often has to be copied and exported to be used by other applications, creating a high degree of vendor lock-in. Four years later, 74% of enterprises have deployed a lakehouse architecture.

The foundation of the lakehouse is open source data formats that enable ACID transactions on data stored in object storage. These formats dramatically improve the reliability and performance of data operations on the data lake and were specifically designed for open source engines such as Apache Spark™, Trino and Presto. To address these challenges, Databricks worked with the Linux Foundation to create the Delta Lake project. Since its inception, Delta Lake has over 500 code contributors from a diverse set of organizations, and over 10,000 companies globally use Delta Lake to process 4+ exabytes of data on average each day.

Around the same time Delta Lake was created, Ryan Blue and Daniel Weeks developed the Iceberg project at Netflix and donated it to the Apache Software Foundation. Since then, Delta Lake and Iceberg have emerged as the two leading open source standards for lakehouse formats. Even though both of these formats are based on Apache Parquet and share similar goals and designs, they became incompatible due to independent development. Over time a number of other open source and proprietary engines have adopted these formats. However, they usually adopted only one of the standards and more often than not, only part of that standard, leading to fragmented and siloed enterprise data, undermining the value of the lakehouse architecture.

Over time a number of other open source and proprietary engines have adopted these formats. However, they usually adopted only one of the standards and more often than not, only part of that standard, leading to fragmented and siloed enterprise data, undermining the value of the lakehouse architecture." 

The Road to Interoperability

Companies need data interoperability to realize the benefits of the lakehouse, and Databricks will work closely with the Delta Lake and Iceberg communities to bring interoperability to the formats over time. This is a long journey, one that will likely take several years to achieve in those communities. That is why last year, Databricks introduced Delta Lake UniForm. UniForm tables provide interoperability across Delta Lake, Iceberg, and Hudi, and support the Iceberg restful catalog interface so companies can use the analytics engines and tools they are already familiar with, across all their data. Generally available today, UniForm allows companies to achieve compatibility. With the addition of the original Iceberg team, Databricks will greatly broaden the ambitions of Delta Lake UniForm.

"Databricks pioneered the lakehouse and over the past four years, the world has embraced the lakehouse architecture, combining the best of data warehouses and data lakes to help customers decrease TCO, embrace openness, and deliver on AI projects faster. Unfortunately, the lakehouse paradigm has been split between the two most popular formats: Delta Lake and Iceberg. Databricks and Tabular will work with the open-source community to bring the two formats closer to each other over time, increasing openness, and reducing silos and friction for customers," said Ali Ghodsi, Co-founder and CEO at Databricks. "Last year, we announced Delta Lake UniForm to bring interoperability to these two formats, and we're thrilled to bring together the foremost leaders in open data lakehouse formats to make UniForm the best way to unify your data for every workload."

A Shared Commitment to Openness

Databricks and Tabular share a history of championing open source formats. Both companies were founded to commercialize open source technologies created by the founders and today, Databricks is the largest and most successful independent open source company by revenue and has donated 12 million lines of code to open source projects. This acquisition highlights Databricks' commitment to open formats and open source data in the cloud, helping ensure that companies are in control of their data and free from the lock-in created by proprietary vendor-owned formats.

"We created Apache Iceberg to solve critical data challenges around correctness, performance, and scalability. It's been amazing to see both Iceberg and Delta Lake grow massively in popularity, largely fueled by the open lakehouse becoming the industry standard. With Tabular joining Databricks, we intend to build the best data management platform based on open lakehouse formats so that companies don't have to worry about picking the 'right' format or getting locked into proprietary data formats," said Ryan Blue, Co-Founder and CEO at Tabular.

To learn more about Databricks and Tabular joining forces, register to attend the Data + AI Summit, June 10-13: databricks.com/dataaisummit

Details Regarding the Proposed Acquisition

The proposed acquisition is subject to customary closing conditions, and is expected to close in Databricks' second fiscal quarter. 

About Tabular

Tabular is the independent data platform built by the original creators of Apache Iceberg. Tabular addresses the pain data engineers and data scientists endure fighting the shortcomings of their data infrastructure. Tabular was founded by Netflix alumni Ryan Blue, Dan Weeks and Jason Reid. Blue also serves as the Iceberg PMC Chair and Weeks is an Iceberg PMC member.

About Databricks

Databricks is the Data and AI company. More than 10,000 organizations worldwide — including Block, Comcast, Condé Nast, Rivian, Shell and over 60% of the Fortune 500 — rely on the Databricks Data Intelligence Platform to take control of their data and put it to work with AI. Databricks is headquartered in San Francisco, with offices around the globe, and was founded by the original creators of Lakehouse, Apache Spark™, Delta Lake and MLflow. To learn more, follow Databricks on LinkedIn, X and Facebook.

Contact: [email protected]


Disclaimer

The content, including but not limited to any articles, news, quotes, information, data, text, reports, ratings, opinions, images, photos, graphics, graphs, charts, animations and video (Content) is a service of Kalkine Media Pty Ltd (“Kalkine Media, we or us”), ACN 629 651 672 and is available for personal and non-commercial use only. The principal purpose of the Content is to educate and inform. The Content does not contain or imply any recommendation or opinion intended to influence your financial decisions and must not be relied upon by you as such. Some of the Content on this website may be sponsored/non-sponsored, as applicable, but is NOT a solicitation or recommendation to buy, sell or hold the stocks of the company(s) or engage in any investment activity under discussion. Kalkine Media is neither licensed nor qualified to provide investment advice through this platform. Users should make their own enquiries about any investments and Kalkine Media strongly suggests the users to seek advice from a financial adviser, stockbroker or other professional (including taxation and legal advice), as necessary.
The content published on Kalkine Media also includes feeds sourced from third-party providers. Kalkine does not assert any ownership rights over the content provided by these third-party sources. The inclusion of such feeds on the Website is for informational purposes only. Kalkine does not guarantee the accuracy, completeness, or reliability of the content obtained from third-party feeds. Furthermore, Kalkine Media shall not be held liable for any errors, omissions, or inaccuracies in the content obtained from third-party feeds, nor for any damages or losses arising from the use of such content.
Kalkine Media hereby disclaims any and all the liabilities to any user for any direct, indirect, implied, punitive, special, incidental or other consequential damages arising from any use of the Content on this website, which is provided without warranties. The views expressed in the Content by the guests, if any, are their own and do not necessarily represent the views or opinions of Kalkine Media. Some of the images/music that may be used on this website are copyrighted to their respective owner(s). Kalkine Media does not claim ownership of any of the pictures displayed/music used on this website unless stated otherwise. The images/music that may be used on this website are taken from various sources on the internet, including paid subscriptions or are believed to be in public domain. We have made reasonable efforts to accredit the source wherever it was indicated as or found to be necessary.

This disclaimer is subject to change without notice. Users are advised to review this disclaimer periodically for any updates or modifications.

AU_advertise

Advertise your brand on Kalkine Media

Sponsored Articles


Investing Ideas

Previous Next
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.