According to the Linux Foundation, at present, every organization hopes to get more value from data through data science, machine learning and analysis, but it is greatly hindered by the lack of data reliability in the data lake. Delta Lake solves the challenge of data reliability by enabling concurrent reading and writing by making transactions in accordance with ACID standards. Its architectural implementation capability helps to ensure that there is no damaged and unqualified data in the data lake. Since its launch in October 2017, Delta Lake has been adopted by more than 4, 000 organizations, processing more than 2 exabyte (gigabytes) of data a month.
"bringing Delta Lake into the neutral organization of the Linux Foundation will help the open source community that relies on the project to develop technologies to store and process big data (locally and in the cloud)," said Michael Dolan, vice president of strategic planning at the Linux Foundation.
In fact, the co-founder of databricks is the creator of the Apache spark project. Spark has become the de facto standard for large-scale data processing. Although delta lake was originally designed to work with spark, it has developed a vigorous open source community and increased support for other open source data systems.
Delta lake has been adopted by thousands of organizations, including, Alibaba, Booz Allen Hamilton, starburst, etc., and they are also important contributors to its open source ecosystem. In order to further promote the development of delta Lake's open source ecosystem, databricks, the company behind delta lake, made the decision to host delta lake to the Linux foundation.
Ali Ghodsi, CEO and co-founder of Databricks, said: "our team continues to create and contribute to open source projects because we know this is the fastest and most comprehensive way to innovate. To address the organization's data challenges, we want to ensure that the project is open source in the most real form. Through the strength and contribution of the Linux Foundation community, we believe that Delta Lake will soon become the standard for data storage in the data lake. "
Related reading:
Delta lake has just been rated as the best open source software in 2019 by InfoWorld. Please refer to:
https://www.oschina.net/news/110451/2019-infoworld-bossie-awards
User comments