Skip to content

Releases: lakesoul-io/LakeSoul

Python 1.0.2

26 Sep 00:43
Compare
Choose a tag to compare
  • Fix memory overrun in metadata query result serialization

Python 1.0.1

24 Sep 01:46
Compare
Choose a tag to compare
  • Fix protoc version during build

v3.0.0

05 Sep 10:41
Compare
Choose a tag to compare

LakeSoul 3.0.0 Release

After nearly a year of iterative optimization, LakeSoul 3.0.0 is officially released. This release brings the following key updates:

  1. Core Functionality Updates for the LakeSoul Lake Warehouse Framework
    1. LakeSoul NativeIO performance has been significantly optimized, including adjustments to write file compression and dictionary encoding algorithms, and optimizations to key Merge on Read code paths, resulting in a doubling of both read and write performance(compared to 2.6 version).
    2. LakeSoul NativeIO has added a local hot data caching feature. This allows remote object storage files to be cached on local disk, significantly improving the performance of MPP queries and other queries. Local caching is supported for all types of remote storage.
    3. LakeSoul query partition filter pushdown performance has been significantly optimized. By using metadata index queries, pushdown of equal-value partition filter conditions has been significantly optimized. In actual tests, partition filtering on a single table with millions of partitions took only 50ms.
    4. Flink upgraded to version 1.20.
    5. LakeSoul natively supports the Spark + Gluten vectorization engine, significantly improving batch computing performance.
    6. LakeSoul natively supports the Presto + Velox vectorization engine, providing high-performance MPP on-lake analytics and queries. The Presto engine has added RBAC permissions.
    7. Arrow Flight SQL RPC Service: Provides a high-performance columnar data read and write gateway service based on the Arrow Flight protocol, supporting load balancing, elastic scaling, and RBAC permission verification.
    8. Python packages are now available on PyPi, and the LakeSoul Python package can be directly installed via pip install lakesoul.
  2. LakeSoul Lake Warehouse Maintenance Service
    1. A new generation of size-tiered automatic background compaction service, significantly improving compaction performance and significantly reducing write amplification, thereby lowering compaction resource overhead.
    2. A new generation of automatic asynchronous cleanup service: Asynchronously cleans redundant and expired data by consuming metadata change logs.
    3. Asset Statistics Service: Automatically generates lake warehouse asset statistics by consuming metadata change logs, providing real-time statistics on storage resource consumption across multiple dimensions, including space, namespace, table, partition, and user.

LakeSoul 3.0.0 版本发布

经过近 1 年的迭代优化,LakeSoul 3.0.0 版本正式发布。本次发布带来以下重要更新:

  1. LakeSoul 湖仓框架内核功能更新
    1. LakeSoul NativeIO 性能再次大幅优化,包括调整写文件压缩和字典编码算法、优化 Merge on Read 关键代码路径等,实现读、写性能均提升一倍(对比 2.6 版本)。
    2. LakeSoul NativeIO 新增本地热数据缓存功能。可以支持将远程对象存储文件缓存在本地磁盘,大幅提升 MPP 查询等性能。支持所有类型远程存储的本地缓存。
    3. LakeSoul 查询分区过滤下推性能大幅优化,通过元数据索引查询方式,对等值分区过滤条件下推做了大幅度的性能优化。实测单表百万级分区,分区过滤仅需 50ms。
    4. Flink 升级至 1.20 版本
    5. LakeSoul 原生支持 Spark + Gluten 向量化引擎,实现批计算大幅性能提升
    6. LakeSoul 原生支持 Presto + Velox 向量化引擎,提供高性能 MPP 湖上分析查询。Presto 引擎新增 RBAC 权限功能
    7. Arrow Flight SQL RPC 服务:提供基于 Arrow Flight 协议的高性能列式数据读写网关服务,支持负载均衡、弹性伸缩,支持 RBAC 权限校验
    8. Python 包推送至 PyPi ,支持通过 pip install lakesoul 直接安装 LakeSoul Python 包
  2. LakeSoul 湖仓后台服务
    1. 新一代分层 Size-tiered 自动后台 Compaction 服务,Compaction 性能显著提升并大幅减少写放大,降低 Compaction 资源开销
    2. 新一代自动异步清理服务:通过消费元数据变更日志,实现异步化的自动冗余、过期数据清理
    3. 资产统计服务:通过消费元数据变更日志,自动进行湖仓资产统计,提供空间、namespace、表、分区、用户等多个维度的存储资源消耗实时统计

Python v1.0.0

05 Sep 09:44
Compare
Choose a tag to compare

Release LakeSoul Python package 1.0.0 to PyPi.

v2.6.2

07 Aug 07:32
Compare
Choose a tag to compare

Full Changelog: v2.6.1...v2.6.2

v2.6.1

22 Jul 05:23
Compare
Choose a tag to compare

Full Changelog: v2.6.0...v2.6.1

v2.6.0

17 Jul 05:49
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v2.5.4...v2.6.0

v2.5.4

23 May 06:27
Compare
Choose a tag to compare
  1. Fix class shading in lakesoul common

v2.5.3

29 Mar 08:19
Compare
Choose a tag to compare
  1. Add shaded packages for release
  2. Fix compaction may write to incorrect partition

v2.5.1

29 Jan 10:45
Compare
Choose a tag to compare
  1. Fix Flink sink parallelism for non-primary key table;
  2. Fix native io filter for non-ascii names and nested columns;
  3. Optimize compaction performance.