Click any tag below to further narrow down your results
Links
This article explores a new indexing technique for data lakehouses called OTree, developed by Qbeast. It challenges traditional methods by using adaptive hypercubes to optimize data layout, improving query performance while addressing issues like partition granularity and imbalanced data distribution.
Apache Hudi 1.1 introduces a pluggable table format framework that supports multiple storage formats, enhancing flexibility in data management. The release also includes indexing improvements, faster clustering, and a new storage-based lock provider for better concurrency. These updates aim to make Hudi tables more efficient and easier to operate.