Click any tag below to further narrow down your results
Links
This article discusses how Apache Hudi's Non-Blocking Concurrency Control (NBCC) improves write throughput in data lakehouses by allowing concurrent writers to append data without conflicts. It contrasts NBCC with Optimistic Concurrency Control (OCC), highlighting the inefficiencies of retries in high-frequency streaming scenarios. The piece also explains how to configure NBCC in your data pipelines.
Apache Hudi 1.1 introduces a pluggable table format framework that supports multiple storage formats, enhancing flexibility in data management. The release also includes indexing improvements, faster clustering, and a new storage-based lock provider for better concurrency. These updates aim to make Hudi tables more efficient and easier to operate.