Click any tag below to further narrow down your results
Links
Apache Flink 2.2.0 enhances real-time data processing by integrating AI capabilities, introducing new functions like ML_PREDICT for large language models and VECTOR_SEARCH for vector similarity searches. The release also improves materialized tables, batch processing, and connector frameworks, addressing over 220 issues.
Apache DataFusion 50.0.0 has been released, featuring significant performance enhancements, including improved dynamic filter pushdown and nested loop join optimizations. The update introduces new capabilities such as support for the QUALIFY SQL clause and extended functionality for window functions, alongside community growth and contributions.
Apache Spark 4.0.0 is the first release in the 4.x series, showcasing significant community collaboration with over 5100 resolved tickets. Major enhancements include a new lightweight Python client, expanded features in Spark SQL and PySpark, and improved structured streaming capabilities, alongside numerous other updates for better performance and usability.