hoodie-0.4.7
版本发布时间: 2019-05-29 11:05:37
apache/hudi最新发布版本:release-1.0.0-beta2(2024-07-16 15:41:55)
Highlights
- Major releases with fundamental changes to filesystem listing & write failure handling
- Introduced the first version of HoodieTimelineServer that runs embedded on the driver
- With all executors fetching filesystem listing via RPC to timeline server, drastically reduced filesystem listing!
- Failing concurrent write tasks are now handled differently to be robust against spark stage retries
- Bug fixes/clean up around indexing, compaction
Full PR List
- @bvaradar - HUDI-135 - Skip Meta folder when looking for partitions #698
- @bvaradar - HUDI-136 - Only inflight commit timeline (.commit/.deltacommit) must be used when checking for sanity during compaction scheduling #699
- @bvaradar - HUDI-134 - Disable inline compaction for Hoodie Demo #696
- @v3nkatesh - default implementation for HBase index qps allocator #685
- @bvaradar - SparkUtil#initLauncher shoudn't raise when spark-defaults.conf doesn't exist #670HUDI-131 Zero File Listing in Compactor run #693
- @vinothchandar - Fixed HUDI-116 : Handle duplicate record keys across partitions #687
- @leilinen - HUDI-105 : Fix up offsets not available on leader exception #650
- @bvaradar - Allow users to set hoodie configs figs for Compactor, Cleaner and HDFSParquetImporter utility scripts #691
- @bvaradar - Spark Stage retry handling #651
- @pseudomoto - HUDI-113: Use Pair over # delimited string #672
- @bvaradar - Support nested types for recordKey, partitionPath and combineKey #684
- @vinothchandar - Downgrading fasterxml jackson to 2.6.7 to be spark compatible #686
- @bvaradar - Timeline Service with Incremental View Syncing support #600