Highlights
- Support Spark 4.0 (ANSI = OFF, RSS / Lake = OFF)
- JDK21 Support
- Provide basic support for Velox + GPU
- Add Flink accelerator subproject
- Further improve Iceberg / Delta read/write capabilities, along with performance optimizations
- Add a large number of new functions, including multiple performance optimizations and bug fixes
- Discontinue support for Spark 3.2
What's Changed
- [TEST] Disable a gluten test temporarily: cast string to timestamp by @philo-he in #10518
- [CORE] Bump version to 1.6.0-SNAPSHOT by @philo-he in #10517
- [MINOR] Refactor a string concatenation by following scala style by @beliefer in #10520
- [VL][INFRA] Fix docker build error on Centos-7 by @philo-he in #10522
- [GLUTEN-8953][VL] Support Iceberg overwrite table by @Zouxxyy in #10514
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_08_26) by @GlutenPerfBot in #10528
- [GLUTEN-10521][VL] Fall back
to_jsonfunction for uppercase struct field name by @zml1206 in #10523 - [VL] Gluten-it: Simplify CollectionConverter.scala by @zhztheplayer in #10533
- [VL] Fix missing path
package/**from the Velox backend PR CI path trigger by @zhztheplayer in #10538 - [GLUTEN-10529] Remove unnecessary create for Runtimes by @beliefer in #10530
- [TEST][VL] Reinclude "cast string to timestamp" test by @philo-he in #10532
- [VL] Extend gluten-it to support more data source types by @zhztheplayer in #10554
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_08_27) by @GlutenPerfBot in #10549
- [GLUTEN-10555] Remove unnecessary parameter leafTransformers for WholeStageTransformer by @beliefer in #10556
- [VL] Gluten-it: Clean up Maven dependency relationships by @zhztheplayer in #10563
- [GLUTEN-10552][VL] Fix openEuler compiling issue by @zhouyuan in #10564
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_08_28) by @GlutenPerfBot in #10571
- [FLINK] Add
java-17profile for Flink build and update project version in flink doc by @zjuwangg in #10561 - [GLUTEN-9671][VL] Fix broadcast exchange stackoverflow due to Kryo serialization by @felixloesing in #10541
- [VL] Separate filesystem configuration initialization by @marin-ma in #10540
- [MINOR] Remove unnecessary fields by @beliefer in #10560
- [VL] Support independent Gluten CPP build by @kerwin-zk in #10575
- [GLUTEN-10107][CH] Decouple Celeborn-related code from CH backend module by @zjuwangg in #10537
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_08_29) by @GlutenPerfBot in #10580
- [GLUTEN-10578] Remove unnecessary numaBindingInfo by @beliefer in #10579
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_08_30) by @GlutenPerfBot in #10588
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_08_31) by @GlutenPerfBot in #10589
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_09_01) by @GlutenPerfBot in #10592
- [GLUEN-10107][INFRA]Deprecate isUseUniffleShuffleManager from glutenConfig by @zjuwangg in #10558
- [VL] Gluten-it: Support using Delta tables in TPC-H and TPC-DS benchmarks by @zhztheplayer in #10562
- [GLUTEN-10582][VL] Add Cudf memory resource mode and percent parameters by @jinchengchenghh in #10583
- [GLUTEN-8852][VL] Update package script for spark-400 by @zhouyuan in #10584
- [GLUTEN-8821][VL] Weekly Update Velox function support docs (2025_09_01) by @GlutenPerfBot in #10590
- [GLUTEN-10387][VL] Set ANSI mode for Velox according to Spark's configuration by @philo-he in #10385
- [VL] Gluten-it: Update Delta versions, and other minors by @zhztheplayer in #10594
- [VL] Update Velox branch by @rui-mo in #10597
- [GLUTEN-10524] Remove unnecessary
outputAttributesfromBasicScanExecTransformerby @beliefer in #10525 - [GLUTEN-10595][VL] Separate cpp test utils from the utils directory by @marin-ma in #10596
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_09_02) by @GlutenPerfBot in #10601
- [DOC][FLINK] Update flink build command to skip gpg and spotless check by @zjuwangg in #10604
- [Minor] Refactor test utility to let users compare the query result by @jinchengchenghh in #10565
- [VL][Minor] Remove unused code for shuffle compression mode by @marin-ma in #10609
- [GLUTEN-10599][VL] Fix Centos dev docker image build by @zhouyuan in #10600
- [GLUTEN-10599][VL] Followup to enable git in CI scripts by @zhouyuan in #10610
- [Minor] Add enhanced features runtime config by @jinchengchenghh in #10608
- [VL] Update class duplication list in Maven enforcer by @zhztheplayer in #10536
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_09_03) by @GlutenPerfBot in #10612
- [GLUTEN-9335][VL] Support iceberg partition write by @jinchengchenghh in #10497
- [GLUTEN-10607][MINOR] Fix: Use
setSafein DateWriter to avoid overflow by @jiangjiangtian in #10581 - [GLUTEN-10577][CELEBORN] Refactor
CelebornShuffleManagerto load factory in a better way by @zjuwangg in #10591 - [CORE] Merge SubstraitUtil classes by @kevinwilfong in #10587
- [GLUTEN-10546][FLINK] Support all flink operators for nexmark by @shuai-xu in #10548
- [GLUTEN-10566][VL] Add Spark unix_timestamp support with timestamp and format arguments by @nimesh1601 in #10567
- [Minor] Fix the velox target duplicated include VELOX_BUILD_PATH by @jinchengchenghh in #10615
- [GLUTEN-10570][FLINK] Add
--add-opensoptions to MAVEN_OPTS for Java 17 compatibility by @KevinyhZou in #10572 - [GLUTEN-10605][VL] Rewrite unbounded window to an equivalent aggregate join by @zml1206 in #10606
- [GLUTEN-10013][FLINK] Support function reinterpret by @KevinyhZou in #10022
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_09_04) by @GlutenPerfBot in #10626
- [VL] Refactor gluten-it to pass structured query information to runner by @zhztheplayer in #10623
- [GLUTEN-10210][VL] Enable tpcds tests for Spark-400 in CI by @zhouyuan in #10633
- [VL] Fix arrow url typo by @liujiayi771 in #10641
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_09_06) by @GlutenPerfBot in #10643
- [GLUTEN-10214][VL] Merge inputstream for shuffle reader by @marin-ma in #10499
- [MINOR] Add
.java-versionto.gitignoreby @Zouxxyy in #10642 - [GLUTEN-10618][VL] Update input iterator metrics name to include more details by @marin-ma in #10619
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_09_08) by @GlutenPerfBot in #10653
- [GLUTEN-10635][VL] bugfix: file INSTALL cannot set permissions by @beliefer in #10638
- [GLUTEN-10361][FLINK] Fix UT failure between the conversion of
BinaryRowDataandStatefulRecordby @KevinyhZou in #10362 - [GLUTEN-8889][VL] Fix Spark-355 download in GHA by @zhouyuan in #10655
- [GLUTEN-10450][VL] Reclassify internal/public configs and remove internal configs from doc by @zjuwangg in #10603
- [GLUTEN-9366][VL] Support Iceberg functions by @jinchengchenghh in #10285
- [GLUTEN-10544] Remove unnecessary method separateScanRDD by @beliefer in #10545
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_09_09) by @GlutenPerfBot in #10656
- [VL] Support generating DV-enabled TPC-DS Delta table in gluten-it by @zhztheplayer in #10639
- [GLUTEN-10630][FLINK] Support processing time for q12 by @shuai-xu in #10631
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_09_10) by @GlutenPerfBot in #10665
- [GLUTEN-10056][FLINK] Support function split_index by @KevinyhZou in #10057
- Add kevinwilfong as a collaborator by @FelixYBW in #10673
- [GLUTEN-10671][VL] Ignore datetime-legacy.sql by @rui-mo in #10672
- [GLUTEN-10664] Improve Gluten UI enabled check by @beliefer in #10657
- [GLUTEN-10675][CELEBORN] Bump Celeborn version to 0.6.1 by @SteNicholas in #10677
- [VL][CI] Fix JVM heap OOM in TPC-DS baseline test by @philo-he in #10681
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_09_11) by @GlutenPerfBot in #10685
- [VL] Respect
INSTALL_PREFIXwhen building arrow by @Zouxxyy in #10650 - [GLUTEN-6887][VL] Daily Update Velox Version (2025_09_12) by @GlutenPerfBot in #10690
- [VL] Fix decimal arithmetic offload when allowPrecisionLoss=false by @zhli1142015 in #10689
- [CORE] Defer Protobuf serialization of SplitInfos in GlutenPartitions by @kevinwilfong in #10662
- [GLUTEN-10687] Refactor
expressionBlacklistto follow Scala style by @beliefer in #10688 - [MINOR] Refactor a collection operation to follow Scala style by @beliefer in #10682
- [GLUTEN-10678] Refactor
transformSortDirectioninSortExecTransformerby @beliefer in #10679 - [GLUTEN-10472][VL] feat: Support columnar partial generate for HiveUDTF by @jiangjiangtian in #10475
- [MINOR][CH] Refactor extraExpressionConverter for CHSparkPlanExecApi by @beliefer in #10694
- [GLUTEN-10668] Refactor to share WindowGroupLimitExecShim across different Spark shims by @beliefer in #10669
- [GLUTEN-10613][VL] Add
experimentalmethod to ConfigBuilder and document experimental config by @zjuwangg in #10659 - [GLUTEN-10625][CORE] Add RewriteCreateTableAsSelect for spark34 by @Zouxxyy in #10646
- [CORE] Minor cleanups for Delta support code by @zhztheplayer in #10695
- [VL] Example workload for benchmarking Gluten + Delta on TPC-DS datasets by @zhztheplayer in #10614
- [GLUTEN-8821][VL] Weekly Update Velox function support docs (2025_09_15) by @GlutenPerfBot in #10700
- [VL] Support Velox's preferred_output_batch_bytes config by @kevinwilfong in #10661
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_09_15) by @GlutenPerfBot in #10711
- [MINOR] Refactor genFinalStageIterator to follow Scala style by @beliefer in #10709
- [HOTFIX][PR-10695] Fix compile error for the CH backend by @zzcclp in #10718
- [GLUTEN-10715][CORE] Bump Spark version from 4.0.0 to 4.0.1 by @Zouxxyy in #10716
- [GLUTEN-10691] Improve getAndCheckSubstraitName for ExpressionConverter by @beliefer in #10692
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_09_16) by @GlutenPerfBot in #10720
- [GLUTEN-10703] Refactor PlanNode to avoid addAll calls by @beliefer in #10704
- [VL] Remove unused-function warning by @boneanxs in #10721
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_09_17) by @GlutenPerfBot in #10730
- [VL] Enable passing CSV tests by @acvictor in #10732
- [CORE] Fix FallbackByNativeValidation attempts to offload a whole tree than a single node by @zhztheplayer in #10696
- [GLUTEN-10701] Refactor SortRelNode to avoid addAll calls by @beliefer in #10702
- [CORE][VL] Bump the default spark version from 3.4.4 to 3.5.5 by @kerwin-zk in #10729
- [GLUTEN-10731][VL]Remove the sort window in Velox backend by @JkSelf in #10734
- [GLUTEN-10629][FLINK] Support udf count_char by @shuai-xu in #10674
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_09_18) by @GlutenPerfBot in #10749
- [GLUTEN-10683][VL] Add lock to restrict single task for one GPU by @jinchengchenghh in #10684
- [VL] Fix weekly build validation by @philo-he in #10750
- [MINOR] Fix pom.xml code format by @liujiayi771 in #10744
- [VL] Gluten-it: Fix inverted expected vs actual in result mismatch er… by @zhztheplayer in #10751
- [VL] Enable passing JSON tests by @acvictor in #10743
- [GLUTEN-8851][VL] Fix the TakeOrderedAndProjectExecTransformer does not respect cudf tag by @jinchengchenghh in #10754
- [GLUTEN-10713] Eliminate the c2r before iceberg partition write by @Zouxxyy in #10714
- [VL][CI] Add a format checker for scala code to enable early failure reporting by @philo-he in #10747
- [MINOR] Enable pom spotless format check by @liujiayi771 in #10755
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_09_21) by @GlutenPerfBot in #10762
- [GLUTEN-10736] Refactor SparkDirectoryUtil with ConcurrentHashMap by @beliefer in #10737
- [GLUTEN-10550][FLINK] Add nexmark tests for q18-q21 by @shuai-xu in #10757
- [GLUTEN-8072][VL] Add close method to close the hdfs connection by @JkSelf in #10593
- [GLUTEN-10632][FLINK] Add nexmark test for q11, q12 and q22 by @shuai-xu in #10735
- [GLUTEN-10741][CH] Invalid results: sort keys contains partition keys when get windows' topK by @lgbo-ustc in #10746
- [GLUTEN-10772][CH] Fix crash caused by not releasing
QueryContextresources properly. by @lgbo-ustc in #10775 - [MINOR] Refactor parseLocalFileNodes for VeloxPlanConverter by @beliefer in #10739
- [GLUTEN-10766][VL] Fix the incorrect log msg for VeloxRuntime by @beliefer in #10767
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_09_23) by @GlutenPerfBot in #10788
- [VL] Add datasource add-split and read metrics by @rui-mo in #10787
- [MINOR] Override
outputPartitioninginGenerateExecTransformerBaseby @jiangjiangtian in #10768 - [GLUTEN-10770][VL] Improve std::vector usage in
toVeloxPlanby @beliefer in #10771 - [VL] Promote SharedLibraryLoader to SPI by @yaooqinn in #10774
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_09_24) by @GlutenPerfBot in #10792
- [GLUTEN-10671][VL] Enable datetime-legacy.sql by @rui-mo in #10789
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_09_25) by @GlutenPerfBot in #10800
- [VL] Add convenient C2R API by @zhztheplayer in #10796
- [VL] Add metrics for lazy vector load by @marin-ma in #10726
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_09_26) by @GlutenPerfBot in #10808
- [GLUTEN-10628][FLINK] Add decimal type support to
greaterthanfunction by @shuai-xu in #10769 - [CORE] Fix iceberg version to 1.10.0 for Spark 4.0 by @philo-he in #10809
- [VL] Fix flaky columnar cache tests by clearing the cache serializer states before the suites are run by @zhztheplayer in #10812
- [VL] Delta: Add Delta Lake write unit test for Spark 3.5 + Delta 3.3 by @zhztheplayer in #10802
- [GLUTEN-8821][VL] Weekly Update Velox function support docs (2025_09_29) by @GlutenPerfBot in #10816
- [VL] Add Spark array_sort support with lambda function by @boneanxs in #10780
- [GLUTEN-10660][VL] Adding configuration for hash table build by @zhouyuan in #10634
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_09_27) by @GlutenPerfBot in #10814
- [GLUTEN-8851][VL] Fix the TakeOrderedAndProjectExecTransformer does not respect cudf tag by @jinchengchenghh in #10821
- [VL] Make VeloxResizeBatchesExec inherit from ColumnarToColumnarExec to simplify the code by @Zouxxyy in #10763
- [GLUTEN-10621][VL] feat: Support cudf parquet connector by @jinchengchenghh in #10622
- [DOC] Improve and simplify README.md and NewToGluten.md by @philo-he in #10793
- [VL] tools/workload/tpch: Delete temporary files generated by dbgen by @xinghuayu007 in #10783
- [GLUTEN-9615][INFRA] Automate release process with build script and GHA workflow by @philo-he in #10807
- [GLUTEN-10648][VL] Support Iceberg overwrite partitions dynamic by @Zouxxyy in #10823
- [VL] Delta: Fix input file expressions not getting pushed to native scan when column mapping is applicable by @zhztheplayer in #10822
- [GLUTEN-9860][VL] Update nightly job to delete old packages by @zhouyuan in #10819
- [VL][MINOR] Prevent selected CI jobs from running on forks by @philo-he in #10828
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_10_01) by @GlutenPerfBot in #10826
- [VL] Following #10823, correct the config option key by @zhztheplayer in #10830
- [VL] Minor refactoring of get_velox.sh for better readability by @Zouxxyy in #10647
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_10_02) by @GlutenPerfBot in #10832
- [CORE] Refactor configuration API to allow explicit
getAllEntriescalls on specific configuration registry objects by @zhztheplayer in #10835 - [GLUTEN-6887][VL] Daily Update Velox Version (2025_10_06) by @GlutenPerfBot in #10844
- [GLUTEN-10621] Fix GPU connector CudfHiveTableHandle is not used by @jinchengchenghh in #10843
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_10_08) by @GlutenPerfBot in #10849
- [CI] Check out PR branch in the weekly CentOS 7 build workflow to ease PR verification by @Zouxxyy in #10839
- [CORE] Optimize the verboseString of ColumnarShuffleExchangeExec by @Zouxxyy in #10846
- [GLUTEN-10773][VL] Add support for Velox
expression.max_compiled_regexesconfiguration by @NEUpanning in #10776 - [GLUTEN-10774][VL][FOLLOWUP] Strip quotes for osName and osVersion by @liujiayi771 in #10850
- [VL] Add golden files for TPC-H Spark 3.5 test by @pgaref in #10848
- [GLUTEN-10247][FLINK] Support date_format function by @KevinyhZou in #10248
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_10_09) by @GlutenPerfBot in #10854
- [VL] Fix script for packaging releases by @philo-he in #10853
- [GLUTEN-10574] Correct Gluten versions in documents and jupyter notebooks by @philo-he in #10858
- [MINOR] Optimize code style for test of backend-velox by @zml1206 in #10857
- [GLUTEN-10797][CH] Reject all JNI calls after native destroy is invoked by @lgbo-ustc in #10815
- [GLUTEN-10781][VL] Add months_between support by @zml1206 in #10782
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_10_10) by @GlutenPerfBot in #10862
- [GLUTEN-10574] Include copyright for Hudi and Paimon in NOTICE-binary by @philo-he in #10855
- [GLUTEN-9525][FLINK]Support tps metric for source task by @KevinyhZou in #10023
- Revert "[GLUTEN-8966][VL] Propagate HashAggregate's ignoreNullKeys when possible" by @zml1206 in #10852
- [VL] Check VELOX_HOME variable in build_arrow to allow custom Velox path by @liujiayi771 in #10865
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_10_11) by @GlutenPerfBot in #10866
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_10_12) by @GlutenPerfBot in #10869
- [VL] tools/workload/tpch: Close sql files to avoid resources leak by @xinghuayu007 in #10785
- [INFRA] Link referenced issues in PR description and unify with label pull requests job by @philo-he in #10863
- [GLUTEN-10818][VL] Implement KnownSizeEstimation for buildSideRelation by @zjuwangg in #10817
- [GLUTEN-10707] Improve SparkDirectoryUtil by reduce synchronized by @beliefer in #10708
- [GLUTEN-10574] Add missing license headers by @philo-he in #10872
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_10_13) by @GlutenPerfBot in #10880
- [VL] Update user-facing metric name for
wallNanosin batch scan by @Zouxxyy in #10870 - [GLUTEN-8821][VL] Weekly Update Velox function support docs (2025_10_13) by @GlutenPerfBot in #10871
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_10_14) by @GlutenPerfBot in #10882
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_10_15) by @GlutenPerfBot in #10890
- [GLUTEN-10875][CH] Bump io.minio:minio from 8.5.9 to 8.6.0 in /backends-clickhouse by @dependabot[bot] in #10884
- [VL] Reduce vector copy when doing c2r by @zhli1142015 in #10825
- [VL] Support mapping columns by position index for ORC and Parquet files by @kevinwilfong in #10697
- [GLUTEN-10215][VL] Delta: Native write support for Delta 3.3.1 / Spark 3.5 by @zhztheplayer in #10801
- [GLUTEN-10878][INFRA] Maturity CS50: Synchronize GitHub discussions to the dev mailing list by @zhztheplayer in #10896
- [CI] Ensure Velox Enhanced tests use Spark 3.5 home by @pgaref in #10897
- [GLUTEN-10834][CORE] Prefer hyphens over underscores in shell script names by @Zouxxyy in #10836
- [CORE] Make LeafTransformSupport's getPartitions return Seq[Partition] by @Zouxxyy in #10838
- [GLUTEN-10887][CORE] Bump Delta 3.3.x version to 3.3.2 for Spark 3.5.x by @zhztheplayer in #10894
- [MINOR] Update GPU document by @jinchengchenghh in #10888
- [GLUTEN-10833][VL] Reduce memcpy in ColumnToRow by @zhouyuan in #10824
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_10_16) by @GlutenPerfBot in #10899
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_10_17) by @rui-mo in #10902
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_10_20) by @GlutenPerfBot in #10913
- [GLUTEN-8851][VL] cuDF: Validate the plan before execution by @jinchengchenghh in #10889
- [GLUTEN-10874][INFRA] Maturity RE50: In documentation, add a section for the release process by @zhztheplayer in #10883
- [MINOR][CORE] Fix StaticInvoke fallback reason by @wForget in #10914
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_10_21) by @GlutenPerfBot in #10917
- [VL] Make PartialProject support struct with null fields by @jiangjiangtian in #10706
- [GLUTEN-10911][CH] Enable
get_json_objectrewrite inexplodeby @lgbo-ustc in #10912 - [GLUTEN-10761] Add iceberg write metrics by @Zouxxyy in #10908
- [GLUTEN-8969][VL] feat: Enable iceberg test TestPartitionValues by @jinchengchenghh in #10910
- [CORE] Doc: Remove duplicated release page by @zhztheplayer in #10905
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_10_22) by @GlutenPerfBot in #10921
- [VL] Add page load time metric by @rui-mo in #10294
- [GLUTEN-10920][VL] Allow disabling hash/sort shuffle reader buffer by @wForget in #10922
- [VL] Validate Iceberg write support before executing offload by @Zouxxyy in #10900
- [GLUTEN-10915][VL] Fix dynamic offheap sizing feature by setting default offheap size by @zhouyuan in #10916
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_10_23) by @GlutenPerfBot in #10927
- [GLUTEN-10621][VL] cuDF: Fix cudf table scan disable mode by @jinchengchenghh in #10925
- [VL] Fix setup scripts with multiple commands to run by @marin-ma in #10930
- [GLUTEN-9106][VL] Add support for staticInvoke CharVarcharCodegenUtils. #9107 by @Yifeng-Wang in #9107
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_10_24) by @GlutenPerfBot in #10941
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_10_25) by @GlutenPerfBot in #10947
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_10_26) by @GlutenPerfBot in #10949
- [GLUTEN-10932][VL] Pass memory changes to taskMemoryManager for dynamic off heap by @wForget in #10936
- [MINOR][CORE] Remove unused class DummyPairRDDWithPartitions by @jinchengchenghh in #10931
- [VL] Validate runtime compatibility using reflection before registering Gluten components by @philo-he in #10942
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_10_28) by @GlutenPerfBot in #10962
- [VL] Add dlerror message for library load failures by @liujiayi771 in #10952
- [MINOR][VL] Improve temp parquet directory in FunctionsValidateSuite by @wangyum in #10955
- [GLUTEN-10956][VL] Reduce repeated construction of ArrowProjection in ColumnarPartialProjectExec by @wForget in #10957
- [GLUTEN-10966][VL] Support codegen for ArrowProjection by @wForget in #10968
- [GLUTEN-10963][VL] Fix the validation for merge_extract companion function by @rui-mo in #10976
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_10_29) by @GlutenPerfBot in #10974
- [GLUTEN-10950][VL] Add time unit to 'spark.hadoop.fs.s3a.connection.timeout' by @zml1206 in #10951
- [GLUTEN-10969][FLINK] Set timeout for waiting for task output by @xinghuayu007 in #10970
- [GLUTEN-10933][VL] cuDF: Move lock to ShuffleReader by @jinchengchenghh in #10934
- [GLUTEN-10964][VL] Fix CI docker image build by @zhouyuan in #10953
- [VL][MINOR] Replace NULL with nullptr by @beliefer in #10958
- [GLUTEN-10980][CH]Fix unix_timestamp diff when format is not
yyyy-MM-ddoryyyy-MM-dd HH:mm:ssby @KevinyhZou in #10981 - [GLUTEN-6887][VL] Daily Update Velox Version (2025_10_30) by @GlutenPerfBot in #10985
- [GLUTEN-10971][FLINK] Support nexmark q15-17 by @shuai-xu in #10972
- [GLUTEN-10966][VL][FOLLOWUP] Fix UnsupportedOperationException when the result type of expressions in GenerateArrowProjection is array by @jiangjiangtian in #10986
- [VL] Base Delta read support for Spark 4.0 + Delta 4.0 by @zhztheplayer in #10973
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_10_31) by @GlutenPerfBot in #10987
- [INFRA] Fix Apache Software Foundation organization URL by @philo-he in #10998
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_11_01) by @GlutenPerfBot in #10996
- [VL] Update setup-rhel.sh by @marin-ma in #10935
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_11_03) by @GlutenPerfBot in #11000
- [GLUTEN-10933][VL] Introduce GPU ShuffleWriterType kGpuHashShuffle by @jinchengchenghh in #10984
- [CORE] Add a test case where dependencies of a component are not registered by @zhztheplayer in #11006
- [GLUTEN-10961][CH] Support joins without any join clause by @lgbo-ustc in #10967
- [MINOR] Add Gluten-1.5 and spark-4.0.x option to bug issue template by @wForget in #11019
- [GLUTEN-10963][VL] Fix the validation for merge_extract companion function by @rui-mo in #11007
- [CORE] Follow-up: switch to direct class references guarded by component's runtime-compatibility validation by @philo-he in #10999
- [VL] Remove support for BNLJ full outer join without condition by @zml1206 in #11021
- [CORE] Fix
ColumnarBatches#selectincorrect comments description by @zjuwangg in #11011 - [GLUTEN-6887][VL] Daily Update Velox Version (2025_11_04) by @GlutenPerfBot in #11014
- [CH] Include iceberg module in spotless plugin and apply formatting fixes by @philo-he in #11015
- [GLUTEN-11016][VL] Validate regex pattern of spilt function by @wForget in #11017
- [VL] Add waitForPreloadSplitNanos in metric by @rui-mo in #11024
- [CORE] ComponentSuite: Add a test case against dependency cycles by @zhztheplayer in #11008
- [GLUTEN-10959][FLINK] Support function decimal_notequalto by @xinghuayu007 in #10960
- [GLUTEN-10794][VL] Register AzureClientProviderFactory on ABFS write by @zhouyuan in #10795
- [MINOR][DOC] Fix default value of velox spillPartitionBits by @wForget in #11026
- [GLUTEN-10988][VL] Do not resize batches for sort-based/rss-sort shuffle by @wForget in #10991
- [CORE] Exclude dependent components of incompatible components by @zhztheplayer in #11009
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_11_05) by @GlutenPerfBot in #11030
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_11_06) by @GlutenPerfBot in #11039
- [Minor][VL] Remove unused valueStreamNodeFactory by @jinchengchenghh in #11031
- [GLUTEN-10660][VL] Fix passing configurations for Velox backend for hashmap dedup by @zhouyuan in #11020
- [VL] Configure C++20 standard for dependency libraries by @philo-he in #11005
- [GLUTEN-11037][VL] Corrected logic for updating uniffle shuffle metrics by @wForget in #11038
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_11_07) by @GlutenPerfBot in #11047
- [GLUTEN-10926][VL] Add GCC-13 image for CI by @zhouyuan in #10993
- [CORE] Avoid getting duplicate resource files for component discovery by @philo-he in #11043
- [GLUTEN-11012][CH] Support arabic-indic digits in
unix_timestampby @lgbo-ustc in #11013 - [GLUTEN-10926][VL] Fix arm pipeline to use centos9 image by @zhouyuan in #11061
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_11_10) by @GlutenPerfBot in #11065
- [VL] Refactor getAggRelInternal in HashAggregateExecTransformer by @Zouxxyy in #11040
- [GLUTEN-11048][VL] Implement OrderPreservingNode/PartitioningPreservingNode for ColumnarPartialProjectExec by @wForget in #11049
- [GLUTEN-10636][VL]Use backend validation to find all unsupported expression by @jiangjiangtian in #10637
- [MINOR][VL] Fix unused variable when GPU was not enabled by @wForget in #11067
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_11_11) by @GlutenPerfBot in #11066
- [VL] Gluten-it: Allow empty configurations in dimension pattern by @zhztheplayer in #11042
- [GLUTEN-11106][VL] Spark 3.5 / Delta 3.3: Add DeltaSuite / UpdateSQLSuite test suits by @zhztheplayer in #11041
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_11_12) by @GlutenPerfBot in #11071
- [GLUTEN-11050] Regenerate input partitions for small files by @marin-ma in #11051
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_11_13) by @GlutenPerfBot in #11075
- [GLUTEN-9785][VL] Pass additional configuration options programmatically from Java to C++ by @jinchengchenghh in #11032
- [VL] Make VeloxResizeBatchesExec not inherit ColumnarToColumnarTransition by @zhztheplayer in #11072
- [GLUTEN-9034][VL] Fix the VeloxResizeBatch not add for ReusedExchange by @jinchengchenghh in #11069
- [VL] Remove unused memkind dependency after HBM removal by @philo-he in #11087
- [GLUTEN-10207][VL] adding spark40 unit test framework by @zhouyuan in #10725
- [GLUTEN-9577] Add macOS Velox build workflow by @wangyum in #11077
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_11_14) by @GlutenPerfBot in #11079
- [CORE] Use Substrait timestamp_tz for Spark TimestampType to preserve timezone-aware semantics by @liujiayi771 in #11074
- [MINOR] Replace match plan with startsWith by case match by @beliefer in #11084
- [VL] Remove buffering of sorted partitions in RSS writer to prevent OOM by @boneanxs in #11059
- [GLUTEN-11091][VL] Fix weekly build jobs on CentOS/Ubuntu platform by @zhouyuan in #11092
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_11_15) by @GlutenPerfBot in #11095
- [GLUTEN-11102][VL] Ban flaky CSV tests in Spark-4.0 by @zhouyuan in #11104
- [GLUTEN-11103][VL] Optimize Celeborn tests by @kerwin-zk in #11105
- [VL] Upgrade vcpkg builtin-baseline by @philo-he in #11004
- [GLUTEN-10926][VL] followup to fix docker image conflict by @zhouyuan in #11064
- [VL] Remove the test configurations related to Celeborn 0.4 by @kerwin-zk in #11114
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_11_17) by @GlutenPerfBot in #11110
- [Gluten-11099] Refactor the sameRows for GlutenQueryTest by @beliefer in #11081
- [INFRA][VL] Fix build failures for docker images by @philo-he in #11122
- [VL] Refactor: move arenas_ from VeloxShuffleWriter to VeloxHashShuffleWriter by @zhli1142015 in #11111
- [VL] Delta: Mark Delta unit tests as slow tests by @zhztheplayer in #11116
- [INFRA][VL] Follow-up: Fix build failures for docker images by @philo-he in #11124
- [VL] Fix overflow of pageNumber in VeloxSortShuffleWriter by @zhli1142015 in #11101
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_11_19) by @GlutenPerfBot in #11119
- [GLUTEN-10926][VL] Refine docker image build by @zhouyuan in #11126
- [VL] [Minor] fix install script by @FelixYBW in #11131
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_11_20) by @GlutenPerfBot in #11129
- [GLUTEN-10159][VL] Fix json tuple with dot-containing field names by @lyy-pineapple in #10939
- [VL] Suppress "uninitialized" warnings treated as errors in GCC 13 builds by @philo-he in #11118
- [GLUTEN-11106][VL] Spark 3.5 / Delta 3.3: Add DeltaInsertIntoTableSuite, DeltaDDLSuite by @zhztheplayer in #11107
- [GLUTEN-11133][VL] Refactor batch serialization API to defer the buffer copy from C++ code to Java code by @zhztheplayer in #11127
- [GLUTEN-11088][VL] Fix MiscOperatorSuite by @marin-ma in #11147
- [VL] RSS client should push complete rows by @wecharyu in #11123
- [GLUTEN-11108][CORE] Upgrade iceberg version to 1.10.0 for spark 3.4 and 3.5 by @jinchengchenghh in #11115
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_11_21) by @GlutenPerfBot in #11141
- [GLUTEN-10926][VL] Update Spark mirror to speed up downloading in CI docker image by @zhouyuan in #11143
- [GLUTEN-11062][CORE] Supports mixed input format for partitioned Hive table by @beliefer in #11113
- [GLUTEN-11088][VL] Fix ArithmeticAnsiValidateSuite by @marin-ma in #11150
- [MINOR] Rename checkGlutenOperatorMatch and checkGlutenOperatorCount by @beliefer in #11085
- [VL] Hot Fix CI failure by @marin-ma in #11165
- [GLUTEN-11062][FOLLOW-UP][TEST] Add test cases for other Spark version by @beliefer in #11154
- [GLUTEN-11088][VL] Fix NativeBenchmarkPlanGenerator by @marin-ma in #11144
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_11_22) by @GlutenPerfBot in #11152
- [GLUTEN-11155][CORE] Code cleanup for BasicScanExecTransformer by @beliefer in #11156
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_11_24) by @GlutenPerfBot in #11168
- [GLUTEN-11161][CORE] Refactor findAllLeafTransformers for WholeStageTransformer by @beliefer in #11162
- [GLUTEN-11159][CORE] Code cleanup for generateWholeStageTransformContext by @beliefer in #11160
- [VL] Allow Celeborn client compression to be configurable by @kerwin-zk in #11177
- [GLUTEN-11142][VL] Add timestampadd support for Spark 4.0 by @zml1206 in #11171
- [GLUTEN-11088] Fix GlutenDateExpressionsSuite in Spark-4.0 by @zml1206 in #11172
- [GLUTEN-10933][VL] Support resize GPU batch and shuffle reader outputs cudf::table by @jinchengchenghh in #11090
- [GLUTEN-11157][CORE] Replace NullPointerException with IllegalStateException for WholeStageTransformer by @beliefer in #11158
- [GLUTEN-11182][VL] Refactor genSplitInfo for VeloxIteratorApi by @beliefer in #11183
- [VL] Fallback Parquet scan if legacy timezone is found in file metadata by @zhztheplayer in #11117
- [GLUTEN-11088][VL] Fix try_cast in ScalarFunctionsValidateSuite for Spark 4.0 by @marin-ma in #11180
- [GLUTEN-11088][VL] Add raise_error restriction in spark 4.0 and enable ScalarFunctionsValidateSuite by @marin-ma in #11170
- [GLUTEN-11163][CORE] Refactor dynamicallySelectedPartitions for FileSourceScanExecShim by @beliefer in #11164
- [GLUTEN-11191][CORE] Refactor leafInputMetricsUpdater for WholeStageTransformer by @beliefer in #11192
- [GLUTEN-11088][VL] fix GlutenQueryExecutionErrorsSuite by @marin-ma in #11185
- [GLUTEN-11133][VL] Reduce on-heap memory allocation and memory copy when off-heap BHJ is enabled by @zhztheplayer in #11148
- [GLUTEN-11088][VL] Add config in GlutenCastSuite to cast to char/varchar by @marin-ma in #11181
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_11_26) by @GlutenPerfBot in #11193
- [GLUTEN-11175] Refactor generateMetadataColumns for SparkShims by @beliefer in #11176
- [GLUTEN-11088][VL] Fix Spark 4.0 exception wrap by @jinchengchenghh in #11200
- [GLUTEN-11088] Fix GlutenTakeOrderedAndProjectSuite in Spark-4.0 by @zml1206 in #11194
- Bump org.apache.hive:hive-llap-common from 2.3.10 to 4.0.0 in /gluten-ut/spark40 by @dependabot[bot] in #11188
- [GLUTEN-11052][CH] Disable ARM CI temporary by @zhouyuan in #11187
- [GLUTEN-11088][VL] Support test Spark-4.0 in intellij by @jinchengchenghh in #11198
- [GLUTEN-11088][VL] Fix the Spark4.0 storage partition join by @jinchengchenghh in #11184
- [GLUTEN-11088][VL] Fix Spark4.0 varchar type check suites by @jinchengchenghh in #11202
- [GLUTEN-11088][VL] Fix GlutenDeltaBasedMergeIntoTableSuite in Spark-4.0 by @zhli1142015 in #11204
- [VL] Simplify code in VeloxBloomFilter by @marin-ma in #11209
- [VL] Upgrade Uniffle version to 0.10.0 by @zuston in #10806
- [GLUTEN-11088][VL] fix CoalesceShufflePartitionsSuite by @marin-ma in #11201
- [GLUTEN-11088] Fix GlutenCollapseProjectExecTransformerSuite in Spark-4.0 by @zml1206 in #11206
- [GLUTEN-11052][CH] Fix CI trigger by @zhouyuan in #11214
- [GLUTEN-11088][VL] fix
bloomFilterin GlutenDataFrameStatSuite by @marin-ma in #11211 - [GLUTEN-11088][VL] Fix GlutenDatasetSuite in Spark-4.0 by @zhli1142015 in #11197
- [GLUTEN-11088] Fix GlutenDataFrameFunctionsSuite in Spark-4.0 by @zml1206 in #11195
- [GLUTEN-8782][VL] Add correct suffix to the name of merge_extract companion function by @rui-mo in #11210
- [CORE] Refactor push down filter to scan by @Zouxxyy in #11153
- [GLUTEN-11169][VL] Fix OOM error not thrown when global off-heap memory is running out by off-heap BHJ by @zhztheplayer in #11208
- [GLUTEN-11221][VL] Remove JArrayList, JHashMap, JMap from VeloxIteratorApi by @beliefer in #11222
- [GLUTEN-9860[VL] Bump Hudi/Paimon version for Spark-4.0, adding nightly package for Spark-4.0 by @zhouyuan in https://github.com//pull/11189
- [GLUTEN-10748][FLINK] Fix functions
in/is_not_null/greaterthan/lessthancan not be mapped to velox backend by @KevinyhZou in #10764 - [VL] Add virtual destructor in RowVectorStream by @Zouxxyy in #11228
- [VL] Extract common createHiveConnectorSessionConfig method and use it for Iceberg write by @Zouxxyy in #11227
- [VL] Clean up vcpkg overlay ports by @philo-he in #11121
- [VL] Disable Parquet metadata validation by default due to performance regression by @zhztheplayer in #11233
- [GLUTEN-11088][CORE] Fix Spark 4.0 GlutenDynamicPartitionPruningV1SuiteAEOn by @jinchengchenghh in #11212
- [GLUTEN-11166][TEST] Open check HiveTableScanExecTransformer by @beliefer in #11173
- [VL] Enable div function by @zhli1142015 in #11230
- [GLUTEN-11088][VL] Enable some ignored tests in Spark-4.0 by @jinchengchenghh in #11235
- [VL] Fast fail or resend data to uniffle ASSP by @zuston in #11213
- [GLUTEN-11224][VL] FileSourceScan check codec by @jinchengchenghh in #11225
- [GLUTEN-10892][VL] Use
veloxPreferredBatchBytesto control the max size of memory of batches combined by @jiangjiangtian in #11140 - [VL] Support pass unmapped dynamic and static configs to Velox by @Yohahaha in #11229
- [VL] Update shuffle write metrics per batch by @Yohahaha in #11083
- [GLUTEN-11231][CORE] Improve regeneratePartition for PartitionsUtil by @beliefer in #11232
- [GLUTEN-11088][VL] Fix GlutenStreamingQuerySuite by @marin-ma in #11223
- [VL] Bump Folly to 2025.05.19 in VCPKG by @zhouyuan in #11186
- [GLUTEN-11203][CORE] Validate input format for Hive table's partition by @beliefer in #11205
- [GLUTEN-11174][FLINK] Support function
is nullby @xinghuayu007 in #11174 - [GLUTEN-11237][CORE] Refactor the implementation of TaskResources by @beliefer in #11238
- Bump fonttools from 4.52.4 to 4.61.0 in /tools/workload/benchmark_velox/analysis by @dependabot[bot] in #11241
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_12_03) by @GlutenPerfBot in #11247
- [VL] Use clean docker image for Celeborn tests by @zhouyuan in #11250
- [GLUTEN-11243][CORE] Refactor parseConfig for GlutenConfigUtil by @beliefer in #11244
- [GLUTEN-11012][CH]Support more local digits date by @lgbo-ustc in #11073
- [GLUTEN-11239][VL] Deprecate parquet encrypted check config by @jinchengchenghh in #11240
- [GLUTEN-10660][VL] Fix passing Velox session configurations on hash build optimizations by @zhouyuan in #11134
- [VL] Enable
SIMDJSON_SKIPUTF8VALIDATIONfor simdjson build in setup script by @philo-he in #11257 - [VL] free disk space for GPU CI by @zhouyuan in #11258
- [GLUTEN-8821][VL] Weekly Update Velox function support docs (2025_12_08) by @GlutenPerfBot in #11265
- [GLUTEN-11245][CORE] Refactor getNativeSessionConf and getNativeBackendConf for GlutenConfig by @beliefer in #11246
- [INFRA] Add
--no-check-certificatetowget_optsby @Zouxxyy in #11263 - [CORE] Enrich the scan node name with class and table name by @Zouxxyy in #11248
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_12_08) by @GlutenPerfBot in #11266
- [GLUTEN-11259][VL][CORE] Reserve memory for some vector by @beliefer in #11260
- [VL] Allow the use of Gzip codec for shuffle compression by @philo-he in #11242
- [VL] Delta: Support overwrite mode in write by @zhztheplayer in #11226
- [GLUTEN-10660][VL][FOLLOWUP] Simplify getNativeSessionConf for GlutenConfig by @beliefer in #11268
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_12_09) by @GlutenPerfBot in #11270
- [VL] Document enhanced features by @FelixYBW in #11272
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_12_10) by @GlutenPerfBot in #11273
- [GLUTEN-11274][VELOX] Refactor Code by @xinghuayu007 in #11274
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_12_11) by @GlutenPerfBot in #11280
- [GLUTEN-11088][VL] Add GlutenTests for get_json_object and schema merging by @baibaichen in #11276
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_12_12) by @GlutenPerfBot in #11286
- [VL] Update dedup hash build configuration name by @marin-ma in #11289
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_12_13) by @GlutenPerfBot in #11291
- [VL] Remove unused parameters in shuffle reader by @xinghuayu007 in #11285
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_12_14) by @GlutenPerfBot in #11293
- [VL] Correct the conversion from
AggregateRel.groupingstoAggregationNode.groupingKeysby @Zouxxyy in #11264 - [GLUTEN-11169][VL] Trigger GC before throwing OOM to ensure unused off-heap broadcasted relations are correctly released by @zhztheplayer in #11236
- [GLUTEN-11283][VL] Respect spark default zstd compression level by @wecharyu in #11284
- [CORE] Get correct fallback reason on nodes without logicalLink by @liujiayi771 in #11295
- [GLUTEN-11088][VL] Add compatibility layer for StructsToJson and StaticInvoke expressions across Spark versions by @baibaichen in #11294
- [GLUTEN-11088][VL] Spark 4.0: Fix ArrowEvalPythonExecSuite by @zhztheplayer in #11288
- [GLUTEN-10215][VL] Delta: Support update command in write by @zhztheplayer in #11271
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_12_16) by @GlutenPerfBot in #11301
- [GLUTEN-11088][VL] Fix GlutenParquetIOSuite compatibility issues for Spark 4.0 by @baibaichen in #11281
- [CORE] Update default Iceberg and Delta versions by @zml1206 in #11303
- [MINOR] Fix script name typos for TPC-H/DS data generation by @philo-he in #11305
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_12_17) by @GlutenPerfBot in #11306
- [VL] Disable parquet file metadata validation by default by @Yohahaha in #11307
- [Substrait] Remove unused enable_row_group_maxmin_index option by @EpsilonPrime in #11277
- [GLUTEN-9553][FLINK] Support kafka connector for data source by @KevinyhZou in #11312
- [GLUTEN-9456][VL] Release Velox memory manager when the reservedBytes become empty by @rui-mo in #11249
- [GLUTEN-11302][VL] Fix gpu build by bumping to cuda-13.1 by @zhouyuan in #11275
- [DOC] Update docs to remove references to non-existent directory by @tomscut in #11310
- [INFRA] Remove TPP.txt by @yaooqinn in #11314
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_12_19) by @rui-mo in #11311
- [CORE] Replace Component BuildInfo with String Map by @Zouxxyy in #11292
- [VL] Minor refactor to replace START_TIMEING with SCOPE_TIMER by @Yohahaha in #11320
- [GLUTEN-11316][VL] Enable JDK-21 for Spark-4.0 by @zhouyuan in #11304
- [GLUTEN-11088][VL] Adding SQLQueryTestSuite for Spark-4.0 by @zhouyuan in #11136
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_12_22) by @zhouyuan in #11323
- Update license wording in README.md by @yaooqinn in #11324
- Add a project-wide mvn build script by @yaooqinn in #11322
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_12_24) by @zhouyuan in #11327
- [GLUTEN-10064][FLINK] Support filesystem sink by @KevinyhZou in #11300
- [GLUTEN-10727][CH]Daily Update Clickhouse Version (20250916) by @zzcclp in #11299
- [GLUTEN-11340][CORE][VL][CH] Fix Compatibility issues addressed in Spark 4.1 by @baibaichen in #11313
- [GLUTEN-11341][CORE] Support Scala 2.13 with IntelliJ IDE by @baibaichen in #11342
- Bump org.apache.kafka:kafka_2.12 from 3.4.0 to 3.9.1 in /gluten-flink/ut by @dependabot[bot] in #11317
- [MINOR] Refactor Spark 40 Test suite by @baibaichen in #11344
- [GLUTEN-6887][VL] Daily Update Velox Version (2025_12_29) by @jinchengchenghh in #11337
- [GLUTEN-11330][VL] Make PartialProject support array and map with null values by @jiangjiangtian in #11331
- [GLUTEN-6887][VL] Daily Update Velox Version (2026_01_02) by @GlutenPerfBot in #11349
- INFRA: Open rebase for large commit by @baibaichen in #11360
- [GLUTEN-11088][VL] Remove excluded tests in VeloxTestSettings following Velox upgrade by @baibaichen in #11359
- Use build/mvn wrapper in velox_backend_x86 workflow by @yaooqinn in #11332
- [GLUTEN-11357][FLINK] Bump velox4j by @lgbo-ustc in #11361
- [GLUTEN-6887][VL] Daily Update Velox Version (2026_01_05) by @GlutenPerfBot in #11356
- [GLUTEN-11145][VL] Iceberg test supports Spark3.5 and Spark4.0 by @jinchengchenghh in #11146
- [VL] Fix memory leak caused by hard references on keys of TreeMemoryConsumers#FACTORIES by @zhztheplayer in #11362
- [GLUTEN-10933][VL]fix: Fix the CudfVector childreSize is not correct by @jinchengchenghh in #11326
- Revert "[GLUTEN-11145][VL] Iceberg test supports Spark3.5 and Spark40 by @jinchengchenghh in #11374
- [GLUTEN-11346][CORE][VL] Add Spark 4.1 Shim Layer by @baibaichen in #11347
- [GLUTEN-6887][VL] Daily Update Velox Version (2026_01_07) by @GlutenPerfBot in #11375
- [VL] Add config spark.gluten.sql.columnar.backend.velox.maxExtendedPartialAggregationMemory by @Zouxxyy in #11354
- [MINOR] Code cleanup: Remove deprecated files and fix typos by @baibaichen in #11370
- [GLUTEN-11088][VL] Follow-up: Spark 4.0: Fix remaining error in ArrowEvalPythonExecSuite by @zhztheplayer in #11377
- [GLUTEN-11251] Fix incorrect whole stage id in WholeStageTransformerExec by @marin-ma in #11252
- [GLUTEN-11343][CORE][VL] Support Spark 4.1 UT by @baibaichen in #11353
- [GLUTEN-11251] Follow-up: Enable ut SPARK-47939 for spark 4.1 by @marin-ma in #11381
- [GLUTEN-11088][VL] Mark test case
SPARK-47289: extended explain infoas non-fixable by @zhztheplayer in #11382 - [GLUTEN-6887][VL] Daily Update Velox Version (2026_01_08) by @GlutenPerfBot in #11378
- [GLUTEN-11341][VL] Add iceberg shim to fix deprecated method by @jinchengchenghh in #11367
- [MINOR] Improve logs for TPC-H/DS by @tomscut in #11384
- [GLUTEN-10950][VL][FOLLOWUP] Fix missing time unit in hadoop-aws time configuration (pre-3.4 versions) by @xumanbu in #11329
- [GLUTEN-6887][VL] Daily Update Velox Version (2026_01_10) by @GlutenPerfBot in #11389
- [TEST] Fix includeAllGlutenTests/excludeAllGlutenTests to use prefix matching by @Angryrou in #11345
- [GLUTEN-6887][VL] Daily Update Velox Version (2026_01_12) by @GlutenPerfBot in #11393
- [GLUTEN-11027][VL][CI] Add clang-tidy to check cpp code by @xinghuayu007 in #11120
- [Fix] Update artifact download path for velox-native-lib for spark 41 by @baibaichen in #11399
- [GLUTEN-6887][VL] Daily Update Velox Version (2026_01_13) by @GlutenPerfBot in #11396
- [GLUTEN-11355][UT] Add new Spark 4.1 tests by @baibaichen in https://github.com/apache/incubator-gluten/pull/11380
- [VL] Do not rewriteAggBufferAttributes for Complete agg mode by @Zouxxyy in https://github.com/apache/incubator-gluten/pull/11358
- [GLUTEN-11364][FLINK] Improve the job graph generation code by @lgbo-ustc in https://github.com/apache/incubator-gluten/pull/11365
- [CORE][VL] Handle exception in Schema validation for unsupported join types by @Surbhi-Vijay in https://github.com/apache/incubator-gluten/pull/11387
- Bump dawidd6/action-download-artifact from 3 to 6 in /.github/workflows by @dependabot[bot] in https://github.com/apache/incubator-gluten/pull/11409
- [GLUTEN-6887][VL] Daily Update Velox Version (2026_01_14) by @GlutenPerfBot in https://github.com/apache/incubator-gluten/pull/11408
- [GLUTEN-11401][VL] Add missing scenario for unsupported agg function by @marin-ma in https://github.com/apache/incubator-gluten/pull/11418
- [VL] Delta write / Spark 3.2, 3.3 write: Fix null sink memory pool causing crash with remote file systems by @zhztheplayer in https://github.com/apache/incubator-gluten/pull/11410
- [VL] Delta write / Spark 3.2, 3.3 write: Fix too-small partition number limit in dynamic partition writer by @zhztheplayer in https://github.com/apache/incubator-gluten/pull/11420
- [GLUTEN-6887][VL] Daily Update Velox Version (2026_01_15) by @GlutenPerfBot in https://github.com/apache/incubator-gluten/pull/11424
- [GLUTEN-11368][VL] Enable failed AQE UT by @marin-ma in https://github.com/apache/incubator-gluten/pull/11411
- [VL] Allow calling TaskResources.runUnsafe in Spark task code by @zhztheplayer in https://github.com/apache/incubator-gluten/pull/11405
- [GLUTEN-6887][VL] Daily Update Velox Version (2026_01_16) by @GlutenPerfBot in https://github.com/apache/incubator-gluten/pull/11433
- [GLUTEN-11432][VL] Bump Maven from 3.9.2 to 3.9.12 by @zhouyuan in https://github.com/apache/incubator-gluten/pull/11431
- [VL] Restrict
-Wno-class-memaccessflag to GNU compiler by @liujiayi771 in https://github.com/apache/incubator-gluten/pull/11429 - [GLUTEN-11413][VL] Add a patch for Arrow to support IBM power by @marin-ma in https://github.com/apache/incubator-gluten/pull/11415
- Fix CH CI build by @jinchengchenghh in https://github.com/apache/incubator-gluten/pull/11439
- [GLUTEN-11088][VL] Fall back CSV reader by @jinchengchenghh in https://github.com/apache/incubator-gluten/pull/11190
- [GLUTEN-6887][VL] Daily Update Velox Version (2026_01_19) by @GlutenPerfBot in https://github.com/apache/incubator-gluten/pull/11443
- [GLUTEN-11425][CORE] Upgrade Spark 4.1.0 to 4.1.1 by @baibaichen in https://github.com/apache/incubator-gluten/pull/11426
- [GLUTEN-11412][FLINK] Refine watermark assigner by @lgbo-ustc in https://github.com/apache/incubator-gluten/pull/11414
- [GLUTEN-11106][VL] Spark 3.5 / Delta 3.3: Add several suites for delta optimizations by @zhztheplayer in https://github.com/apache/incubator-gluten/pull/11430
- [GLUTEN-10903][VL] Install cudf libraries in docker to simplify link and accelerate build by @jinchengchenghh in https://github.com/apache/incubator-gluten/pull/11386
- [CORE] Deprecate and remove Spark 3.2 support by @QCLyu in https://github.com/apache/incubator-gluten/pull/11351
- [GLUTEN-6887][VL] Daily Update Velox Version (2026_01_20) by @GlutenPerfBot in https://github.com/apache/incubator-gluten/pull/11447
- [CORE] Fix numFiles metric not being populated in FileSourceScanExecTransformer by @acvictor in https://github.com/apache/incubator-gluten/pull/11459
- [GLUTEN-11316][VL] Add Spark 4.1 with JDK 21 to nightly builds by @Mariamalmesfer in https://github.com/apache/incubator-gluten/pull/11442
- [GLUTEN-6887][VL] Daily Update Velox Version (2026_01_22) by @GlutenPerfBot in https://github.com/apache/incubator-gluten/pull/11467
- [VL][CI] Fix CI reporting success despite command failures by adding 'set -e' by @jinchengchenghh in https://github.com/apache/incubator-gluten/pull/11462
- [GLUTEN-11445][CORE] Fix dynamicOffHeapSizingEnabled config not takin… by @zhli1142015 in https://github.com/apache/incubator-gluten/pull/11448
- [GLUTEN-10215][VL] Delta Write: Offload DeltaOptimizedWriterExec by @zhztheplayer in https://github.com/apache/incubator-gluten/pull/11461
- [GLUTEN-11400][CORE] Implement partitioning-aware union for ColumnarUnionExec by @baibaichen in https://github.com/apache/incubator-gluten/pull/11455
- [GLUTEN-11390][VL] Align aarch64 compiler flags with Velox to fix xsimd initialization errors by @xumanbu in https://github.com/apache/incubator-gluten/pull/11457
- [CORE] Consolidate duplicate logDebug statements in OffloadSingleNodeRules by @liujiayi771 in https://github.com/apache/incubator-gluten/pull/11398
- [GLUTEN-11406][VL][UT][MINOR] Fix unstable memory test in UnsafeColumnarBuildSideRelationTest by @baibaichen in https://github.com/apache/incubator-gluten/pull/11475
- [VL] Fix typo of
spark.gluten.auto.AdjustStageResource.enabled=trueby @taiyang-li in https://github.com/apache/incubator-gluten/pull/11486 - [GLUTEN-11432][FOLLOWUP] Bump project level Maven to 3.9.12 by @zhouyuan in https://github.com/apache/incubator-gluten/pull/11470
- Cleanup the GPU docker script by @FelixYBW in https://github.com/apache/incubator-gluten/pull/11482
- [GLUTEN-11473][VL] Add null check for array_ when releasing JavaRssClient by @wForget in https://github.com/apache/incubator-gluten/pull/11474
- [GLUTEN-10215][VL] Delta Write: Fix redundant C2R2C transition by @zhztheplayer in https://github.com/apache/incubator-gluten/pull/11478
- [INFRA] Minor follow-up for PR #11482, remove SPARK_HOME setting from cudf docker build by @philo-he in https://github.com/apache/incubator-gluten/pull/11490
- [GLUTEN-6887][VL] Daily Update Velox Version (2026_01_26) by @GlutenPerfBot in https://github.com/apache/incubator-gluten/pull/11487
- [GLUTEN-11451][CH] Fix CH BNLJ left outer build side validation by @zhanglistar in https://github.com/apache/incubator-gluten/pull/11466
- Bump wheel from 0.44.0 to 0.46.2 in /tools/workload/benchmark_velox/analysis by @dependabot[bot] in https://github.com/apache/incubator-gluten/pull/11480
- [VL] Adding RHEL 9.7 support by @zhouyuan in https://github.com/apache/incubator-gluten/pull/11460
- [INFRA] Add whether it is an AI-generated tag in the PR and issue template by @zml1206 in https://github.com/apache/incubator-gluten/pull/11492
- [GLUTEN-11497][FLINK] Refine watermark process by @lgbo-ustc in https://github.com/apache/incubator-gluten/pull/11477
- [CORE] Add newline before top-level import statements in scalafmt by @liujiayi771 in https://github.com/apache/incubator-gluten/pull/11502
- [GLUTEN-10933][VL] BroadcastExchange outputs cudf::table by @jinchengchenghh in https://github.com/apache/incubator-gluten/pull/11441
- [VL] Fix object leak on
TreeMemoryConsumers#FACTORIESby @zhztheplayer in https://github.com/apache/incubator-gluten/pull/11500 - [GLUTEN-6887][VL] Daily Update Velox Version (2026_01_27) by @GlutenPerfBot in https://github.com/apache/incubator-gluten/pull/11498
- [CI] Fix clang-tidy workflow by @philo-he in https://github.com/apache/incubator-gluten/pull/11507
- [GLUTEN-11485][VL] Fix the race condition in ArrowMemoryPool by @marin-ma in https://github.com/apache/incubator-gluten/pull/11493
- Bump org.assertj:assertj-core from 3.26.3 to 3.27.7 in /backends-velox by @dependabot[bot] in https://github.com/apache/incubator-gluten/pull/11499
- [VL] Use build/mvn wrapper in velox_backend_enhanced and velox_backend_arm workflows by @yaooqinn in https://github.com/apache/incubator-gluten/pull/11496
- [VL] Enable CSV test in Spark41 by @jinchengchenghh in https://github.com/apache/incubator-gluten/pull/11449
- [MINOR] Add missing profiles in
format-scala-code.shby @liujiayi771 in https://github.com/apache/incubator-gluten/pull/11506 - [VL] Fix stoi issue when get parquet write options by @boneanxs in https://github.com/apache/incubator-gluten/pull/11504
- [GLUTEN-9456][VL] Add custom direct buffered input by @rui-mo in https://github.com/apache/incubator-gluten/pull/11452
- [MINOR] Correct the native write check in VeloxParquetWriteSuite by @Zouxxyy in https://github.com/apache/incubator-gluten/pull/11391
- Bump org.assertj:assertj-core from 3.24.2 to 3.27.7 in /gluten-flink/ut by @dependabot[bot] in https://github.com/apache/incubator-gluten/pull/11517
- [VL] Make the filename written by Iceberg Native consistent with in Java by @Zouxxyy in https://github.com/apache/incubator-gluten/pull/11435
- [UT] Add missing Gluten test suites for Spark 4.0 and 4.1 by @baibaichen in https://github.com/apache/incubator-gluten/pull/11512
- [GLUTEN-11012][CH] Support Thai/Khmer digit dates in CH by @zhanglistar in https://github.com/apache/incubator-gluten/pull/11476
- [GLUTEN-9619][VL] Add HDFS integration tests to gluten-it by @infvg in https://github.com/apache/incubator-gluten/pull/11373
- [CORE] Refactor
ArrowColumnarArrayto reduce code duplication by @philo-he in https://github.com/apache/incubator-gluten/pull/11525 - [MINOR] Add GCS Maven Central mirror for faster dependency downloads by @yaooqinn in https://github.com/apache/incubator-gluten/pull/11545
- [GLUTEN-6887][VL] Daily Update Velox Version (2026_01_30) by @GlutenPerfBot in https://github.com/apache/incubator-gluten/pull/11529
- [DOC] Add Mac development document by @jinchengchenghh in https://github.com/apache/incubator-gluten/pull/11547
- [GLUTEN-11403][VL] Validate attribute existence before binding in ExpressionConverter by @baibaichen in https://github.com/apache/incubator-gluten/pull/11546
- [MINOR] Gitignore GitHub Copilot config files by @liuneng1994 in https://github.com/apache/incubator-gluten/pull/11557
- [CORE] Use build/mvn wrapper for scheduled jobs and in Dockerfiles by @yaooqinn in https://github.com/apache/incubator-gluten/pull/11515
- [GLUTEN-11555][VL] Fix cudf validation by @marin-ma in https://github.com/apache/incubator-gluten/pull/11556
- [VL] Fix raise_error support for constant-folded MapData in Spark 4.x by @acvictor in https://github.com/apache/incubator-gluten/pull/11558
- [GLUTEN-11514][VL] Refactor plan execution by adding
addIteratorSplitsandnoMoreSplitsmethods to the plan execution API by @zhztheplayer in https://github.com/apache/incubator-gluten/pull/11527 - [GLUTEN-11509][VL] Make TreeMemoryConsumer thread-safe by @malinjawi in https://github.com/apache/incubator-gluten/pull/11553
- [CORE][VL] Fix protobuf memory leak in JNI_OnUnload by @clee704 in https://github.com/apache/incubator-gluten/pull/11532
- [VL] Fix parquet write document by @FelixYBW in https://github.com/apache/incubator-gluten/pull/11536
- [VL] Support Iceberg WriteToDataSourceV2 by @Zouxxyy in https://github.com/apache/incubator-gluten/pull/11533
- [GLUTEN-11550][UT] Fix GlutenLogQuerySuite by @kapilks in https://github.com/apache/incubator-gluten/pull/11565
- [GLUTEN-11550][CORE][UT] Fix GlutenSingleJoinSuite for Spark 4.0+ by @baibaichen in https://github.com/apache/incubator-gluten/pull/11577
- [GLUTEN-11383][VL] Allow bloom filter pushdown in hash probe by @infvg in https://github.com/apache/incubator-gluten/pull/11392
- [GLUTEN-11402][VL] fix reading decimal partition key by @zhouyuan in https://github.com/apache/incubator-gluten/pull/11518
- [GLUTEN-11569][VL] Following #11527, fix the error calling #noMoreSplits on cuDF value stream nodes by @zhztheplayer in https://github.com/apache/incubator-gluten/pull/11572
- Bump pip from 24.2 to 26.0 in /tools/workload/benchmark_velox/analysis by @dependabot[bot] in https://github.com/apache/incubator-gluten/pull/11564
- [VL][CI] Migrate Spark 4.1 tests to CentOS 9 by @ReemaAlzaid in https://github.com/apache/incubator-gluten/pull/11519
- [MINOR] Remove scan filter extraction logs from
BatchScanExecTransformerBaseby @Zouxxyy in https://github.com/apache/incubator-gluten/pull/11573 - [MINOR] Fix deprecated Delta Lake API usages in velox test code by @baibaichen in https://github.com/apache/incubator-gluten/pull/11594
- [GLUTEN-11559][Build] Improve incremental build time for test-compile phase by @baibaichen in https://github.com/apache/incubator-gluten/pull/11560
- [GLUTEN-6887][VL] Daily Update Velox Version (2026_02_06) by @GlutenPerfBot in https://github.com/apache/incubator-gluten/pull/11583
- [Tools] Add explicit HDFS URI scheme support for qualification-tool by @tomscut in https://github.com/apache/incubator-gluten/pull/11602
- [Tools] Improve error handling and data sanitization in UnsupportedOperators report by @tomscut in https://github.com/apache/incubator-gluten/pull/11601
- [GLUTEN-11417][VL] Enhance qualification tool to support lakehouse format detection by @ruoliu2 in https://github.com/apache/incubator-gluten/pull/11436
- [VL] Enhance VeloxIceberg documentation with SQL options by @FelixYBW in https://github.com/apache/incubator-gluten/pull/11586
- [VL] Add S3 integration gluten tests by @Mariamalmesfer in https://github.com/apache/incubator-gluten/pull/11516
- [INFRA] Release 1.6: Port "Cleanup for TLP release process (#11696)" by @zhztheplayer in https://github.com/apache/incubator-gluten/pull/11701
- [CORE] Release 1.6: Bump version to 1.6.0 (RC1) by @zhztheplayer in https://github.com/apache/incubator-gluten/pull/11702
New Contributors
- @pgaref made their first contribution in #10848
- @Yifeng-Wang made their first contribution in #9107
- @xumanbu made their first contribution in #11329
- @Angryrou made their first contribution in #11345
- @QCLyu made their first contribution in https://github.com/apache/incubator-gluten/pull/11351
- @Mariamalmesfer made their first contribution in https://github.com/apache/incubator-gluten/pull/11442
- @infvg made their first contribution in https://github.com/apache/incubator-gluten/pull/11373
- @malinjawi made their first contribution in https://github.com/apache/incubator-gluten/pull/11553
- @ruoliu2 made their first contribution in https://github.com/apache/incubator-gluten/pull/11436
Full Changelog: v1.5.0...v1.6.0