[mt] Split up gradient types for the GPU hist. #11798

trivialfis · 2025-11-07T10:43:23Z

Support using reduced gradient for tree structure exploration.

The PR adds a gradient container with two different gradient types: one for tree splits and the other for leaf values. This is an optimization for vector-leaf to reduce the overhead of finding the tree structure. Currently, the interface is exposed through a custom objective function by creating a specialized tree objective.

An alternative approach would be to decouple the dimension reduction function from the objective function by introducing an additional parameter, such as grad_reducer. This is actually the first approach I tried.

I don't have a strong opinion on which one we should choose. The objective approach facilitates associating the reduced gradient with the prediction and label more easily, but may be less modularized.

ref #9043

- Use reduced gradient for tree structure exploration. The PR adds a gradient container that has two different gradient types, one for tree split and the other one for leaf values. This is an optimization for vector-leaf to reduce the overhead of finding tree structure. work on build hist. notes. work on evaluation. build all nodes. inputs. getter. alloc. proto. apply. disable. Work on high-level tests. lint. cleanup. test policy. lazy gen. Fix scan. Cleanup. Cleanup single split. Revert. Remove sync. check. cleanup. notes. error message. Cleanup grp type, notes. notes. leaf sum. leaf weight. cleanup. work on the path. update. update. sth to run. parameters. sort. probing. change n targets. Expand tree. q set leaves. print. fix. nidx. fix. copy root sum. copy root sum. Fix sum. work on a simple python test. note. sort leaves. cleanup. test. cleanup. comment. cleanup. check. note. try to find an interface. notes. remove. check. clenaup. Add debugging utilities. set root. allow smaller weight. Cleanup. Cleanup. cleanup. Doc, cleanup. Remove allocations. move container. unify the do boost. Remove the update method.

Copilot

Pull Request Overview

This PR introduces support for reduced gradients in multi-target tree construction, enabling different gradients for split evaluation versus leaf value calculation. This is part of an experimental feature for improving multi-target learning.

Key changes:

Introduces a new GradientContainer struct to hold both split gradients and optional value gradients
Adds experimental TreeObjective Python class with split_grad method for custom gradient reduction
Refactors tree updaters and booster code to use GradientContainer instead of raw gradient matrices
Renames SetLeaf to SetRoot for multi-target trees to clarify semantics

Reviewed Changes

Copilot reviewed 54 out of 54 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
include/xgboost/gradient.h	New header defining `GradientContainer` struct for holding split and value gradients
include/xgboost/tree_updater.h	Updates `Update` signature to accept `GradientContainer*` instead of gradient matrix
include/xgboost/tree_model.h	Renames `SetLeaf` to `SetRoot` and adds `SetLeaves` method for batch leaf updates
include/xgboost/multi_target_tree_model.h	Updates multi-target tree API to use `SetRoot` and adds `SetLeaves`
include/xgboost/learner.h	Updates `BoostOneIter` signature to accept `GradientContainer`
include/xgboost/gbm.h	Updates `DoBoost` signature to accept `GradientContainer`
include/xgboost/objective.h	Changes parameter type from `std::int32_t` to `bst_target_t` for consistency
include/xgboost/linalg.h	Adds `EmptyLike` utility function for tensor creation
src/tree/updater_gpu_hist.cuh	Implements reduced gradient support with separate split/value quantizers
src/tree/updater_gpu_hist.cu	Updates GPU histogram builder to handle gradient container
src/tree/leaf_sum.cuh	New file implementing leaf gradient sum calculations for GPU
src/tree/leaf_sum.cu	Implementation of leaf weight calculation from value gradients
src/tree/multi_target_tree_model.cc	Refactors weight setting with `SetRoot` and implements `SetLeaves`
src/tree/tree_model.cc	Adds `SetLeaves` wrapper method
src/tree/updater_*.cc	Updates all tree updaters to accept `GradientContainer`
src/gbm/gbtree.cc	Refactors boosting logic to handle gradient containers
src/gbm/gblinear.cc	Updates linear booster to use gradient container
src/learner.cc	Updates learner to use gradient containers throughout
src/c_api/c_api.cc	Adds experimental `XGBoosterTrainOneIterWithObj` API and renames function
src/c_api/c_api.cu	Renames `CopyGradientFromCUDAArrays` to `CopyGradientFromCudaArrays`
src/common/device_helpers.cuh	Removes deprecated `DebugSyncDevice` function
src/common/device_debug.cuh	New file with debug utilities moved from device_helpers
src/common/algorithm.h	Removes incorrect GPU check from `ArgSort`
python-package/xgboost/objective.py	New module with experimental objective interface
python-package/xgboost/core.py	Implements support for tree objectives with split gradients
python-package/xgboost/testing/multi_target.py	Adds comprehensive tests for reduced gradient feature
python-package/xgboost/testing/init.py	Fixes `ls_obj` to handle multi-dimensional arrays
tests/python-gpu/test_gpu_multi_target.py	Adds test for reduced gradient on GPU
tests/cpp/*/test_.cc	Updates test files to use `GradientContainer`

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/tree/updater_gpu_hist.cuh

src/tree/gpu_hist/row_partitioner.cuh

Copilot

Pull Request Overview

Copilot reviewed 56 out of 56 changed files in this pull request and generated 9 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/tree/gpu_hist/multi_evaluate_splits.cu

src/tree/updater_gpu_hist.cuh

src/tree/gpu_hist/multi_evaluate_splits.cu

src/common/device_debug.cuh

src/tree/updater_refresh.cc

src/common/algorithm.h

src/tree/tree_model.cc

src/learner.cc

Copilot

Pull Request Overview

Copilot reviewed 56 out of 56 changed files in this pull request and generated 5 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/tree/multi_target_tree_model.cc

src/tree/gpu_hist/leaf_sum.cu

src/tree/updater_gpu_hist.cu

src/tree/gpu_hist/multi_evaluate_splits.cu

python-package/xgboost/core.py

Copilot

Pull Request Overview

Copilot reviewed 60 out of 60 changed files in this pull request and generated 4 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

python-package/xgboost/testing/multi_target.py

src/tree/gpu_hist/multi_evaluate_splits.cu

src/tree/updater_gpu_hist.cuh

src/tree/updater_gpu_hist.cu

trivialfis · 2025-11-07T20:56:54Z

cc @rongou

trivialfis added 5 commits November 7, 2025 18:36

a class interface.

3e3c2ce

tests.

849248e

cpu build.

ac833ec

lint.

bf83fe2

trivialfis requested a review from Copilot November 7, 2025 12:29

Copilot AI reviewed Nov 7, 2025

View reviewed changes

src/tree/updater_gpu_hist.cuh Outdated Show resolved Hide resolved

src/tree/gpu_hist/row_partitioner.cuh Outdated Show resolved Hide resolved

trivialfis added 2 commits November 7, 2025 20:42

cleanup.

37b5474

sycl.

db2c763

trivialfis requested a review from Copilot November 7, 2025 12:45

rename.

009c14d

Copilot AI reviewed Nov 7, 2025

View reviewed changes

trivialfis added 5 commits November 7, 2025 21:14

Cleanup.

165c36a

Cleanup.

354f08c

lint.

e6fdc9a

Lint.

e937028

Fix test.

efb83ea

trivialfis requested a review from Copilot November 7, 2025 14:43

Copilot AI reviewed Nov 7, 2025

View reviewed changes

trivialfis added 9 commits November 8, 2025 00:04

Cleanup.

bfcdf18

Device tree.

1c96961

Merge branch 'master' into mt-gpu-hist-red

d596059

pylint.

d523a96

Test.

93cdc00

Move files.

b40cc90

Notes.

02b7cb3

parallel.

287b50e

Optimize the reduction.

18a629d

trivialfis requested a review from Copilot November 7, 2025 19:25

Copilot AI reviewed Nov 7, 2025

View reviewed changes

python-package/xgboost/testing/multi_target.py Show resolved Hide resolved

src/tree/gpu_hist/multi_evaluate_splits.cu Show resolved Hide resolved

src/tree/updater_gpu_hist.cuh Show resolved Hide resolved

src/tree/updater_gpu_hist.cu Show resolved Hide resolved

cleanup.

36c17ae

trivialfis changed the title ~~[wip][mt] Split up gradient types for the GPU hist.~~ [mt] Split up gradient types for the GPU hist. Nov 7, 2025

trivialfis marked this pull request as ready for review November 7, 2025 20:56

trivialfis added 2 commits November 8, 2025 17:09

Small cleanups.

78915ce

note.

573c946

trivialfis mentioned this pull request Nov 9, 2025

[Roadmap] Multiple outputs. #9043

Open

44 tasks

rongou approved these changes Nov 10, 2025

View reviewed changes

trivialfis merged commit 73261fe into dmlc:master Nov 10, 2025
63 checks passed

trivialfis deleted the mt-gpu-hist-red branch November 10, 2025 21:36

Uh oh!

[mt] Split up gradient types for the GPU hist. #11798

[mt] Split up gradient types for the GPU hist. #11798

Conversation

trivialfis commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

trivialfis commented Nov 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

trivialfis commented Nov 7, 2025 •

edited

Loading