Pull in intel/llvm changes to main - Fri 21st Feb #2719

kbenzie · 2025-02-21T14:59:38Z

We were reading the kernel arguments at kernel execution time, but kernel arguments are allowed to change between enqueuing and executing. Make sure to create a copy of kernel arguments ahead of time. This was previously approved as a unified-runtime PR: oneapi-src#2700

Update UMF to the commit: ``` commit 5a515c56c92be75944c8246535c408cee7711114 Author: Lukasz Dorau <[email protected]> Date: Mon Feb 17 10:56:05 2025 +0100 Merge pull request oneapi-src#1086 from vinser52/svinogra_l0_linking ``` to fix the issue in LLVM (SYCL/CUDA): intel/llvm#16944 [SYCL][CUDA] Nsys profiling broken after memory providers change Moved from: oneapi-src#2708 Fixes: intel/llvm#16944 Signed-off-by: Lukasz Dorau <[email protected]>

Adds implements calls shared between command buffer and queue in unified-runtime level-zero v2 adapter and moves the shared code to `command_list_manager.cpp`

As discussed in oneapi-src#2670 (comment) the `pCommandBufferDesc` parameter to `urCommandBufferCreateExp` is optional. However, the UR spec doesn't state what the configuration of the created command-buffer is when this isn't passed, and being optional is also inconsistent with the description parameters to urSamplerCreate & urMemImageCreate which are not optional. This PR updates the descriptor parameter to command-buffer creation to be mandatory to address these concerns. Closes oneapi-src#2673 **Note**: This UR patch was previously approved and ready-to-merge in oneapi-src#2676 prior to the repo move

After the [spec bump of cl_khr_command_buffer to 0.9.7](https://github.com/KhronosGroup/OpenCL-Docs/), in the OpenCL adapter we no longer need to worry about the in-order/out-of-order property of the internal queue used on command-command-buffer creation matching the queue used to enqueue the command-buffer. We can therefore take advantage of the in-order flag passed on UR command-buffer creation to use an in-order queue for command-buffer creation, and omit using sync points. **Note:** This UR patch was previously approved and ready-to-merge prior to the UR repo move in oneapi-src#2681

Fixes #16677 by only setting `-pie` linker option in Release builds on executables rather than on any type of target.

…free related error (#16706) UR: oneapi-src#2592 --------- Co-authored-by: Kenneth Benzie (Benie) <[email protected]>

… and improve its conformance test (#17067) Migrated from oneapi-src#2533 This patch turns all of the values returned by urEventGetProfilingInfo to be optional and updates adapters to handle this by returning the appropriate enum when it is not supported. The tests have also been updated, to ensure that returning a counter of "0" or values equal to the previous profiling event is no longer considered a failure.

- Fix group count not being recalculated when a user only passes a new local work size and no new global size - Remove CTS test skips for local update on L0

MSVC warns about a possible uninitialized variable. This is a false positive but explicitly initializing always is harmless, so do this.

This is a first step towards reenabling UR performance testing CI. This introduces the reusable yml workflow and a way to trigger it manually. Here's an example how it looks: pbalcer/llvm#2 (comment)

Use UMF Proxy pool manager with UMF CUDA memory provider in UR. UMF Proxy pool manager is just a wrapper for the UMF memory provider (CUDA memory provider in this case) plus it adds also tracking of memory allocations. Moved from: oneapi-src#2659 Signed-off-by: Lukasz Dorau <[email protected]>

hvdijk and others added 13 commits February 21, 2025 14:56

Support device memory sanitizer for DG2 GPU device (#16619)

dcb022f

Add remaining calls shared with queue in level-zero v2 adapter (#17061)

441720f

Adds implements calls shared between command buffer and queue in unified-runtime level-zero v2 adapter and moves the shared code to `command_list_manager.cpp`

Don't set -pie on shared objects (#16880)

a239c8f

Fixes #16677 by only setting `-pie` linker option in Release builds on executables rather than on any type of target.

Fix throw "UR_RESULT_ERROR_INVALID_ARGUMENT" exception when catching …

a6e7ea7

…free related error (#16706) UR: oneapi-src#2592 --------- Co-authored-by: Kenneth Benzie (Benie) <[email protected]>

Fix issue with command-buffer local mem update (#17069)

fd4508a

- Fix group count not being recalculated when a user only passes a new local work size and no new global size - Remove CTS test skips for local update on L0

Fix MSVC build. (#17090)

cf84b75

MSVC warns about a possible uninitialized variable. This is a false positive but explicitly initializing always is harmless, so do this.

add manually triggered benchmark action (#17088)

99fa00b

This is a first step towards reenabling UR performance testing CI. This introduces the reusable yml workflow and a way to trigger it manually. Here's an example how it looks: pbalcer/llvm#2 (comment)

martygrant approved these changes Feb 21, 2025

View reviewed changes

kbenzie merged commit dd7d5c6 into oneapi-src:main Feb 21, 2025
10 checks passed

kbenzie deleted the benie/main branch February 21, 2025 16:08

kbenzie temporarily deployed to github-pages February 21, 2025 16:24 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Pull in intel/llvm changes to main - Fri 21st Feb #2719

Pull in intel/llvm changes to main - Fri 21st Feb #2719

Uh oh!

kbenzie commented Feb 21, 2025

Uh oh!

Uh oh!

Uh oh!

Pull in intel/llvm changes to main - Fri 21st Feb #2719

Pull in intel/llvm changes to main - Fri 21st Feb #2719

Uh oh!

Conversation

kbenzie commented Feb 21, 2025

Uh oh!

Uh oh!

Uh oh!