Update alpaka to 2.1.1 "Fixed Transformation" [16.0.x] by fwyzard · Pull Request #10260 · cms-sw/cmsdist

fwyzard · 2025-12-19T10:20:20Z

This release implements a new interface, similar to std::transform, that simplifies writing asynchronous parallel algorithms across all back-ends. SYCL support is extended to NVIDIA and AMD GPUs.
The release introduces unified memory and expands asynchronous memory allocation to buffers of any dimension. Interoperability with standard C++ is improved through std::span support: alpaka buffers expose a span interface, and any std::span can be used as an alpaka view. It adds compile-time warp-size definitions, extends atomic increment and decrement operations and fixes their behaviour on CPU back-end; it introduces a C++ concept for alpaka accelerators together with new type traits, along with many smaller fixes and improvements. The CI has been updated to test newer operating systems and compilers, including Clang 20 and ROCm 6.3, 6.4, and 7.0.

The full list of changes is available in the ChangeLog.

This release implements a new interface, similar to std::transform, that simplifies writing asynchronous parallel algorithms across all back-ends. SYCL support is extended to NVIDIA and AMD GPUs. The release introduces unified memory and expands asynchronous memory allocation to buffers of any dimension. Interoperability with standard C++ is improved through std::span support: alpaka buffers expose a span interface, and any std::span can be used as an alpaka view. It adds compile-time warp-size definitions, extends atomic increment and decrement operations and fixes their behaviour on CPU back-end; it introduces a C++ concept for alpaka accelerators together with new type traits, along with many smaller fixes and improvements. The CI has been updated to test newer operating systems and compilers, including Clang 20 and ROCm 6.3, 6.4, and 7.0. The full list of changes is available in the ChangeLog.

fwyzard · 2025-12-19T10:20:29Z

enable gpu

fwyzard · 2025-12-19T10:20:32Z

please test

cmsbuild · 2025-12-19T10:20:45Z

A new Pull Request was created by @fwyzard for branch IB/CMSSW_16_0_X/master.

@akritkbehera, @iarspider, @raoatifshad, @smuzaffar can you please review it and eventually sign? Thanks.
@ftenchini, @mandrenguyen, @sextonkennedy you are the release manager for this.
cms-bot commands are listed here

Backported from Update alpaka to 2.1.1 "Fixed Transformation" #10250

cmsbuild · 2025-12-19T10:20:46Z

cms-bot internal usage

fwyzard · 2025-12-19T10:21:05Z

backport #10250

cmsbuild · 2025-12-21T00:02:42Z

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-927186/50364/summary.html
COMMIT: e3c6e9d
CMSSW: CMSSW_16_0_X_2025-12-18-2300/el8_amd64_gcc13
Additional Tests: GPU,AMD_MI300X,AMD_W7900,NVIDIA_H100,NVIDIA_L40S,NVIDIA_T4
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmsdist/10260/50364/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

You potentially added 20 lines to the logs
ROOTFileChecks: Some differences in event products or their sizes found
Reco comparison results: 6 differences found in the comparisons
Reco comparison had 4 failed jobs
DQMHistoTests: Total files compared: 53
DQMHistoTests: Total histograms compared: 4280393
DQMHistoTests: Total failures: 73
DQMHistoTests: Total nulls: 0
DQMHistoTests: Total successes: 4280300
DQMHistoTests: Total skipped: 20
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0.0 KiB( 52 files compared)
Checked 227 log files, 198 edm output root files, 53 DQM output files
TriggerResults: no differences found

AMD_MI300X Comparison Summary

Summary:

You potentially removed 1 lines from the logs
Reco comparison results: 257 differences found in the comparisons
Reco comparison had 6 failed jobs
DQMHistoTests: Total files compared: 11
DQMHistoTests: Total histograms compared: 149371
DQMHistoTests: Total failures: 29404
DQMHistoTests: Total nulls: 11
DQMHistoTests: Total successes: 119956
DQMHistoTests: Total skipped: 0
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0.0 KiB( 10 files compared)
Checked 42 log files, 45 edm output root files, 11 DQM output files
TriggerResults: no differences found

AMD_W7900 Comparison Summary

Summary:

You potentially removed 8 lines from the logs
Reco comparison results: 240 differences found in the comparisons
Reco comparison had 6 failed jobs
DQMHistoTests: Total files compared: 11
DQMHistoTests: Total histograms compared: 149371
DQMHistoTests: Total failures: 30232
DQMHistoTests: Total nulls: 10
DQMHistoTests: Total successes: 119129
DQMHistoTests: Total skipped: 0
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0.0 KiB( 10 files compared)
Checked 42 log files, 45 edm output root files, 11 DQM output files
TriggerResults: no differences found

NVIDIA_H100 Comparison Summary

There are some workflows for which there are errors in the baseline:
29834.402 step 2
29834.403 step 2
29834.404 step 2
29834.751 step 2
The results for the comparisons for these workflows could be incomplete
This means most likely that the IB is having errors in the relvals.The error does NOT come from this pull request

Summary:

You potentially added 120 lines to the logs
ROOTFileChecks: Some differences in event products or their sizes found
Reco comparison results: 200 differences found in the comparisons
Reco comparison had 6 failed jobs
DQMHistoTests: Total files compared: 8
DQMHistoTests: Total histograms compared: 87401
DQMHistoTests: Total failures: 9316
DQMHistoTests: Total nulls: 0
DQMHistoTests: Total successes: 78085
DQMHistoTests: Total skipped: 0
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0.0 KiB( 7 files compared)
Checked 36 log files, 41 edm output root files, 8 DQM output files
TriggerResults: no differences found

NVIDIA_L40S Comparison Summary

Summary:

You potentially added 12 lines to the logs
Reco comparison results: 220 differences found in the comparisons
Reco comparison had 6 failed jobs
DQMHistoTests: Total files compared: 11
DQMHistoTests: Total histograms compared: 149371
DQMHistoTests: Total failures: 35333
DQMHistoTests: Total nulls: 5
DQMHistoTests: Total successes: 114033
DQMHistoTests: Total skipped: 0
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0.0 KiB( 10 files compared)
Checked 42 log files, 45 edm output root files, 11 DQM output files
TriggerResults: no differences found

NVIDIA_T4 Comparison Summary

Summary:

You potentially added 1 lines to the logs
Reco comparison results: 248 differences found in the comparisons
Reco comparison had 6 failed jobs
DQMHistoTests: Total files compared: 11
DQMHistoTests: Total histograms compared: 149371
DQMHistoTests: Total failures: 29208
DQMHistoTests: Total nulls: 10
DQMHistoTests: Total successes: 120153
DQMHistoTests: Total skipped: 0
DQMHistoTests: Total Missing objects: 0
DQMHistoSizes: Histogram memory added: 0.0 KiB( 10 files compared)
Checked 42 log files, 45 edm output root files, 11 DQM output files
TriggerResults: no differences found

smuzaffar · 2026-01-06T11:33:28Z

@gartung , can you please check the max memory comparison job. It failed for h100 gpu . the link for max memory failed job is broken at https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-927186/50364/summary.html . It points to missing https://cmssdt.cern.ch/SDT/jenkins-artifacts/baseLineComparisonsNVIDIA_H100/CMSSW_16_0_X_2025-12-18-2300+927186/72479/maxmem-comparison/maxmem_summary.html file. In the log file https://cmssdt.cern.ch/SDT/jenkins-artifacts/baseLineComparisonsNVIDIA_H100/CMSSW_16_0_X_2025-12-18-2300+927186/72479/maxmem-comparison/maxmem_summary.log I see messages like

/data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/results/maxmem-comparison/29834.402.err:KeyError: 'step3'
/data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/results/maxmem-comparison/29834.403.err:KeyError: 'step3'
/data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/results/maxmem-comparison/29834.404.err:KeyError: 'step3'
/data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/results/maxmem-comparison/29834.751.err:KeyError: 'step3'

smuzaffar · 2026-01-06T11:34:22Z

@fwyzard , is this ready to go in or do you want to run more local tests?

smuzaffar · 2026-01-06T11:35:48Z

ah this is for 16.0.X, lets wait for this to be integrated in 16.1.X first

fwyzard · 2026-01-06T12:32:41Z

Both 16.1.x and 16.0.x PRs are good to go for me.

cmsbuild · 2026-01-19T09:38:04Z

REMINDER @mandrenguyen, @ftenchini, @sextonkennedy: This PR was tested with cms-sw/cmssw#49848, please check if they should be merged together

smuzaffar · 2026-01-19T15:28:48Z

+externals

@cms-sw/orp-l2 feel free to include it for next 16.0.X IB/release. Note that, to avoid compilation warnings, we also need cms-sw/cmssw#49848 to go with this change

cmsbuild · 2026-01-19T15:29:15Z

This pull request is fully signed and it will be integrated in one of the next IB/CMSSW_16_0_X/master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @mandrenguyen, @sextonkennedy, @ftenchini (and backports should be raised in the release meeting by the corresponding L2)

mandrenguyen · 2026-01-19T18:45:29Z

+1

cmsbuild added externals-pending pending-signatures tests-started orp-pending labels Dec 19, 2025

cmsbuild added the backport label Dec 19, 2025

cmsbuild added tests-approved and removed tests-started labels Dec 21, 2025

fwyzard mentioned this pull request Dec 21, 2025

Update alpaka to 2.1.1 "Fixed Transformation" #10250

Merged

cmsbuild added backport-ok and removed backport labels Jan 7, 2026

fwyzard mentioned this pull request Jan 16, 2026

Use the alpaka Acc concept [16.0.x] cms-sw/cmssw#49848

Merged

cmsbuild added externals-approved fully-signed and removed externals-pending pending-signatures labels Jan 19, 2026

cmsbuild added orp-approved and removed orp-pending labels Jan 19, 2026

cmsbuild merged commit 4ed7b22 into cms-sw:IB/CMSSW_16_0_X/master Jan 19, 2026
45 of 47 checks passed

fwyzard deleted the IB/CMSSW_16_0_X/master_alpaka_210 branch January 19, 2026 22:06

Conversation

fwyzard commented Dec 19, 2025

Uh oh!

fwyzard commented Dec 19, 2025

Uh oh!

fwyzard commented Dec 19, 2025

Uh oh!

cmsbuild commented Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cmsbuild commented Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fwyzard commented Dec 19, 2025

Uh oh!

cmsbuild commented Dec 21, 2025

Comparison Summary

AMD_MI300X Comparison Summary

AMD_W7900 Comparison Summary

NVIDIA_H100 Comparison Summary

NVIDIA_L40S Comparison Summary

NVIDIA_T4 Comparison Summary

Uh oh!

smuzaffar commented Jan 6, 2026

Uh oh!

smuzaffar commented Jan 6, 2026

Uh oh!

smuzaffar commented Jan 6, 2026

Uh oh!

fwyzard commented Jan 6, 2026

Uh oh!

cmsbuild commented Jan 19, 2026

Uh oh!

smuzaffar commented Jan 19, 2026

Uh oh!

cmsbuild commented Jan 19, 2026

Uh oh!

mandrenguyen commented Jan 19, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

cmsbuild commented Dec 19, 2025 •

edited

Loading

cmsbuild commented Dec 19, 2025 •

edited

Loading