Skip to content

Update alpaka to 2.1.1 "Fixed Transformation" [16.0.x]#10260

Merged
cmsbuild merged 1 commit intocms-sw:IB/CMSSW_16_0_X/masterfrom
fwyzard:IB/CMSSW_16_0_X/master_alpaka_210
Jan 19, 2026
Merged

Update alpaka to 2.1.1 "Fixed Transformation" [16.0.x]#10260
cmsbuild merged 1 commit intocms-sw:IB/CMSSW_16_0_X/masterfrom
fwyzard:IB/CMSSW_16_0_X/master_alpaka_210

Conversation

@fwyzard
Copy link
Contributor

@fwyzard fwyzard commented Dec 19, 2025

This release implements a new interface, similar to std::transform, that simplifies writing asynchronous parallel algorithms across all back-ends. SYCL support is extended to NVIDIA and AMD GPUs.
The release introduces unified memory and expands asynchronous memory allocation to buffers of any dimension. Interoperability with standard C++ is improved through std::span support: alpaka buffers expose a span interface, and any std::span can be used as an alpaka view. It adds compile-time warp-size definitions, extends atomic increment and decrement operations and fixes their behaviour on CPU back-end; it introduces a C++ concept for alpaka accelerators together with new type traits, along with many smaller fixes and improvements. The CI has been updated to test newer operating systems and compilers, including Clang 20 and ROCm 6.3, 6.4, and 7.0.

The full list of changes is available in the ChangeLog.

This release implements a new interface, similar to std::transform, that
simplifies writing asynchronous parallel algorithms across all back-ends.
SYCL support is extended to NVIDIA and AMD GPUs.
The release introduces unified memory and expands asynchronous memory allocation
to buffers of any dimension. Interoperability with standard C++ is improved
through std::span support: alpaka buffers expose a span interface, and any
std::span can be used as an alpaka view. It adds compile-time warp-size
definitions, extends atomic increment and decrement operations and fixes their
behaviour on CPU back-end; it introduces a C++ concept for alpaka accelerators
together with new type traits, along with many smaller fixes and improvements.
The CI has been updated to test newer operating systems and compilers, including
Clang 20 and ROCm 6.3, 6.4, and 7.0.

The full list of changes is available in the ChangeLog.
@fwyzard
Copy link
Contributor Author

fwyzard commented Dec 19, 2025

enable gpu

@fwyzard
Copy link
Contributor Author

fwyzard commented Dec 19, 2025

please test

@cmsbuild
Copy link
Contributor

cmsbuild commented Dec 19, 2025

A new Pull Request was created by @fwyzard for branch IB/CMSSW_16_0_X/master.

@akritkbehera, @iarspider, @raoatifshad, @smuzaffar can you please review it and eventually sign? Thanks.
@ftenchini, @mandrenguyen, @sextonkennedy you are the release manager for this.
cms-bot commands are listed here

@cmsbuild
Copy link
Contributor

cmsbuild commented Dec 19, 2025

cms-bot internal usage

@fwyzard
Copy link
Contributor Author

fwyzard commented Dec 19, 2025

backport #10250

@cmsbuild
Copy link
Contributor

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-927186/50364/summary.html
COMMIT: e3c6e9d
CMSSW: CMSSW_16_0_X_2025-12-18-2300/el8_amd64_gcc13
Additional Tests: GPU,AMD_MI300X,AMD_W7900,NVIDIA_H100,NVIDIA_L40S,NVIDIA_T4
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmsdist/10260/50364/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

  • You potentially added 20 lines to the logs
  • ROOTFileChecks: Some differences in event products or their sizes found
  • Reco comparison results: 6 differences found in the comparisons
  • Reco comparison had 4 failed jobs
  • DQMHistoTests: Total files compared: 53
  • DQMHistoTests: Total histograms compared: 4280393
  • DQMHistoTests: Total failures: 73
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 4280300
  • DQMHistoTests: Total skipped: 20
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 52 files compared)
  • Checked 227 log files, 198 edm output root files, 53 DQM output files
  • TriggerResults: no differences found

AMD_MI300X Comparison Summary

Summary:

  • You potentially removed 1 lines from the logs
  • Reco comparison results: 257 differences found in the comparisons
  • Reco comparison had 6 failed jobs
  • DQMHistoTests: Total files compared: 11
  • DQMHistoTests: Total histograms compared: 149371
  • DQMHistoTests: Total failures: 29404
  • DQMHistoTests: Total nulls: 11
  • DQMHistoTests: Total successes: 119956
  • DQMHistoTests: Total skipped: 0
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 10 files compared)
  • Checked 42 log files, 45 edm output root files, 11 DQM output files
  • TriggerResults: no differences found

AMD_W7900 Comparison Summary

Summary:

  • You potentially removed 8 lines from the logs
  • Reco comparison results: 240 differences found in the comparisons
  • Reco comparison had 6 failed jobs
  • DQMHistoTests: Total files compared: 11
  • DQMHistoTests: Total histograms compared: 149371
  • DQMHistoTests: Total failures: 30232
  • DQMHistoTests: Total nulls: 10
  • DQMHistoTests: Total successes: 119129
  • DQMHistoTests: Total skipped: 0
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 10 files compared)
  • Checked 42 log files, 45 edm output root files, 11 DQM output files
  • TriggerResults: no differences found

NVIDIA_H100 Comparison Summary

There are some workflows for which there are errors in the baseline:
29834.402 step 2
29834.403 step 2
29834.404 step 2
29834.751 step 2
The results for the comparisons for these workflows could be incomplete
This means most likely that the IB is having errors in the relvals.The error does NOT come from this pull request

Summary:

  • You potentially added 120 lines to the logs
  • ROOTFileChecks: Some differences in event products or their sizes found
  • Reco comparison results: 200 differences found in the comparisons
  • Reco comparison had 6 failed jobs
  • DQMHistoTests: Total files compared: 8
  • DQMHistoTests: Total histograms compared: 87401
  • DQMHistoTests: Total failures: 9316
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 78085
  • DQMHistoTests: Total skipped: 0
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 7 files compared)
  • Checked 36 log files, 41 edm output root files, 8 DQM output files
  • TriggerResults: no differences found

NVIDIA_L40S Comparison Summary

Summary:

  • You potentially added 12 lines to the logs
  • Reco comparison results: 220 differences found in the comparisons
  • Reco comparison had 6 failed jobs
  • DQMHistoTests: Total files compared: 11
  • DQMHistoTests: Total histograms compared: 149371
  • DQMHistoTests: Total failures: 35333
  • DQMHistoTests: Total nulls: 5
  • DQMHistoTests: Total successes: 114033
  • DQMHistoTests: Total skipped: 0
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 10 files compared)
  • Checked 42 log files, 45 edm output root files, 11 DQM output files
  • TriggerResults: no differences found

NVIDIA_T4 Comparison Summary

Summary:

  • You potentially added 1 lines to the logs
  • Reco comparison results: 248 differences found in the comparisons
  • Reco comparison had 6 failed jobs
  • DQMHistoTests: Total files compared: 11
  • DQMHistoTests: Total histograms compared: 149371
  • DQMHistoTests: Total failures: 29208
  • DQMHistoTests: Total nulls: 10
  • DQMHistoTests: Total successes: 120153
  • DQMHistoTests: Total skipped: 0
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 10 files compared)
  • Checked 42 log files, 45 edm output root files, 11 DQM output files
  • TriggerResults: no differences found

@smuzaffar
Copy link
Contributor

@gartung , can you please check the max memory comparison job. It failed for h100 gpu . the link for max memory failed job is broken at https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-927186/50364/summary.html . It points to missing https://cmssdt.cern.ch/SDT/jenkins-artifacts/baseLineComparisonsNVIDIA_H100/CMSSW_16_0_X_2025-12-18-2300+927186/72479/maxmem-comparison/maxmem_summary.html file. In the log file https://cmssdt.cern.ch/SDT/jenkins-artifacts/baseLineComparisonsNVIDIA_H100/CMSSW_16_0_X_2025-12-18-2300+927186/72479/maxmem-comparison/maxmem_summary.log I see messages like

/data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/results/maxmem-comparison/29834.402.err:KeyError: 'step3'
/data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/results/maxmem-comparison/29834.403.err:KeyError: 'step3'
/data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/results/maxmem-comparison/29834.404.err:KeyError: 'step3'
/data/cmsbld/jenkins/workspace/compare-root-files-short-matrix/results/maxmem-comparison/29834.751.err:KeyError: 'step3'

@smuzaffar
Copy link
Contributor

@fwyzard , is this ready to go in or do you want to run more local tests?

@smuzaffar
Copy link
Contributor

ah this is for 16.0.X, lets wait for this to be integrated in 16.1.X first

@fwyzard
Copy link
Contributor Author

fwyzard commented Jan 6, 2026

Both 16.1.x and 16.0.x PRs are good to go for me.

@cmsbuild
Copy link
Contributor

REMINDER @mandrenguyen, @ftenchini, @sextonkennedy: This PR was tested with cms-sw/cmssw#49848, please check if they should be merged together

@smuzaffar
Copy link
Contributor

+externals

@cms-sw/orp-l2 feel free to include it for next 16.0.X IB/release. Note that, to avoid compilation warnings, we also need cms-sw/cmssw#49848 to go with this change

@cmsbuild
Copy link
Contributor

This pull request is fully signed and it will be integrated in one of the next IB/CMSSW_16_0_X/master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @mandrenguyen, @sextonkennedy, @ftenchini (and backports should be raised in the release meeting by the corresponding L2)

@mandrenguyen
Copy link

+1

@cmsbuild cmsbuild merged commit 4ed7b22 into cms-sw:IB/CMSSW_16_0_X/master Jan 19, 2026
45 of 47 checks passed
@fwyzard fwyzard deleted the IB/CMSSW_16_0_X/master_alpaka_210 branch January 19, 2026 22:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants