Skip to content

Add AMD ROCm GPU build and test CI infrastructure#1

Open
Geramy wants to merge 9 commits intomasterfrom
geramy/rocm-build-tests-matrix
Open

Add AMD ROCm GPU build and test CI infrastructure#1
Geramy wants to merge 9 commits intomasterfrom
geramy/rocm-build-tests-matrix

Conversation

@Geramy
Copy link
Member

@Geramy Geramy commented Mar 3, 2026

  • Add ROCm chipsets to Matrix and create per chipset builds and CI Tests
  • Add self-hosted runner test jobs (test-rocm-linux, test-rocm-windows) for gfx1151/gfx1150
  • Add cleanup composite actions for Linux and Windows runners
  • Add runner heartbeat monitoring workflow
  • Configure ci/run.sh with ROCm environment (HIP_PLATFORM, LD_LIBRARY_PATH, cmake flags)
  • Add Windows ROCm build support to build.yml
  • Fix conditional expression syntax warnings in build.yml

@Geramy Geramy self-assigned this Mar 3, 2026
@Geramy Geramy requested a review from ramkrishna2910 March 3, 2026 18:50
Copy link

@ramkrishna2910 ramkrishna2910 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added comments.

ci/run.sh Outdated

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This will affect the behavior of all builds even non ROCm ones. Please make a note about it.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks like this will apply to CUDA/Vulkan/CPU as well
Is that intended?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am going to remove this, there is no real purpose.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It was originally added to make this more compatible with windows.

* Introduce null-check to prevent SEGV

* Fix error message
@Geramy Geramy requested a review from ramkrishna2910 March 10, 2026 02:05
Geramy added 8 commits March 9, 2026 20:55
- Add self-hosted runner test jobs (test-rocm-linux, test-rocm-windows) for gfx1151/gfx1150
- Add cleanup composite actions for Linux and Windows runners
- Add runner heartbeat monitoring workflow
- Configure ci/run.sh with ROCm environment (HIP_PLATFORM, LD_LIBRARY_PATH, cmake flags)
- Add Windows ROCm build support to build.yml
- Fix conditional expression syntax warnings in build.yml
… should_build outputs to be specific. I ahve removed outputs.rocm_version from both ci steps, extracted resolve_rocm to a shared script for both jobs to use them. Fixed the matrix, removed both ubuntu-rocm and windows-rocm FGGML_ROCM=1 flag which doesn't apply because it isn't a real flag. Also commented out heartbeat runners.
…[^<]*\)<\/Key>.*/\1/gp'. This works on both Linux and Windows Git Bash.
@Geramy Geramy force-pushed the geramy/rocm-build-tests-matrix branch from 6f435d0 to cd3b5fc Compare March 10, 2026 03:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants