Skip to content

cutlass profiler - align emitted SFA/SFB kernel naming with typical convention#2942

Open
aidando73 wants to merge 1 commit intoNVIDIA:mainfrom
aidando73:aidand-swap-vecs
Open

cutlass profiler - align emitted SFA/SFB kernel naming with typical convention#2942
aidando73 wants to merge 1 commit intoNVIDIA:mainfrom
aidando73:aidand-swap-vecs

Conversation

@aidando73
Copy link
Contributor

This one tripped me up a bit - typically we denote A.shape=m x k and B.shape=k x n, but the scales in SFB are ordered backwards (n x k)

This PR swaps the SFA/SFB naming order so it follows the standard m × k, k × n convention.

Note: This is a backward-incompatible change for kernel filters but my sense is that SFA/SFB isn’t commonly used for filtering, but let me know if we should add any compatibility handling.

cc @hwu36 @depaulmillz

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant