RFC: Better API and control over deciding which worker is selected. #5465
michaelfeil
started this conversation in
General
Replies: 1 comment
-
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
We are looking to improve the routing logic.
Specifically the lines:
dynamo/lib/llm/src/kv_router/scheduler.rs
Lines 470 to 521 in 0b7e127
Understood, there is a full API for the
worker_selector, but the e.g. softmax sample and the following lines are helpful.If I want to implement various versions of it, it would be helpful to
Beta Was this translation helpful? Give feedback.
All reactions