text-generation-inference/router/src/infer/v3/mod.rs
OlivierDehaene 184c89fd55 feat: add SchedulerV3 (#1996)
- Refactor code to allow supporting multiple versions of the
generate.proto at the same time
- Add v3/generate.proto (ISO to generate.proto for now but allow for
future changes without impacting v2 backends)
- Add Schedule trait to abstract queuing and batching mechanisms that
will be different in the future
- Add SchedulerV2/V3 impl
2024-09-24 03:28:31 +00:00

5 lines
66 B
Rust

mod queue;
mod scheduler;
pub(crate) use scheduler::SchedulerV3;