text-generation-inference/router/src/infer/v2
Nicolas Patry 7a48a84784
Using an enum for flash backens (paged/flashdecoding/flashinfer) (#2385)
* Using an enum for flash backens (paged/flashdecoding/flashinfer)

* Early exit on server too.

* Clippy.

* Fix clippy and fmt.
2024-08-09 16:41:17 +02:00
..
mod.rs Rebase TRT-llm (#2331) 2024-07-31 10:33:10 +02:00
queue.rs Pr 2352 ci branch (#2382) 2024-08-09 10:54:32 +02:00
scheduler.rs Using an enum for flash backens (paged/flashdecoding/flashinfer) (#2385) 2024-08-09 16:41:17 +02:00