text-generation-inference/backends
Nicolas Patry 849bd93dc3 Using an enum for flash backens (paged/flashdecoding/flashinfer) (#2385)
* Using an enum for flash backens (paged/flashdecoding/flashinfer)

* Early exit on server too.

* Clippy.

* Fix clippy and fmt.
2024-09-25 06:04:51 +00:00
..
client Rebase TRT-llm (#2331) 2024-09-25 05:55:39 +00:00
grpc-metadata Rebase TRT-llm (#2331) 2024-09-25 05:55:39 +00:00
trtllm Rebase TRT-llm (#2331) 2024-09-25 05:55:39 +00:00
v3 Using an enum for flash backens (paged/flashdecoding/flashinfer) (#2385) 2024-09-25 06:04:51 +00:00