text-generation-inference/backends
Funtowicz Morgan c3401e0b99 More fixes trtllm (#2342)
* (backend) use parking_lot crate for RwLock fairness

* (docker) let's put rust in the TRTLLM folder when building

* (docker) build ompi with SLURM support

* (launcher) default new server::run parameters to false for now

* (chore) fmt ... why?
2024-09-25 06:08:00 +00:00
..
client Add support for prefix caching to the v3 router (#2392) 2024-09-25 06:05:08 +00:00
grpc-metadata Rebase TRT-llm (#2331) 2024-09-25 05:55:39 +00:00
trtllm More fixes trtllm (#2342) 2024-09-25 06:08:00 +00:00
v3 Keeping the benchmark somewhere (#2401) 2024-09-25 06:05:43 +00:00