text-generation-inference/router/client
Karol Damaszke bf5263b88b
Disable watermark with FP8 quantization (#114)
Co-authored-by: Karol Damaszke <kdamaszke@habana.ai>
2024-03-27 13:32:20 +01:00
..
src Disable watermark with FP8 quantization (#114) 2024-03-27 13:32:20 +01:00
build.rs feat: add distributed tracing (#62) 2023-02-13 13:02:45 +01:00
Cargo.toml Add warmup for all possible shapes for prefill #49 (#81) 2024-02-28 10:40:13 +01:00