text-generation-inference/backends/v3/src
2025-05-05 17:34:03 -04:00
..
client feat: support logit bias in chat request 2025-05-05 17:34:03 -04:00
backend.rs Add backend name to telemetry (#2962) 2025-01-28 16:53:16 +01:00
block_allocator.rs Warmup gaudi backend (#3172) 2025-04-24 09:57:08 +02:00
lib.rs Choosing input/total tokens automatically based on available VRAM? (#2673) 2024-10-28 04:59:49 +01:00
main.rs Add option to configure prometheus port (#3187) 2025-04-23 20:43:25 +05:30
queue.rs feat: support logit bias in chat request 2025-05-05 17:34:03 -04:00
radix.rs Add property-based testing for RadixAllocator (#3068) 2025-03-04 15:09:46 +01:00