text-generation-inference/backends/v3/src
Nicolas Patry 0a495ad118
Updating the kvrouter to support roundrobin
for comparison, still withthe overloading checks
2025-01-31 09:08:55 +01:00
..
client Revert "feat: improve qwen2-vl startup " (#2924) 2025-01-17 12:09:05 -05:00
backend.rs Add backend name to telemetry (#2962) 2025-01-28 16:53:16 +01:00
block_allocator.rs Upgrading our rustc version. (#2908) 2025-01-15 17:04:03 +01:00
lib.rs Choosing input/total tokens automatically based on available VRAM? (#2673) 2024-10-28 04:59:49 +01:00
main.rs feat: add payload limit (#2726) 2024-11-21 18:20:15 +00:00
queue.rs Upgrading our rustc version. (#2908) 2025-01-15 17:04:03 +01:00
radix.rs Updating the kvrouter to support roundrobin 2025-01-31 09:08:55 +01:00