text-generation-inference/backends/v3/src
2024-08-21 09:06:54 +02:00
..
client Add support for prefix caching to the v3 router (#2392) 2024-08-12 14:59:17 +02:00
backend.rs Making prefix/flashinfer the default and testing the full release tests. 2024-08-21 09:06:54 +02:00
block_allocator.rs Keeping the benchmark somewhere (#2401) 2024-08-12 15:22:02 +02:00
lib.rs Keeping the benchmark somewhere (#2401) 2024-08-12 15:22:02 +02:00
main.rs Pr 2352 ci branch (#2382) 2024-08-09 10:54:32 +02:00
queue.rs Prefix caching (#2402) 2024-08-20 11:15:30 +02:00
radix.rs Prefix caching (#2402) 2024-08-20 11:15:30 +02:00