text-generation-inference/backends/v3/src
Wang, Yi d658b5def3
Deepseek R1 for Gaudi backend (#3211)
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
2025-05-19 16:36:39 +02:00
..
client Revert "feat: improve qwen2-vl startup " (#2924) 2025-01-17 12:09:05 -05:00
backend.rs Add backend name to telemetry (#2962) 2025-01-28 16:53:16 +01:00
block_allocator.rs Warmup gaudi backend (#3172) 2025-04-24 09:57:08 +02:00
lib.rs Choosing input/total tokens automatically based on available VRAM? (#2673) 2024-10-28 04:59:49 +01:00
main.rs Add option to configure prometheus port (#3187) 2025-04-23 20:43:25 +05:30
queue.rs Deepseek R1 for Gaudi backend (#3211) 2025-05-19 16:36:39 +02:00
radix.rs Add property-based testing for RadixAllocator (#3068) 2025-03-04 15:09:46 +01:00