text-generation-inference/backends/client/src/v3
2024-12-23 13:47:18 -05:00
..
client.rs Choosing input/total tokens automatically based on available VRAM? (#2673) 2024-10-28 04:59:49 +01:00
mod.rs feat: support video input chunks and enable qwen2 vl to process video 2024-12-23 13:47:18 -05:00
sharded_client.rs Choosing input/total tokens automatically based on available VRAM? (#2673) 2024-10-28 04:59:49 +01:00