text-generation-inference

mirror of https://github.com/huggingface/text-generation-inference.git synced 2025-10-11 07:55:24 +00:00

History

OlivierDehaene 757223b352 feat: add SchedulerV3 (#1996 ) - Refactor code to allow supporting multiple versions of the generate.proto at the same time - Add v3/generate.proto (ISO to generate.proto for now but allow for future changes without impacting v2 backends) - Add Schedule trait to abstract queuing and batching mechanisms that will be different in the future - Add SchedulerV2/V3 impl		2024-06-04 15:56:56 +02:00
..
pb	feat: add SchedulerV3 (#1996 )	2024-06-04 15:56:56 +02:00
client.rs	feat: add SchedulerV3 (#1996 )	2024-06-04 15:56:56 +02:00
mod.rs	feat: add SchedulerV3 (#1996 )	2024-06-04 15:56:56 +02:00
sharded_client.rs	feat: add SchedulerV3 (#1996 )	2024-06-04 15:56:56 +02:00