text-generation-inference/router/client/src
OlivierDehaene 532146338b
feat(router): add max_batch_size (#1542)
Some hardware require a maximum batch size.
2024-02-09 12:38:41 +01:00
..
pb Init 2022-10-08 12:30:12 +02:00
client.rs feat(router): add max_batch_size (#1542) 2024-02-09 12:38:41 +01:00
lib.rs Speculative (#1308) 2023-12-11 12:46:30 +01:00
sharded_client.rs feat(router): add max_batch_size (#1542) 2024-02-09 12:38:41 +01:00