text-generation-inference/router/src
Mohit Sharma 73e797528d
L4 fixes (#3161)
add fix
2025-04-14 22:13:53 +05:30
..
infer Fix tool call4 (#3094) 2025-03-12 09:28:47 +01:00
chat.rs Fix tool call4 (#3094) 2025-03-12 09:28:47 +01:00
config.rs L4 fixes (#3161) 2025-04-14 22:13:53 +05:30
kserve.rs fix: include add_special_tokens in kserve request (#2859) 2024-12-19 16:55:17 -05:00
lib.rs L4 fixes (#3161) 2025-04-14 22:13:53 +05:30
logging.rs Rebase TRT-llm (#2331) 2024-07-31 10:33:10 +02:00
sagemaker.rs feat: allow any supported payload on /invocations (#2683) 2024-10-23 11:26:01 +00:00
server.rs Fixing tokenization like https://github.com/huggingface/text-embeddin… (#3156) 2025-04-09 18:42:25 +02:00
usage_stats.rs Gaudi: clean cuda/rocm code in hpu backend, enable flat_hpu (#3113) 2025-04-14 15:58:13 +02:00
validation.rs L4 fixes (#3161) 2025-04-14 22:13:53 +05:30
vertex.rs Improve tool call message processing (#3036) 2025-02-21 10:30:29 +01:00