text-generation-inference/router/src
Alex Weston 58a65f7914
Add request parameters to OTel span for /v1/chat/completions endpoint (#3000)
Record request parameters in OTel span for /v1/chat/completions endpoint
2025-03-10 12:26:57 +01:00
..
infer Fix tool call2 (#3076) 2025-03-07 19:45:57 +01:00
config.rs feat: add initial qwen2.5-vl model and test (#2971) 2025-02-19 12:38:20 +01:00
kserve.rs fix: include add_special_tokens in kserve request (#2859) 2024-12-19 16:55:17 -05:00
lib.rs Fix tool call2 (#3076) 2025-03-07 19:45:57 +01:00
logging.rs Rebase TRT-llm (#2331) 2024-07-31 10:33:10 +02:00
sagemaker.rs feat: allow any supported payload on /invocations (#2683) 2024-10-23 11:26:01 +00:00
server.rs Add request parameters to OTel span for /v1/chat/completions endpoint (#3000) 2025-03-10 12:26:57 +01:00
usage_stats.rs feat: Add the parsing of HF_HUB_USER_AGENT_ORIGIN environment variable for telemetry (#3027) 2025-02-19 21:09:12 +01:00
validation.rs feat: add initial qwen2.5-vl model and test (#2971) 2025-02-19 12:38:20 +01:00
vertex.rs Improve tool call message processing (#3036) 2025-02-21 10:30:29 +01:00