text-generation-inference/router/src
regisss f208ba6afc
Fix HF_HUB_OFFLINE=1 for Gaudi backend (#3193)
* Fix `HF_HUB_OFFLINE=1` for Gaudi backend

* Fix HF cache default value in server.rs

* Format
2025-05-06 10:47:53 +02:00
..
infer Skip {% generation %} and {% endgeneration %} template handling (#3204) 2025-05-01 12:13:17 +02:00
chat.rs Fix tool call4 (#3094) 2025-03-12 09:28:47 +01:00
config.rs Fixing the router + template for Qwen3. (#3200) 2025-04-29 16:29:26 +02:00
kserve.rs fix: include add_special_tokens in kserve request (#2859) 2024-12-19 16:55:17 -05:00
lib.rs Pr 2982 ci branch (#3046) 2025-05-01 10:17:16 -04:00
logging.rs Get opentelemetry trace id from request headers instead of creating a new trace (#2648) 2025-04-18 09:06:41 +02:00
sagemaker.rs Fixing CI (#3184) 2025-04-18 13:07:18 +02:00
server.rs Fix HF_HUB_OFFLINE=1 for Gaudi backend (#3193) 2025-05-06 10:47:53 +02:00
usage_stats.rs Gaudi: clean cuda/rocm code in hpu backend, enable flat_hpu (#3113) 2025-04-14 15:58:13 +02:00
validation.rs Pr 2982 ci branch (#3046) 2025-05-01 10:17:16 -04:00
vertex.rs Get opentelemetry trace id from request headers instead of creating a new trace (#2648) 2025-04-18 09:06:41 +02:00