text-generation-inference/router/src
drbh 73ebbd05f8 Pr 2451 ci branch (#2454)
* fix[router]: Fix tools not passed in chat template

Signed-off-by: GitHub <noreply@github.com>

* feat: improve default tool serialization and lints

* feat: refactor tool logic to include notify_error in prompt and adjust typing

* fix: adjust non tool template apply

* fix: simplify tool grammar logic and improve schema

* feat: avoid skip tool test and avoid empty tool prompts

* fix: increase test client timeout for grammar compilation tests

---------

Signed-off-by: GitHub <noreply@github.com>
Co-authored-by: Simone Rossi <simone.rossi.93@gmail.com>
2024-09-25 06:10:59 +00:00
..
infer Pr 2451 ci branch (#2454) 2024-09-25 06:10:59 +00:00
config.rs add gptj modeling in TGI #2366 (CI RUN) (#2372) 2024-09-25 05:55:39 +00:00
kserve.rs fix: simplify kserve endpoint and fix imports (#2119) 2024-09-24 03:55:04 +00:00
lib.rs Pr 2451 ci branch (#2454) 2024-09-25 06:10:59 +00:00
logging.rs Rebase TRT-llm (#2331) 2024-09-25 05:55:39 +00:00
main.rs.back Rebase TRT-llm (#2331) 2024-09-25 05:55:39 +00:00
server.rs Pr 2451 ci branch (#2454) 2024-09-25 06:10:59 +00:00
usage_stats.rs refactor usage stats (#2339) 2024-09-25 05:55:39 +00:00
validation.rs feat: validate template variables before apply and improve sliding wi… (#2403) 2024-09-25 06:05:43 +00:00