text-generation-inference/router/src
2024-08-09 14:52:59 +00:00
..
infer Using an enum for flash backens (paged/flashdecoding/flashinfer) (#2385) 2024-08-09 16:41:17 +02:00
config.rs add gptj modeling in TGI #2366 (CI RUN) (#2372) 2024-08-07 21:32:37 -04:00
kserve.rs fix: simplify kserve endpoint and fix imports (#2119) 2024-06-25 19:30:10 -04:00
lib.rs Using an enum for flash backens (paged/flashdecoding/flashinfer) (#2385) 2024-08-09 16:41:17 +02:00
logging.rs Rebase TRT-llm (#2331) 2024-07-31 10:33:10 +02:00
main.rs.back Rebase TRT-llm (#2331) 2024-07-31 10:33:10 +02:00
server.rs feat: return the generated text when parsing fails (#2353) 2024-08-06 13:10:19 -04:00
usage_stats.rs refactor usage stats (#2339) 2024-07-31 16:29:07 +02:00
validation.rs Prefix caching WIP 2024-08-09 14:52:59 +00:00