Commit Graph

8 Commits

Author SHA1 Message Date
Morgan Funtowicz
003163a2b9 backend(vllm): map ResultOutput to InferStreamResponse to stream back to the client 2025-01-30 16:12:52 +01:00
Morgan Funtowicz
32dffcff60 backend(vllm): expose FFI for CompletionOutput and RequestOutput on Rust side 2025-01-30 13:35:21 +01:00
Morgan Funtowicz
7028f5bce2 backend(vllm): make v1 the default 2025-01-29 17:01:20 +01:00
Morgan Funtowicz
dc5addae81 backend(vllm): remove python print stmt 2025-01-27 22:43:16 +01:00
Morgan Funtowicz
a7c2a470d6 backend(vllm): submit new request to vLLM engine 2025-01-27 22:39:35 +01:00
Morgan Funtowicz
02e4b9ab32 backend(vllm): plug in the tokio server and CLI 2025-01-24 10:41:07 +01:00
Morgan Funtowicz
bd2ec03d53 backend(vllm): statically allocate LLMEngine 2025-01-22 22:15:33 +01:00
Morgan Funtowicz
cfd22726c9 backend(vllm): initial commit 2025-01-21 23:37:56 +01:00