Morgan Funtowicz
|
003163a2b9
|
backend(vllm): map ResultOutput to InferStreamResponse to stream back to the client
|
2025-01-30 16:12:52 +01:00 |
|
Morgan Funtowicz
|
32dffcff60
|
backend(vllm): expose FFI for CompletionOutput and RequestOutput on Rust side
|
2025-01-30 13:35:21 +01:00 |
|
Morgan Funtowicz
|
7028f5bce2
|
backend(vllm): make v1 the default
|
2025-01-29 17:01:20 +01:00 |
|
Morgan Funtowicz
|
dc5addae81
|
backend(vllm): remove python print stmt
|
2025-01-27 22:43:16 +01:00 |
|
Morgan Funtowicz
|
a7c2a470d6
|
backend(vllm): submit new request to vLLM engine
|
2025-01-27 22:39:35 +01:00 |
|
Morgan Funtowicz
|
02e4b9ab32
|
backend(vllm): plug in the tokio server and CLI
|
2025-01-24 10:41:07 +01:00 |
|
Morgan Funtowicz
|
bd2ec03d53
|
backend(vllm): statically allocate LLMEngine
|
2025-01-22 22:15:33 +01:00 |
|
Morgan Funtowicz
|
cfd22726c9
|
backend(vllm): initial commit
|
2025-01-21 23:37:56 +01:00 |
|