Morgan Funtowicz
|
cdac4b0058
|
chore(looper): cleanup a bit more
|
2024-10-22 09:52:05 +02:00 |
|
Morgan Funtowicz
|
04c6f51258
|
feat(trtllm): rewrite health to not account for current state
|
2024-10-22 09:52:05 +02:00 |
|
Morgan Funtowicz
|
d73401ac73
|
chore(rebase): fix invalid references
|
2024-10-21 21:44:28 +02:00 |
|
Morgan Funtowicz
|
8d1c3c8ad4
|
feat(trtllm): do not tokenize twice
|
2024-10-21 15:06:54 +02:00 |
|
Morgan Funtowicz
|
1a3da05f34
|
misc(router): remove SchedulingError
|
2024-10-21 14:57:19 +02:00 |
|
Morgan Funtowicz
|
6687c06a21
|
feat(looper): minor optimizations to avoid growing too much the containers
|
2024-10-21 10:00:27 +02:00 |
|
Morgan Funtowicz
|
629153b44b
|
feat(looper): check engine and executorWorker paths exist before creating the backend
|
2024-10-21 10:00:27 +02:00 |
|
Morgan Funtowicz
|
c8a99af6c9
|
(fix): do not recreate the stateful hashmap at every it
|
2024-10-21 10:00:27 +02:00 |
|
Morgan Funtowicz
|
213acc6e34
|
(misc) move to latest trtllm
|
2024-10-21 10:00:27 +02:00 |
|
Morgan Funtowicz
|
984ae9798f
|
(post) impl postprocessing
|
2024-10-21 10:00:27 +02:00 |
|
Morgan Funtowicz
|
fa63db0d07
|
(scheduler) rework submit/pull logic
|
2024-10-21 10:00:27 +02:00 |
|
Morgan Funtowicz
|
b1846fb4e6
|
(backend) refactor & cleanup
|
2024-10-21 10:00:27 +02:00 |
|
Morgan Funtowicz
|
8e648ce425
|
(ffi) fix usage of wrong vector constructor making a capacity fill call
|
2024-10-21 10:00:27 +02:00 |
|
Morgan Funtowicz
|
291eaa99fb
|
use blocking_recv in looper to consume awaiting_requests at max before pulling in a single step
|
2024-10-21 10:00:27 +02:00 |
|
Morgan Funtowicz
|
7bebc629af
|
(misc) missing Result types for Rust
|
2024-10-21 10:00:27 +02:00 |
|
Morgan Funtowicz
|
c2e21d8725
|
(backend) implement the post_processor background thread
|
2024-10-21 10:00:27 +02:00 |
|
Morgan Funtowicz
|
0dca168bcb
|
(misc) change scope identifiers
|
2024-10-21 10:00:27 +02:00 |
|
Morgan Funtowicz
|
933ab67aa1
|
(ffi) encode the provided user prompt within each request thread
|
2024-10-21 10:00:27 +02:00 |
|
Morgan Funtowicz
|
fb759bdd2a
|
(looper) new looper initial implementation
|
2024-10-21 10:00:27 +02:00 |
|