Wang, Yi
74edda9c23
update to metrics 0.23.0 or could work with metrics-exporter-promethe… ( #2190 )
...
update to metrics 0.23.0 or could work with metrics-exporter-prometheus 0.15.1
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
2024-09-25 05:21:34 +00:00
drbh
381c5c02a6
fix: prefer serde structs over custom functions ( #2127 )
...
* fix: prefer enum for chat object
* fix: adjust typo
* fix: enum CompletionType not ObjectType
* fix: adjust typo
* feat: leverage serde for conditional deser
* fix: adjust HubTokenizerConfig after rebase
* fix: update create_post_processor logic for token type
* fix: adjust unwrap syntax in template
* Fixing the post processor.
---------
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
2024-09-24 03:57:32 +00:00
drbh
d0a1d50fd3
PR #2049 CI run ( #2054 )
...
* Use minijinja's pycompat mode for python methods
* fix: cargo fmt lint for pre commit
---------
Co-authored-by: Armin Ronacher <armin.ronacher@active-4.com>
2024-09-24 03:42:29 +00:00
OlivierDehaene
184c89fd55
feat: add SchedulerV3 ( #1996 )
...
- Refactor code to allow supporting multiple versions of the
generate.proto at the same time
- Add v3/generate.proto (ISO to generate.proto for now but allow for
future changes without impacting v2 backends)
- Add Schedule trait to abstract queuing and batching mechanisms that
will be different in the future
- Add SchedulerV2/V3 impl
2024-09-24 03:28:31 +00:00