OlivierDehaene
|
4acc42a605
|
fix(server): better handling of inference mode (#57)
|
2023-02-07 15:38:22 +01:00 |
|
OlivierDehaene
|
20c3c5940c
|
feat(router): refactor API and add openAPI schemas (#53)
|
2023-02-03 12:43:37 +01:00 |
|
OlivierDehaene
|
df227ac20d
|
fix(server): allow greedy repetition penalty (#51)
|
2023-02-02 10:34:35 +01:00 |
|
OlivierDehaene
|
775115e3a5
|
feat(server): allow the server to use a local weight cache (#49)
|
2023-02-01 16:22:10 +01:00 |
|
OlivierDehaene
|
313194f6d7
|
feat(server): support repetition penalty (#47)
|
2023-02-01 15:58:42 +01:00 |
|
OlivierDehaene
|
2ad895a6cc
|
feat(server): allow gpt-neox models with odd vocab sizes to be sharded (#48)
|
2023-02-01 14:43:59 +01:00 |
|
OlivierDehaene
|
f830706b21
|
feat(server): Support GPT-Neox (#39)
|
2023-01-31 18:53:56 +01:00 |
|
OlivierDehaene
|
54fec93193
|
fix(server): fix seeding with multiple shards (#44)
|
2023-01-31 16:01:15 +01:00 |
|
OlivierDehaene
|
03bdf18290
|
fix(server): fix seeding on gpu (#42)
|
2023-01-31 14:30:33 +01:00 |
|
OlivierDehaene
|
cd298bc5e5
|
feat: Support sampling seeding (#37)
Co-authored-by: Yannic Kilcher <yk@users.noreply.github.com>
|
2023-01-30 15:36:16 +01:00 |
|
OlivierDehaene
|
15511edc01
|
feat(server): Support SantaCoder (#26)
|
2023-01-20 12:24:39 +01:00 |
|
Nick Hill
|
e6d3eb5d5d
|
fix(server): Minor refactorization using new_zeros (#24)
- Fix some type hints, in particular base tokenizer class
- Make use of `tensor.new_zero/empty` methods
- Simplify env var string parsing in launcher
|
2023-01-17 09:10:22 +01:00 |
|
OlivierDehaene
|
611e21cb13
|
fix(server): Fix stop sequences (#11)
|
2022-12-16 16:03:39 +01:00 |
|
OlivierDehaene
|
32a253063d
|
feat: Return logprobs (#8)
|
2022-12-15 17:03:56 +01:00 |
|
OlivierDehaene
|
718096f695
|
feat: Support stop sequences (#7)
|
2022-12-12 18:25:22 +01:00 |
|
OlivierDehaene
|
a2985036aa
|
feat(server): Add model tests (#6)
|
2022-12-08 18:49:33 +01:00 |
|
OlivierDehaene
|
daa1d81d5e
|
feat(server): Support Galactica (#4)
|
2022-12-01 19:31:54 +01:00 |
|
OlivierDehaene
|
c5665f5c8b
|
feat(server): Support generic AutoModelForCausalLM
|
2022-11-04 14:22:47 +01:00 |
|
OlivierDehaene
|
b3b7ea0d74
|
feat: Use json formatter by default in docker image
|
2022-11-02 17:29:56 +01:00 |
|
OlivierDehaene
|
3cf6368c77
|
feat(server): Support all AutoModelForCausalLM on a best effort basis
|
2022-10-28 19:24:00 +02:00 |
|