Default Branch

8f8819795f · Fixing CI (#3184) · Updated 2025-04-18 11:07:18 +00:00

Branches

5b27307438 · Don't error on OpenAI valid top_p values. · Updated 2024-07-12 20:22:23 +00:00    Leaf

492
1

5c69639f74 · add condition different than PR · Updated 2024-07-12 11:19:52 +00:00    Leaf

500
8

4dfdb481fb · Version 2.1.1 · Updated 2024-07-04 10:39:07 +00:00    Leaf

517
1

fe3991e857 · feat: add simple ttft load_test · Updated 2024-07-02 15:57:01 +00:00    Leaf

522
1

cb232a35a9 · feat: add test to view batch speedup amount · Updated 2024-07-02 13:33:26 +00:00    Leaf

522
1

0a5b19a3ed · updated doc · Updated 2024-07-02 13:10:26 +00:00    Leaf

566
22

dea9c0dc74 · Fixing rocm. (#2164) · Updated 2024-07-02 10:01:08 +00:00    Leaf

524
0
Included

88e2a6a23a · fix: avoid loading mistral adapters in mixtral · Updated 2024-07-01 19:49:05 +00:00    Leaf

530
1

9815feb2e3 · Revert "Update devcontainer to use correct update content command path" · Updated 2024-06-28 13:26:45 +00:00    Leaf

547
13

192d49af0b · 2.1.0 names for release. · Updated 2024-06-28 06:20:59 +00:00    Leaf

541
1

02ac45131f · some cleaning · Updated 2024-06-27 13:33:35 +00:00    Leaf

547
4

2bcc87bb02 · add dummy backend · Updated 2024-06-26 13:39:28 +00:00    Leaf

547
5
ci2

c45551cfc4 · Using new cache. · Updated 2024-06-26 13:21:03 +00:00    Leaf

547
1

0dcf31a749 · Fixing gemma2. · Updated 2024-06-26 13:02:56 +00:00    Leaf

547
1

7947c347b7 · exl2 phi does not use packed QKV/gate-up projections · Updated 2024-06-26 08:38:08 +00:00    Leaf

547
1

a7556ba800 · fix: refactors and helpful comments · Updated 2024-06-24 13:39:56 +00:00    Leaf

569
36

65506e19bf · update dockerfile · Updated 2024-06-20 15:36:46 +00:00    Leaf

671
1

56b16614de · continue refactoring · Updated 2024-06-20 14:59:38 +00:00    Leaf

566
2

48010f14b5 · fix: re update the docs · Updated 2024-06-20 01:05:47 +00:00    Leaf

568
8

c1125781e0 · Try something · Updated 2024-06-19 07:33:45 +00:00    Leaf

569
1