Commit Graph

3 Commits

Author SHA1 Message Date
drbh
3f12750a18 fix: marlin repeat scale for fp8 and bump snapshots 2024-08-12 11:48:07 -04:00
OlivierDehaene
4844ff790a
fix(server): fix fp8 weight loading (#2268)
* fix(server): fix fp8 weight loading

* fixed scales loading

* update snap

* revert default dtype
2024-07-22 15:51:32 +00:00
Daniël de Kok
e5c1d6d611
Add FP8 release test (#2261) 2024-07-20 10:26:06 +00:00