text-generation-inference/integration-tests/models/__snapshots__/test_flash_llama_fp8
OlivierDehaene 4844ff790a
fix(server): fix fp8 weight loading (#2268)
* fix(server): fix fp8 weight loading

* fixed scales loading

* update snap

* revert default dtype
2024-07-22 15:51:32 +00:00
..
test_flash_llama_fp8_all_params.json fix(server): fix fp8 weight loading (#2268) 2024-07-22 15:51:32 +00:00
test_flash_llama_fp8_load.json Add FP8 release test (#2261) 2024-07-20 10:26:06 +00:00
test_flash_llama_fp8.json Add FP8 release test (#2261) 2024-07-20 10:26:06 +00:00