text-generation-inference/integration-tests/models/__snapshots__/test_flash_llama_fp8
OlivierDehaene a7515b8af1 fix(server): fix fp8 weight loading (#2268)
* fix(server): fix fp8 weight loading

* fixed scales loading

* update snap

* revert default dtype
2024-09-25 05:31:08 +00:00
..
test_flash_llama_fp8_all_params.json fix(server): fix fp8 weight loading (#2268) 2024-09-25 05:31:08 +00:00
test_flash_llama_fp8_load.json Add FP8 release test (#2261) 2024-09-25 05:29:35 +00:00
test_flash_llama_fp8.json Add FP8 release test (#2261) 2024-09-25 05:29:35 +00:00