mirror of
				https://github.com/huggingface/text-generation-inference.git
				synced 2025-10-20 12:25:23 +00:00 
			
		
		
		
	| * fix(server): fix fp8 weight loading * fixed scales loading * update snap * revert default dtype | ||
|---|---|---|
| .. | ||
| test_flash_llama_fp8_all_params.json | ||
| test_flash_llama_fp8_load.json | ||
| test_flash_llama_fp8.json | ||