text-generation-inference/integration-tests/models/__snapshots__/test_flash_mixtral_gptq
Daniël de Kok 7f54b7336a
Test Marlin MoE with desc_act=true (#2622)
Update the Mixtral GPTQ test to use a model with `desc_act=true` and
`group_size!=-1` to ensure that we are checking activation
sorting/non-full K (with tensor parallelism). The `desc_act=false` case
is already checked by the Mixtral AWQ test.
2024-10-21 12:50:35 +02:00
..
test_flash_mixtral_gptq_all_params.json Test Marlin MoE with desc_act=true (#2622) 2024-10-21 12:50:35 +02:00
test_flash_mixtral_gptq_load.json Test Marlin MoE with desc_act=true (#2622) 2024-10-21 12:50:35 +02:00
test_flash_mixtral_gptq.json Test Marlin MoE with desc_act=true (#2622) 2024-10-21 12:50:35 +02:00