text-generation-inference/launcher/src
Nicolas Patry 57b3495823
Fixing exl2 and other quanize tests again. (#2419)
* Fixing exl2 and other quanize tests again.

* Mark exl2 as non release (so CI tests them, needs to be removed latet).

* Fixing exl2 (by disabling cuda graphs)

* Fix quantization defaults without cuda graphs on exl2 (linked to new
issues with it).

* Removing serde override.

* Go back to released exl2 and remove log.

* Adding warnings for deprecated bitsandbytes + upgrade info to warn.
2024-08-15 11:12:51 +02:00
..
env_runtime.rs add intel xpu support for TGI (#1475) 2024-04-26 15:48:58 +02:00
main.rs Fixing exl2 and other quanize tests again. (#2419) 2024-08-15 11:12:51 +02:00