text-generation-inference

mirror of https://github.com/huggingface/text-generation-inference.git synced 2025-10-08 22:45:23 +00:00

History

OlivierDehaene 8a8f43410d chore(docker): use nvidia base image (#318 )		2023-05-12 17:32:40 +02:00
..
__init__.py	feat(server): flash santacoder (#153 )	2023-04-03 19:06:42 +02:00
flash_llama_modeling.py	feat(server): GPTQ quantization (step1) (#277 )	2023-05-12 14:46:41 +02:00
flash_neox_modeling.py	feat(server): GPTQ quantization (step1) (#277 )	2023-05-12 14:46:41 +02:00
flash_santacoder_modeling.py	chore(docker): use nvidia base image (#318 )	2023-05-12 17:32:40 +02:00