text-generation-inference/server/text_generation_server/models/custom_modeling
2023-05-12 17:32:40 +02:00
..
__init__.py feat(server): flash santacoder (#153) 2023-04-03 19:06:42 +02:00
flash_llama_modeling.py feat(server): GPTQ quantization (step1) (#277) 2023-05-12 14:46:41 +02:00
flash_neox_modeling.py feat(server): GPTQ quantization (step1) (#277) 2023-05-12 14:46:41 +02:00
flash_santacoder_modeling.py chore(docker): use nvidia base image (#318) 2023-05-12 17:32:40 +02:00