This website requires JavaScript.
Explore
Help
Sign In
huggingface
/
text-generation-inference
Watch
5
Star
0
Fork
0
You've already forked text-generation-inference
mirror of
https://github.com/huggingface/text-generation-inference.git
synced
2025-04-21 14:52:20 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
0ff8219fdb
text-generation-inference
/
server
/
text_generation_server
/
utils
/
gptq
History
Felix Marty
ee7ba48b9a
add exllama gptq kernel
2023-07-05 15:43:42 +00:00
..
custom_autotune.py
feat(server): Add inference support for GPTQ (llama + falcon tested) + Quantization script (
#438
)
2023-06-26 12:27:01 +02:00
quant_linear.py
add exllama gptq kernel
2023-07-05 15:43:42 +00:00
quantize.py
feat(server): Add inference support for GPTQ (llama + falcon tested) + Quantization script (
#438
)
2023-06-26 12:27:01 +02:00