This website requires JavaScript.
Explore
Help
Sign In
huggingface
/
text-generation-inference
Watch
5
Star
0
Fork
0
You've already forked text-generation-inference
mirror of
https://github.com/huggingface/text-generation-inference.git
synced
2025-04-26 12:32:10 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
16162602c2
text-generation-inference
/
server
/
text_generation_server
/
layers
/
moe
History
Mohit Sharma
16162602c2
Add fp8 support moe models
2025-01-20 13:55:54 +00:00
..
__init__.py
Add fp8 support moe models
2025-01-20 13:55:54 +00:00
fp8.py
Add fp8 support moe models
2025-01-20 13:55:54 +00:00
gptq_marlin.py
Add support for fused MoE Marlin for AWQ (
#2616
)
2024-10-08 11:56:41 +02:00
unquantized.py
Add fp8 support moe models
2025-01-20 13:55:54 +00:00