text-generation-inference/server
Mohit Sharma 87a0af4ec2
Update transformers to 4.51 (#3148)
* update transformres

* Upgrading the nix deps too.

* Forcing torchvision to be in there.

* Fixing bug in mllama.

* Those tests cannot be run in CI.

* Lint.

---------

Co-authored-by: Pedro Cuenca <pedro@huggingface.co>
Co-authored-by: Nicolas Patry <patry.nicolas@protonmail.com>
2025-04-07 12:55:43 +02:00
..
custom_kernels
exllama_kernels
exllamav2_kernels
tests
text_generation_server Update transformers to 4.51 (#3148) 2025-04-07 12:55:43 +02:00
.gitignore
bounds-from-nix.py
kernels.lock
Makefile
Makefile-awq
Makefile-eetq
Makefile-exllamav2
Makefile-flash-att
Makefile-flash-att-v2
Makefile-flashinfer
Makefile-lorax-punica
Makefile-selective-scan
Makefile-vllm
pyproject.toml Update transformers to 4.51 (#3148) 2025-04-07 12:55:43 +02:00
README.md
req.txt
requirements_cuda.txt
requirements_gen.txt
requirements_intel.txt
requirements_rocm.txt
uv.lock Update transformers to 4.51 (#3148) 2025-04-07 12:55:43 +02:00

Text Generation Inference Python gRPC Server

A Python gRPC server for Text Generation Inference

Install

make install

Run

make run-dev