mirror of https://github.com/huggingface/text-generation-inference.git synced 2025-10-10 23:45:23 +00:00

History

Nick Hill a172430d8b fix: Some small fixes - Avoid theoretical hang in batcher loop - Avoid a couple of clones in py server generate method - Keep attention mask tensors as integers		2022-12-01 16:19:12 -08:00
..
text_generation	fix: Some small fixes	2022-12-01 16:19:12 -08:00
.gitignore	feat(server): Support all AutoModelForCausalLM on a best effort basis	2022-10-28 19:24:00 +02:00
Makefile	feat(server): Support Galactica (#4 )	2022-12-01 19:31:54 +01:00
poetry.lock	feat(server): Improved doc	2022-11-07 12:53:56 +01:00
pyproject.toml	feat(server): Improved doc	2022-11-07 12:53:56 +01:00
README.md	feat(server): Use safetensors	2022-10-22 20:00:15 +02:00

BLOOM Inference Python gRPC Server

A Python gRPC server for BLOOM Inference

Install

make install

make run-dev