text-generation-inference

mirror of https://github.com/huggingface/text-generation-inference.git synced 2025-04-22 15:32:08 +00:00

History

Vidya Galli ca1b2f4994 Updated kv cache for starcoder (#128 )		2024-06-14 22:36:44 +02:00
..
test_bloom.py	feat: add more latency metrics in forward (#1346 )	2024-04-19 13:41:34 +03:00
test_causal_lm.py	[Torch.compile] Enable llama-2-7b (#157 )	2024-06-14 15:56:23 +02:00
test_grammar.py	Add grammar support (#140 )	2024-05-20 11:16:34 +02:00
test_model.py	fix(server): fix decode token (#334 )	2023-05-16 23:23:27 +02:00
test_santacoder.py	feat: add more latency metrics in forward (#1346 )	2024-04-19 13:41:34 +03:00
test_seq2seq_lm.py	feat: add more latency metrics in forward (#1346 )	2024-04-19 13:41:34 +03:00
test_starcoder.py	Updated kv cache for starcoder (#128 )	2024-06-14 22:36:44 +02:00