text-generation-inference/server/tests/models
2024-06-14 22:36:44 +02:00
..
test_bloom.py feat: add more latency metrics in forward (#1346) 2024-04-19 13:41:34 +03:00
test_causal_lm.py [Torch.compile] Enable llama-2-7b (#157) 2024-06-14 15:56:23 +02:00
test_grammar.py Add grammar support (#140) 2024-05-20 11:16:34 +02:00
test_model.py fix(server): fix decode token (#334) 2023-05-16 23:23:27 +02:00
test_santacoder.py feat: add more latency metrics in forward (#1346) 2024-04-19 13:41:34 +03:00
test_seq2seq_lm.py feat: add more latency metrics in forward (#1346) 2024-04-19 13:41:34 +03:00
test_starcoder.py Updated kv cache for starcoder (#128) 2024-06-14 22:36:44 +02:00