mirror of
https://github.com/huggingface/text-generation-inference.git
synced 2025-11-18 23:15:59 +00:00
* Attempt for cleverer auto batch_prefill values (some simplifications). * Less flaky tests. * Fixing typo insertion. * Update launcher/src/main.rs Co-authored-by: Daniël de Kok <me@danieldk.eu> * Adding small comment for source of calculation. * Adding L40. * Adding L40s. --------- Co-authored-by: Daniël de Kok <me@danieldk.eu> |
||
|---|---|---|
| .. | ||
| custom_modeling | ||
| __init__.py | ||
| bloom.py | ||
| causal_lm.py | ||
| flash_causal_lm.py | ||
| galactica.py | ||
| globals.py | ||
| idefics_causal_lm.py | ||
| mamba.py | ||
| metadata_kernels.py | ||
| mllama_causal_lm.py | ||
| model.py | ||
| pali_gemma.py | ||
| seq2seq_lm.py | ||
| types.py | ||
| vlm_causal_lm.py | ||