text-generation-inference

mirror of https://github.com/huggingface/text-generation-inference.git synced 2025-04-24 00:12:08 +00:00

History

Daniël de Kok 4727a3af67 launcher: correctly get the head dimension for VLMs For most (?) VLMs, the head dimension is in the `text_config` configuration section. However, since we only queried the top-level `head_dim` (which typically doesn't exist in VLMs), we would never use flashinfer. This change adds a method that gets the head dimension from the top-level `Config` struct or `text_config` when that fails.		2025-03-17 10:07:39 +00:00
..
src	launcher: correctly get the head dimension for VLMs	2025-03-17 10:07:39 +00:00
build.rs	chore(github): add templates (#264 )	2023-05-02 15:43:19 +02:00
Cargo.toml	feat: add support for HF_HUB_USER_AGENT_ORIGIN to add user-agent Origin field in Hub requests. (#3061 )	2025-03-04 16:43:50 +01:00