text-generation-inference/docs/source
drbh da5ab46705
Improve vlm support (add idefics3 support) (#2437)
* feat: expand vlm support and add image token logic and tests

* fix: avoid unused perceiver config

* feat: integrate image tokens into inputs embeds

* feat: add simple idefics3 test

* feat: update docs, image token logic and weight names

* fix: improve image processing

* feat: improve prefix for idefics3

* fix: bump idefics3 tests and snapshots

* fix: improve text model loading

* feat: consolidate changes with existing vlms and add support and test for smolvlm

* fix: create new idefic3 file, simplify logic and adjust llama weight loading

* fix: lint with ruff

* fix: clean up idefics 3 and improve prefix handling

* fix: improve typing

* fix: improve prompt_split_image with ref to original impl

* fix: adjust ruff lints and small refactors

* fix: adjust FlashLlamaModel prefix logic
2025-01-09 10:35:32 -05:00
..
backends fix: lint backend and doc files (#2850) 2024-12-16 16:12:34 -05:00
basic_tutorials Prepare patch release. (#2829) 2024-12-11 21:03:50 +01:00
conceptual Prepare patch release. (#2829) 2024-12-11 21:03:50 +01:00
reference Prepare patch release. (#2829) 2024-12-11 21:03:50 +01:00
_toctree.yml TensorRT-LLM backend bump to latest version + misc fixes (#2791) 2024-12-13 15:50:59 +01:00
architecture.md TensorRT-LLM backend bump to latest version + misc fixes (#2791) 2024-12-13 15:50:59 +01:00
index.md Removing ../ that broke the link (#2789) 2024-12-02 05:48:55 +01:00
installation_amd.md Prepare patch release. (#2829) 2024-12-11 21:03:50 +01:00
installation_gaudi.md MI300 compatibility (#1764) 2024-05-17 15:30:47 +02:00
installation_inferentia.md MI300 compatibility (#1764) 2024-05-17 15:30:47 +02:00
installation_intel.md Prepare patch release. (#2829) 2024-12-11 21:03:50 +01:00
installation_nvidia.md Prepare patch release. (#2829) 2024-12-11 21:03:50 +01:00
installation.md MI300 compatibility (#1764) 2024-05-17 15:30:47 +02:00
multi_backend_support.md fix: lint backend and doc files (#2850) 2024-12-16 16:12:34 -05:00
quicktour.md Prepare patch release. (#2829) 2024-12-11 21:03:50 +01:00
supported_models.md Improve vlm support (add idefics3 support) (#2437) 2025-01-09 10:35:32 -05:00
usage_statistics.md feat: allow any supported payload on /invocations (#2683) 2024-10-23 11:26:01 +00:00