text-generation-inference/docs/source
Wang, Yi ebb26f0ccd
[gaudi] Deepseek v2 mla and add ep to unquantized moe (#3287)
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
2025-07-07 11:29:39 +02:00
..
backends [gaudi] Deepseek v2 mla and add ep to unquantized moe (#3287) 2025-07-07 11:29:39 +02:00
basic_tutorials Neuron backend fix and patch version 3.3.4 (#3273) 2025-06-19 10:52:41 +02:00
conceptual Neuron backend fix and patch version 3.3.4 (#3273) 2025-06-19 10:52:41 +02:00
reference Neuron backend fix and patch version 3.3.4 (#3273) 2025-06-19 10:52:41 +02:00
_toctree.yml Release of Gaudi Backend for TGI (#3091) 2025-03-13 10:56:01 +01:00
architecture.md Avoid running neuron integration tests twice (#3054) 2025-02-26 12:15:01 +01:00
index.md Removing ../ that broke the link (#2789) 2024-12-02 05:48:55 +01:00
installation_amd.md Neuron backend fix and patch version 3.3.4 (#3273) 2025-06-19 10:52:41 +02:00
installation_gaudi.md Release of Gaudi Backend for TGI (#3091) 2025-03-13 10:56:01 +01:00
installation_inferentia.md Avoid running neuron integration tests twice (#3054) 2025-02-26 12:15:01 +01:00
installation_intel.md Neuron backend fix and patch version 3.3.4 (#3273) 2025-06-19 10:52:41 +02:00
installation_nvidia.md Neuron backend fix and patch version 3.3.4 (#3273) 2025-06-19 10:52:41 +02:00
installation_tpu.md Fix typo in TPU docs (#2911) 2025-01-15 18:32:07 +01:00
installation.md MI300 compatibility (#1764) 2024-05-17 15:30:47 +02:00
multi_backend_support.md Avoid running neuron integration tests twice (#3054) 2025-02-26 12:15:01 +01:00
quicktour.md Neuron backend fix and patch version 3.3.4 (#3273) 2025-06-19 10:52:41 +02:00
supported_models.md Add llama4 (#3145) 2025-04-06 10:20:22 +02:00
usage_statistics.md fix: Telemetry (#2957) 2025-01-28 10:29:18 +01:00