text-generation-inference/docs/source
Wang, Yi 778b61c0da
[gaudi] Remove unnecessary reinitialize to HeterogeneousNextTokenChooser to make sampling output correct (#3284)
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Co-authored-by: regisss <15324346+regisss@users.noreply.github.com>
2025-07-03 10:03:16 +02:00
..
backends [gaudi] Remove unnecessary reinitialize to HeterogeneousNextTokenChooser to make sampling output correct (#3284) 2025-07-03 10:03:16 +02:00
basic_tutorials Neuron backend fix and patch version 3.3.4 (#3273) 2025-06-19 10:52:41 +02:00
conceptual Neuron backend fix and patch version 3.3.4 (#3273) 2025-06-19 10:52:41 +02:00
reference Neuron backend fix and patch version 3.3.4 (#3273) 2025-06-19 10:52:41 +02:00
_toctree.yml Release of Gaudi Backend for TGI (#3091) 2025-03-13 10:56:01 +01:00
architecture.md Avoid running neuron integration tests twice (#3054) 2025-02-26 12:15:01 +01:00
index.md Removing ../ that broke the link (#2789) 2024-12-02 05:48:55 +01:00
installation_amd.md Neuron backend fix and patch version 3.3.4 (#3273) 2025-06-19 10:52:41 +02:00
installation_gaudi.md Release of Gaudi Backend for TGI (#3091) 2025-03-13 10:56:01 +01:00
installation_inferentia.md Avoid running neuron integration tests twice (#3054) 2025-02-26 12:15:01 +01:00
installation_intel.md Neuron backend fix and patch version 3.3.4 (#3273) 2025-06-19 10:52:41 +02:00
installation_nvidia.md Neuron backend fix and patch version 3.3.4 (#3273) 2025-06-19 10:52:41 +02:00
installation_tpu.md Fix typo in TPU docs (#2911) 2025-01-15 18:32:07 +01:00
installation.md MI300 compatibility (#1764) 2024-05-17 15:30:47 +02:00
multi_backend_support.md Avoid running neuron integration tests twice (#3054) 2025-02-26 12:15:01 +01:00
quicktour.md Neuron backend fix and patch version 3.3.4 (#3273) 2025-06-19 10:52:41 +02:00
supported_models.md Add llama4 (#3145) 2025-04-06 10:20:22 +02:00
usage_statistics.md fix: Telemetry (#2957) 2025-01-28 10:29:18 +01:00