text-generation-inference/launcher/src
2024-08-12 23:59:03 +02:00
..
env_runtime.rs Integrate flash attention for starcoder2 tgi through habana and some fixes, enabling (#198) 2024-08-07 22:06:05 +02:00
main.rs Merge pull request #187 from yuanwu2017/v2.0.4 2024-08-12 23:59:03 +02:00