Change batch size sort in Neuron config

Fixing https://github.com/huggingface/text-generation-inference/issues/3299
This commit is contained in:
Jim Burtoft 2025-07-18 22:39:45 -04:00 committed by GitHub
parent 24c2bff659
commit 3c55c053ee
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194

View File

@ -112,7 +112,7 @@ def neuron_config_to_env(neuron_config):
def sort_neuron_configs(dictionary):
return -dictionary["tp_degree"], -dictionary["batch_size"]
return -dictionary["tp_degree"], dictionary["batch_size"]
def lookup_compatible_cached_model(