mirror of
https://github.com/huggingface/text-generation-inference.git
synced 2025-09-11 12:24:53 +00:00
update partition size
This commit is contained in:
parent
ca071bdd1d
commit
07e9ec2b66
@ -179,7 +179,7 @@ def paged_attention(
|
||||
1.0,
|
||||
1.0,
|
||||
None,
|
||||
512,
|
||||
_PARTITION_SIZE,
|
||||
)
|
||||
|
||||
return out
|
||||
|
Loading…
Reference in New Issue
Block a user