fix: adds causal to attention params to check when using flash attn v1

This commit is contained in:
drbh 2024-08-13 00:56:15 +00:00
parent 8a7749b8fb
commit 519e5ac05b

View File

@ -293,6 +293,7 @@ else:
max_s,
softmax_scale,
window_size_left=-1,
causal=None,
softcap=None,
):
if window_size_left != -1: