Finetune with/without flash attention #983

chiroiu96 · 2024-01-10T15:13:32Z

chiroiu96
Jan 10, 2024

I am trying to finetune LLaVA on a custom dataset. Flash attention is not supported by my GPU.

Do I necessarily need to finetune with flash attention?
Will results differ with or without flash attention?

Lawhori · 2025-02-25T16:46:42Z

Lawhori
Feb 25, 2025

No, you do not need flash attention to finetune, in the finetuning script use train.py instead of train_mem.py. It worked for me and i didnt see much difference in results.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Finetune with/without flash attention #983

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Finetune with/without flash attention #983

Uh oh!

chiroiu96 Jan 10, 2024

Replies: 1 comment

Uh oh!

Lawhori Feb 25, 2025

chiroiu96
Jan 10, 2024

Lawhori
Feb 25, 2025