Skip to content

Commit f796954

Browse files
committed
Revamp FFN down and attn_k
And complete FFN up Shrink a bit more non GQA models
1 parent 596a4ae commit f796954

File tree

1 file changed

+231
-158
lines changed

1 file changed

+231
-158
lines changed

0 commit comments

Comments
 (0)