Skip to content

CUDA: lower GPU latency + fix Windows performance

9268745
Select commit
Loading
Failed to load commit list.
Merged

CUDA: refactor ggml_cuda_op + lower GPU latency via quantization on main GPU and tiling #3110

CUDA: lower GPU latency + fix Windows performance
9268745
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs