Compile llama.cpp with CUDA 12 #13403

artyomboyko · 2025-05-09T12:42:41Z

artyomboyko
May 9, 2025

I am building llamacpp with CUDA 12 support (RTX5000). How can I add support for RTX4000 and RXT5000 using cmake -B build -DGGML_CUDA=ON -DCMAKE_CUDA_ARCHITECTURES=“86;89”? The range of acceptable values for the DCMAKE_CUDA_ARCHITECTURES parameter is not specified in documentation. Should I specify CUDA 12 as 120 or as 12? For example:

cmake -B build -DGGML_CUDA=ON -DCMAKE_CUDA_ARCHITECTURES=”86;89;120”

0cc4m · 2025-05-09T12:51:53Z

0cc4m
May 9, 2025
Collaborator

Those are cuda compute capabilities, basically device architectures, not cuda versions. See https://developer.nvidia.com/cuda-gpus

0 replies

artyomboyko · 2025-05-09T16:41:59Z

artyomboyko
May 9, 2025
Author

@0cc4m OK. How do I set this parameter correctly so that RTX4000 and RTX5000 can be used?

0 replies

CommanderLake · 2025-05-11T23:15:01Z

CommanderLake
May 11, 2025

Dont modify CMAKE_CUDA_ARCHITECTURES the llama.cpp CUDA backend should run on newer GPUs without explicitly setting the architectures.

0 replies

artyomboyko · 2025-06-13T09:08:26Z

artyomboyko
Jun 13, 2025
Author

@CommanderLake I will try, thanks. I'll come back with feedback.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Compile llama.cpp with CUDA 12 #13403

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 4 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Compile llama.cpp with CUDA 12 #13403

Uh oh!

Uh oh!

artyomboyko May 9, 2025

Replies: 4 comments

Uh oh!

0cc4m May 9, 2025 Collaborator

Uh oh!

artyomboyko May 9, 2025 Author

Uh oh!

CommanderLake May 11, 2025

Uh oh!

artyomboyko Jun 13, 2025 Author

artyomboyko
May 9, 2025

0cc4m
May 9, 2025
Collaborator

artyomboyko
May 9, 2025
Author

CommanderLake
May 11, 2025

artyomboyko
Jun 13, 2025
Author