stable-diffusion.cpp: Extremely slow performance on Ryzen 7950X3D

Running the line from the readme, I get this:

step 1 sampling completed, taking 50.97s

Compiled with cmake on Windows. Shouldn’t it be a little bit faster?

About this issue

  • Original URL
  • State: closed
  • Created a year ago
  • Comments: 19 (11 by maintainers)

Most upvoted comments

Works. Still very slow, but I guess that’s expected. About 7 sec per step with CuBLAS, 30 sec without.

very funky, @leejet i will probably make a pr later with improved cmake (by copying from llama.cpp)

very funky, @leejet i will probably make a pr later with improved cmake (by copying from llama.cpp)

The latest GGML code has already fixed this issue. I will rebase my code onto the latest GGML code.