stable-diffusion.cpp: Extremely slow performance on Ryzen 7950X3D
Running the line from the readme, I get this:
step 1 sampling completed, taking 50.97s
Compiled with cmake on Windows. Shouldn’t it be a little bit faster?
About this issue
- Original URL
- State: closed
- Created a year ago
- Comments: 19 (11 by maintainers)
Works. Still very slow, but I guess that’s expected. About 7 sec per step with CuBLAS, 30 sec without.
very funky, @leejet i will probably make a pr later with improved cmake (by copying from llama.cpp)
The latest GGML code has already fixed this issue. I will rebase my code onto the latest GGML code.