Tensile: Tensile won't produce backend libraries for archs without optimized logic files when using --separate-architectures
Issue
Tensile won’t produce backend libraries for archs without optimized logic files when using --separate-architectures.
Description
According with https://github.com/ROCmSoftwarePlatform/Tensile/issues/1165#issuecomment-1094556880 “gfx1010 has been enabled by default in rocBLAS builds since ROCm 4.3.0.” however since rocBLAS does not have optimized logic files for navi10 no library is produced for gfx1010.
$ drun --rm rocm/dev-ubuntu-22.04:5.6-complete
root@ftl:/# ls -1 /opt/rocm/lib/rocblas/library/TensileLibrary_lazy_gfx*
/opt/rocm/lib/rocblas/library/TensileLibrary_lazy_gfx1030.dat
/opt/rocm/lib/rocblas/library/TensileLibrary_lazy_gfx1100.dat
/opt/rocm/lib/rocblas/library/TensileLibrary_lazy_gfx1101.dat
/opt/rocm/lib/rocblas/library/TensileLibrary_lazy_gfx1102.dat
/opt/rocm/lib/rocblas/library/TensileLibrary_lazy_gfx803.dat
/opt/rocm/lib/rocblas/library/TensileLibrary_lazy_gfx900.dat
/opt/rocm/lib/rocblas/library/TensileLibrary_lazy_gfx906.dat
/opt/rocm/lib/rocblas/library/TensileLibrary_lazy_gfx908.dat
/opt/rocm/lib/rocblas/library/TensileLibrary_lazy_gfx90a.dat
Expected
Tensile should produce libraries for all requested architectures, using the fallback logic files for archs missing optimized logic files.
Workaround
Building rocBLAS with --merge-architectures --no-lazy-library-loading seems to avoid the issue.
Patch
About this issue
- Original URL
- State: closed
- Created 10 months ago
- Reactions: 8
- Comments: 33 (28 by maintainers)
Links to this issue
Commits related to this issue
- Use fallback libraries for archs without optimized logic Fixes #1757. Enables architectures that don't have optimized logic files to also produce libraries when `--separate-architectures` or `--lazy... — committed to GZGavinZhao/Tensile by GZGavinZhao 6 months ago
- Use fallback libraries for archs without optimized logic (#1862) Fixes #1757. Enables architectures that don't have optimized logic files to also produce libraries when `--separate-architectures` or ... — committed to ROCm/Tensile by GZGavinZhao 5 months ago
- Use fallback libraries for archs without optimized logic Fixes #1757. Enables architectures that don't have optimized logic files to also produce libraries when `--separate-architectures` or `--lazy... — committed to GZGavinZhao/Tensile by GZGavinZhao 6 months ago
- Use fallback libraries for archs without optimized logic (#1897) Fixes #1757. Enables architectures that don't have optimized logic files to also produce libraries when `--separate-architectures`... — committed to ROCm/Tensile by GZGavinZhao 4 months ago
- rocBLAS: Tensile: Fix fallback architecture Patch taken from https://github.com/ROCm/Tensile/issues/1757 — committed to T0mstone/void-packages by T0mstone 4 months ago
Although that’s probably not the right place, I really needed to say thank you! I’ve struggled with that basically since my card has been released and finally I was able to fix it because of you.
Doing compute stuff is just a nightmare with AMD, really.
I will implement some workaround for this fail. To make it work now,