ROCm: RX 470 card no longer recognized by clinfo after 4.5 update

Card was working fine with 4.3.

I uninstalled my previous version (4.3) and installed 4.5.

output from /opt/rocm/opencl/bin/clinfo :

Number of platforms:				 1
  Platform Profile:				 FULL_PROFILE
  Platform Version:				 OpenCL 2.2 AMD-APP (3361.0)
  Platform Name:				 AMD Accelerated Parallel Processing
  Platform Vendor:				 Advanced Micro Devices, Inc.
  Platform Extensions:				 cl_khr_icd cl_amd_event_callback 


  Platform Name:				 AMD Accelerated Parallel Processing
Number of devices:				 0

output from rocminfo

hsa api call failure at: /long_pathname_so_that_rpms_can_package_the_debug_info/src/rocminfo/rocminfo.cc:1143
Call returned HSA_STATUS_ERROR_OUT_OF_RESOURCES: The runtime failed to allocate the necessary resources. This error may also occur when the core runtime library needs to spawn threads or create internal OS-specific events.

output from rocm-smi

======================= ROCm System Management Interface =======================
================================= Concise Info =================================
GPU  Temp   AvgPwr   SCLK     MCLK    Fan     Perf  PwrCap  VRAM%  GPU%  
0    47.0c  23.116W  1169Mhz  300Mhz  19.22%  auto  92.0W    16%   0%    
================================================================================
============================= End of ROCm SMI Log ==============================

About this issue

  • Original URL
  • State: closed
  • Created 3 years ago
  • Reactions: 3
  • Comments: 30

Most upvoted comments

There’s a difference between “not supported” as in “we don’t provide support”, and “not supported” as in “we don’t enable that feature in our binaries”.

I had hoped that “not supported” in ROCm meant the first case, similar to ECC RAM on consumer Ryzen platforms which recieves no support as in help and assistance from AMD, yet it works at your own risk.

It’s sad to see those Polaris cards now apparently being dropped from ROCm binary releases.

@ROCmSupport Can’t there be a policy similar to Ryzen and ECC, in that you don’t provide assistance for that feature, but don’t actively prevent it from running either?

@ROCmSupport absolutely, all cards have an EOL, but 6 years is not very long IMHO.

The 590 is not even 4 years old yet I believe.

@ROCmSupport @johnbridgman Polaris 10 cards have worked fine with ROCM since the library was launched in 2016. It would be a shame, in my opinion, to disable these cards at this stage, unless there’s a good reason for doing so.

Where can we these business standards for software support duration of AMD cards?

I’ve opened a new issue for this situation

https://github.com/RadeonOpenCompute/ROCm/issues/1659

Is there any reason why you need 4.5 instead of 4.3.1?

Not really. It’s just that it’s usually better to run newer rather than older software, and nobody knows how long 4.3.1 will keep working on modern systems.

Compiling this beast is daunting even for people experienced in building software (not me), so I can’t say if it’s an issue with the binaries or the rocm source…

Would be great to know if this issue also happens with self-compiled builds.

@ROCmSupport absolutely, all cards have an EOL, but 6 years is not very long IMHO.

@clavinet on the one hand there are a ton of Polaris cards out there, on the other hand 5 years is a reasonable time to support this card. Is there any reason why you need 4.5 instead of 4.3.1 ?

I tried both the bundled clinfo that comes with ROCm, as well as the system clinfo. My distro is openSUSE Tumbleweed 20211102 .