core: [ZHA] Integration randomly stops working, sits in 'initialising' state. (still)

The problem

As per previous issue (https://github.com/home-assistant/core/issues/105445) I am experiencing my ZHA randomly becoming completely unresponsive and seeing that the integration is sitting “initialising”

What version of Home Assistant Core has the issue?

core-2024.1.2

What was the last working version of Home Assistant Core?

core-2024.1.1

What type of installation are you running?

Home Assistant Container

Integration causing the issue

ZHA

Link to integration documentation on our website

No response

Diagnostics information

config_entry-zha-5fb366dc2478313fb3cb2b29c52254af.json.txt

Example YAML snippet

No response

Anything in the logs that might be useful for us?

[home-assistant_zha_2024-01-07T19-41-14.250Z.log.zip](https://github.com/home-assistant/core/files/13854715/home-assistant_zha_2024-01-07T19-41-14.250Z.log.zip)

Additional information

No response

About this issue

  • Original URL
  • State: open
  • Created 6 months ago
  • Reactions: 9
  • Comments: 43 (10 by maintainers)

Most upvoted comments

Same issue here.

whatever was changed in 2024.1.3 has made it even worse. what was once a week has happened about 4 times in 2 days

Let me set one up to test. I’ve been running my home network on a Silvercrest gateway without issues for the past day so perhaps it’s something specific to the Sonoff.

my coordinator as it is currently is a sonoff brigge flashed with tasmota. unlike others here.

is there a way of restarting HA as part of an automation? ie if detecting that the integration isn’t available (or a device isn’t) then restart HA completely? (I don’t seem to be able to restart the integration manually even when i have access to the gui because it needs to be “up” for it to be able to restart… so i’m guessing it wouldnt work with restarting the integration itself… although I dont mind trying). for a way of getting around this issue for now as I’m out of the house for a few days and my wife and family are at home, their life is going to be marred with frustration whilst i’m away when it inevitably craps out multiple times during the day and night and i’m not around to sort it.

thanks

I’ve set up an automation:

Trigger: one of my plugged in zigbee devices become unavailable Condition: check all my plugged in zigbee devices for unavailable state Action: HA reboot core

But I’ve since disabled it because ZHA keeps reinitializing every 5 minutes and it takes 10 to boot up again.

Any solution for non-SkyConnect users in the works? ZHA is now reinitializing 5 minutes after reboot which takes about 10 minutes each time so it’s basically unusable. It used to only have this issue once or twice a day.

so https://github.com/home-assistant/addons/issues/3408 is not an exact match either.

The firmware is identical for both so it’s very likely the same issue.

if I can figure out how to do it without breaking anything.

There are documented steps here: https://yellow.home-assistant.io/guides/disable-multiprotocol/. It will migrate your network back to Zigbee-only.

Once that’s done, plug in the SkyConnect, install the OpenThread Border Router addon: it’ll flash your SkyConnect with Thread firmware. You can then push your preferred dataset to the same border router from the Thread configuration and replicate your multi-PAN setup with two stable radios.

Wife Acceptance Factor is dropping rapidly.

Know what you mean. Home automations (including things we have come to rely on) being broken for months is not winning me any points. I had to revert a bunch of things to failsafe mode and find workarounds for a bunch of other things. Overall this is causing me a significant amount of work and effort.

Same issue here.

Same thing here:

I run HAOS on an NUC, have the SkyConnect connected via USB extension cable (like you’re supposed to), got the 2.4 update to the Silicon Labs Multiprotocol to 2.4.0, things started breaking… hours later I updated to 2.4.1… still broken… another few hours 2.4.2 was pushed and I upgraded.

Since 2.4.2 it’s been up a day, then randomly the ZHA integration goes back to “Failed Setup Will Retry”

What’s worse, I’m running both Zigbee AND Thread on the SkyConnect… so BOTH type of devices (85 of them) are broken… including lights.

Wife Acceptance Factor is dropping rapidly.

How do I downgrade back to 2.3.2??