csi-driver: Random volume mount error
This error follows me for the last year.
tax-mysql-0 0/2 Init:0/1 0 3m5s
Warning FailedAttachVolume 119s attachdetach-controller AttachVolume.Attach failed for volume "pvc-b269c782-6c14-41d8-a043-9b8c890e485a" : rpc error: code = Internal desc = failed to publish volume: Volume assigning step has failed due to an unknown error. (unknown_error)
Pod randomly can’t create volume mount. And stays in Init state forever. Sometimes cloud node on which pod was running goes offline, but not always. I need to go to Hetzner console and turn node on manually. Workaround is deleting pod and wait for another try. csi version 1.6.0 kubernetes 1.19.9
About this issue
- Original URL
- State: closed
- Created 2 years ago
- Reactions: 5
- Comments: 22 (4 by maintainers)
@apricote thanks for offering to help!
We have similar issues. Sometimes volumes take forever to mount to the pod which results in startup times of 10min+:
The volume cannot be mounted but according to Hetzner CSI Controller the disk is attached, opened (LUKS) and ready.
Sometimes we also get errors, that state that the server is locked.
This is very annoying and we would be very happy to assist finding the issue!
We already had a look into our debug logs and were not able to find any issues there. These logs are generated close to pod scheduling:
Then we see nothing for a long time (thats probably the time during which the pod isn’t able to mount the volume). At some point, when the volume was successfully mounted, we see these logs:
We don’t know what changed or what made the volume suddenly mountable.
Sometimes node gets turned off after failed volume mount, from logs it seems like powerloss, with no error in syslog at all. So I need to go to hetzner web console and turn it on manually. Indeed maybe it is related to virtualization errors somehow. But this volume mount problem happens very often.