cert-manager: Leader election timeout (?) causes exit
I’ve been seeing this over a few clusters (v1.6.1)
E0304 04:54:00.561791 1 leaderelection.go:367] Failed to update lock: Put "https://10.128.0.1:443/api/v1/namespaces/kube-system/configmaps/cert-manager-controller": context deadline exceeded
I0304 04:54:00.564601 1 leaderelection.go:283] failed to renew lease kube-system/cert-manager-controller: timed out waiting for the condition
E0304 04:54:00.841843 1 leaderelection.go:306] Failed to release lock: Operation cannot be fulfilled on configmaps "cert-manager-controller": the object has been modified; please apply your changes to the latest version and try again
I0304 04:54:00.843278 1 controller.go:126] cert-manager/controller/certificaterequests-issuer-ca "msg"="shutting down queue as workqueue signaled shutdown"
I0304 04:54:00.843603 1 controller.go:126] cert-manager/controller/certificaterequests-issuer-acme "msg"="shutting down queue as workqueue signaled shutdown"
I0304 04:54:00.843694 1 controller.go:126] cert-manager/controller/certificates-request-manager "msg"="shutting down queue as workqueue signaled shutdown"
I0304 04:54:00.843746 1 controller.go:126] cert-manager/controller/certificates-issuing "msg"="shutting down queue as workqueue signaled shutdown"
I0304 04:54:00.843912 1 controller.go:126] cert-manager/controller/certificaterequests-issuer-vault "msg"="shutting down queue as workqueue signaled shutdown"
I0304 04:54:00.843968 1 controller.go:126] cert-manager/controller/certificates-trigger "msg"="shutting down queue as workqueue signaled shutdown"
I0304 04:54:00.844003 1 controller.go:126] cert-manager/controller/certificaterequests-issuer-selfsigned "msg"="shutting down queue as workqueue signaled shutdown"
I0304 04:54:00.844055 1 controller.go:126] cert-manager/controller/certificates-revision-manager "msg"="shutting down queue as workqueue signaled shutdown"
I0304 04:54:00.844112 1 controller.go:126] cert-manager/controller/certificates-metrics "msg"="shutting down queue as workqueue signaled shutdown"
I0304 04:54:00.846530 1 controller.go:126] cert-manager/controller/issuers "msg"="shutting down queue as workqueue signaled shutdown"
I0304 04:54:00.846724 1 controller.go:126] cert-manager/controller/challenges "msg"="shutting down queue as workqueue signaled shutdown"
I0304 04:54:00.846835 1 controller.go:126] cert-manager/controller/orders "msg"="shutting down queue as workqueue signaled shutdown"
I0304 04:54:00.846897 1 controller.go:126] cert-manager/controller/ingress-shim "msg"="shutting down queue as workqueue signaled shutdown"
I0304 04:54:00.846945 1 controller.go:126] cert-manager/controller/certificates-key-manager "msg"="shutting down queue as workqueue signaled shutdown"
I0304 04:54:00.847006 1 controller.go:126] cert-manager/controller/certificates-readiness "msg"="shutting down queue as workqueue signaled shutdown"
I0304 04:54:00.847049 1 controller.go:126] cert-manager/controller/certificaterequests-approver "msg"="shutting down queue as workqueue signaled shutdown"
I0304 04:54:00.847207 1 controller.go:126] cert-manager/controller/clusterissuers "msg"="shutting down queue as workqueue signaled shutdown"
I0304 04:54:00.847269 1 controller.go:126] cert-manager/controller/certificaterequests-issuer-venafi "msg"="shutting down queue as workqueue signaled shutdown"
E0304 04:54:00.852721 1 main.go:39] cert-manager "msg"="error while executing" "error"="error starting controller: leader election lost"
It seems to corrospond to periods of time in which the kube api server is taking longer than usual to respond. These usually last a couple of minutes at most, People using managed clusters (LKE, GKE, etc) don’t tend to have control over this as the master node is provided by their cloud vendor.
There seems to have been a few issues that have also had this (or similar) including most similarly #2362
I suspect a portion of api servers high load is caused by #3766 (amongst other things which should be resolved in v1.18 😃 )
Is there anything further that a more experienced (with cert-manager) eye spots here?
About this issue
- Original URL
- State: closed
- Created 2 years ago
- Comments: 26 (3 by maintainers)
I’m receiving a similar error on GKE 1.22 using cert-manager
v1.8.2
:I’m using k3s 1.23.x and experiencing the same issue since v.1.6.0. I also tried setting extraArgs and related hotfixes. I’ve updated to v1.8.0 and without success.
Can I provide you any more info for investigating this issue?