rancher: Failed to list *: context deadline exceeded seen in rancher/rancher container logs
What kind of request is this (question/bug/enhancement/feature request): bug
Steps to reproduce (least amount of steps as possible):
Run rancher/rancher:v2.4.5
or v2.4-head
, create a custom cluster and add a node with all roles. Watch Rancher container logs.
Result: Following lines are seen:
Failed to list *v1.X: Get X: context deadline exceeded
gzrancher/rancher#11544
gzrancher/rancher#11329
About this issue
- Original URL
- State: closed
- Created 4 years ago
- Reactions: 11
- Comments: 20 (7 by maintainers)
Same issue here after upgrading to v2.4.5
This seems to be related to
resourceVersion
changes and the watcher/informer using an old/too highresourceVersion
causing the logging. When I restart thekube-apiserver
usingdocker restart kube-apiserver
on the control plane nodes of the cluster (or possibly just the one which address is being logged in thehttps://
URL), it seems to stop.Now figuring out how to reset/fix this.
This was introduced by https://github.com/rancher/norman/pull/367/
Hi @superseb ,
Not sure whether it is related (and hopefully this helps), but we were getting these errors too, and noticed on the affected control plane node containers running the version of the rancher-agent we upgraded from (v2.3.6, with image ID beginning with 697), instead of the v2.4.5 image
Also, to clarify, share-mnt was already running on the v2.4.5 image by this stage.
Following your workaround to restart the apiserver, and we see the v2.4.5 image correctly being used:
Encountered with Rancher running on a
1.15.2
cluster.No impact has been reported, if you experience any, please report back. It seems to be related to 1.18 k8s go client vs older Kubernetes versions.