kubernetes: Kubelet CNI nsenter failure
Kubernetes version (use kubectl version
):
Client Version: version.Info{Major:"1", Minor:"5", GitVersion:"v1.5.2", GitCommit:"08e099554f3c31f6e6f07b448ab3ed78d0520507", GitTreeState:"clean", BuildDate:"2017-01-12T04:57:25Z", GoVersion:"go1.7.4", Compiler:"gc", Platform:"linux/amd64"}
Server Version: version.Info{Major:"1", Minor:"5", GitVersion:"v1.5.2", GitCommit:"08e099554f3c31f6e6f07b448ab3ed78d0520507", GitTreeState:"clean", BuildDate:"2017-01-12T04:52:34Z", GoVersion:"go1.7.4", Compiler:"gc", Platform:"linux/amd64"}
Environment:
- Cloud provider or hardware configuration: Bare metal
- OS (e.g. from /etc/os-release): Ubuntu precise (12.04.4 LTS)
- Kernel (e.g.
uname -a
): 3.13.0-55-generic - Install tools: N/A
- Others: N/A
What happened:
During an otherwise-normal rollingupdate for a replication controller, we see this error in the logs:
E0307 21:43:51.235958 3650 docker_manager.go:373] NetworkPlugin cni failed on the status hook for pod 'foo-r3rg5' - Unexpected command output nsenter: cannot open /proc/5875/ns/net: No such file or directory\n with error: exit status 1
What you expected to happen:
Expected that successful rollingupdate wouldn’t generate error-level logs without an error-level problem – trying to determine what the error-level problem is.
How to reproduce it (as minimally and precisely as possible):
During our prod upgrade of various services, this occurred during 3 out of 70 rollingupdates.
Anything else we need to know:
Perhaps this is the same general issue as #25281? But I couldn’t find any references to the “Unexpected command output nsenter” variation seen above, thus filing separately.
About this issue
- Original URL
- State: closed
- Created 7 years ago
- Comments: 27 (16 by maintainers)
I’m also getting this every now and then, and when it starts happening, the only workaround seems to be tearing down the cluster and bringing it back up. At least in my case it seems to be related to resource limits. I’m trying to run a pod both with and without a resource limit - every time I specify a memory limit, the pod is stuck at
ContainerCreating
, and every time I comment them out it startsRunning
without a hitch.Here’s some logs from the kubelet trying to run the resource limited pods. I’m running with a kubeadm install + flannel.
EDIT: kubeadm, kubectl version 1.6.4
https://github.com/kubernetes/kubernetes/pull/43879 may help this, though it has been reverted and needs some fixups.