kubernetes: Missing taint: node.kubernetes.io/unreachable:NoExecute when nodes enter NotReady state on new installs of 1.19+

What happened: I have been testing some ansible roles to deploy Kubernetes using kubeadm on Ubuntu virtual machines, when testing I noticed that pods were not being terminated + re-created on additional nodes when the existing node that the pod lived on went into a NotReady state. The node that goes into a NotReady state does not have the node.kubernetes.io/unreachable:NoExecute taint applied. Thus preventing the pods to be evicted(terminated) and re-created. (I confirmed this when installing manually(non ansible) too, also I used dockershim/containerd as the cri, tested using flannel/calico as the cni)

This occurs in new installations of kubernetes when using any version greater than 1.18.12 i.e. 1.19.x. This has been tested & confirmed with kubeadm running on Ubuntu and kind running on Mac

Strangely I can confirm though, that if kubeadm/kubelet=1.8.12 is installed first, then upgraded to 1.19.x on Ubuntu using APT that the taint is added to the NotReady node and the pods are evicted(terminated) and re-created on additional nodes. (I haven’t confirmed this using kind)

What you expected to happen: I expected that any pods running on a node which has entered the NotReady state to be terminated and re-created on any other available node because the node.kubernetes.io/unreachable:NoExecute taint is applied.

How to reproduce it (as minimally and precisely as possible):

Using kind

kind create cluster --config - <<EOF
kind: Cluster
apiVersion: kind.x-k8s.io/v1alpha4
nodes:
- role: control-plane
- role: worker
- role: worker
EOF

kubectl apply -f - <<EOF
apiVersion: apps/v1
kind: Deployment
metadata:
  name: test
spec:
  replicas: 1
  selector:
    matchLabels:
      srv: test
  template:
    metadata:
      labels:
        srv: test
    spec:
      containers:
      - image: nginx
        name: test
        ports:
        - containerPort: 80
        resources: {}
      tolerations:
      - effect: NoExecute
        key: node.kubernetes.io/unreachable
        operator: Exists
        tolerationSeconds: 10
      - effect: NoExecute
        key: node.kubernetes.io/not-ready
        operator: Exists
        tolerationSeconds: 10
EOF

Working/Tested on k8s 1.18.2 (kind v0.8.1):

kubectl get nodes
NAME                 STATUS   ROLES    AGE   VERSION
kind-control-plane   Ready    master   13m   v1.18.2
kind-worker          Ready    <none>   13m   v1.18.2
kind-worker2         Ready    <none>   13m   v1.18.2

kubectl get pods -o wide
NAME                    READY   STATUS    RESTARTS   AGE   IP           NODE          NOMINATED NODE   READINESS GATES
test-67bff98d4b-ch2ng   1/1     Running   0          25s   10.244.2.3   kind-worker   <none>           <none>

docker stop kind-worker
kind-worker

kubectl get nodes
NAME                 STATUS     ROLES    AGE   VERSION
kind-control-plane   Ready      master   23m   v1.18.2
kind-worker          NotReady   <none>   22m   v1.18.2
kind-worker2         Ready      <none>   23m   v1.18.2

kubectl get pods -o wide
NAME                    READY   STATUS        RESTARTS   AGE     IP           NODE           NOMINATED NODE   READINESS GATES
test-67bff98d4b-ch2ng   1/1     Terminating   0          2m38s   10.244.2.3   kind-worker    <none>           <none>
test-67bff98d4b-kqkcv   1/1     Running       0          37s     10.244.3.3   kind-worker2   <none>           <none>

kubectl describe node kind-worker | grep node.kubernetes.io
Taints:             node.kubernetes.io/unreachable:NoExecute
                    node.kubernetes.io/unreachable:NoSchedule

Not Working/Tested on k8s 1.19.1 (kind v0.9.0):

kubectl get nodes
NAME                 STATUS   ROLES    AGE     VERSION
kind-control-plane   Ready    master   4m37s   v1.19.1
kind-worker          Ready    <none>   4m1s    v1.19.1
kind-worker2         Ready    <none>   4m1s    v1.19.1

kubectl get pods -o wide
NAME                    READY   STATUS    RESTARTS   AGE   IP           NODE           NOMINATED NODE   READINESS GATES
test-5f8dd86d7c-hrlwg   1/1     Running   0          48s   10.244.1.2   kind-worker2   <none>           <none>

docker stop kind-worker2
kind-worker2

kubectl get nodes
NAME                 STATUS     ROLES    AGE     VERSION
kind-control-plane   Ready      master   5m46s   v1.19.1
kind-worker          Ready      <none>   5m10s   v1.19.1
kind-worker2         NotReady   <none>   5m10s   v1.19.1

kubectl get pods -o wide
NAME                    READY   STATUS    RESTARTS   AGE     IP           NODE           NOMINATED NODE   READINESS GATES
test-5f8dd86d7c-hrlwg   1/1     Running   0          3m36s   10.244.1.2   kind-worker2   <none>           <none>

kubectl describe node kind-worker2 | grep node.kubernetes.io
Taints:             node.kubernetes.io/unreachable:NoSchedule

Using kubeadm

Please see my comment in another issue

Anything else we need to know?:

Please see the comment from @adampl confirming the bug here

Environment:

Kubernetes version (use kubectl version): 1.19.x
Cloud provider or hardware configuration: Bare-metal/Mac
OS (e.g: cat /etc/os-release): Ubuntu 20.04/MacOS Catalina
Kernel (e.g. uname -a): Linux captain1.k8s.someadmin.com 5.4.0-56-generic #62-Ubuntu SMP Mon Nov 23 19:20:19 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux
Install tools: kubeadm/kind
Network plugin and version (if this is a network-related bug): calico
Others:

About this issue

Original URL
State: closed
Created 4 years ago
Reactions: 4
Comments: 30 (7 by maintainers)

Most upvoted comments

I also encountered the same problem. This bug has the same reason as #94183 and #96183. and there is an open PR to fixed it #96876

chenyw1990 on Dec 8, 2020

I can confirm on a new install of 1.20.1 using kubeadm the issue still occurs

kubectl get nodes
NAME                         STATUS     ROLES                  AGE   VERSION
captain1   Ready      control-plane,master   23m   v1.20.1
node1      Ready      <none>                 22m   v1.20.1
node2      Ready      <none>                 22m   v1.20.1
node3      NotReady   <none>                 22m   v1.20.1

kubectl describe node node3 | grep node.kubernetes.io
Taints:             node.kubernetes.io/unreachable:NoSchedule

sculley on Dec 20, 2020

@guzmonne It will appear in 1.21 and there are also backporting PRs for 1.19 (#98140) and 1.20 (#98168).

It will be fixed in 1.20.5 and 1.19.9 if they got merged. The target date is 2021-03-17 according to detailed-release-history-for-active-branches

pacoxu on Feb 26, 2021

@guzmonne It will appear in 1.21 and there are also backporting PRs for 1.19 (https://github.com/kubernetes/kubernetes/pull/98140) and 1.20 (https://github.com/kubernetes/kubernetes/pull/98168).

adampl on Jan 21, 2021

One thing I would like to re-mention, when I upgraded a kubeadm install from 1.18.12 to 1.19.4 on Ubuntu, the NoExecute taint is still added but as described here on new installs its not. That seems rather strange?

I would also assume this is quite a serious bug, I imagine most people are not impacted as they user 1.18 and versions provided by cloud providers but this sort of breaks HA in some ways. Glad to see there is a PR to fix though.

sculley on Dec 8, 2020

I can reproduce it on my Ubuntu server, but not on the Mac.

I use kind with docker 19.03.13

root@ubuntu-master:~# kubectl get pods -o wide
NAME                    READY   STATUS    RESTARTS   AGE     IP           NODE           NOMINATED NODE   READINESS GATES
test-645b75c986-dlqsw   1/1     Running   0          86s     10.244.1.2   kind-worker2   <none>           <none>
test-645b75c986-ph2lv   1/1     Running   0          6m29s   10.244.2.2   kind-worker    <none>           <none>
root@ubuntu-master:~# kubectl get node
NAME                 STATUS     ROLES    AGE   VERSION
kind-control-plane   Ready      master   28m   v1.19.1
kind-worker          Ready      <none>   28m   v1.19.1
kind-worker2         NotReady   <none>   28m   v1.19.1
root@ubuntu-master:~# kubectl describe node kind-worker2 | grep node.kubernetes.io
Taints:             node.kubernetes.io/unreachable:NoSchedule
root@ubuntu-master:~#

pacoxu on Dec 8, 2020

after upgrade to v1.19.9 is normal now

System Info:
  Machine ID:                 0865d62057404583a528f06969d7aa8d
  System UUID:                b2450642-5e01-3b93-e39f-9d937b0ee927
  Boot ID:                    ad14c78f-fc72-4095-a5d5-94e19ca68b2f
  Kernel Version:             5.4.0-65-generic
  OS Image:                   Ubuntu 20.04.2 LTS
  Operating System:           linux
  Architecture:               amd64
  Container Runtime Version:  cri-o://1.20.0
  Kubelet Version:            v1.19.9
  Kube-Proxy Version:         v1.19.9

root@20-81-31-k8s:~/k8s-cni-dashboard-metrics# kubectl get node -o wide
NAME           STATUS     ROLES           AGE   VERSION   INTERNAL-IP   EXTERNAL-IP   OS-IMAGE             KERNEL-VERSION     CONTAINER-RUNTIME
20-81-31-k8s   Ready      master,worker   11d   v1.19.9   10.20.81.31   <none>        Ubuntu 20.04.2 LTS   5.4.0-65-generic   cri-o://1.20.0
20-81-32-k8s   Ready      master,worker   11d   v1.19.9   10.20.81.32   <none>        Ubuntu 20.04.2 LTS   5.4.0-65-generic   cri-o://1.20.0
20-81-33-k8s   NotReady   master,worker   11d   v1.19.9   10.20.81.33   <none>        Ubuntu 20.04.2 LTS   5.4.0-65-generic   cri-o://1.20.0

channing342 on Mar 19, 2021

Until the PR #98168 is merged current versions of 1.20 won’t be fixed. It should get done on the following dates.

guzmonne on Feb 24, 2021

1.18.12 has this issue too, can you cherry-pick to 1.18 ?

LuckySB on Jan 22, 2021