kubeadm: CoreDNS Crashloop Error - 1.12.1

Is this a BUG REPORT or FEATURE REQUEST?

yes, BUG REPORT

Versions

kubeadm version (use kubeadm version): 1.12.1

Environment:

Kubernetes version (use kubectl version): 1.12.1
Cloud provider or hardware configuration: Local Laptop
OS (e.g. from /etc/os-release): Ubutnu 18.04
Kernel (e.g. uname -a): 4.15.0-36-generic
Others:

What happened?

After kubeadm init, and installing Calico, the coredns never recovers from crashloop and settles in a error state

What you expected to happen?

the same installation method works in 1.11.0 and all pods are in running state.

How to reproduce it (as minimally and precisely as possible)?

install the latest kubernetes via kubeadm

Anything else we need to know?

About this issue

Original URL
State: closed
Created 6 years ago
Reactions: 2
Comments: 45 (9 by maintainers)

Links to this issue

CoreDNS CrashLoopBackOff on fresh install

Most upvoted comments

@bravinash please, confirm that the issue is the same on your setup.

If it’s the same than you can try to point kubelet to the original resolv.conf this way:

$ echo -e '[Service]\nEnvironment="KUBELET_EXTRA_ARGS=--resolv-conf=/run/systemd/resolve/resolv.conf"\n' | sudo tee /etc/systemd/system/kubelet.service.d/99-local.conf
$ sudo systemctl daemon-reload
$ sudo systemctl restart kubelet

and remove coredns pods:

$ kubectl get pods -n kube-system -oname |grep coredns |xargs kubectl delete -n kube-system

Kubelet will start them again with new configuration.

At least this fixed the issue in my setup:

$ kubectl get pods -n kube-system
NAME                                READY   STATUS    RESTARTS   AGE
calico-node-cpsqz                   2/2     Running   0          5m20s
coredns-576cbf47c7-2prpv            1/1     Running   0          18s
coredns-576cbf47c7-xslbx            1/1     Running   0          18s
etcd-ed-ipv6-1                      1/1     Running   0          17m
kube-apiserver-ed-ipv6-1            1/1     Running   0          17m
kube-controller-manager-ed-ipv6-1   1/1     Running   0          17m
kube-proxy-8dkqt                    1/1     Running   0          17m
kube-scheduler-ed-ipv6-1            1/1     Running   0          17m

bart0sh on Oct 9, 2018

I shared the solution that has worked for me here: https://stackoverflow.com/a/53414041/1005102

utkuozdemir on Nov 27, 2018

Try 1.4.0 to see if that behaves any differently. Could be related to the glog/klog issue we fixed in 1.4.0.

chrisohaver on Apr 1, 2019

@utkuozdemir

ps auxww | grep kubelet you might see a line like: /usr/bin/kubelet … --resolv-conf=/run/systemd/resolve/resolv.conf

alternatively: cat /var/lib/kubelet/kubeadm-flags.env kubeadm writes the --resolv-conf flag in that file each time it runs join and init.

neolit123 on Nov 27, 2018

I will close this issue for now as this is not reproducible anymore.

bravinash on Oct 9, 2018

I’d propose to close this issue for 2 reasons:

it’s more coredns than Kubeadm issue
it’s not reproducible anymore

@bravinash you can reopen this issue if you see it again

bart0sh on Oct 9, 2018

On my system original resolv.conf also triggered the same issue. It started to work after copying it to another file, removing one of 3 nameserver lines from it and changing --resolv-conf option in /var/lib/kubelet/kubeadm-flags.env

Anyway, we need confirmation from @bravinash that the issue is the same. It doesn’t look like Kubeadm issue for me.

bart0sh on Oct 9, 2018