kops: etcd-manager seems to pick up incorrect etcdv3 version
1. What kops version are you running? The command kops version, will display
this information.
kops 1.12
2. What Kubernetes version are you running? kubectl version will print the
version if a cluster is running or provide the Kubernetes version specified as
a kops flag.
1.12.7
3. What cloud provider are you using?
AWS
4. What commands did you run? What is the simplest way to reproduce this issue?
First created a cluster with etcd 3.1.11. Then per upgrade docs, migrated to 3.2.18. Confirmed the etcd pods were indeed running 3.2.18 Rolled masters fast following the guide.
5. What happened after the commands executed?
The masters did not come online. In /var/log/etcd.log I see entries like this:
I0515 10:04:21.595477 3021 etcdserver.go:553] overriding clientURLs with [http://etcd-a.internal.foo.bar:4001] (state had [http://0.0.0.0:4001])
I0515 10:04:21.595491 3021 etcdserver.go:557] overriding quarantinedClientURLs with [http://etcd-a.internal.foo.bar:3994] (state had [http://0.0.0.0:3994])
W0515 10:04:21.595499 3021 pki.go:46] not generating peer keypair as peers-ca not set
W0515 10:04:21.595504 3021 pki.go:84] not generating client keypair as clients-ca not set
W0515 10:04:21.595527 3021 etcdserver.go:92] error running etcd: unknown etcd version v3.1.11: not found in [/opt/etcd-v3.1.11-linux-amd64]```
Again, I confirm that the previous version running was 3.2.18. this is also mentioned in older etcd.log files.
The masters come back online when changing from etcd-manager to legacy.
**6. What did you expect to happen?**
Whatever is happening to understand that 3.2.18 is in use and migrate to manager.
About this issue
- Original URL
- State: closed
- Created 5 years ago
- Comments: 33 (19 by maintainers)
Commits related to this issue
- Support restoring from compatible minor version We don't ship every version of etcd; but we can usually use a version with the same major/minor to restore. We start a "safelist" for using one of the... — committed to justinsb/etcd-manager by justinsb 5 years ago
- Support restoring from compatible minor version We don't ship every version of etcd; but we can usually use a version with the same major/minor to restore. We start a "safelist" for using one of the... — committed to justinsb/etcd-manager by justinsb 5 years ago
- Support restoring from compatible minor version We don't ship every version of etcd; but we can usually use a version with the same major/minor to restore. We start a "safelist" for using one of the... — committed to justinsb/etcd-manager by justinsb 5 years ago
- Support restoring from compatible minor version We don't ship every version of etcd; but we can usually use a version with the same major/minor to restore. We start a "safelist" for using one of the... — committed to justinsb/etcdadm by justinsb 5 years ago
- Support restoring from compatible minor version We don't ship every version of etcd; but we can usually use a version with the same major/minor to restore. We start a "safelist" for using one of the... — committed to justinsb/etcdadm by justinsb 5 years ago
- Support restoring from compatible minor version We don't ship every version of etcd; but we can usually use a version with the same major/minor to restore. We start a "safelist" for using one of the... — committed to justinsb/etcdadm by justinsb 5 years ago
- Support restoring from compatible minor version We don't ship every version of etcd; but we can usually use a version with the same major/minor to restore. We start a "safelist" for using one of the... — committed to justinsb/etcdadm by justinsb 5 years ago
- Support restoring from compatible minor version We don't ship every version of etcd; but we can usually use a version with the same major/minor to restore. We start a "safelist" for using one of the... — committed to justinsb/etcdadm by justinsb 5 years ago
- Support restoring from compatible minor version We don't ship every version of etcd; but we can usually use a version with the same major/minor to restore. We start a "safelist" for using one of the... — committed to justinsb/etcdadm by justinsb 5 years ago
- Support restoring from compatible minor version We don't ship every version of etcd; but we can usually use a version with the same major/minor to restore. We start a "safelist" for using one of the... — committed to justinsb/etcdadm by justinsb 5 years ago
I have the same problem.
Interestingly, we upgraded around 10 small pre-production clusters without a single issue. When we got around to upgrading clusters in production, we hit this problem on the first one.
We had EBS snapshots and were able to roll back, and on a second attempt it worked.
Answering my own question: once you upgrade all master nodes simultaneously, all you need to do is:
cd /mnt/master-vol-02e4f7fb71a78b634/rm -rf state data-trashcanrebootIssue
err (dial tcp 127.0.0.1:4001: connect: connection refused)Solution
Same as quoted at the begging, with the following versions though:
👉
on a second attempt it worked✔️ 🏁Considerations
>= 1.12.x.>= 1.13.x), most probably has already been solved.Ref-Link: https://github.com/kubernetes/kops/blob/master/docs/etcd3-migration.md
CC: @diego-ojeda-binbash @gdmlnx