kubernetes: Investigate node-kubelet-conformance test failures
Which jobs are failing:
node-kubelet-conformance suite
Which test(s) are failing:
The tests are failing in the BeforeSuite.
Since when has it been failing:
10/23/2020
Testgrid link:
https://testgrid.k8s.io/sig-node-kubelet#node-kubelet-conformance
Reason for failure:
It started failing at commit 237dae5a5, but it’s unknown if it’s related to that.
Anything else we need to know:
We discussed in the Kubernetes SIG-Node CI subgroup and will start taking a look at it.
About this issue
- Original URL
- State: closed
- Created 4 years ago
- Comments: 28 (14 by maintainers)
looking green!
I can repro the issue now, and I think the line was moved upward to generate the token before starting the kubelet. Moving it up out of the if-statement may help partially, but in the conformance test the kubelet is manually started ahead of this code, so it may not matter.
You’ll also need to use the
--test-suite=conformanceflag to make sure it runs in Docker, I believe.quick interruption for instructions on how to reproduce this one: I think this may work
Focus and skip The FOCUS and SKIP arguments im getting from https://github.com/kubernetes/test-infra/blob/48f2834380836b283d700833975a8162b392dfe4/config/jobs/kubernetes/sig-node/node-kubelet.yaml#L95
Image config file The job also uses the following config file https://github.com/kubernetes/test-infra/blob/48f2834380836b283d700833975a8162b392dfe4/config/jobs/kubernetes/sig-node/node-kubelet.yaml#L91 https://github.com/kubernetes/test-infra/blob/master/jobs/e2e_node/image-config.yaml Make sure to copy it and name it
node-test.yaml(the argument to IMAGE_CONFIG_FILE above).Additional arguments The test args to this job are https://github.com/kubernetes/test-infra/blob/48f2834380836b283d700833975a8162b392dfe4/config/jobs/kubernetes/sig-node/node-kubelet.yaml#L92
I think these are the defaults already but just in case may be actually useful to add the following to the above command
Miscellaneous If you run into trouble with the above commands try going through https://github.com/contributing-to-kubernetes/gnosis/tree/master/stories/e2e-node-tests
The info here came from https://github.com/kubernetes/community/blob/master/contributors/devel/sig-node/e2e-node-tests.md
Catching up on this. qq: has anyone vetted https://github.com/kubernetes/kubernetes/pull/94723/files ? Ise that pr went in around the time this job began to fail https://github.com/kubernetes/kubernetes/compare/1fcd02cc2...237dae5a5
The beforesuite (AI: we should list out what it does), fails to communicate with nodes
the mention of 6443 ports looks interesting here.