kubernetes: [Failing Test] ci-kubernetes-e2e-gci-gce-serial is timing out

Which jobs are failing: ci-kubernetes-e2e-gci-gce-serial

Which test(s) are failing: Job is timing out.

Since when has it been failing: The timeouts started around 8/6 at 13:00 PDT k/k diff for context https://github.com/kubernetes/kubernetes/compare/16d9a659d...6049253aa

Testgrid link: https://testgrid.k8s.io/sig-release-master-blocking#gce-cos-master-serial

Reason for failure: These are the args passed to kubetest

Args: --job=ci-kubernetes-e2e-gci-gce-serial --service-account=/etc/service-account/service-account.json --upload=gs://kubernetes-jenkins/logs --timeout=520 --bare --scenario=kubernetes_e2e -- --check-leaked-resources --env=NODE_LOCAL_SSDS_EXT=1,scsi,fs --extract=ci/latest --gcp-master-image=gci --gcp-node-image=gci --gcp-zone=us-west1-b --provider=gce '--test_args=--ginkgo.focus=\[Serial\]|\[Disruptive\] --ginkgo.skip=\[Flaky\]|\[Feature:.+\] --minStartupPods=8' --timeout=500m

There is a timeout of 500m (~8 hours). The latest runs have latest 8+ hours https://prow.k8s.io/view/gcs/kubernetes-jenkins/logs/ci-kubernetes-e2e-gci-gce-serial/1159223580651687936

Everything else seems ok, all tests passed, https://prow.k8s.io/view/gcs/kubernetes-jenkins/logs/ci-kubernetes-e2e-gci-gce-serial/1159223580651687936#0:build-log.txt%3A45205

/milestone v1.16 /priority critical-urgent /kind failing-test /sig sig-storage /cc @kubernetes/sig-storage-test-failures /cc @Verolop @jimangel @soggiest @alenkacz

About this issue

  • Original URL
  • State: closed
  • Created 5 years ago
  • Comments: 16 (16 by maintainers)

Most upvoted comments

Looks like we’re under the 8hrs now that https://github.com/kubernetes/kubernetes/pull/81375 has merged

reducing the number of permutations of test cases

PR to reduce number of test permutations here https://github.com/kubernetes/kubernetes/pull/81375