spark-on-k8s-operator: Job never comes up and fails with driver pod not found
Sometimes the creation request succeeds and then the job never comes up and doing a describe on the spark application shows
SparkApplicationFailed 6m54s spark-operator SparkApplication log-validation failed: Driver Pod not found
We’ve seen this happen about 25% of the time. Any advice ?
About this issue
- Original URL
- State: closed
- Created 4 years ago
- Comments: 38 (19 by maintainers)
Commits related to this issue
- Fixed the spec update issue in #795 — committed to liyinan926/spark-on-k8s-operator by liyinan926 4 years ago
- Fixed the spec update issue in #795 (#804) — committed to GoogleCloudPlatform/spark-on-k8s-operator by liyinan926 4 years ago
- Revert "Fixed the spec update issue in #795 (#804)" This reverts commit 1687c0647c11a5c64d6eeb808522fae75051ad6a. — committed to GoogleCloudPlatform/spark-on-k8s-operator by liyinan926 4 years ago
- Revert "Fixed the spec update issue in #795 (#804)" (#805) This reverts commit 1687c0647c11a5c64d6eeb808522fae75051ad6a. — committed to GoogleCloudPlatform/spark-on-k8s-operator by liyinan926 4 years ago
- Fixed the spec update issue in #795 (#804) — committed to nokia/spark-on-k8s-operator by liyinan926 4 years ago
- Revert "Fixed the spec update issue in #795 (#804)" (#805) This reverts commit 1687c0647c11a5c64d6eeb808522fae75051ad6a. — committed to nokia/spark-on-k8s-operator by liyinan926 4 years ago
- Fixed the spec update issue in #795 (#804) — committed to nokia/spark-on-k8s-operator by liyinan926 4 years ago
- Revert "Fixed the spec update issue in #795 (#804)" (#805) This reverts commit 1687c0647c11a5c64d6eeb808522fae75051ad6a. — committed to nokia/spark-on-k8s-operator by liyinan926 4 years ago
Still seeing the same issue.