cloud-builders: Ignore failed step?

Hi all,

The first step of my build is a docker pull of an image to be used in the --cache-from of the docker build step. If the docker pull fails because the image is not present the whole build will fail. That first step is really optional, the docker build will just run as normal without the cached image.

Is there any way to make the builder ignore a failed step which is not critical? In bash I would just write docker pull || true but I can’t seem to make it work here. Thanks!

About this issue

Original URL
State: closed
Created 6 years ago
Reactions: 10
Comments: 27 (6 by maintainers)

Most upvoted comments

Not only would we consider it, it is currently on our roadmap for prioritization.

+26

bendory on Apr 16, 2018

Maybe this issue should remain open until the feature is implemented?

+20

acasademont on Oct 23, 2018

@bendory is it still on your roadmap? Any updates on the story?

+16

selslack on Aug 6, 2019

Hey Albert,

We don’t have an integrated solution to skip a failed step, but you can always use bash:

- name: 'gcr.io/cloud-builders/docker'
  entrypoint: 'bash'
  args:
  - '-c'
  - |
    docker pull gcr.io/$PROJECT_ID/my-image || exit 0

+13

Philmod on Apr 16, 2018

Hey. I would like to add an use case for this feature. We’re using caching feature from Docker. To do that, we need to pull an image first. But it doesn’t exist for new images. So simply speaking, it’s a “chicken & egg” problem. Example below:

steps:
  - name: gcr.io/cloud-builders/docker
    id: pull_image_A_cache
    args:
      - pull
      - gcr.io/${PROJECT_ID}/image_A:latest

  - name: gcr.io/cloud-builders/docker
    id: build_image_A
    args:
      - build
      - --file=Dockerfile
      - --tag=gcr.io/${PROJECT_ID}/image_A:latest
      - --cache-from
      - gcr.io/${PROJECT_ID}/image_A:latest
      - .

If we could allow the first step to fail, that would be cool 😃

gruzewski on Dec 7, 2018

This would indeed be a nice feature. Also what I’d love to see implemented is a post build action (like https://jenkins.io/doc/book/pipeline/syntax/#post ) which would allow to run some steps after the build has finished/failed (e.g. uploading the test reports to object storage so the failure could be better analyzed).

severi on Mar 4, 2019

I will add my perspective on the utility of allowing a failed step.

Here is a practical real-world example:

Build infrastructure
Run infrastructure tests
Tear down infrastructure

If I fail the test step I won’t remove the infrastructure I created in the first step and will continue being billed. I need to have a step that tears the deployed resources down regardless of the result of the prior steps. Without allowing the tests to fail as a warning, and drawing attention to that step in the process, I end up in a lose-lose scenario. I am either hiding an important piece of information to get the utility I need, or I am being billed for things that are broken. I am sure this extends to applications as well and not just the infrastructure example.

I propose a config setting allow_failure at the step level to treat a failure as a warning and continue the pipeline which should also change the visual language of the resulting step in the logs. Ideally we could also have a way to force a step to always run.

Gitlab-CI has an example of this capability in multiple forms: Allow a step to fail and treat it as a warning: https://docs.gitlab.com/ee/ci/yaml/README.html#allow_failure Always run a step regardless of prior steps (treats a failure as a failure instead of a warning, but allows cleanup type operations): https://docs.gitlab.com/ee/ci/yaml/README.html#when

tfsparks on Feb 18, 2019

This should be handled as a feature request.

niedbalski on Mar 28, 2019

Would love to have this feature built in. Unfortunately, don’t see a lot of activity happening here…

0x962 on Feb 13, 2019

I found a simple (but a bit violent) workaround - build cancel itself if some condition matches.

  - name: 'gcr.io/cloud-builders/gcloud'
    id: 'Cancel current build if on master'
    entrypoint: 'sh'
    args:
      - '-c'
      - |
        test $BRANCH_NAME = "master" && gcloud builds cancel $BUILD_ID > /dev/null || true

Note the service account that runs the builds (xxx@cloudbuild.gserviceaccount.com) should have appropriate permissions to cancel build.

ReallyLiri on Nov 13, 2019

Is this tracked in Google’s issuetracker somewhere?

bbhoss on Feb 25, 2019

I’ve been working around this with the above suggestions. We have multiple different test steps (unit, intergration etc) that we run in parallel and if we don’t use the workaround of returning a exit code of 0 the pipeline simply exits without waiting for the rest of the parallel steps to return an exit code themselves.

An unfortunate side affect of the workaround is that I have to have a step that ‘checks’ the test result, and it becomes verbose and hard for people to work out what has actually failed.

Really wish there were some configuration options around this as others have mentioned.

NullPrice on Jan 21, 2019

That’s a nice workaround indeed, thanks @Philmod

I’m wondering if this could be something you would consider as a feature request maybe? That is, the possibility of defining an allow_failure: true option on a build step.

- name: 'gcr.io/cloud-builders/docker'
  args: ['pull', 'gcr.io/$PROJECT_ID/my-image']
  allow_failure: true

I believe this should be pretty easy to implement

acasademont on Apr 16, 2018

@bbhoss I just saw your question and created one here: https://issuetracker.google.com/issues/128353446

Feel free to star it as it doesn’t appear Github issues are getting much love. Re-reading the contribution info it looks like issues on Github are supposed to be specifically about the builder containers, not the Cloud Build tool itself.

tfsparks on Mar 12, 2019

no updates; still on the list, but given the ease of work-around, it hasn’t yet bubbled up as a priority

bendory on Jun 13, 2018

Any progress / updates on this matter? The workaround isn’t really an option for my use case…

gijswobben on Jan 31, 2019

@bendory I know the “entrypoint” key is documented on this page, but I think it would be helpful if this workaround were explicitly mentioned near the top of https://cloud.google.com/cloud-build/docs/create-custom-build-steps, before it talks about building custom images. This should be the first thing most people turn to when they want to execute a bit of bash, and I discovered it here in this issue (though, on retrospect, it is obvious).

jeremyjh on Sep 11, 2018

@bendory this workaround doesn’t allow one to see how the build went if you’re using cloud builder for CI. It would be great to see that the build failed without stopping all the other steps. Is the plan when skipping failed steps to show the build as failed? Do you have any recommendation for running tests on many packages and seeing which packages failed without short circuiting?

emarx on Jul 19, 2018