vision: CI fails on windows: ci/circleci: unittest_windows_cpu_pyX.Y

Tests on windows are started failing:

test/test_models.py::test_classification_model[cpu-regnet_y_16gf] PASSED [ 79%]
test/test_models.py::test_classification_model[cpu-regnet_y_32gf] PASSED [ 79%]
test/test_models.py::test_classification_model[cpu-regnet_y_128gf] 

Exited with code exit status 127

CircleCI received exit code 127

It started appearing on PyTorch core nightly 20220622

cc @pmeier @seemethere

About this issue

  • Original URL
  • State: closed
  • Created 2 years ago
  • Comments: 18 (13 by maintainers)

Most upvoted comments

@vfdev-5 the failure is suspicious because it’s on a very large model. Can you try skipping the specific test to see if this is related to issues on CircleCI side rather than on core? Another thing we can do to confirm that the core is not the issue, is to fix the nightly the the one before and rerun the job. If it fails we will know it’s the CircleCI.

@YosuaMichael just to confirm if we run everything locally it does not fail right, only Circle CI is failing everytime ?

For my macbook it does not fail, but I think this is expected (in the circle ci, only windows one failing and probably because of resource problem like memory)

That’s the problem with very large models like that. They often cause random memory issues. If you send a PR that adds a list of such models and skips them (similar to what you have for the GPU), I’ll be happy to review it. Basically we should turn off the specific test and recover our CI.

I have confirmed that skipping the big models indeed make the CI green again. Now with the same PR #6195 I use the older nightly version (torch==1.13.0.dev20220621) to check if this issue is caused by core or circle CI.

@YosuaMichael actually, tests are still failing on #5009. I reopen

Ah yeah, previously I just rerun the test and it seems green. But it get the error after I update the branch. Sorry for the false negative @vfdev-5 !

@YosuaMichael actually, tests are still failing on https://github.com/pytorch/vision/pull/5009. I reopen