moby: [1.13.0-rc3 -- Compose V3] 'network not found' on re-creating previously created stack

Steps to reproduce the issue:

$ docker deploy --compose-file docker-compose.yml pyweb
Creating network pyweb_pyweb
Creating network pyweb_default
Creating service pyweb_redis
Creating service pyweb_web

$ docker stack rm pyweb
Removing service pyweb_web
Removing service pyweb_redis
Removing network pyweb_default
Removing network pyweb_pyweb

$ docker deploy --compose-file docker-compose.yml pyweb
Creating network pyweb_default
Creating service pyweb_redis
Error response from daemon: network pyweb_pyweb not found

Ouch, that’s odd, and worse yet, if you just wait a while it seems to work again.

$ docker deploy --compose-file docker-compose.yml pyweb
Creating network pyweb_pyweb
Creating service pyweb_web
Creating service pyweb_redis

Does not exactly inspire me with operational confidence…

Describe the results you expected:

It should not complain about network not being found… It’s intended to create it

Additional information you deem important (e.g. issue happens only occasionally):

Output of docker version:

$ docker version
Client:
 Version:      1.13.0-rc3
 API version:  1.25
 Go version:   go1.7.3
 Git commit:   4d92237
 Built:        Tue Dec  6 01:15:44 2016
 OS/Arch:      darwin/amd64

Server:
 Version:      1.13.0-rc3
 API version:  1.25 (minimum version 1.12)
 Go version:   go1.7.3
 Git commit:   4d92237
 Built:        Tue Dec  6 01:15:44 2016
 OS/Arch:      linux/amd64
 Experimental: true

Output of docker info:

$ docker info
Containers: 54
 Running: 2
 Paused: 0
 Stopped: 52
Images: 146
Server Version: 1.13.0-rc3
Storage Driver: aufs
 Root Dir: /var/lib/docker/aufs
 Backing Filesystem: extfs
 Dirs: 330
 Dirperm1 Supported: true
Logging Driver: json-file
Cgroup Driver: cgroupfs
Plugins:
 Volume: local
 Network: bridge host macvlan null overlay
Swarm: active
 NodeID: ahf9hjo0vkjx9fb63rvne5350
 Is Manager: true
 ClusterID: jn37ze7eaxcwe0e313t4jsvbn
 Managers: 1
 Nodes: 1
 Orchestration:
  Task History Retention Limit: 5
 Raft:
  Snapshot Interval: 10000
  Number of Old Snapshots to Retain: 0
  Heartbeat Tick: 1
  Election Tick: 3
 Dispatcher:
  Heartbeat Period: 5 seconds
 CA Configuration:
  Expiry Duration: 3 months
 Node Address: 192.168.65.2
 Manager Addresses:
  192.168.65.2:2377
Runtimes: runc
Default Runtime: runc
Init Binary: docker-init
containerd version: 03e5862ec0d8d3b3f750e19fca3ee367e13c090e
runc version: 51371867a01c467f08af739783b8beafc154c4d7
init version: 949e6fa
Security Options:
 seccomp
  Profile: default
Kernel Version: 4.8.12-moby
Operating System: Alpine Linux v3.4
OSType: linux
Architecture: x86_64
CPUs: 4
Total Memory: 1.952 GiB
Name: moby
ID: JNQG:SSZK:VLFH:2ZGU:YLS5:NGPA:R6Z6:Z2PO:HJOA:V5YI:2AYZ:5JRX
Docker Root Dir: /var/lib/docker
Debug Mode (client): false
Debug Mode (server): true
 File Descriptors: 58
 Goroutines: 166
 System Time: 2016-12-10T00:44:17.58671518Z
 EventsListeners: 3
No Proxy: *.local, 169.254/16
Username: nathanleclaire
Registry: https://index.docker.io/v1/
Experimental: true
Insecure Registries:
 127.0.0.0/8
Live Restore Enabled: false

Additional environment details (AWS, VirtualBox, physical, etc.):

Docker For Mac latest beta

About this issue

Original URL
State: open
Created 8 years ago
Reactions: 37
Comments: 41 (6 by maintainers)

Commits related to this issue

Waits until the service & network are deleted Avoids problem reported in https://github.com/moby/moby/issues/29293 Uses the solution from https://github.com/moby/moby/issues/29293#issuecomment-318... — committed to Archi-Lab/archilab-jenkins by rudolfgrauberger 5 years ago

Most upvoted comments

Is still happening in docker version 17.09.0-ce

+19

alexrun on Oct 22, 2017

@biels You just need to explicitly wait for the services and networks to be removed before deploying again. Here’s an example of how I’m doing i (from #30942).

until [ -z "$(docker service ls --filter label=com.docker.stack.namespace=$COMPOSE_PROJECT_NAME -q)" ] || [ "$limit" -lt 0 ]; do
  sleep 1;
done

until [ -z "$(docker network ls --filter label=com.docker.stack.namespace=$COMPOSE_PROJECT_NAME -q)" ] || [ "$limit" -lt 0 ]; do
  sleep 1;
done

The $COMPOSE_PROJECT_NAME is whatever you named the stack. The two loops run until there are no services and networks belonging to the named stack. It’s generally safe to re-deploy once that’s true.

+14

kinghuang on Jul 29, 2017

This issue is over two years old, curious why it gets no attention or replies from code owners?

+10

thernstig on Feb 15, 2019

I have the same issue with docker 18.06.0-ce

alelk on Aug 6, 2018

Still in Docker version 18.09.1

javieryanez on Jan 16, 2019

I have my vm restarted and the problem solved.

ijse on Dec 20, 2017

It seems like this is probably because swarm mode is eventually consistent. When you run the network rm it returns before the network is fully removed. So if you attempt to deploy again, you might find the network that is being removed.

Do you think that’s the case?

Could be, but why would a rm request update “desired” state and return immediately, whereas a create request would actually refuse to go through if the network already exists (i.e., check actual state)?

Shouldn’t the creation of a network also modify only “desired” state and let swarm mode converge the actual state in the background?

nathanleclaire on Dec 16, 2016

Still in Docker version 18.09.0, build 4d60db4

storytime on Nov 8, 2018

Reproduced on RC5

docker stack rm project
Removing service project_service_a
Removing service project_service_b
Removing service project_service_c
Removing network project_default

docker stack deploy --compose-file stack.yml project
Creating service project_service_a
Error response from daemon: network project_default not found

As trying again work that’s clear to me the creation check is done before network actually deleted. Anything I could try to help?

mostolog on Jan 12, 2017

Is still happening in docker version 18.03.1-ce

jortkoopmans on Jul 2, 2018

@dnephin - there are cases in which we would like to rm + redeploy for example on DEV and QA environments (as oppose to production in which we redeploy on top of existing deployment). The result is interrupted work processed for developers and automated testing.

eyal-lupu on Feb 17, 2017

I tried this, it worked for me

Inspect the network which we are unable to delete

docker network inspect <id> or <name>

Disconnect the network

docker network disconnect -f <networkID> <endpointName> or <endpointId>

Delete unused networks

docker network prune

JinnaBalu on Mar 10, 2021

I am using Docker for Mac, this consistently happens for a year in my dev env.

overmike on Feb 27, 2019

This happens consistently on Linux with docker-ce 19.03.2

alexdelorenzo on Sep 27, 2019

Just saw this on Windows 2019 with Docker 18.09.6. I’m going to try to create the networks before deploying the stack with ‘docker network create’ and reference them as external in the stack file.

drnybble on Jun 28, 2019

Yes i have taken a look in the logs of docker for windows and the docker client send the rest call /DELETE network and get the response but the deletion is asynchronous. I think the network is removed when all services are removed

sylvainmouquet on Aug 31, 2017

Thanks @kinghuang. Just implemented this logic in a gradle buildscript. I share it in case it can help someone because I think it is a sensible thing to want to do (be able to redeploy your dev environments in one command):

task undeployDev(type: Exec, group: 'application'){
    commandLine 'docker', 'stack', 'rm', 'sm'
}

private void waitForRemoval(String dockerEntity) {
    while (true) {
        ByteArrayOutputStream stdout = new ByteArrayOutputStream()
        exec {
            commandLine 'docker', dockerEntity, 'ls', '--filter', 'label=com.docker.stack.namespace=sm', '-q'
            standardOutput = stdout;
        }
        def length = stdout.toString().length()
        if (length == 0) break;
    }
}

task deployDev(type: Exec, group: 'application'){
    dependsOn ':sm-manager:buildDocker', ':sm-worker:buildDocker'
    mustRunAfter undeployDev
    doFirst{
        println 'Waiting for stack to finish undeploying...'
        waitForRemoval('service')
        waitForRemoval('network')
        println 'Stack undeploy finished'
    }
    commandLine 'docker', 'stack', 'deploy', '-c', 'docker-compose.yml', 'sm'
}

task redeployDev(group: 'application'){
    dependsOn undeployDev, deployDev
}

In this case it works with a stack named sm declared in a docker-compose.yml file at the root of the project but you can adapt it to your needs.

biels on Jul 29, 2017

Why are you using stack rm for deployments? It should be just another stack deploy to send updates.

dnephin on Feb 15, 2017

@eyal-lupu In my automated deployments, I explicitly wait for services and networks to be removed before redeploying. There’s a basic example of this in #30942 (the two until blocks). I have them between the docker stack rm … and docker stack deploy … commands.

kinghuang on Feb 15, 2017