ClickHouse: Integration tests fails due to docker-compose pull timeout
Sometimes, 900 seconds is not enough for docker to pull the images, so:
- maybe there are some problems with http://dockerhub-proxy.dockerhub-proxy-zone:5000/?
- or just with CI infrastructure?
- or maybe it worth to simply increase this timeout? and enable
--debug
mode fordockerd
(maybe it will have more logs, like on retries on something), but I doubt that this is a good idea, since otherwise the tests will takes even longer.
@Felixoid what do you think?
Examples:
- https://s3.amazonaws.com/clickhouse-test-reports/56030/331f661322ee0b12ec41cec8cba36b9973a6aa5a/integration_tests__asan__analyzer__[1_6].html
- https://s3.amazonaws.com/clickhouse-test-reports/56030/331f661322ee0b12ec41cec8cba36b9973a6aa5a/integration_tests__asan__[6_6].html
- https://s3.amazonaws.com/clickhouse-test-reports/56030/331f661322ee0b12ec41cec8cba36b9973a6aa5a/integration_tests__asan__analyzer__[6_6].html
- https://s3.amazonaws.com/clickhouse-test-reports/56030/331f661322ee0b12ec41cec8cba36b9973a6aa5a/integration_tests_flaky_check__asan_.html
About this issue
- Original URL
- State: closed
- Created 8 months ago
- Comments: 20 (19 by maintainers)
Together with support, we narrowed down the issue to the OS. And there are the following lines in the syslog:
From the linux manual and some random pages, I try the following configuration to mitigate it:
References: https://dzone.com/articles/tcp-out-of-memory-consider-tuning-tcp-mem and https://www.kernel.org/doc/html/latest/networking/ip-sysctl.html
update: I caught another resets case, and fixed it by
net.ipv4.tcp_mem=4096 131072 16777216
. Will apply everywhere in a momentThe query I’m monitoring
updated one