openshift-ansible: Installing cluster fails with "Control plane install failed."

Description

I’m trying a new installation of MASTER branch and it’s failing when installing the Master with:

Control plane install failed.
Version
Ansible: ansible 2.5.2
Openshift Version: openshift-ansible-3.10.0-0.47.0-65-g4796c1d
RPM: package openshift-ansible is not installed
Steps To Reproduce
  1. Follow all pre-requisits
  2. git clone https://github.com/openshift/openshift-ansible
  3. cd openshift-ansible
  4. ansible-playbook playbooks/prerequisites.yml
  5. ansible-playbook playbooks/deploy_cluster.yml
Expected Results
Openshift origin 3.10 installed successfully
Observed Results
TASK [openshift_control_plane : fail] *********************************************************************************************************
Friday 18 May 2018  20:12:34 -0300 (0:00:00.046)       0:11:30.859 ************ 
skipping: [origin-m.hospitalaleman.com]

TASK [openshift_control_plane : Verify that the control plane is running] *********************************************************************************************************
Friday 18 May 2018  20:12:34 -0300 (0:00:00.059)       0:11:30.919 ************ 
FAILED - RETRYING: Verify that the control plane is running (60 retries left).
FAILED - RETRYING: Verify that the control plane is running (59 retries left).
FAILED - RETRYING: Verify that the control plane is running (58 retries left).
FAILED - RETRYING: Verify that the control plane is running (57 retries left).
FAILED - RETRYING: Verify that the control plane is running (56 retries left).
FAILED - RETRYING: Verify that the control plane is running (55 retries left).
......
.....
FAILED - RETRYING: Verify that the control plane is running (2 retries left).
FAILED - RETRYING: Verify that the control plane is running (1 retries left).
fatal: [origin-m.hospitalaleman.com]: FAILED! => {"attempts": 60, "changed": false, "cmd": ["curl", "-k", "https://origin-m.hospitalaleman.com:8443/healthz/ready"], "delta": "0:00:00.018986", "end": "2018-05-18 20:17:49.046176", "msg": "non-zero return code", "rc": 7, "start": "2018-05-18 20:17:49.027190", "stderr": "  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current\n                                 Dload  Upload   Total   Spent    Left  Speed\n\r  0     0    0     0    0     0      0      0 --:--:-- --:--:-- --:--:--     0curl: (7) Failed connect to origin-m.hospitalaleman.com:8443; Connection refused", "stderr_lines": ["  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current", "                                 Dload  Upload   Total   Spent    Left  Speed", "", "  0     0    0     0    0     0      0      0 --:--:-- --:--:-- --:--:--     0curl: (7) Failed connect to origin-m.hospitalaleman.com:8443; Connection refused"], "stdout": "", "stdout_lines": []}
...ignoring

TASK [openshift_control_plane : Check status in the kube-system namespace] *********************************************************************************************************Friday 18 May 2018  20:17:49 -0300 (0:05:14.345)       0:16:45.265 ************ 
fatal: [origin-m.hospitalaleman.com]: FAILED! => {"changed": true, "cmd": ["oc", "status", "--config=/etc/origin/master/admin.kubeconfig", "-n", "kube-system"], "delta": "0:00:00.150817", "end": "2018-05-18 20:17:49.472223", "msg": "non-zero return code", "rc": 1, "start": "2018-05-18 20:17:49.321406", "stderr": "The connection to the server origin-m.hospitalaleman.com:8443 was refused - did you specify the right host or port?
....
...
TASK [openshift_control_plane : Report control plane errors] ********************************************************************************************************************************************************************************************************
Friday 18 May 2018  20:17:51 -0300 (0:00:00.130)       0:16:47.737 ************ 
fatal: [origin-m.hospitalaleman.com]: FAILED! => {"changed": false, "msg": "Control plane install failed."}

NO MORE HOSTS LEFT *********************************************************************************************************

PLAY RECAP *********************************************************************************************************localhost                  : ok=14   changed=0    unreachable=0    failed=0   
origin-1.hospitalaleman.com : ok=99   changed=33   unreachable=0    failed=0   
origin-m.hospitalaleman.com : ok=295  changed=130  unreachable=0    failed=1   


INSTALLER STATUS *********************************************************************************************************Initialization    : Complete (0:00:13)
Health Check      : Complete (0:00:25)
Node Preparation  : Complete (0:08:54)
etcd Install      : Complete (0:00:31)
Master Install    : In Progress (0:06:45)
	This phase can be restarted by running: playbooks/openshift-master/config.yml
Friday 18 May 2018  20:17:51 -0300 (0:00:00.077)       0:16:47.815 ************ 
Additional Information
OS: CentOS Linux release 7.5.1804 (Core) 

About this issue

  • Original URL
  • State: closed
  • Created 6 years ago
  • Comments: 17 (2 by maintainers)

Most upvoted comments

@DanyC97 Thank you for info! But with time going forward, we are already on 3.10 , and not seeing this issue anymore.