hacluster charm fails to start corosync and cluster
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
hacluster (Juju Charms Collection) |
Invalid
|
Undecided
|
David Ames |
Bug Description
It seems the most recent commit the hacluster charm (41dc7b3fad59ea
Corosync on one or more nodes may not start properly. The symptom is a juju hook left in executing for ever.
root 17499 0.0 0.0 19700 3276 ? Ss 20:17 0:00 bash
/var/lib/
root 17510 0.0 0.0 1686172 52192 ? Sl 20:17 0:01 \_
/var/lib/
/var/lib/juju
root 31870 1.1 0.0 123400 54460 ? S 20:21 1:03 \_
/usr/bin/python /var/lib/
root 223353 0.0 0.0 4508 712 ? S 21:58 0:00 \_
sh -c { crm node list; } 2>&1
root 223354 0.0 0.0 102056 19268 ? R 21:58 0:00
\_ /usr/bin/python /usr/sbin/crm node list
ubuntu@
sudo: unable to resolve host juju-machine-
ERROR: status: crm_mon (rc=107): Connection to cluster failed: Transport
endpoint is not connected
Corosync is failing to start with a timeout
Jun 29 20:25:02 juju-machine-
withdrawing server sockets
Jun 29 20:25:02 juju-machine-
withdrawing server sockets
Jun 29 20:25:02 juju-machine-
server sockets
Jun 29 20:25:02 juju-machine-
server sockets
Jun 29 20:25:02 juju-machine-
server sockets
Jun 29 20:25:02 juju-machine-
Retransmit List: 2 3 4 7
Jun 29 20:25:02 juju-machine-
List: 2 3 4 7
Jun 29 20:25:02 juju-machine-
Cluster Engine.
Jun 29 20:25:02 juju-machine-
entered failed state.
Jun 29 20:25:02 juju-machine-
result 'timeout'.
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
Jun 29 22:17:18 juju-machine-
affects: | charms → hacluster (Juju Charms Collection) |
Changed in hacluster (Juju Charms Collection): | |
assignee: | nobody → David Ames (thedac) |
status: | New → Triaged |
milestone: | none → 16.07 |
This turned out to be missing the cluster_count setting. Setting to invalid.