rabbitmq bundle failed or stopped after fresh install
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
tripleo |
New
|
Undecided
|
Unassigned |
Bug Description
Description
===========
After fresh installation of TripleO Stein/Stable on 5 nodes (3 HA Controllers and 2 Computes),
rabbitmq bundle and some other resources Failed in Pacemaker.
Steps to reproduce
==================
1- installing undercloud
2- installing overcloud with this command:
openstack overcloud deploy \
--control-flavor control \
--compute-flavor compute \
--templates ~/openstack-
-r /home/stack/
-e /home/stack/
-e environment.yaml \
-e ~/openstack-
-e ~/openstack-
-e ~/openstack-
-e ~/openstack-
--timeout 360 \
--ntp-server pool.ntp.org
I got same result without network isolation and custom network environment, and completely default settings.
Expected result
===============
Fresh healthy HA OpenStack.
Actual result
=============
pcs status output is as follows:
Full list of resources:
Docker container set: rabbitmq-bundle [192.168.
rabbitmq-
rabbitmq-
rabbitmq-
Docker container set: galera-bundle [192.168.
galera-bundle-0 (ocf::heartbeat
galera-bundle-1 (ocf::heartbeat
galera-bundle-2 (ocf::heartbeat
Docker container set: redis-bundle [192.168.
redis-bundle-0 (ocf::heartbeat
redis-bundle-1 (ocf::heartbeat
redis-bundle-2 (ocf::heartbeat
ip-192.168.24.10 (ocf::heartbeat
ip-X.X.X.X (ocf::heartbeat
ip-172.16.2.175 (ocf::heartbeat
ip-172.16.2.41 (ocf::heartbeat
ip-172.16.1.166 (ocf::heartbeat
ip-172.16.3.10 (ocf::heartbeat
Docker container set: haproxy-bundle [192.168.
haproxy-
haproxy-
haproxy-
Docker container set: ovn-dbs-bundle [192.168.
ovn-dbs-bundle-0 (ocf::ovn:
ovn-dbs-bundle-1 (ocf::ovn:
ovn-dbs-bundle-2 (ocf::ovn:
Docker container: openstack-
openstack-
Failed Resource Actions:
* ip-172.
last-
* ip-172.
last-
* ip-172.
last-
* ip-172.
last-
* ip-172.
last-
* ip-172.
last-
* rabbitmq_start_0 on rabbitmq-bundle-1 'unknown error' (1): call=2121, status=Timed Out, exitreason='',
last-
* rabbitmq_start_0 on rabbitmq-bundle-2 'unknown error' (1): call=1979, status=Timed Out, exitreason='',
last-
* rabbitmq_
last-
* ovndb_servers_
last-
Daemon Status:
corosync: active/enabled
pacemaker: active/enabled
pcsd: active/enabled
It seems cluster is completely unhealthy. even running these commands don't help:
pcs resource restart rabbitmq-bundle
pcs resource cleanup rabbitmq-bundle
or restarting the whole cluster or all nodes with deleting rmenia directory.
All requests on overcloud are extremely slow, Horizon takes one minute for each refresh.
adding additional services like Octavia cause failed overcloud installation due to 504 Gateway timeout.
Environment
===========
TripleO OpenStack Stable/Stein
Logs & Configs
==============
I can provide any required log or config.