RabbitMQ cannot stop on controllers during re-assembling a cluster
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Fuel for OpenStack |
New
|
High
|
Fuel Library (Deprecated) |
Bug Description
Steps to reproduce:
1) Deploy HA cluster (Ubuntu, 3 controllers, 1 compute)
2) Shutdown primary controller (slave-
3) Run OSTF check
Expected result: OSTF Functional tests are passed, several tests that check count of services are failed.
Actual result: OSTF Functional tests that perform actions with an instance are failed, RabbitMQ replication test failed:
"Failed to connect to 5673 port on host 10.109.20.5 Please refer to OpenStack logs for more details."
On the node-2 and node-3 there are several processes that are trying to shutdown 'rabbitmq_server' every 6 minutes:
http://
Resources 'p_rabbitmq-server' become unmanaged on node-2 and node-3:
-------
Master/Slave Set: master_
p_
p_
...
Failed actions:
p_rabbitmq-
, queued=60003ms, exec=0ms
): unknown error
p_rabbitmq-
, queued=60003ms, exec=0ms
): unknown error
-------
Changed in fuel: | |
importance: | Undecided → High |
Update:
After manual killing 'beam' process with rabbitmq_server on one node (node-2), the same process on node-3 also was finished , and then pacemaker assemble rabbitmq cluster from node-2 and node-3 in several minutes.