Key repository setup fails after simultaneously removing several ceph-mon units
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Ceph Monitor Charm |
New
|
Undecided
|
Unassigned | ||
OpenStack Keystone Charm |
Invalid
|
Undecided
|
Unassigned |
Bug Description
Ceph-mon will reach a blocked state with a status of "Unit not clustered (no quorum)" if several clustered ceph-mon units are simultaneously removed while leaving enough units for quorum. This seems to be caused by the key repository setup failing in keystone.
This problem involves bad paths to keyrings on ceph-mon units. The expected keyring path is /var/lib/
WARNING osd-relation-
WARNING osd-relation-
WARNING osd-relation-
This bug is observed in a healthy test deployment which initially includes keystone, three ceph-mon units, among others. To reproduce this bug, four mon units are added then later removed after full deployment. The remaining ceph-mon units enter a blocked state after losing quorum.
On keystone, the following error is found:
INFO juju-log Checking no pids for apache2 exist
INFO juju-log Installing [] with options: ['--option=
ERROR juju-log Key repository setup failed, will retry in config-changed hook: Command '['sudo', '-u', 'keystone', 'keystone-manage', 'credential_
WARNING config-changed ERROR no relation id specified
keystone version: 18.0.0
ceph-mon version: 15.2.8
Juju version: 2.8.10-focal-amd64
OpenStack version: openstack 5.2.0
This does not affect the keystone charm.