Activity log for bug #1971451

Date Who What changed Old value New value Message
2022-05-03 15:35:57 Bas de Bruijne bug added bug
2022-05-03 15:39:29 Bas de Bruijne description In testrun: https://solutions.qa.canonical.com/testruns/testRun/a9e7b27f-bdb1-426a-b6d4-1f71a076d18b The ncc units are stuck waiting: ``` nova-cloud-controller/0 waiting idle 0/lxd/7 10.246.167.175 8774/tcp,8775/tcp Incomplete relations: messaging filebeat/51 active idle 10.246.167.175 Filebeat ready. hacluster-nova-cloud-controller/2 active idle 10.246.167.175 Unit is ready and clustered landscape-client/51 maintenance idle 10.246.167.175 Need computer-title and juju-info to proceed logrotated/45 active idle 10.246.167.175 Unit is ready. nova-cloud-controller-mysql-router/2 active idle 10.246.167.175 Unit is ready nrpe/51 active idle 10.246.167.175 icmp,5666/tcp Ready public-policy-routing/25 active idle 10.246.167.175 Unit is ready telegraf/51 active idle 10.246.167.175 9103/tcp Monitoring nova-cloud-controller/0 (source version/commit cc7fa21) ``` In the logs we see: ``` var/log/juju/unit-nova-cloud-controller-1.log:2022-05-03 00:42:17 DEBUG unit.nova-cloud-controller/1.juju-log server.go:327 Skipping 10.246.169.105 password not sent which indicates unit is not ready. var/log/juju/unit-nova-cloud-controller-1.log-2022-05-03 00:42:17 DEBUG unit.nova-cloud-controller/1.juju-log server.go:327 Skipping 10.246.169.98 password not sent which indicates unit is not ready. var/log/juju/unit-nova-cloud-controller-1.log-2022-05-03 00:42:17 DEBUG unit.nova-cloud-controller/1.juju-log server.go:327 Skipping 10.246.169.72 password not sent which indicates unit is not ready. ``` However, the rabbitmq units themselves show that they are happy. We know that the relations are getting rendered because of the juju-show-unit output that is in the crashdump. Pausing and resume the ncc units does not help, suggesting that its not a race condition. Link to crashdumps etc: https://oil-jenkins.canonical.com/artifacts/a9e7b27f-bdb1-426a-b6d4-1f71a076d18b/index.html In testrun: https://solutions.qa.canonical.com/testruns/testRun/a9e7b27f-bdb1-426a-b6d4-1f71a076d18b The ncc units are stuck waiting: ``` nova-cloud-controller/0 waiting idle 0/lxd/7 10.246.167.175 8774/tcp,8775/tcp Incomplete relations: messaging   filebeat/51 active idle 10.246.167.175 Filebeat ready.   hacluster-nova-cloud-controller/2 active idle 10.246.167.175 Unit is ready and clustered   landscape-client/51 maintenance idle 10.246.167.175 Need computer-title and juju-info to proceed   logrotated/45 active idle 10.246.167.175 Unit is ready.   nova-cloud-controller-mysql-router/2 active idle 10.246.167.175 Unit is ready   nrpe/51 active idle 10.246.167.175 icmp,5666/tcp Ready   public-policy-routing/25 active idle 10.246.167.175 Unit is ready   telegraf/51 active idle 10.246.167.175 9103/tcp Monitoring nova-cloud-controller/0 (source version/commit cc7fa21) ``` In the logs we see: ``` var/log/juju/unit-nova-cloud-controller-1.log:2022-05-03 00:42:17 DEBUG unit.nova-cloud-controller/1.juju-log server.go:327 Skipping 10.246.169.105 password not sent which indicates unit is not ready. var/log/juju/unit-nova-cloud-controller-1.log-2022-05-03 00:42:17 DEBUG unit.nova-cloud-controller/1.juju-log server.go:327 Skipping 10.246.169.98 password not sent which indicates unit is not ready. var/log/juju/unit-nova-cloud-controller-1.log-2022-05-03 00:42:17 DEBUG unit.nova-cloud-controller/1.juju-log server.go:327 Skipping 10.246.169.72 password not sent which indicates unit is not ready. ``` However, the rabbitmq units themselves show that they are happy. We know that the relations are getting rendered because of the juju-show-unit output that is in the crashdump. Pausing and resume the ncc units does not help, suggesting that its not a race condition. Link to crashdumps etc: https://oil-jenkins.canonical.com/artifacts/a9e7b27f-bdb1-426a-b6d4-1f71a076d18b/index.html List of occurrences of this bug: https://solutions.qa.canonical.com/bugs/bugs/bug/1971451
2022-06-16 14:44:28 Bas de Bruijne summary [yoga-edge] nova-cloud-controller units stuck waiting: Incomplete relations: messaging [yoga] nova-cloud-controller units stuck waiting: Incomplete relations: messaging
2022-06-16 15:16:07 Bas de Bruijne tags cdo-qa foundations-engine
2022-06-16 20:59:35 Billy Olsen bug task added charm-rabbitmq-server
2022-06-16 20:59:43 Billy Olsen charm-nova-cloud-controller: status New Invalid
2022-06-16 22:27:57 OpenStack Infra charm-rabbitmq-server: status New In Progress
2022-07-04 14:08:03 OpenStack Infra charm-rabbitmq-server: status In Progress Fix Committed
2022-07-22 19:47:26 Pedro Guimarães bug added subscriber Canonical Field Medium
2022-07-24 16:57:13 OpenStack Infra tags cdo-qa foundations-engine cdo-qa foundations-engine in-stable-jammy
2022-11-11 19:49:14 Jeffrey Chang bug added subscriber Jeffrey Chang