Activity log for bug #1423116

Date Who What changed Old value New value Message
2015-02-18 11:09:10 Leontiy Istomin bug added bug
2015-02-18 11:12:07 Bogdan Dobrelya mos: milestone 6.0.2
2015-02-18 11:12:15 Bogdan Dobrelya mos: milestone 6.0.2 6.0.1
2015-02-18 11:12:25 Bogdan Dobrelya mos: importance Undecided Critical
2015-02-18 11:12:35 Bogdan Dobrelya mos: status New In Progress
2015-02-18 11:30:28 Leontiy Istomin description [root@fuel ~]# fuel --fuel-version api: '1.0' astute_sha: f7cda2171b0b677dfaeb59693d980a2d3ee4c3e0 auth_required: true build_id: 2015-02-07_20-50-01 build_number: '76' feature_groups: - mirantis fuellib_sha: 64f3ebe9fcbd18bf6c80a948e06061783a090347 fuelmain_sha: c799e3a6d88289e58db764a6be7910aab7da3149 nailgun_sha: 2ef819732a3ee7acf7b610e7d1c1a6da0434c1a0 ostf_sha: 3b57985d4d2155510894a1f6d03b478b201f7780 production: docker release: 6.0.1 release_versions: 2014.2-6.0.1: VERSION: api: '1.0' astute_sha: f7cda2171b0b677dfaeb59693d980a2d3ee4c3e0 build_id: 2015-02-07_20-50-01 build_number: '76' feature_groups: - mirantis fuellib_sha: 64f3ebe9fcbd18bf6c80a948e06061783a090347 fuelmain_sha: c799e3a6d88289e58db764a6be7910aab7da3149 nailgun_sha: 2ef819732a3ee7acf7b610e7d1c1a6da0434c1a0 ostf_sha: 3b57985d4d2155510894a1f6d03b478b201f7780 production: docker release: 6.0.1 Baremetal,Ubuntu, HA, Neutron-gre,Ceilometer,Ceph-all, Debug, 6.0.1_76 Controllers:3 Computes:96 Deployment has been passed successfully, but during full rally test primary controller node has been marked as offline. Also this node is unreachable via ssh. [root@fuel ~]# ssh node-19 Warning: Permanently added 'node-19' (RSA) to the list of known hosts. Write failed: Broken pipe But at the moment I have one opened ssh session which gives the able to execute some commands. here is output of top command: http://paste.openstack.org/show/176687/ root@node-19:~# free -m total used free shared buffers cached Mem: 32142 31768 373 0 211 11263 -/+ buffers/cache: 20292 11849 Swap: 15624 12 15612 "rabbitmqctl cluster_status" and "rabbitmqctl list_queues" commands just hang on this node from other controller node: root@node-52:~# rabbitmqctl cluster_status Cluster status of node 'rabbit@node-52' ... [{nodes,[{disc,['rabbit@node-19','rabbit@node-52','rabbit@node-65']}]}, {running_nodes,['rabbit@node-19','rabbit@node-65','rabbit@node-52']}, {cluster_name,<<"rabbit@node-19.domain.tld">>}, {partitions,[]}] ...done. root@node-52:~# rabbitmqctl list_queues | grep -v 0$ Listing queues ... dhcp_agent.node-19 96 notifications.error 415 reply_0c7bc35f0e114b119b959160645ca04a 1 ...done. root@node-19:~# dmesg | grep -i error [ 9.798790] ACPI Error: [\_SB_.PRAD] [ 10.883460] ACPI Error: Method parse/execution failed [\_GPE._L24] (Node ffff880853d9d3e8), AE_NOT_FOUND (20131115/psparse-536) [ 16.284591] ioapic: probe of 0000:00:05.4 failed with error -22 [ 17.779631] ERST: Error Record Serialization Table (ERST) support is initialized. [ 31.029678] EXT4-fs (sda3): re-mounted. Opts: errors=remount-ro crm status output is here http://paste.openstack.org/show/176723/ The last line in rabbitmq log is: =INFO REPORT==== 18-Feb-2015::10:11:18 === accepting AMQP connection <0.9669.490> (192.168.0.54:41674 -> 192.168.0.21:5673) snapshot will be here asap [root@fuel ~]# fuel --fuel-version api: '1.0' astute_sha: f7cda2171b0b677dfaeb59693d980a2d3ee4c3e0 auth_required: true build_id: 2015-02-07_20-50-01 build_number: '76' feature_groups: - mirantis fuellib_sha: 64f3ebe9fcbd18bf6c80a948e06061783a090347 fuelmain_sha: c799e3a6d88289e58db764a6be7910aab7da3149 nailgun_sha: 2ef819732a3ee7acf7b610e7d1c1a6da0434c1a0 ostf_sha: 3b57985d4d2155510894a1f6d03b478b201f7780 production: docker release: 6.0.1 release_versions:   2014.2-6.0.1:     VERSION:       api: '1.0'       astute_sha: f7cda2171b0b677dfaeb59693d980a2d3ee4c3e0       build_id: 2015-02-07_20-50-01       build_number: '76'       feature_groups:       - mirantis       fuellib_sha: 64f3ebe9fcbd18bf6c80a948e06061783a090347       fuelmain_sha: c799e3a6d88289e58db764a6be7910aab7da3149       nailgun_sha: 2ef819732a3ee7acf7b610e7d1c1a6da0434c1a0       ostf_sha: 3b57985d4d2155510894a1f6d03b478b201f7780       production: docker       release: 6.0.1 Baremetal,Ubuntu, HA, Neutron-gre,Ceilometer,Ceph-all, Debug, 6.0.1_76 Controllers:3 Computes:96 I have changed /usr/bin/keystone-all file on each controller node. I have added one line: 147 config.setup_logging() this change regard to https://bugs.launchpad.net/mos/+bug/1413341 Deployment has been passed successfully, but during full rally test primary controller node has been marked as offline. Also this node is unreachable via ssh. [root@fuel ~]# ssh node-19 Warning: Permanently added 'node-19' (RSA) to the list of known hosts. Write failed: Broken pipe But at the moment I have one opened ssh session which gives the able to execute some commands. here is output of top command: http://paste.openstack.org/show/176687/ root@node-19:~# free -m              total used free shared buffers cached Mem: 32142 31768 373 0 211 11263 -/+ buffers/cache: 20292 11849 Swap: 15624 12 15612 "rabbitmqctl cluster_status" and "rabbitmqctl list_queues" commands just hang on this node from other controller node: root@node-52:~# rabbitmqctl cluster_status Cluster status of node 'rabbit@node-52' ... [{nodes,[{disc,['rabbit@node-19','rabbit@node-52','rabbit@node-65']}]},  {running_nodes,['rabbit@node-19','rabbit@node-65','rabbit@node-52']},  {cluster_name,<<"rabbit@node-19.domain.tld">>},  {partitions,[]}] ...done. root@node-52:~# rabbitmqctl list_queues | grep -v 0$ Listing queues ... dhcp_agent.node-19 96 notifications.error 415 reply_0c7bc35f0e114b119b959160645ca04a 1 ...done. root@node-19:~# dmesg | grep -i error [ 9.798790] ACPI Error: [\_SB_.PRAD] [ 10.883460] ACPI Error: Method parse/execution failed [\_GPE._L24] (Node ffff880853d9d3e8), AE_NOT_FOUND (20131115/psparse-536) [ 16.284591] ioapic: probe of 0000:00:05.4 failed with error -22 [ 17.779631] ERST: Error Record Serialization Table (ERST) support is initialized. [ 31.029678] EXT4-fs (sda3): re-mounted. Opts: errors=remount-ro crm status output is here http://paste.openstack.org/show/176723/ The last line in rabbitmq log is: =INFO REPORT==== 18-Feb-2015::10:11:18 === accepting AMQP connection <0.9669.490> (192.168.0.54:41674 -> 192.168.0.21:5673) snapshot will be here asap
2015-02-18 13:37:38 Leontiy Istomin attachment added node-19_ipmi.png https://bugs.launchpad.net/mos/+bug/1423116/+attachment/4321771/+files/node-19_ipmi.png
2015-02-18 14:06:56 Bogdan Dobrelya bug added subscriber Vladimir Kuklin
2015-02-18 14:07:49 Bogdan Dobrelya attachment added status.strace.tgz https://bugs.launchpad.net/mos/+bug/1423116/+attachment/4321794/+files/status.strace.tgz
2015-02-18 14:50:52 Bogdan Dobrelya nominated for series mos/6.0.x
2015-02-18 14:50:52 Bogdan Dobrelya bug task added mos/6.0.x
2015-02-18 14:51:06 Bogdan Dobrelya nominated for series mos/5.0.x
2015-02-18 14:51:06 Bogdan Dobrelya bug task added mos/5.0.x
2015-02-18 14:51:06 Bogdan Dobrelya nominated for series mos/5.1.x
2015-02-18 14:51:06 Bogdan Dobrelya bug task added mos/5.1.x
2015-02-18 14:51:06 Bogdan Dobrelya nominated for series mos/6.1.x
2015-02-18 14:51:06 Bogdan Dobrelya bug task added mos/6.1.x
2015-02-18 14:51:12 Bogdan Dobrelya mos/6.0.x: importance Undecided Critical
2015-02-18 14:51:16 Bogdan Dobrelya mos/5.1.x: importance Undecided Critical
2015-02-18 14:51:19 Bogdan Dobrelya mos/5.0.x: importance Undecided Critical
2015-02-18 14:51:23 Bogdan Dobrelya mos/6.0.x: status New Triaged
2015-02-18 14:51:26 Bogdan Dobrelya mos/5.1.x: status New Triaged
2015-02-18 14:51:29 Bogdan Dobrelya mos/5.0.x: status New Triaged
2015-02-18 14:51:48 Bogdan Dobrelya mos/6.0.x: milestone 6.0.1
2015-02-18 14:51:58 Bogdan Dobrelya mos/5.1.x: milestone 5.1.1
2015-02-18 14:52:19 Bogdan Dobrelya mos/5.1.x: milestone 5.1.1 5.1.2
2015-02-18 14:52:26 Bogdan Dobrelya mos/5.0.x: milestone 5.0.3
2015-02-18 14:55:34 Bogdan Dobrelya summary primary controller has been marked as offline by fuel. Pacemaker RA for RabbitMQ cannot detect when its app hangs and keeps it unresponsive
2015-02-18 14:58:46 Bogdan Dobrelya mos/6.0.x: assignee Fuel Library Team (fuel-library)
2015-02-18 14:58:51 Bogdan Dobrelya mos/5.1.x: assignee Fuel Library Team (fuel-library)
2015-02-18 14:58:57 Bogdan Dobrelya mos/5.0.x: assignee Fuel Library Team (fuel-library)
2015-02-18 15:51:27 Vladimir Kuklin mos/5.0.x: status Triaged Invalid
2015-02-18 15:51:33 Vladimir Kuklin mos/5.1.x: status Triaged Won't Fix
2015-02-18 21:14:22 Leontiy Istomin attachment added crm_status https://bugs.launchpad.net/mos/+bug/1423116/+attachment/4322130/+files/crm_status.txt
2015-02-18 21:14:43 Leontiy Istomin attachment added dmesg https://bugs.launchpad.net/mos/+bug/1423116/+attachment/4322131/+files/dmesg.txt
2015-02-18 21:15:00 Leontiy Istomin attachment added free -m https://bugs.launchpad.net/mos/+bug/1423116/+attachment/4322132/+files/free.txt
2015-02-18 21:15:20 Leontiy Istomin attachment added lsob -u rabbit https://bugs.launchpad.net/mos/+bug/1423116/+attachment/4322133/+files/lsof_rabbitmq.txt
2015-02-18 21:15:57 Leontiy Istomin attachment added ps aux https://bugs.launchpad.net/mos/+bug/1423116/+attachment/4322134/+files/ps.txt
2015-02-19 16:39:47 Dmitry Mescheryakov mos/6.1.x: milestone 6.0.1 6.1
2015-02-25 11:08:44 Bogdan Dobrelya mos/6.1.x: status In Progress Fix Committed
2015-02-25 11:08:46 Bogdan Dobrelya mos/6.0.x: status Triaged Fix Committed
2015-02-25 11:08:52 Bogdan Dobrelya mos/6.0.x: assignee Fuel Library Team (fuel-library) Bogdan Dobrelya (bogdando)
2015-05-27 16:07:16 Vitaly Sedelnik nominated for series mos/6.0-updates
2015-05-27 16:07:16 Vitaly Sedelnik bug task added mos/6.0-updates
2015-05-27 16:07:23 Vitaly Sedelnik mos/6.0-updates: status New Invalid
2015-05-27 16:07:27 Vitaly Sedelnik mos/6.0-updates: importance Undecided Critical
2015-05-27 16:07:39 Vitaly Sedelnik mos/6.0-updates: assignee MOS Sustaining (mos-sustaining)
2015-05-27 16:07:45 Vitaly Sedelnik mos/6.0-updates: milestone 6.0-updates
2015-08-05 09:29:39 Timur Nurlygayanov mos/6.1.x: status Fix Committed Incomplete
2015-08-05 09:29:41 Timur Nurlygayanov mos/6.0.x: status Fix Committed Incomplete
2015-08-05 13:55:48 Bogdan Dobrelya mos/6.0.x: status Incomplete Fix Committed
2015-08-05 13:55:52 Bogdan Dobrelya mos/6.1.x: status Incomplete Fix Committed