Pacemaker neutron agent scripts start/stop/migration will fail if management vip moved recently
| Affects | Status | Importance | Assigned to | Milestone | ||
|---|---|---|---|---|---|---|
| Fuel for OpenStack | ||||||
| 4.1.x |
Fix Committed
|
High
|
Fuel Library (Deprecated) | |||
Bug Description
{"build_id": "2014-03-
Steps to reproduce:
1 - Deploy Ubuntu HA (Cinder LVM backend, Swift glance backend, Neutron with GRE segmentation) 3 computes - 1 controller - 1 storage
2 - Log into first controller and run crm_resource -r vip__management_old --move --node node-3 (NOTE: replace node-3 where it is the nonprimary controller)
3 - Wait ~60s for keystone and other services to recover
4 - Run neutron agent-list
Results:
# neutron agent-list
+------
| id | agent_type | host | alive | admin_state_up |
+------
| 09699e60-
| 12236192-
| 2c0ec06d-
| ad4c9181-
| bd893993-
| f7451cfd-
# crm resource show | egrep 'l3|dhcp'
p_neutron-
p_neutron-l3-agent (ocf::mirantis:
From l3 agent logs:
p_neutron-
p_neutron-
p_neutron-
p_neutron-
p_neutron-
p_neutron-
(2013, "Lost connection to MySQL server at 'reading initial communication packet', system error: 0") None None (HTTP 500) ]
p_neutron-
p_neutron-
p_neutron-
p_neutron-
p_neutron-
We should tune OCF scripts and/or q-agent-cleanup.py to be more tolerant of keystone being unavailable for up to 2 minutes.
| tags: | added: in progress |
| tags: | added: backports-4.1.1 |
| tags: | added: ha |
| Changed in fuel: | |
| assignee: | Dmitry Borodaenko (dborodaenko) → Sergey Vasilenko (xenolog) |
| summary: |
- Neutron L3/DHCP agents fail when VIP fails over + Pacemaker neutron agent scripts start/stop/migration will fail if + management vip moved recently |
| no longer affects: | fuel/5.0.x |
| no longer affects: | fuel |

Fix proposed to branch: master /review. openstack. org/77895
Review: https:/