Contrail analytics response time varies based on the number of VN/VMI when one of the control node fails
Affects | Status | Importance | Assigned to | Milestone | ||
---|---|---|---|---|---|---|
Juniper Openstack | Status tracked in Trunk | |||||
R3.1 |
Fix Committed
|
High
|
Zhiqiang Cui | |||
R3.2 |
Fix Committed
|
High
|
Zhiqiang Cui | |||
R4.0 |
Fix Committed
|
High
|
Zhiqiang Cui | |||
R4.1 |
Fix Committed
|
High
|
Zhiqiang Cui | |||
R5.0 |
Fix Committed
|
High
|
Zhiqiang Cui | |||
Trunk |
Fix Committed
|
High
|
Zhiqiang Cui |
Bug Description
Customer is testing analytics response time when one of the control node fails. Response time varies based on the number of VN and VMI’s. Greater the number VN’s & VMI it takes longer for the response.
Customer setup is as below
3 Control, config
3 collector
3 DB
1 openstack
6 compute nodes
2 TSN nodes
/etc/contrail/
Customer has provided scripts to create VN & VMI and to query the analytics. They shutdown one of the control node and note down the time. They see a large difference in correct response for the analytics queries based on the number of VN’s and VMI
VN VMI Response time
303 600 5 Sec
1500 3000 50 secs
3000 6000 approx 2 min With one control node shutdown
The above delta time doubles when two control nodes are shutdown.
Is this intended behavior?
Why is this difference noticed in clustered scenario when collector nodes stop responding (to replicate the nodes are shutdown). The DB nodes are all up and running when performing this test.
Can the response time be reduced & consistent irrespective number of interfaces.
I could replicate the issue in lab up to 1500 VN and 3000 VMI. Due to resource constraints couldn’t scale this higher.
When querying fro VMI we were getting Http 200 K as response but nothing pertaining to interface or network (output of script)
Valid response
10.204.74.242:8081 default-
Invalid response
10.204.74.242:8081 default-
From LogsFrom contrail-
09/05/2017 11:46:02 AM [contrail-
LeaderNotAvaila
09/05/2017 11:46:07 AM [contrail-
LeaderNotAvaila
09/05/2017 11:46:07 AM [contrail-
LeaderNotAvaila
09/05/2017 11:51:00 AM [contrail-
ConnectionError: Error connecting to 192.168.0.124:6379. timed out.
09/05/2017 11:51:00 AM [contrail-
ConnectionError: Error connecting to 192.168.0.124:6379. timed out.
09/05/2017 11:51:00 AM [contrail-
((u'192.168.0.124', 6379, 1149),) : traceback Traceback (most recent call last):
09/05/2017 11:51:04 AM [contrail-
09/05/2017 11:51:40 AM [contrail-
information type: | Proprietary → Private |
information type: | Private → Public |
Changed in juniperopenstack: | |
assignee: | nobody → Arvind (arvindv) |
description: | updated |
Changed in juniperopenstack: | |
importance: | Undecided → Critical |
tags: | added: nttc |
tags: | added: 2017-0905-0258 jtac |
tags: | added: config |
tags: |
added: analytics removed: config |
Hi,
Any Update?
Best Regards,
Vijay Kumar