Some services in Nagios change status from OK to UNKNOWN and back

Bug #1616860 reported by Vitaly Gusev
10
This bug affects 2 people
Affects Status Importance Assigned to Milestone
StackLight
Fix Released
High
Simon Pasquier

Bug Description

MOS 9 and plugins 1.0
Steps to reproduce:
1. Deploy an environment with 3 controllers, 1 compute+cinder and 1 plugin node
(Run deploy_toolchain test with ha_controller_nodes)
2. Once deployment finished go to Nagios UI

Expected result:
All services have OK status

Actual result:
Services with hosts 00-global-clusters-env1 and 00-node-clusters-env1 are in UNKNOWN status. After several minutes they change status to OK and after several minutes again move in UNKNOWN status.

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to fuel-plugin-lma-collector (master)

Fix proposed to branch: master
Review: https://review.openstack.org/360609

Changed in lma-toolchain:
assignee: LMA-Toolchain Fuel Plugins (mos-lma-toolchain) → Simon Pasquier (simon-pasquier)
status: New → In Progress
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to fuel-plugin-lma-collector (master)

Reviewed: https://review.openstack.org/360609
Committed: https://git.openstack.org/cgit/openstack/fuel-plugin-lma-collector/commit/?id=eb9f36fa633cd5b0a4e8af9f56aed065276c974e
Submitter: Jenkins
Branch: master

commit eb9f36fa633cd5b0a4e8af9f56aed065276c974e
Author: Simon Pasquier <email address hidden>
Date: Thu Aug 25 16:45:53 2016 +0200

    Fix the GSE filter wrt Pacemaker metrics

    With the recent refactoring [1] of the Pacemaker collectd plugin, the
    GSE filter may receive Pacemaker metrics from the other nodes of the
    cluster. The Heka filter needs to be updated to discard these messages
    otherwise the GSE filter flaps between active and inactive state.

    [1] I8b5b987704f69c6a60b13e8ea982f27924f488d1

    Change-Id: I6047da6ec5d28f22d309f1858bfbf5d3558cfcb4
    Closes-Bug: #1616860

Changed in lma-toolchain:
status: In Progress → Fix Committed
Vitaly Gusev (vgusev)
Changed in lma-toolchain:
status: Fix Committed → Won't Fix
status: Won't Fix → Fix Released
tags: added: plugin stacklight
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.