Monitor Network Health

As with any network, one of the challenges is keeping track of all of the moving parts. With the NetQ GUI, you can view the overall health of your network at a glance and then delve deeper for periodic checks or as conditions arise that require attention. For a general understanding of how well your network is operating, the Network Health card workflow is the best place to start as it contains the highest view and performance rollups.

Network Health Card Workflow Summary

The small Network Health card displays:

  • distribution of overall health

  • current overall score

  • overall health trend

images/download/attachments/8365529/image2019-1-28_12_25_4.png

The medium Network Health card displays the distribution, score, and trend of the:

  • overall network performance

  • network services performance

  • interfaces performance

  • system performance

  • key traces (not in 2.0?)

The large Network Health card contains two tabs.

  • Network Services tab which displays:

    • distribution, score, and trend of the BGP service

    • distribution, score, and trend of the EVPN service

    • distribution, score, and trend of the CLAG service

    • distribution, score, and trend of the LLDP service

    • most recent issues

    • devices by most issues

  • Interfaces, System, and Key Trace Health tab which displays:

    • distribution, score, and trend of interface down time

    • distribution, score, and trend of interface flapping

    • distribution, score, and trend of link utilization

    • distribution, score, and trend of packet drops

    • distribution, score, and trend of CRC errors

    • distribution, score, and trend of NetQ Agent performance

    • distribution, score, and trend of CPU utilization

    • distribution, score, and trend of License state

    • distribution, score, and trend of Memory Usage

    • distribution, score, and trend of NTP service performance

    • distribution, score, and trend of Ping command performance

    • distribution, score, and trend of PSU state

    • distribution, score, and trend of Sensor states

    • most recent issues

    • devices by most issues

View Network Health Summary

Overall network health is based on successful validation results. The summary includes the percentage of successful results, a trend indicator, a performance indicator, and a distribution of the validation results. The trend indicator is based on the count of successful validation results that have occurred in the given period as compared to the count in the last two time periods:

  • Upward facing arrow: successful validation count is higher that the last two time periods, an increasing trend

  • Downward facing arrow: successful validation count is lower than the last two time periods, a decreasing trend

  • No arrow: count is unchanged, trend is steady.

The performance indicator is based on a set of pre-defined thresholds, where:

  • Low: successful validation count is x or less

  • Med: successful validation count is between x and y, inclusive

  • High: successful validation count is y or more

To view a summary of your network health, open the small Network Health card.

View Key Metrics of Network Health

Overall network health is a calculated average of several key health metrics: Network Services health and a combination of Interfaces, System and Trace/Device? Health.

To view these key metrics, open the medium Network Health card. Each metric is shown with the the percentage of successful validations, a trend indicator, and a distribution of the validation results.

View Network Services Health

The network services health is a calculated average of the individual network protocol and services health metrics. In all cases, validation is performed on NTP, LLDP, interfaces, licenses, and NetQ Agents. If you are running BGP, CLAG, or EVPN, the calculation includes these as well. You can view the overall health of network services from the medium Network Health card and information about individual services from the large Network Health card.

To view information about each network protocol or service:

  1. Open the large Network Health card.

  2. Hover over the card and click images/lh4.googleusercontent.com/zs4_0N3HzXHCvBSVQ2Y2FPP6WMWZaG0d_7g8af_y2NQ-gABd4AMUkDpzO-MaTN_bG99zjZ56CUWu1mr2ThICEZ8pcjYCyE9MyKxdr-l6DeKCLqYaY-ts_yW9UDzt4LyeYAYVzY7b .

The health of each protocol or service is represented on the left side of the card by a distribution of the validation results, a trend indicator, and a percentage of successful results. The right side of the card provides a listing of devices running the services.

View Devices with the Most Issues

It is useful to know which devices are experiencing the most issues with their network services as this can help focus troubleshooting efforts toward selected devices versus the protocol or service. To view devices with the most issues, open the large Network Health card. Select Devices with Most Issues from the dropdown above the table on the right. Devices with the highest number of issues are listed at the top. Scroll down to view those with fewer issues. To further investigate the critical devices, open their Switch cards.

View ????

other choices in the dropdown?

View Interfaces, System and Key Trace Health


View All Events

The Network Health card workflow enables you to view all of the network protocol and services alarms in the designated time period.

To view all alarms:

  1. Open the full screen Network Health card.

  2. Click All Events tab in the navigation panel.

  3. Sort alarm data by (name= date/timestamp? ) column to view alarms in most recent to least recent.

Where to go next depends on what data you see, but a few options include:

  • Sort or filter alarm data further. Refer to gui overview section .

  • Export the data for use in another analytics tool, by clicking Export and providing a name for the data file.