Detecting issues impacting multiple devices
Detect issues impacting multiple devices to allow application and network L2+ teams to proactively respond to global issues in their specific areas. Notify relevant application owners about issues impacting their applications. Using the following use cases, evaluate:
The number of impacted devices or users, for example, the number of devices with specific application crashes.
Frequent issues across devices, for example, the number of specific application crashes across all devices.
Both approaches are vital and often complement each other. Use either approach when configuring monitor trigger conditions to avoid triggering alerts and sending notifications when issues are not relevant to the recipient. For example, the system triggers an alert when the number of specific application crashes across all devices exceeds 20 and affects more than 5 devices. The system then notifies the application owner.
The following sections describe two use cases in detail.
Monitoring the number of devices or users with issues
Detect the number of devices or users with an issue to proactively monitor issues impacting multiple devices.
Create an NQL query that returns a summarized number of devices. Optionally, you can use the by
keyword to group your results. The system triggers an alert per group.
Notifications
The system sends notifications for all devices at once, or if the query includes the by
clause, for each group separately.
Alerts overview dashboard
In the Alerts overview dashboard, the alert is displayed in a single line without the context-related label. If grouping has been added, the alert is displayed for each group in a separate line with context about the grouping.
Monitoring frequent issues across devices
Detect an issue across multiple devices which is reflected in an aggregated metric value.
Create an NQL query that returns a summarized metric value. Optionally, group your results using the by
keyword. The system triggers an alert per group.
Notifications
The system sends notifications for a single metric, or if the query includes the by
clause, for each group separately.
Alerts overview dashboard
In the Alerts overview dashboard, the alert is displayed in a single line without the context-related label. If grouping has been added, the alert is displayed for each group in a separate line with context about the grouping.
Refer to the NQL examples below and the NQL data model documentation for more information about NQL.
NQL Examples
Below is a list of NQL query examples to help you create and edit monitors. Review the queries and pick the one most similar to the monitor you are creating or editing. Copy the query and adjust it to your use case, including the thresholds that have been provided as an example.
Detect specific web errors for an application.
This NQL query returns the aggregated number of errors and devices with errors for a specific application and triggers the alert per specific error code separately:
Detect applications with a high web error ratio.
Select other thresholds to make sure there is enough usage volume and that there are enough issues to avoid false positives.
Detect a high number of crashes for binaries.
Detect a high number of devices with long boot time with Geolocation by country.
The long boot time is defined as time_until_desktop_is_visible>= 60s
Virtualization alert for when the average CPU queue length per desktop pool is > =3
Last updated