Alerts & Monitoring

You can create several alerts related to CPU, Disk, Network, YARN Applications, etc. Acceldata also ships with a set of stock alerts . A unique feature of these alerts is the granular level to which they can be created.

For example, you can create an alert for all YARN MapReduce jobs, which monitors the number of mappers, or the number of reducers. If there is an abnormality, the alert raises an alarm. Another example is when a Spark job takes a second more than a specified threshold to execute, then an alert is raised.

Creating an Alert

Alerts can be created in multiple categories, which encompass the following areas:

  • Infrastructure metrics
  • Hadoop Services
  • YARN Applications
  • Ingestion Alerts with Kafka Support
  • Hive and Spark Specific Applications

Important Definitions

TerminologyDefinition
CategoryThe category of an alert is the module for which it is created, either system alerts such as CPU and network or then integrated applications, such as Spark, Hive, etc.
ExpressionsExpressions define alert conditions, filter conditions, groups, and alert thresholds that need to be specified by the user.
  • Execution Interval - This is the delay between two consecutive evaluations of an alert. The initial delay is also equal to execution interval. This is specified in seconds.
  • Threshold - A threshold is either the number of times the alert has occurred or number of seconds the alert has lasted.
  • Incident - When the alert breaches the threshold continuously, then an incident is created for the alert.
NotificationsThe method of informing the user. You can configure notifications to be sent via Slack, or as an email.
Auto ActionsPre-defined workflows and activities that can be executed based on an alert or an incident
SummaryThe description of the alert including the name of the alert, severity type, and description

To create an alert, do the following:

  1. Click Create Alert button. The Create Alert wizard is displayed.

  2. Select a Category. Based on the category selected a list of metrics are displayed in Expressions.

  3. Specify values for the given metric fields.

    note

    Click from above the alerts panel to view the documentation on alert metrics.

    In case you are creating and endpoint check, do the following.

    note

    Endpoint Check: The Endpoint Check category lets you set up alerts to check if a selected service or URL is up and running.

    1. Under Expression, choose or fill the following options or fields.
    • (Optional) Type Custom Endpoint Url: Click this checkbox and type the URL in the Metric textbox.
    • (Optional) Show Response Body: Click this checkbox to display the response body of the alert when raised.
    • Metric: If Type Custom Endpoint Url is checked, type the endpoint URL(s) separated by comma in the Metric box. If Type Custom Endpoint Url is not checked, choose the endpoint from the dropdown.
    • Define Alert Threshold: You can define the threshold of the endpoint alert in the following scenarios.
      • Execute the condition at every xx sec: You can enter the time interval (in seconds) to set frequency of condition of alerts.
      • Raise the alert if the condition matches for at least xx times/seconds: You can enter the value for number of times the condition matches. You can also enter the number of seconds for the same condition.
  4. Specify one or more methods to get notifications on the metric selected.

    Notification TypeDescription
    EmailSpecify a email address to receive alerts or notifications. You can CC and BCC the notifications to others as well. Adding an email address to CC and BCC is optional.
    Slack
    • Click the checkbox to use Default Slack Configuration URL.
      NOTE
      The default slack configuration URL is provided by you during installation of Pulse.
    • Specify the Slack Webhook URL to get notifications on a Slack channel.
    • You can specify multiple Slack Webhook URLs for different groups or channels by adding commas to separate them.
    HangoutsClick the Webhook notification required checkbox to receive notifications and alerts on Hangouts.
    WebhooksClick the checkbox to use Default Slack Configurations. Else, specify the webhook URL and relevant tags.
    LogClick the checkbox to use the log file and its configurations provided during installation of Pulse to log notifications. Specify any relevant tags.
  5. Click Next. The Auto Actions menu is displayed.

  6. Select an action from the Auto Action drop-down list and click Next.

  7. Specify details for the following properties in the Summary:

    Property NameDescription
    NameSpecify a name for the alert.
    Severity TypeBased on the severity of the alert, select one type of severity from the Severity type drop-down list.
    DescriptionSpecify a description to describe the alert.
    Help text to include with IncidentsSpecify any tips or notes for the user that will be helpful to work on the alert
    EnabledClick the checkbox to enable the summary.
  8. Click Save.

Viewing Alerts

On creation of an alert, it is displayed in the Alerts panel of the Alerts page. You can filter alerts based on Alert Type and Severity level.

Click the Show Disabled Alert checkbox to include disabled alerts in the Alerts panel.

Click to refresh the alerts panel.

Click Rows and select a number from the drop-down list to display the selected number of rows with alerts in the Alerts panel.

Editing Alerts

Click the name of an alert you want to edit. The Edit Alert wizard is displayed. Make your changes and click save.

Deleting Alerts

From the alerts panel list, click from the end of a alert row and select Delete to delete the alert.

Cloning Existing Alerts

From the alerts list, click for the alert you want to clone and select Clone . The Clone Alert wizard is displayed. Make your changes to the properties and save the alert.

Exporting Alerts

  1. Click to export alerts. The Import/Export wizard is displayed.
  2. Click Export Alerts . All the alerts are downloaded as a .JSON file.

Importing Alerts

  1. Click to import alerts. The Import/Export wizard is displayed.
  2. Drag a .JSON file or click the dashed region to upload a .JSON file. The alerts are imported into Acceldata.