IBM DataPower Operations Dashboard v1.0.20.x

A newer version of this product documentation is available.

You are viewing an older version. View latest at IBM DPOD Documentation.

List of Built in Alerts

Alerts

NameTypeDescription
About to Expire Certificates AlertGatewayAlert for certificates that are about to expire. The alert will check for syslog message with code 0x806000e2 that were written in the last 24 hours
Already Expired Certificates AlertGatewayAlert for certificates that already expired - the alert will check for syslog messages with code 0x806000e1 that were written in the last 24 hours
API Error Message Count AlertAPI-CAlert when more than X (defaults to 10) API transactions ended with a specific error message, the searched message can be changed via the alert parameters, it's okay to use only a substring of the message.
API Latency Above 100ms AlertAPI-CAlert if more than 5 API Connect API calls finished with latency of over 100ms
Change 5 to any other number using the field "Error Threshold"
To change "100" to any other latency, you will need to edit the JSON - duplicate the alert first, as system predefined alerts' JSON cannot be edited
APIs That Ended in Error Code RangeAPI-CAlert on any API transaction that ended with status code 500.
You may change the range of the status code by editing the parameters JSON (for example, alert on statusCode between 300 and 600)
APIs That Ended in Error Code Range - CountAPI-CAlert when count of API transactions that ended with status code 500 is more than 0
Rate Limit AlertAPI-CAlert when the Rate Limit utilization (for the same API name, Consumer Application and Plan) in the last 5 minutes is more than 80%
Domain Restarts AlertGatewayAlert on domain restarts
Message Codes Frequency AlertGatewayAlert when message codes frequency exceeds threshold value
Number of Probes AlertGatewayAlert if more than 1000 transactions with probles were run in the last 10 minutes
Objects Down AlertGatewayAlert on all the DataPower objects that are enabled in configuration but in a down state (similar to the data shown in the Failed Objects page)
Syslog Errors MessageCode AlertGateway

Alert when a specific syslog message is written (only messages with severity = error)
You will need to edit this alert and enter the message codes to alert on instead of the supplied sample message code (0x81000098)

Transaction Errors AlertGatewayAlert when 5 or more transactions with errors ran in the last 30 minutes
Please note: When duplicating this alert - the new alert name must start with "Transaction_Errors" (e.g. "Transaction_Errors_2") 
Trans. Over 30 Secs by Service Name AlertGatewayAlert when more than X (defaults to 10) successful transactions took more than 30 seconds to finish (the number of seconds can be changed via the alert parameters)
Unavailable Devices AlertGatewayAlert when a device becomes unavailable (cannot be sampled)
Unused ServicesGateway
Alert when service total transactions equals to zero.
You will need to edit this alert and enter the list of services that should be included in the alert instead of the supplied sample service names (Service.Name.1,Service.Name.2)

System Health Metrics

See System Health for more detailes

NameDescription
Devices CPU Metric 1Alert when the max device CPU during the last 5 minutes was over 80%
Devices Fan Metric 1Alert when the device fan health is less than 100%
Devices Load Metric 1Alert when the max device load in the last 5 minutes was more than 80%
Devices Memory Metric 1Alert when the max used memory of the device in the last 5 minutes was over 70%
Devices Space Encrypted Metric 1Alert if the free space of the encrypted file system is less than 15%
Devices Space Internal Metric 1Alert if the free space of the internal file system is less than 15%
Devices Space Temp Metric 1Alert if the free space of the temporary file system is less than 15%
Devices Temperature Metric 1Alert when the device temperature health is less than 100%
Devices Voltage Metric 1Alert if the device's voltage health is less than 100%
MQ Connections MetricAlert if MQ connections used over 80%
System Errors MetricAlert if more than 10 critical system errors were written to syslog in the last 5 minutes



(1) The following alerts and metrics are valid only for devices with the option "Device Resources Monitoring" enabled. You may edit the device setting from [Manage → Devices → Monitored Devices]. 
See also Adding Monitored Gateways.



Copyright © 2015 MonTier Software (2015) Ltd.