IBM DataPower Operations Dashboard v1.0.17.0
A newer version of this product documentation is available.
You are viewing an older version. View latest at IBM DPOD Documentation.
List of Built in Alerts
Alerts
Name | Type | Description |
---|---|---|
About to Expire Certificates Alert | Gateway | Alert for certificates that are about to expire. The alert will check for syslog message with code 0x806000e2 that were written in the last 24 hours |
Already Expired Certificates Alert | Gateway | Alert for certificates that already expired - the alert will check for syslog messages with code 0x806000e1 that were written in the last 24 hours |
API Error Message Count Alert | API-C | Alert when more than X (defaults to 10) API transactions ended with a specific error message, the searched message can be changed via the alert parameters, it's okay to use only a substring of the message. |
API Latency Above 100ms Alert | API-C | Alert if more than 5 API Connect API calls finished with latency of over 100ms Change 5 to any other number using the field "Error Threshold" To change "100" to any other latency, you will need to edit the JSON - duplicate the alert first, as system predefined alerts' JSON cannot be edited |
APIs That Ended in Error Code Range | API-C | Alert on any API transaction that ended with status code 500. You may change the range of the status code by editing the parameters JSON (for example, alert on statusCode between 300 and 600) |
APIs That Ended in Error Code Range - Count | API-C | Alert when count of API transactions that ended with status code 500 is more than 0 |
Rate Limit Alert | API-C | Alert when the Rate Limit utilization (for the same API name, Consumer Application and Plan) in the last 5 minutes is more than 80% |
Domain Restarts Alert | Gateway | Alert on domain restarts |
Message Codes Frequency Alert | Gateway | Alert when message codes frequency exceeds threshold value |
Number of Probes Alert | Gateway | Alert if more than 1000 transactions with probles were run in the last 10 minutes |
Objects Down Alert | Gateway | Alert on all the DataPower objects that are enabled in configuration but in a down state (similar to the data shown in the Failed Objects page) |
Syslog Errors MessageCode Alert | Gateway | Alert when a specific syslog message is written (only messages with severity = error) |
Transaction Errors Alert | Gateway | Alert when 5 or more transactions with errors ran in the last 30 minutes Please note: When duplicating this alert - the new alert name must start with "Transaction_Errors" (e.g. "Transaction_Errors_2") |
Trans. Over 30 Secs by Service Name Alert | Gateway | Alert when more than X (defaults to 10) successful transactions took more than 30 seconds to finish (the number of seconds can be changed via the alert parameters) |
Unavailable Devices Alert | Gateway | Alert when a device becomes unavailable (cannot be sampled) |
Unused Services | Gateway | Alert when service total transactions equals to zero. You will need to edit this alert and enter the list of services that should be included in the alert instead of the supplied sample service names (Service.Name.1,Service.Name.2) |
System Health Metrics
See System Health for more detailes
Name | Description |
---|---|
Devices CPU Metric 1 | Alert when the max device CPU during the last 5 minutes was over 80% |
Devices Fan Metric 1 | Alert when the device fan health is less than 100% |
Devices Load Metric 1 | Alert when the max device load in the last 5 minutes was more than 80% |
Devices Memory Metric 1 | Alert when the max used memory of the device in the last 5 minutes was over 70% |
Devices Space Encrypted Metric 1 | Alert if the free space of the encrypted file system is less than 15% |
Devices Space Internal Metric 1 | Alert if the free space of the internal file system is less than 15% |
Devices Space Temp Metric 1 | Alert if the free space of the temporary file system is less than 15% |
Devices Temperature Metric 1 | Alert when the device temperature health is less than 100% |
Devices Voltage Metric 1 | Alert if the device's voltage health is less than 100% |
MQ Connections Metric | Alert if MQ connections used over 80% |
System Errors Metric | Alert if more than 10 critical system errors were written to syslog in the last 5 minutes |
(1) The following alerts and metrics are valid only for devices with the option "Device Resources Monitoring" enabled. You may edit the device setting from [Manage → Devices → Monitored Devices].
See also Adding Monitored Gateways.