IBM DataPower Operations Dashboard v1.0.11.0

A newer version of this product documentation is available.

You are viewing an older version. View latest at IBM DPOD Documentation.

Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 15 Next »

Alerts

NameTypeDescription
About to Expire Certificates AlertGatewayAlert for certificates that are about to expire. The alert will check for syslog message with code 0x806000e2 that were written in the last 24 hours
Already Expired Certificates AlertGatewayAlert for certificates that already expired - the alert will check for syslog messages with code 0x806000e1 that were written in the last 24 hours
Domain Restarts AlertGatewayAlert on domain restarts
Message Codes Frequency AlertGatewayAlert when message codes frequency exceeds threshold value
Number of Probes AlertGatewayAlert if more than 1000 transactions with probles were run in the last 10 minutes
Objects Down AlertGatewayAlert on all the DataPower objects that are enabled in configuration but in a down state (similar to the data shown in the Failed Objects page)
Syslog Errors MessageCode AlertGateway

Alert when a specific syslog message is written (only messages with severity = error)
You will need to edit this alert and enter the message codes to alert on instead of the supplied sample message code (0x81000098)

Transaction Errors AlertGatewayAlert when 5 or more transactions with errors ran in the last 30 minutes
Please note: When duplicating this alert - the new alert name must start with "Transaction_Errors" (e.g. "Transaction_Errors_2") 
API Latency Above 100ms AlertAPI-CAlert if more than 5 API Connect API calls finished with latency of over 100ms
Change 5 to any other number using the field "Error Threshold"
To change "100" to any other latency, you will need to edit the JSON - duplicate the alert first, as system predefined alerts' JSON cannot be edited
APIs That Ended in Error Code RangeAPI-CAlert on any API transaction that ended with status code 500.
You may change the range of the status code by editing the parameters JSON (for example, alert on statusCode between 300 and 600)

System Health Metrics

See System Health for more detailes

NameDescription
Devices CPU Metric 1Alert when the max device CPU during the last 5 minutes was over 80%
Devices Fan Metric 1Alert when the device fan health is less than 100%
Devices Load Metric 1Alert when the max device load in the last 5 minutes was more than 80%
Devices Memory Metric 1Alert when the max used memory of the device in the last 5 minutes was over 70%
Devices Space Encrypted Metric 1Alert if the free space of the encrypted file system is less than 15%
Devices Space Internal Metric 1Alert if the free space of the internal file system is less than 15%
Devices Space Temp Metric 1Alert if the free space of the temporary file system is less than 15%
Devices Temperature Metric 1Alert when the device temperature health is less than 100%
Devices Voltage Metric 1Alert if the device's voltage health is less than 100%
System Errors MetricAlert if more than 10 critical system errors were written to syslog in the last 5 minutes



(1) The following alerts and metrics are valid only for devices with the option "Device Resources Monitoring" enabled. You may edit the device setting from [Manage → Devices → Monitored Devices]. 
See also Adding Monitored Devices.



  • No labels