IBM DataPower Operations Dashboard v1.0.11.0

A newer version of this product documentation is available.

You are viewing an older version. View latest at IBM DPOD Documentation.

Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 12 Next »

Alerts

NameDescription
API Latency Above 100ms AlertAlert if more than 5 API Connect API calls finished with latency of over 100ms
Change 5 to any other number using the field "Error Threshold"
To change "100" to any other latency, you will need to edit the JSON - duplicate the alert first, as system predefined alerts' JSON cannot be edited
About to Expire Certificates AlertAlert for certificates that are about to expire. The alert will check for syslog message with code 0x806000e2 that were written in the last 24 hours
Already Expired Certificates AlertAlert for certificates that already expired - the alert will check for syslog messages with code 0x806000e1 that were written in the last 24 hours
Domain Restarts AlertAlert on domain restarts
Number of Probes AlertAlert if more than 1000 transactions with probles were run in the last 10 minutes
Objects Down AlertAlert on all the DataPower objects that are enabled in configuration but in a down state (similar to the data shown in the Failed Objects page)
Syslog Errors MessageCode Alert

Alert when a specific syslog message is written (only messages with severity = error)
You will need to edit this alert and enter the message codes to alert on instead of the supplied sample message code (0x81000098)

Transaction Errors AlertAlert when 5 or more transactions with errors ran in the last 30 minutes
Please note: When duplicating this alert - the new alert name must start with "Transaction_Errors" (e.g. "Transaction_Errors_2") 

System Health Metrics

See System Health for more detailes

NameDescription
Devices CPU Metric 1Alert when the max device CPU during the last 5 minutes was over 80%
Devices Fan Metric 1Alert when the device fan health is less than 100%
Devices Load Metric 1Alert when the max device load in the last 5 minutes was more than 80%
Devices Memory Metric 1Alert when the max used memory of the device in the last 5 minutes was over 70%
Devices Space Encrypted Metric 1Alert if the free space of the encrypted file system is less than 15%
Devices Space Internal Metric 1Alert if the free space of the internal file system is less than 15%
Devices Space Temp Metric 1Alert if the free space of the temporary file system is less than 15%
Devices Temperature Metric 1Alert when the device temperature health is less than 100%
Devices Voltage Metric 1Alert if the device's voltage health is less than 100%
System Errors MetricAlert if more than 10 critical system errors were written to syslog in the last 5 minutes



(1) The following alerts and metrics are valid only for devices with the option "Device Resources Monitoring" enabled. You may edit the device setting from [Manage → Devices → Monitored Devices]. 
See also Adding Monitored Devices.



  • No labels