for BMC ATEN IPMI controllers of Supermicro servers https://www.supermicro.com/solutions/IPMI.cfm
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
Refer to the vendor documentation.
Name | Description | Default |
---|---|---|
{$TEMP_CRIT} | 60 |
|
{$TEMPCRITLOW} | 5 |
|
{$TEMP_WARN} | 50 |
|
{$SNMP.TIMEOUT} | 5m |
|
{$ICMPLOSSWARN} | 20 |
|
{$ICMPRESPONSETIME_WARN} | 0.15 |
Name | Description | Type | Key and additional info |
---|---|---|---|
Supermicro Aten: Uptime (network) | MIB: SNMPv2-MIB The time (in hundredths of a second) since the network management portion of the system was last re-initialized. |
SNMP agent | system.net.uptime[sysUpTime.0] Preprocessing
|
Supermicro Aten: Uptime (hardware) | MIB: HOST-RESOURCES-MIB The amount of time since this host was last initialized. Note that this is different from sysUpTime in the SNMPv2-MIB [RFC1907] because sysUpTime is the uptime of the network management portion of the system. |
SNMP agent | system.hw.uptime[hrSystemUptime.0] Preprocessing
|
Supermicro Aten: SNMP traps (fallback) | The item is used to collect all SNMP traps unmatched by other snmptrap items |
SNMP trap | snmptrap.fallback |
Supermicro Aten: System location | MIB: SNMPv2-MIB The physical location of this node (e.g., `telephone closet, 3rd floor'). If the location is unknown, the value is the zero-length string. |
SNMP agent | system.location[sysLocation.0] Preprocessing
|
Supermicro Aten: System contact details | MIB: SNMPv2-MIB The textual identification of the contact person for this managed node, together with information on how to contact this person. If no contact information is known, the value is the zero-length string. |
SNMP agent | system.contact[sysContact.0] Preprocessing
|
Supermicro Aten: System object ID | MIB: SNMPv2-MIB The vendor's authoritative identification of the network management subsystem contained in the entity. This value is allocated within the SMI enterprises subtree (1.3.6.1.4.1) and provides an easy and unambiguous means for determining |
SNMP agent | system.objectid[sysObjectID.0] Preprocessing
|
Supermicro Aten: System name | MIB: SNMPv2-MIB An administratively-assigned name for this managed node.By convention, this is the node's fully-qualified domain name. If the name is unknown, the value is the zero-length string. |
SNMP agent | system.name Preprocessing
|
Supermicro Aten: System description | MIB: SNMPv2-MIB A textual description of the entity. This value should include the full name and version identification of the system's hardware type, software operating-system, and networking software. |
SNMP agent | system.descr[sysDescr.0] Preprocessing
|
Supermicro Aten: SNMP agent availability | Availability of SNMP checks on the host. The value of this item corresponds to availability icons in the host list. Possible value: 0 - not available 1 - available 2 - unknown |
Zabbix internal | zabbix[host,snmp,available] |
Supermicro Aten: ICMP ping | Simple check | icmpping | |
Supermicro Aten: ICMP loss | Simple check | icmppingloss | |
Supermicro Aten: ICMP response time | Simple check | icmppingsec |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Supermicro Aten: Host has been restarted | Uptime is less than 10 minutes. |
(last(/Supermicro Aten by SNMP/system.hw.uptime[hrSystemUptime.0])>0 and last(/Supermicro Aten by SNMP/system.hw.uptime[hrSystemUptime.0])<10m) or (last(/Supermicro Aten by SNMP/system.hw.uptime[hrSystemUptime.0])=0 and last(/Supermicro Aten by SNMP/system.net.uptime[sysUpTime.0])<10m) |Warning |
Manual close: Yes Depends on:
|
|
Supermicro Aten: System name has changed | The name of the system has changed. Acknowledge to close the problem manually. |
last(/Supermicro Aten by SNMP/system.name,#1)<>last(/Supermicro Aten by SNMP/system.name,#2) and length(last(/Supermicro Aten by SNMP/system.name))>0 |Info |
Manual close: Yes | |
Supermicro Aten: No SNMP data collection | SNMP is not available for polling. Please check device connectivity and SNMP settings. |
max(/Supermicro Aten by SNMP/zabbix[host,snmp,available],{$SNMP.TIMEOUT})=0 |Warning |
Depends on:
|
|
Supermicro Aten: Unavailable by ICMP ping | Last three attempts returned timeout. Please check device connectivity. |
max(/Supermicro Aten by SNMP/icmpping,#3)=0 |High |
||
Supermicro Aten: High ICMP ping loss | min(/Supermicro Aten by SNMP/icmppingloss,5m)>{$ICMP_LOSS_WARN} and min(/Supermicro Aten by SNMP/icmppingloss,5m)<100 |Warning |
Depends on:
|
||
Supermicro Aten: High ICMP ping response time | avg(/Supermicro Aten by SNMP/icmppingsec,5m)>{$ICMP_RESPONSE_TIME_WARN} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature Discovery | Scanning ATEN-IPMI-MIB::sensorTable with filter: not connected temp sensors (Value = 0) |
SNMP agent | tempDescr.discovery |
Name | Description | Type | Key and additional info |
---|---|---|---|
{#SENSOR_DESCR}: Temperature | MIB: ATEN-IPMI-MIB A textual string containing information about the interface. This string should include the name of the manufacturer, the product name and the version of the interface hardware/software. |
SNMP agent | sensor.temp.value[sensorReading.{#SNMPINDEX}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#SENSOR_DESCR}: Temperature is above warning threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/Supermicro Aten by SNMP/sensor.temp.value[sensorReading.{#SNMPINDEX}],5m)>{$TEMP_WARN:"{#SENSOR_DESCR}"} |Warning |
Depends on:
|
|
{#SENSOR_DESCR}: Temperature is above critical threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/Supermicro Aten by SNMP/sensor.temp.value[sensorReading.{#SNMPINDEX}],5m)>{$TEMP_CRIT:"{#SENSOR_DESCR}"} |High |
||
{#SENSOR_DESCR}: Temperature is too low | avg(/Supermicro Aten by SNMP/sensor.temp.value[sensorReading.{#SNMPINDEX}],5m)<{$TEMP_CRIT_LOW:"{#SENSOR_DESCR}"} |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
FAN Discovery | Scanning ATEN-IPMI-MIB::sensorTable with filter: not connected FAN sensors (Value = 0) |
SNMP agent | fan.discovery |
Name | Description | Type | Key and additional info |
---|---|---|---|
{#SENSOR_DESCR}: Fan speed, % | MIB: ATEN-IPMI-MIB A textual string containing information about the interface. This string should include the name of the manufacturer, the product name and the version of the interface hardware/software. |
SNMP agent | sensor.fan.speed.percentage[sensorReading.{#SNMPINDEX}] |
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums
Template for monitoring Intel SR1630 server system.
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
Refer to the vendor documentation.
Name | Description | Type | Key and additional info |
---|---|---|---|
Intel SR1630 IPMI: Baseboard Temp | IPMI agent | baseboard_temp | |
Intel SR1630 IPMI: BB +1.05V PCH | IPMI agent | bb1.05vpch | |
Intel SR1630 IPMI: BB +1.1V P1 Vccp | IPMI agent | bb1.1vp1_vccp | |
Intel SR1630 IPMI: BB +1.5V P1 DDR3 | IPMI agent | bb1.5vp1_ddr3 | |
Intel SR1630 IPMI: BB +3.3V | IPMI agent | bb_3.3v | |
Intel SR1630 IPMI: BB +3.3V STBY | IPMI agent | bb3.3vstby | |
Intel SR1630 IPMI: BB +5.0V | IPMI agent | bb_5.0v | |
Intel SR1630 IPMI: Front Panel Temp | IPMI agent | frontpaneltemp | |
Intel SR1630 IPMI: Power | IPMI agent | power | |
Intel SR1630 IPMI: System Fan 2 | IPMI agent | systemfan2 | |
Intel SR1630 IPMI: System Fan 3 | IPMI agent | systemfan3 |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Intel SR1630 IPMI: Baseboard Temp Critical [{ITEM.VALUE}] | last(/Intel SR1630 IPMI/baseboard_temp)<5 or last(/Intel SR1630 IPMI/baseboard_temp)>90 |Disaster |
Depends on:
|
||
Intel SR1630 IPMI: Baseboard Temp Non-Critical [{ITEM.VALUE}] | last(/Intel SR1630 IPMI/baseboard_temp)<10 or last(/Intel SR1630 IPMI/baseboard_temp)>83 |High |
Depends on:
|
||
Intel SR1630 IPMI: BB +1.05V PCH Critical [{ITEM.VALUE}] | last(/Intel SR1630 IPMI/bb_1.05v_pch)<0.953 or last(/Intel SR1630 IPMI/bb_1.05v_pch)>1.149 |Disaster |
Depends on:
|
||
Intel SR1630 IPMI: BB +1.05V PCH Non-Critical [{ITEM.VALUE}] | last(/Intel SR1630 IPMI/bb_1.05v_pch)<0.985 or last(/Intel SR1630 IPMI/bb_1.05v_pch)>1.117 |High |
Depends on:
|
||
Intel SR1630 IPMI: BB +1.1V P1 Vccp Critical [{ITEM.VALUE}] | last(/Intel SR1630 IPMI/bb_1.1v_p1_vccp)<0.683 or last(/Intel SR1630 IPMI/bb_1.1v_p1_vccp)>1.543 |Disaster |
Depends on:
|
||
Intel SR1630 IPMI: BB +1.1V P1 Vccp Non-Critical [{ITEM.VALUE}] | last(/Intel SR1630 IPMI/bb_1.1v_p1_vccp)<0.708 or last(/Intel SR1630 IPMI/bb_1.1v_p1_vccp)>1.501 |High |
Depends on:
|
||
Intel SR1630 IPMI: BB +1.5V P1 DDR3 Critical [{ITEM.VALUE}] | last(/Intel SR1630 IPMI/bb_1.5v_p1_ddr3)<1.362 or last(/Intel SR1630 IPMI/bb_1.5v_p1_ddr3)>1.635 |Disaster |
Depends on:
|
||
Intel SR1630 IPMI: BB +1.5V P1 DDR3 Non-Critical [{ITEM.VALUE}] | last(/Intel SR1630 IPMI/bb_1.5v_p1_ddr3)<1.401 or last(/Intel SR1630 IPMI/bb_1.5v_p1_ddr3)>1.589 |High |
Depends on:
|
||
Intel SR1630 IPMI: BB +3.3V Critical [{ITEM.VALUE}] | last(/Intel SR1630 IPMI/bb_3.3v)<2.982 or last(/Intel SR1630 IPMI/bb_3.3v)>3.625 |Disaster |
Depends on:
|
||
Intel SR1630 IPMI: BB +3.3V Non-Critical [{ITEM.VALUE}] | last(/Intel SR1630 IPMI/bb_3.3v)<3.067 or last(/Intel SR1630 IPMI/bb_3.3v)>3.525 |High |
Depends on:
|
||
Intel SR1630 IPMI: BB +3.3V STBY Critical [{ITEM.VALUE}] | last(/Intel SR1630 IPMI/bb_3.3v_stby)<2.982 or last(/Intel SR1630 IPMI/bb_3.3v_stby)>3.625 |Disaster |
|||
Intel SR1630 IPMI: BB +3.3V STBY Non-Critical [{ITEM.VALUE}] | last(/Intel SR1630 IPMI/bb_3.3v_stby)<3.067 or last(/Intel SR1630 IPMI/bb_3.3v_stby)>3.525 |High |
Depends on:
|
||
Intel SR1630 IPMI: BB +5.0V Critical [{ITEM.VALUE}] | last(/Intel SR1630 IPMI/bb_5.0v)<4.471 or last(/Intel SR1630 IPMI/bb_5.0v)>5.538 |Disaster |
Depends on:
|
||
Intel SR1630 IPMI: BB +5.0V Non-Critical [{ITEM.VALUE}] | last(/Intel SR1630 IPMI/bb_5.0v)<4.630 or last(/Intel SR1630 IPMI/bb_5.0v)>5.380 |High |
Depends on:
|
||
Intel SR1630 IPMI: Front Panel Temp Critical [{ITEM.VALUE}] | last(/Intel SR1630 IPMI/front_panel_temp)<0 or last(/Intel SR1630 IPMI/front_panel_temp)>48 |Disaster |
Depends on:
|
||
Intel SR1630 IPMI: Front Panel Temp Non-Critical [{ITEM.VALUE}] | last(/Intel SR1630 IPMI/front_panel_temp)<5 or last(/Intel SR1630 IPMI/front_panel_temp)>44 |High |
Depends on:
|
||
Intel SR1630 IPMI: Power | last(/Intel SR1630 IPMI/power)=0 |Warning |
|||
Intel SR1630 IPMI: System Fan 2 Critical [{ITEM.VALUE}] | last(/Intel SR1630 IPMI/system_fan_2)<324 |Disaster |
Depends on:
|
||
Intel SR1630 IPMI: System Fan 2 Non-Critical [{ITEM.VALUE}] | last(/Intel SR1630 IPMI/system_fan_2)<378 |High |
Depends on:
|
||
Intel SR1630 IPMI: System Fan 3 Critical [{ITEM.VALUE}] | last(/Intel SR1630 IPMI/system_fan_3)<324 |Disaster |
Depends on:
|
||
Intel SR1630 IPMI: System Fan 3 Non-Critical [{ITEM.VALUE}] | last(/Intel SR1630 IPMI/system_fan_3)<378 |High |
Depends on:
|
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums
Template for monitoring Intel SR1530 server system.
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
Refer to the vendor documentation.
Name | Description | Type | Key and additional info |
---|---|---|---|
Intel SR1530 IPMI: BB +1.8V SM | IPMI agent | bb1.8vsm | |
Intel SR1530 IPMI: BB +3.3V | IPMI agent | bb_3.3v | |
Intel SR1530 IPMI: BB +3.3V STBY | IPMI agent | bb3.3vstby | |
Intel SR1530 IPMI: BB +5.0V | IPMI agent | bb_5.0v | |
Intel SR1530 IPMI: BB Ambient Temp | IPMI agent | bbambienttemp | |
Intel SR1530 IPMI: Power | IPMI agent | power | |
Intel SR1530 IPMI: Processor Vcc | IPMI agent | processor_vcc | |
Intel SR1530 IPMI: System Fan 3 | IPMI agent | systemfan3 |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Intel SR1530 IPMI: BB +1.8V SM Critical [{ITEM.VALUE}] | last(/Intel SR1530 IPMI/bb_1.8v_sm)<1.597 or last(/Intel SR1530 IPMI/bb_1.8v_sm)>2.019 |Disaster |
Depends on:
|
||
Intel SR1530 IPMI: BB +1.8V SM Non-Critical [{ITEM.VALUE}] | last(/Intel SR1530 IPMI/bb_1.8v_sm)<1.646 or last(/Intel SR1530 IPMI/bb_1.8v_sm)>1.960 |High |
Depends on:
|
||
Intel SR1530 IPMI: BB +3.3V Critical [{ITEM.VALUE}] | last(/Intel SR1530 IPMI/bb_3.3v)<2.876 or last(/Intel SR1530 IPMI/bb_3.3v)>3.729 |Disaster |
Depends on:
|
||
Intel SR1530 IPMI: BB +3.3V Non-Critical [{ITEM.VALUE}] | last(/Intel SR1530 IPMI/bb_3.3v)<2.970 or last(/Intel SR1530 IPMI/bb_3.3v)>3.618 |High |
Depends on:
|
||
Intel SR1530 IPMI: BB +3.3V STBY Critical [{ITEM.VALUE}] | last(/Intel SR1530 IPMI/bb_3.3v_stby)<2.876 or last(/Intel SR1530 IPMI/bb_3.3v_stby)>3.729 |Disaster |
|||
Intel SR1530 IPMI: BB +3.3V STBY Non-Critical [{ITEM.VALUE}] | last(/Intel SR1530 IPMI/bb_3.3v_stby)<2.970 or last(/Intel SR1530 IPMI/bb_3.3v_stby)>3.618 |High |
Depends on:
|
||
Intel SR1530 IPMI: BB +5.0V Critical [{ITEM.VALUE}] | last(/Intel SR1530 IPMI/bb_5.0v)<4.362 or last(/Intel SR1530 IPMI/bb_5.0v)>5.663 |Disaster |
Depends on:
|
||
Intel SR1530 IPMI: BB +5.0V Non-Critical [{ITEM.VALUE}] | last(/Intel SR1530 IPMI/bb_5.0v)<4.483 or last(/Intel SR1530 IPMI/bb_5.0v)>5.495 |High |
Depends on:
|
||
Intel SR1530 IPMI: BB Ambient Temp Critical [{ITEM.VALUE}] | last(/Intel SR1530 IPMI/bb_ambient_temp)<5 or last(/Intel SR1530 IPMI/bb_ambient_temp)>66 |Disaster |
|||
Intel SR1530 IPMI: BB Ambient Temp Non-Critical [{ITEM.VALUE}] | last(/Intel SR1530 IPMI/bb_ambient_temp)<10 or last(/Intel SR1530 IPMI/bb_ambient_temp)>61 |High |
Depends on:
|
||
Intel SR1530 IPMI: Power | last(/Intel SR1530 IPMI/power)=0 |Warning |
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums
for IMM2 and IMM1 IBM serverX hardware
Description: Some IMMs (IMM1) do not return disks
Description: Some IMMs (IMM1) do not return fan status: fanHealthStatus
Description: IMM1 servers (M2, M3 generations) sysObjectID is NET-SNMP-MIB::netSnmpAgentOIDs.10
Description: IMM1 servers (M2, M3 generations) only Ambient temperature sensor available
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
Refer to the vendor documentation.
Name | Description | Default |
---|---|---|
{$TEMP_CRIT} | 60 |
|
{$TEMPCRITLOW} | 5 |
|
{$TEMP_WARN} | 50 |
|
{$HEALTHCRITSTATUS} | 2 |
|
{$HEALTHDISASTERSTATUS} | 0 |
|
{$HEALTHWARNSTATUS} | 4 |
|
{$TEMP_CRIT:"Ambient"} | 35 |
|
{$TEMP_WARN:"Ambient"} | 30 |
|
{$DISKOKSTATUS} | Normal |
|
{$PSUOKSTATUS} | Normal |
|
{$FANOKSTATUS} | Normal |
|
{$SNMP.TIMEOUT} | 5m |
|
{$ICMPLOSSWARN} | 20 |
|
{$ICMPRESPONSETIME_WARN} | 0.15 |
Name | Description | Type | Key and additional info |
---|---|---|---|
IBM IMM: Overall system health status | MIB: IMM-MIB Indicates status of system health for the system in which the IMM resides. Value of 'nonRecoverable' indicates a severe error has occurred and the system may not be functioning. A value of 'critical' indicates that an error has occurred but the system is currently functioning properly. A value of 'nonCritical' indicates that a condition has occurred that may change the state of the system in the future but currently the system is working properly. A value of 'normal' indicates that the system is operating normally. |
SNMP agent | system.status[systemHealthStat.0] |
IBM IMM: Hardware model name | MIB: IMM-MIB |
SNMP agent | system.hw.model Preprocessing
|
IBM IMM: Hardware serial number | MIB: IMM-MIB Machine serial number VPD information |
SNMP agent | system.hw.serialnumber Preprocessing
|
IBM IMM: Uptime (network) | MIB: SNMPv2-MIB The time (in hundredths of a second) since the network management portion of the system was last re-initialized. |
SNMP agent | system.net.uptime[sysUpTime.0] Preprocessing
|
IBM IMM: Uptime (hardware) | MIB: HOST-RESOURCES-MIB The amount of time since this host was last initialized. Note that this is different from sysUpTime in the SNMPv2-MIB [RFC1907] because sysUpTime is the uptime of the network management portion of the system. |
SNMP agent | system.hw.uptime[hrSystemUptime.0] Preprocessing
|
IBM IMM: SNMP traps (fallback) | The item is used to collect all SNMP traps unmatched by other snmptrap items |
SNMP trap | snmptrap.fallback |
IBM IMM: System location | MIB: SNMPv2-MIB The physical location of this node (e.g., `telephone closet, 3rd floor'). If the location is unknown, the value is the zero-length string. |
SNMP agent | system.location[sysLocation.0] Preprocessing
|
IBM IMM: System contact details | MIB: SNMPv2-MIB The textual identification of the contact person for this managed node, together with information on how to contact this person. If no contact information is known, the value is the zero-length string. |
SNMP agent | system.contact[sysContact.0] Preprocessing
|
IBM IMM: System object ID | MIB: SNMPv2-MIB The vendor's authoritative identification of the network management subsystem contained in the entity. This value is allocated within the SMI enterprises subtree (1.3.6.1.4.1) and provides an easy and unambiguous means for determining |
SNMP agent | system.objectid[sysObjectID.0] Preprocessing
|
IBM IMM: System name | MIB: SNMPv2-MIB An administratively-assigned name for this managed node.By convention, this is the node's fully-qualified domain name. If the name is unknown, the value is the zero-length string. |
SNMP agent | system.name Preprocessing
|
IBM IMM: System description | MIB: SNMPv2-MIB A textual description of the entity. This value should include the full name and version identification of the system's hardware type, software operating-system, and networking software. |
SNMP agent | system.descr[sysDescr.0] Preprocessing
|
IBM IMM: SNMP agent availability | Availability of SNMP checks on the host. The value of this item corresponds to availability icons in the host list. Possible value: 0 - not available 1 - available 2 - unknown |
Zabbix internal | zabbix[host,snmp,available] |
IBM IMM: ICMP ping | Simple check | icmpping | |
IBM IMM: ICMP loss | Simple check | icmppingloss | |
IBM IMM: ICMP response time | Simple check | icmppingsec |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
IBM IMM: System is in unrecoverable state! | Please check the device for faults |
count(/IBM IMM by SNMP/system.status[systemHealthStat.0],#1,"eq","{$HEALTH_DISASTER_STATUS}")=1 |High |
||
IBM IMM: System status is in critical state | Please check the device for errors |
count(/IBM IMM by SNMP/system.status[systemHealthStat.0],#1,"eq","{$HEALTH_CRIT_STATUS}")=1 |High |
Depends on:
|
|
IBM IMM: System status is in warning state | Please check the device for warnings |
count(/IBM IMM by SNMP/system.status[systemHealthStat.0],#1,"eq","{$HEALTH_WARN_STATUS}")=1 |Warning |
Depends on:
|
|
IBM IMM: Device has been replaced | Device serial number has changed. Acknowledge to close the problem manually. |
last(/IBM IMM by SNMP/system.hw.serialnumber,#1)<>last(/IBM IMM by SNMP/system.hw.serialnumber,#2) and length(last(/IBM IMM by SNMP/system.hw.serialnumber))>0 |Info |
Manual close: Yes | |
IBM IMM: Host has been restarted | Uptime is less than 10 minutes. |
(last(/IBM IMM by SNMP/system.hw.uptime[hrSystemUptime.0])>0 and last(/IBM IMM by SNMP/system.hw.uptime[hrSystemUptime.0])<10m) or (last(/IBM IMM by SNMP/system.hw.uptime[hrSystemUptime.0])=0 and last(/IBM IMM by SNMP/system.net.uptime[sysUpTime.0])<10m) |Warning |
Manual close: Yes Depends on:
|
|
IBM IMM: System name has changed | The name of the system has changed. Acknowledge to close the problem manually. |
last(/IBM IMM by SNMP/system.name,#1)<>last(/IBM IMM by SNMP/system.name,#2) and length(last(/IBM IMM by SNMP/system.name))>0 |Info |
Manual close: Yes | |
IBM IMM: No SNMP data collection | SNMP is not available for polling. Please check device connectivity and SNMP settings. |
max(/IBM IMM by SNMP/zabbix[host,snmp,available],{$SNMP.TIMEOUT})=0 |Warning |
Depends on:
|
|
IBM IMM: Unavailable by ICMP ping | Last three attempts returned timeout. Please check device connectivity. |
max(/IBM IMM by SNMP/icmpping,#3)=0 |High |
||
IBM IMM: High ICMP ping loss | min(/IBM IMM by SNMP/icmppingloss,5m)>{$ICMP_LOSS_WARN} and min(/IBM IMM by SNMP/icmppingloss,5m)<100 |Warning |
Depends on:
|
||
IBM IMM: High ICMP ping response time | avg(/IBM IMM by SNMP/icmppingsec,5m)>{$ICMP_RESPONSE_TIME_WARN} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature Discovery | Scanning IMM-MIB::tempTable |
SNMP agent | tempDescr.discovery |
Name | Description | Type | Key and additional info |
---|---|---|---|
{#SNMPVALUE}: Temperature | MIB: IMM-MIB Temperature readings of testpoint: {#SNMPVALUE} |
SNMP agent | sensor.temp.value[tempReading.{#SNMPINDEX}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#SNMPVALUE}: Temperature is above warning threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/IBM IMM by SNMP/sensor.temp.value[tempReading.{#SNMPINDEX}],5m)>{$TEMP_WARN:"{#SNMPVALUE}"} |Warning |
Depends on:
|
|
{#SNMPVALUE}: Temperature is above critical threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/IBM IMM by SNMP/sensor.temp.value[tempReading.{#SNMPINDEX}],5m)>{$TEMP_CRIT:"{#SNMPVALUE}"} |High |
||
{#SNMPVALUE}: Temperature is too low | avg(/IBM IMM by SNMP/sensor.temp.value[tempReading.{#SNMPINDEX}],5m)<{$TEMP_CRIT_LOW:"{#SNMPVALUE}"} |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature Discovery Ambient | Scanning IMM-MIB::tempTable with Ambient filter |
SNMP agent | tempDescr.discovery.ambient |
Name | Description | Type | Key and additional info |
---|---|---|---|
Ambient: Temperature | MIB: IMM-MIB Temperature readings of testpoint: Ambient |
SNMP agent | sensor.temp.value[tempReading.Ambient.{#SNMPINDEX}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Ambient: Temperature is above warning threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/IBM IMM by SNMP/sensor.temp.value[tempReading.Ambient.{#SNMPINDEX}],5m)>{$TEMP_WARN:"Ambient"} |Warning |
Depends on:
|
|
Ambient: Temperature is above critical threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/IBM IMM by SNMP/sensor.temp.value[tempReading.Ambient.{#SNMPINDEX}],5m)>{$TEMP_CRIT:"Ambient"} |High |
||
Ambient: Temperature is too low | avg(/IBM IMM by SNMP/sensor.temp.value[tempReading.Ambient.{#SNMPINDEX}],5m)<{$TEMP_CRIT_LOW:"Ambient"} |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature Discovery CPU | Scanning IMM-MIB::tempTable with CPU filter |
SNMP agent | tempDescr.discovery.cpu |
Name | Description | Type | Key and additional info |
---|---|---|---|
CPU: Temperature | MIB: IMM-MIB Temperature readings of testpoint: CPU |
SNMP agent | sensor.temp.value[tempReading.CPU.{#SNMPINDEX}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
CPU: Temperature is above warning threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/IBM IMM by SNMP/sensor.temp.value[tempReading.CPU.{#SNMPINDEX}],5m)>{$TEMP_WARN:"CPU"} |Warning |
Depends on:
|
|
CPU: Temperature is above critical threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/IBM IMM by SNMP/sensor.temp.value[tempReading.CPU.{#SNMPINDEX}],5m)>{$TEMP_CRIT:"CPU"} |High |
||
CPU: Temperature is too low | avg(/IBM IMM by SNMP/sensor.temp.value[tempReading.CPU.{#SNMPINDEX}],5m)<{$TEMP_CRIT_LOW:"CPU"} |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU Discovery | IMM-MIB::powerFruName |
SNMP agent | psu.discovery |
Name | Description | Type | Key and additional info |
---|---|---|---|
{#PSU_DESCR}: Power supply status | MIB: IMM-MIB A description of the power module status. |
SNMP agent | sensor.psu.status[powerHealthStatus.{#SNMPINDEX}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#PSU_DESCR}: Power supply is not in normal state | Please check the power supply unit for errors |
count(/IBM IMM by SNMP/sensor.psu.status[powerHealthStatus.{#SNMPINDEX}],#1,"ne","{$PSU_OK_STATUS}")=1 |Info |
Name | Description | Type | Key and additional info |
---|---|---|---|
FAN Discovery | IMM-MIB::fanDescr |
SNMP agent | fan.discovery |
Name | Description | Type | Key and additional info |
---|---|---|---|
{#FAN_DESCR}: Fan status | MIB: IMM-MIB A description of the fan component status. |
SNMP agent | sensor.fan.status[fanHealthStatus.{#SNMPINDEX}] |
{#FAN_DESCR}: Fan speed, % | MIB: IMM-MIB Fan speed expressed in percent(%) of maximum RPM. An octet string expressed as 'ddd% of maximum' where:d is a decimal digit or blank space for a leading zero. If the fan is determined not to be running or the fan speed cannot be determined, the string will indicate 'Offline'. |
SNMP agent | sensor.fan.speed.percentage[fanSpeed.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#FAN_DESCR}: Fan is not in normal state | Please check the fan unit |
count(/IBM IMM by SNMP/sensor.fan.status[fanHealthStatus.{#SNMPINDEX}],#1,"ne","{$FAN_OK_STATUS}")=1 |Info |
Name | Description | Type | Key and additional info |
---|---|---|---|
Physical Disk Discovery | SNMP agent | physicalDisk.discovery |
Name | Description | Type | Key and additional info |
---|---|---|---|
{#SNMPINDEX}: Physical disk status | MIB: IMM-MIB |
SNMP agent | system.hw.physicaldisk.status[diskHealthStatus.{#SNMPINDEX}] |
{#SNMPINDEX}: Physical disk part number | MIB: IMM-MIB disk module FRU name. |
SNMP agent | system.hw.physicaldisk.part_number[diskFruName.{#SNMPINDEX}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#SNMPINDEX}: Physical disk is not in OK state | Please check physical disk for warnings or errors |
count(/IBM IMM by SNMP/system.hw.physicaldisk.status[diskHealthStatus.{#SNMPINDEX}],#1,"ne","{$DISK_OK_STATUS}")=1 |Warning |
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums
The template to monitor HPE Synergy by HTTP. It works without any external scripts and uses the script item.
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
Name | Description | Default |
---|---|---|
{$HPE.SYNERGY.API.PASSWORD} | Specify password for API. |
|
{$HPE.SYNERGY.API.USERNAME} | Specify user name for API. |
zabbix |
{$HPE.SYNERGY.DATA.TIMEOUT} | Response timeout for API. |
15s |
{$HPE.SYNERGY.API.SCHEME} | The API scheme (http/https). |
https |
{$HPE.SYNERGY.API.PORT} | The API port. |
443 |
Name | Description | Type | Key and additional info |
---|---|---|---|
HPE Synergy: Get data | The JSON with the result from requests to API. |
Script | hpe.synergy.get.data |
HPE Synergy: Get errors | A list of errors from API requests. |
Dependent item | hpe.synergy.get.errors Preprocessing
|
HPE Synergy: Get enclosures data | A list of enclosures. |
Dependent item | hpe.synergy.get.enclosures Preprocessing
|
HPE Synergy: Get datacenters data | Data of the datacenters. |
Dependent item | hpe.synergy.get.datacenters Preprocessing
|
HPE Synergy: Get ethernet networks data | Data of the ethernet networks. |
Dependent item | hpe.synergy.get.ethernet_networks Preprocessing
|
HPE Synergy: Get fabrics data | Data of the fabrics. |
Dependent item | hpe.synergy.get.fabrics Preprocessing
|
HPE Synergy: Get FC networks data | Data of the FC networks. |
Dependent item | hpe.synergy.get.fc_networks Preprocessing
|
HPE Synergy: Get hypervisor managers data | Data of the hypervisor managers. |
Dependent item | hpe.synergy.get.hypervisor_managers Preprocessing
|
HPE Synergy: Get interconnects data | Data of the interconnects. |
Dependent item | hpe.synergy.get.interconnects Preprocessing
|
HPE Synergy: Get logical enclosures data | Data of the logical enclosures. |
Dependent item | hpe.synergy.get.logical_enclosures Preprocessing
|
HPE Synergy: Get racks data | Data of the racks. |
Dependent item | hpe.synergy.get.racks Preprocessing
|
HPE Synergy: Get server hardware data | Data of the server hardware. |
Dependent item | hpe.synergy.get.server_hardware Preprocessing
|
HPE Synergy: Get storage pools data | Data of the storage pools. |
Dependent item | hpe.synergy.get.storage_pools Preprocessing
|
HPE Synergy: Get storage systems data | Data of the storage systems. |
Dependent item | hpe.synergy.get.storage_systems Preprocessing
|
HPE Synergy: Get storage volumes data | Data of the storage volumes. |
Dependent item | hpe.synergy.get.storage_volumes Preprocessing
|
HPE Synergy: Get uplink sets data | Data of the uplink sets. |
Dependent item | hpe.synergy.get.uplink_sets Preprocessing
|
HPE Synergy: Service ping | Checks if the service is running and accepting the TCP connections. |
Simple check | net.tcp.service["{$HPE.SYNERGY.API.SCHEME}","{HOST.CONN}","{$HPE.SYNERGY.API.PORT}"] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
HPE Synergy: There are errors in requests to API | Zabbix has received errors from API. |
length(last(/HPE Synergy by HTTP/hpe.synergy.get.errors))>0 |Average |
Depends on:
|
|
HPE Synergy: Service is unavailable | max(/HPE Synergy by HTTP/net.tcp.service["{$HPE.SYNERGY.API.SCHEME}","{HOST.CONN}","{$HPE.SYNERGY.API.PORT}"],5m)=0 |High |
Manual close: Yes |
Name | Description | Type | Key and additional info |
---|---|---|---|
Appliance bays discovery | A list of the appliance bays in the enclosure. |
Dependent item | hpe.synergy.appliances.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Appliance bay [{#ENCLOSURENAME}:{#BAYNUMBER}]: Get data | Data of the appliance bay [{#ENCLOSURENAME}:{#BAYNUMBER}]. |
Dependent item | hpe.synergy.appliance["{#BAYNUMBER}","{#ENCLOSURENAME}",data] Preprocessing
|
Appliance bay [{#ENCLOSURENAME}:{#BAYNUMBER}]: Model | The model name of the appliance. |
Dependent item | hpe.synergy.appliance["{#BAYNUMBER}","{#ENCLOSURENAME}",model] Preprocessing
|
Appliance bay [{#ENCLOSURENAME}:{#BAYNUMBER}]: Part number | The part number of the appliance. |
Dependent item | hpe.synergy.appliance["{#BAYNUMBER}","{#ENCLOSURENAME}",part_number] Preprocessing
|
Appliance bay [{#ENCLOSURENAME}:{#BAYNUMBER}]: Powered on | Yes, if the appliance is powered on; false, otherwise. |
Dependent item | hpe.synergy.appliance["{#BAYNUMBER}","{#ENCLOSURENAME}",powered_on] Preprocessing
|
Appliance bay [{#ENCLOSURENAME}:{#BAYNUMBER}]: Power state | The power state of the appliance bay. EFuse - the power state of the bay - it has been EFused. Reset - the power state of the bay - it has been reset. SoftReset - the power state of the bay - it has been softly reset. Unknown - the power state of the bay is unknown. |
Dependent item | hpe.synergy.appliance["{#BAYNUMBER}","{#ENCLOSURENAME}",baypowerstate] Preprocessing
|
Appliance bay [{#ENCLOSURENAME}:{#BAYNUMBER}]: Presence | Indicates whether an appliance is present in the bay: Absent - the device slot is empty; PresenceNoOp - the device slot is uninitialized; PresenceUnknown - the device presence is unknown; Present - the device slot has a device in it; Subsumed - the device slot is configured to be part of another device slot. Not applicable for the fan or power supply bays. |
Dependent item | hpe.synergy.appliance["{#BAYNUMBER}","{#ENCLOSURENAME}",presence] Preprocessing
|
Appliance bay [{#ENCLOSURENAME}:{#BAYNUMBER}]: Serial number | The serial number of the appliance. |
Dependent item | hpe.synergy.appliance["{#BAYNUMBER}","{#ENCLOSURENAME}",serial_number] Preprocessing
|
Appliance bay [{#ENCLOSURENAME}:{#BAYNUMBER}]: Spare part number | The spare part number of the appliance. |
Dependent item | hpe.synergy.appliance["{#BAYNUMBER}","{#ENCLOSURENAME}",sparepartnumber] Preprocessing
|
Appliance bay [{#ENCLOSURENAME}:{#BAYNUMBER}]: Status | The hardware status of the appliance: Critical - requires immediate attention; Disabled - the resource is currently not operational; OK - indicates normal/informational behavior; Unknown - the health status is not yet known or cannot be determined; Warning - requires attention soon. |
Dependent item | hpe.synergy.appliance["{#BAYNUMBER}","{#ENCLOSURENAME}",status] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Appliance bay [{#ENCLOSURENAME}:{#BAYNUMBER}]: Has critical status | The appliance [{#ENCLOSURENAME}:{#BAYNUMBER}] status is critical. Needs immediate attention. |
last(/HPE Synergy by HTTP/hpe.synergy.appliance["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",status])=0 |High |
||
Appliance bay [{#ENCLOSURENAME}:{#BAYNUMBER}]: Has warning status | The appliance [{#ENCLOSURENAME}:{#BAYNUMBER}] status is warning. Needs attention soon. |
last(/HPE Synergy by HTTP/hpe.synergy.appliance["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",status])=4 |Warning |
||
Appliance bay [{#ENCLOSURENAME}:{#BAYNUMBER}]: Is disabled | The appliance [{#ENCLOSURENAME}:{#BAYNUMBER}] is currently not operational |
last(/HPE Synergy by HTTP/hpe.synergy.appliance["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",status])=1 |Info |
Name | Description | Type | Key and additional info |
---|---|---|---|
Cross bars discovery | SDX cross fabric module connects to all computing devices installed in the system enclosure and brings in the capability of hard partitioning. Crossbar details are relevant only for enclosures with type "SDX". |
Dependent item | hpe.synergy.crossbars.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Crossbar [{#ENCLOSURENAME}:{#BAYNUMBER}]: Get data | Crossbar [{#ENCLOSURENAME}:{#BAYNUMBER}] data |
Dependent item | hpe.synergy.crossbar["{#BAYNUMBER}","{#ENCLOSURENAME}",data] Preprocessing
|
Crossbar [{#ENCLOSURENAME}:{#BAYNUMBER}]: HW version | The hardware version. |
Dependent item | hpe.synergy.crossbar["{#BAYNUMBER}","{#ENCLOSURENAME}",hw_version] Preprocessing
|
Crossbar [{#ENCLOSURENAME}:{#BAYNUMBER}]: Part number | The part number provided by the manufacturer. |
Dependent item | hpe.synergy.crossbar["{#BAYNUMBER}","{#ENCLOSURENAME}",part_number] Preprocessing
|
Crossbar [{#ENCLOSURENAME}:{#BAYNUMBER}]: Presence | The presence in a bay: Absent - the device slot is empty; PresenceNoOp - the device slot is uninitialized; PresenceUnknown - the device presence is unknown; Present - the device slot has a device in it; Subsumed - the device slot is configured to be part of another device slot. Not applicable for the fan or power supply bays. |
Dependent item | hpe.synergy.crossbar["{#BAYNUMBER}","{#ENCLOSURENAME}",presence] Preprocessing
|
Crossbar [{#ENCLOSURENAME}:{#BAYNUMBER}]: Serial number | A serial number. |
Dependent item | hpe.synergy.crossbar["{#BAYNUMBER}","{#ENCLOSURENAME}",serial_number] Preprocessing
|
Crossbar [{#ENCLOSURENAME}:{#BAYNUMBER}]: Status | The overall health status of the crossbar: Critical - requires immediate attention; Disabled - the resource is currently not operational; OK - indicates normal/informational behavior; Unknown - the health status is not yet known or cannot be determined; Warning - requires attention soon. |
Dependent item | hpe.synergy.crossbar["{#BAYNUMBER}","{#ENCLOSURENAME}",status] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Crossbar [{#ENCLOSURENAME}:{#BAYNUMBER}]: Is subsumed | The device slot is configured to be part of another device slot. |
last(/HPE Synergy by HTTP/hpe.synergy.crossbar["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",presence])=4 |Average |
||
Crossbar [{#ENCLOSURENAME}:{#BAYNUMBER}]: Has critical status | The crossbar [{#ENCLOSURENAME}:{#BAYNUMBER}] status is critical. Needs immediate attention. |
last(/HPE Synergy by HTTP/hpe.synergy.crossbar["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",status])=0 |High |
||
Crossbar [{#ENCLOSURENAME}:{#BAYNUMBER}]: Has warning status | The crossbar [{#ENCLOSURENAME}:{#BAYNUMBER}] status is warning. Needs attention soon. |
last(/HPE Synergy by HTTP/hpe.synergy.crossbar["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",status])=4 |Warning |
||
Crossbar [{#ENCLOSURENAME}:{#BAYNUMBER}]: Is disabled | The crossbar [{#ENCLOSURENAME}:{#BAYNUMBER}] currently not operational. |
last(/HPE Synergy by HTTP/hpe.synergy.crossbar["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",status])=1 |Info |
Name | Description | Type | Key and additional info |
---|---|---|---|
Datacenters discovery | A list of the datacenters. |
Dependent item | hpe.synergy.datacenters.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Datacenter [{#NAME}]: Get data | Data of the datacenter [{#NAME}]. |
Dependent item | hpe.synergy.datacenter["{#NAME}",data] Preprocessing
|
Datacenter [{#NAME}]: State | The current state of the resource. The valid values include Adding, AddError, Configured, CredentialError, Refreshing, RefreshError, Removing, RemoveError, and Unmanaged. |
Dependent item | hpe.synergy.datacenter["{#NAME}",state] Preprocessing
|
Datacenter [{#NAME}]: Status | The overall health status of the resource. The following are the valid values for the status of the resource: OK - indicates normal/informational behavior; Disabled - indicates that a resource is not operational; Warning - requires attention soon; Critical - requires immediate attention; Unknown - should be avoided, but there may be rare occasions when the status is unknown. |
Dependent item | hpe.synergy.datacenter["{#NAME}",status] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Datacenter [{#NAME}]: Add error | The adding of the datacenter [{#NAME}] has failed. |
last(/HPE Synergy by HTTP/hpe.synergy.datacenter["{#NAME}",state])=1 |Average |
||
Datacenter [{#NAME}]: Has credential error | The datacenter [{#NAME}] has a credential error. |
last(/HPE Synergy by HTTP/hpe.synergy.datacenter["{#NAME}",state])=3 |Average |
||
Datacenter [{#NAME}]: Has refresh error | The datacenter [{#NAME}] has a refresh error. |
last(/HPE Synergy by HTTP/hpe.synergy.datacenter["{#NAME}",state])=5 |Average |
||
Datacenter [{#NAME}]: Has remove error | The datacenter [{#NAME}] has a remove error. |
last(/HPE Synergy by HTTP/hpe.synergy.datacenter["{#NAME}",state])=7 |Average |
||
Datacenter [{#NAME}]: Has critical status | The datacenter [{#NAME}] status is critical. Needs immediate attention. |
last(/HPE Synergy by HTTP/hpe.synergy.datacenter["{#NAME}",status])=0 |High |
||
Datacenter [{#NAME}]: Has warning status | The datacenter [{#NAME}] status is warning. Needs attention soon. |
last(/HPE Synergy by HTTP/hpe.synergy.datacenter["{#NAME}",status])=4 |Warning |
||
Datacenter [{#NAME}]: Is disabled | the datacenter [{#NAME}] currently not operational. |
last(/HPE Synergy by HTTP/hpe.synergy.datacenter["{#NAME}",status])=1 |Info |
Name | Description | Type | Key and additional info |
---|---|---|---|
Devices discovery | A list of device bays in the enclosure. |
Dependent item | hpe.synergy.devices.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Device [{#ENCLOSURENAME}:{#BAYNUMBER}]: Get data | Data of the device [{#ENCLOSURENAME}:{#BAYNUMBER}]. |
Dependent item | hpe.synergy.device["{#BAYNUMBER}","{#ENCLOSURENAME}",data] Preprocessing
|
Device [{#ENCLOSURENAME}:{#BAYNUMBER}]: Model | The model name of an unsupported device occupying the bay if available. |
Dependent item | hpe.synergy.device["{#BAYNUMBER}","{#ENCLOSURENAME}",model] Preprocessing
|
Device [{#ENCLOSURENAME}:{#BAYNUMBER}]: Power allocated | The power allocated for the enclosed blade. |
Dependent item | hpe.synergy.device["{#BAYNUMBER}","{#ENCLOSURENAME}",power_allocation] Preprocessing
|
Device [{#ENCLOSURENAME}:{#BAYNUMBER}]: Presence | Indicates whether a device is present: Absent - the device slot is empty; PresenceNoOp - the device slot is uninitialized; PresenceUnknown - the device presence is unknown; Present - the device slot has a device in it; Subsumed - the device slot is configured to be part of another device slot. Not applicable for the fan or power supply bays. |
Dependent item | hpe.synergy.device["{#BAYNUMBER}","{#ENCLOSURENAME}",presence] Preprocessing
|
Device [{#ENCLOSURENAME}:{#BAYNUMBER}]: Serial number | If available, the serial number of any device occupying the bay. |
Dependent item | hpe.synergy.device["{#BAYNUMBER}","{#ENCLOSURENAME}",serial_number] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Device [{#ENCLOSURENAME}:{#BAYNUMBER}]: Is subsumed | The device slot is configured to be part of another device slot. |
last(/HPE Synergy by HTTP/hpe.synergy.device["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",presence])=4 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Enclosures discovery | A list of enclosures resources. |
Dependent item | hpe.synergy.enclosures.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Enclosure [{#NAME}]: Get data | Data of the enclosure [{#NAME}]. |
Dependent item | hpe.synergy.enclosure["{#NAME}",data] Preprocessing
|
Enclosure [{#NAME}]: Appliance bays count | The number of the appliance bays in the enclosure. |
Dependent item | hpe.synergy.enclosure["{#NAME}",appliancebaycount] Preprocessing
|
Enclosure [{#NAME}]: Device bays count | The number of the device bays in the enclosure. |
Dependent item | hpe.synergy.enclosure["{#NAME}",devicebaycount] Preprocessing
|
Enclosure [{#NAME}]: Device bays power | The amount of power allocated for the blades in the enclosure. |
Dependent item | hpe.synergy.enclosure["{#NAME}",devicebaywatts] Preprocessing
|
Enclosure [{#NAME}]: Fan bays count | The number of the fan bays in the enclosure. |
Dependent item | hpe.synergy.enclosure["{#NAME}",fanbaycount] Preprocessing
|
Enclosure [{#NAME}]: Firmware baseline | The name of the current firmware baseline. |
Dependent item | hpe.synergy.enclosure["{#NAME}",fwbaselinename] Preprocessing
|
Enclosure [{#NAME}]: Interconnect bays count | The number of the interconnect bays in the enclosure. |
Dependent item | hpe.synergy.enclosure["{#NAME}",interconnectbaycount] Preprocessing
|
Enclosure [{#NAME}]: Interconnect bays power | The amount of power allocated for the interconnects in the enclosure. |
Dependent item | hpe.synergy.enclosure["{#NAME}",interconnectbaywatts] Preprocessing
|
Enclosure [{#NAME}]: Min power supplies | The minimum number of the power supplies needed. |
Dependent item | hpe.synergy.enclosure["{#NAME}",min_ps] Preprocessing
|
Enclosure [{#NAME}]: Min power supplies for redundant power feed | The minimum number of the power supplies needed to fulfill the redundant line feed power mode. |
Dependent item | hpe.synergy.enclosure["{#NAME}",minpsredundant] Preprocessing
|
Enclosure [{#NAME}]: Model | The enclosure model name, for example, "BladeSystem c7000 Enclosure G2. |
Dependent item | hpe.synergy.enclosure["{#NAME}",model] Preprocessing
|
Enclosure [{#NAME}]: Part number | The part number of the enclosure. |
Dependent item | hpe.synergy.enclosure["{#NAME}",part_number] Preprocessing
|
Enclosure [{#NAME}]: Power allocated for fans and management devices | The amount of the power allocated for the fans and management devices of the enclosure. |
Dependent item | hpe.synergy.enclosure["{#NAME}",fansmgmtpower] Preprocessing
|
Enclosure [{#NAME}]: Power capacity | The power capacity based on power mode. |
Dependent item | hpe.synergy.enclosure["{#NAME}",power_capacity] Preprocessing
|
Enclosure [{#NAME}]: Power supply bays count | The number of the power supply bays in the enclosure. |
Dependent item | hpe.synergy.enclosure["{#NAME}",psbaycount] Preprocessing
|
Enclosure [{#NAME}]: Serial number | The serial number of the enclosure. |
Dependent item | hpe.synergy.enclosure["{#NAME}",serial_number] Preprocessing
|
Enclosure [{#NAME}]: State | The current resource state of the enclosure: Adding - the enclosure is being added; Configured - the enclosure is configured and is a part of the logical enclosure. This is the usual state for an enclosure under full management; Configuring - a transient state while the enclosure is being configured for a logical enclosure; Interrupted - the previous operation on the enclosure did not complete. The operation should be re-attempted; Monitored - the enclosure is being monitored. It is not a part of the logical enclosure and only hardware-control operations are available; Pending - there are pending operations on the enclosure. Additional operations are denied; RemoveFailed - the previous operation to remove the enclosure did not succeed. The operation should be re-attempted; Removing - the enclosure is being removed; Unmanaged - the enclosure has been discovered, but has not yet been added for the management or monitoring; Unsupported - the enclosure model or version is not currently supported by HPE OneView. It cannot be configured or monitored. |
Dependent item | hpe.synergy.enclosure["{#NAME}",state] Preprocessing
|
Enclosure [{#NAME}]: State reason | Indicates the reason why the resource in its current state: Missing - the enclosure is no longer connected into the frame link topology; None - no reason is available, or none applies; NotAdded - the enclosure has not been added; NotOwner - the enclosure reports being managed by something other than this HPE OneView; OperationFailed - a prior operation was interrupted; Unowned - the enclosure reports are not being under the management; UnsupportedFirmware - the firmware version of the enclosure is not supported by this version of HPE OneView; UpdatingFirmware - a firmware update is in progress. |
Dependent item | hpe.synergy.enclosure["{#NAME}",state_reason] Preprocessing
|
Enclosure [{#NAME}]: Status | The overall health status of the enclosure. The enclosure status reflects the hardware health of the enclosure, all the bays, and the enclosure components (e.g. the enclosure mid-plane, fans, power supplies, Synergy Frame Link Modules, and Synergy Composers). It explicitly does not include the status of the other HPE OneView resources such as the blades (server hardware), the interconnects, and the drive enclosures. Critical - requires immediate attention. Disabled - the resource is currently not operational. OK - indicates normal/informational behavior. Unknown - the health status is not yet known or cannot be determined. Warning - requires attention soon. |
Dependent item | hpe.synergy.enclosure["{#NAME}",status] Preprocessing
|
Enclosure [{#NAME}]: Total allocated power | The total amount of the power allocated in the enclosure. |
Dependent item | hpe.synergy.enclosure["{#NAME}",powertotalallocated] Preprocessing
|
Enclosure [{#NAME}]: Total available power | The amount of the unallocated power in the enclosure. |
Dependent item | hpe.synergy.enclosure["{#NAME}",powertotalavailable] Preprocessing
|
Enclosure [{#NAME}]: Type | The type of the enclosure, for example, "C7000" or "SY12000" or "SDX". |
Dependent item | hpe.synergy.enclosure["{#NAME}",type] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Enclosure [{#NAME}]: Is interrupted | The previous operation on the enclosure did not complete. The operation should be re-attempted. |
last(/HPE Synergy by HTTP/hpe.synergy.enclosure["{#NAME}",state])=3 and last(/HPE Synergy by HTTP/hpe.synergy.enclosure["{#NAME}",state_reason])>-1 |Warning |
||
Enclosure [{#NAME}]: Is unsupported | The enclosure model or version is not currently supported by HPE OneView. It cannot be configured or monitored. |
last(/HPE Synergy by HTTP/hpe.synergy.enclosure["{#NAME}",state])=9 and last(/HPE Synergy by HTTP/hpe.synergy.enclosure["{#NAME}",state_reason])>-1 |Average |
||
Enclosure [{#NAME}]: Remove failed | The previous operation to remove the enclosure did not succeed. The operation should be re-attempted. |
last(/HPE Synergy by HTTP/hpe.synergy.enclosure["{#NAME}",state])=6 and last(/HPE Synergy by HTTP/hpe.synergy.enclosure["{#NAME}",state_reason])>-1 |Warning |
||
Enclosure [{#NAME}]: Is missing | The enclosure is no longer connected into the frame link topology. |
last(/HPE Synergy by HTTP/hpe.synergy.enclosure["{#NAME}",state_reason])=0 |Average |
||
Enclosure [{#NAME}]: Is unowned | The enclosure reports are not being under the management. |
last(/HPE Synergy by HTTP/hpe.synergy.enclosure["{#NAME}",state_reason])=5 |Average |
||
Enclosure [{#NAME}]: Has critical status | The status of the enclosure [{#NAME}] is critical. Needs immediate attention. |
last(/HPE Synergy by HTTP/hpe.synergy.enclosure["{#NAME}",status])=0 |High |
||
Enclosure [{#NAME}]: Has warning status | The status of the enclosure [{#NAME}] is warning. Needs attention soon. |
last(/HPE Synergy by HTTP/hpe.synergy.enclosure["{#NAME}",status])=4 |Warning |
||
Enclosure [{#NAME}]: Is disabled | The enclosure [{#NAME}] is currently not operational. |
last(/HPE Synergy by HTTP/hpe.synergy.enclosure["{#NAME}",status])=1 |Info |
Name | Description | Type | Key and additional info |
---|---|---|---|
Ethernet networks discovery | A list of the ethernet networks. |
Dependent item | hpe.synergy.ethernet.networks.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Ethernet network [{#NAME}]: Get data | Data of the ethernet network [{#NAME}]. |
Dependent item | hpe.synergy.ethernet.network["{#NAME}",data] Preprocessing
|
Ethernet network [{#NAME}]: State | The current state of the resource. |
Dependent item | hpe.synergy.ethernet.network["{#NAME}",state] Preprocessing
|
Ethernet network [{#NAME}]: Status | The overall health status of the resource. The following are the valid values for the status of the resource: OK - indicates normal/informational behavior; Disabled - indicates that the resource is not operational; Warning - requires attention soon; Critical - requires immediate attention; Unknown - should be avoided, but there may be rare occasions when the status is unknown. |
Dependent item | hpe.synergy.ethernet.network["{#NAME}",status] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Ethernet network [{#NAME}]: Has critical status | The ethernet network [{#NAME}] status is critical. Needs immediate attention. |
last(/HPE Synergy by HTTP/hpe.synergy.ethernet.network["{#NAME}",status])=0 |High |
||
Ethernet network [{#NAME}]: Has warning status | The ethernet network [{#NAME}] status is warning. Needs attention soon. |
last(/HPE Synergy by HTTP/hpe.synergy.ethernet.network["{#NAME}",status])=4 |Warning |
||
Ethernet network [{#NAME}]: Is disabled | The ethernet network [{#NAME}] is currently not operational. |
last(/HPE Synergy by HTTP/hpe.synergy.ethernet.network["{#NAME}",status])=1 |Info |
Name | Description | Type | Key and additional info |
---|---|---|---|
Fabrics discovery | A list of the fabrics. |
Dependent item | hpe.synergy.fabrics.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Fabric [{#NAME}]: Get data | Data of the fabric [{#NAME}]. |
Dependent item | hpe.synergy.fabric["{#NAME}",data] Preprocessing
|
Fabric [{#NAME}]: State | The current state of the resource. |
Dependent item | hpe.synergy.fabric["{#NAME}",state] Preprocessing
|
Fabric [{#NAME}]: Status | The overall health status of the resource. The following are the valid values for the status of the resource: OK - indicates normal/informational behavior; Disabled - indicates that the resource is not operational; Warning - requires attention soon; Critical - requires immediate attention; Unknown - should be avoided, but there may be rare occasions when the status is unknown. |
Dependent item | hpe.synergy.fabric["{#NAME}",status] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Fabric [{#NAME}]: Has critical status | The status of the fabric [{#NAME}] is critical. Needs immediate attention. |
last(/HPE Synergy by HTTP/hpe.synergy.fabric["{#NAME}",status])=0 |High |
||
Fabric [{#NAME}]: Has warning status | The status of the fabric [{#NAME}] is warning. Needs attention soon. |
last(/HPE Synergy by HTTP/hpe.synergy.fabric["{#NAME}",status])=4 |Warning |
||
Fabric [{#NAME}]: Is disabled | The status of the fabric [{#NAME}] is currently not operational. |
last(/HPE Synergy by HTTP/hpe.synergy.fabric["{#NAME}",status])=1 |Info |
Name | Description | Type | Key and additional info |
---|---|---|---|
Fans discovery | A list of the fan bays in the enclosure. |
Dependent item | hpe.synergy.fans.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Fan [{#ENCLOSURENAME}:{#BAYNUMBER}]: Get data | Data of the fan [{#ENCLOSURENAME}:{#BAYNUMBER}]. |
Dependent item | hpe.synergy.fan["{#BAYNUMBER}","{#ENCLOSURENAME}",data] Preprocessing
|
Fan [{#ENCLOSURENAME}:{#BAYNUMBER}]: Model | The common descriptive model of the fan. |
Dependent item | hpe.synergy.fan["{#BAYNUMBER}","{#ENCLOSURENAME}",model] Preprocessing
|
Fan [{#ENCLOSURENAME}:{#BAYNUMBER}]: Part number | The part number of the fan. |
Dependent item | hpe.synergy.fan["{#BAYNUMBER}","{#ENCLOSURENAME}",part_number] Preprocessing
|
Fan [{#ENCLOSURENAME}:{#BAYNUMBER}]: Presence | Indicates whether a fan is present: Absent - the device slot is empty; PresenceNoOp - the device slot is uninitialized; PresenceUnknown - the device presence is unknown; Present - the device slot has a device in it; Subsumed - the device slot is configured to be part of another device slot. Not applicable for the fan or power supply bays. |
Dependent item | hpe.synergy.fan["{#BAYNUMBER}","{#ENCLOSURENAME}",presence] Preprocessing
|
Fan [{#ENCLOSURENAME}:{#BAYNUMBER}]: Required | Indicates whether the enclosure configuration requires a fan to be present in the bay. |
Dependent item | hpe.synergy.fan["{#BAYNUMBER}","{#ENCLOSURENAME}",required] Preprocessing
|
Fan [{#ENCLOSURENAME}:{#BAYNUMBER}]: Serial number | The serial number of the fan. |
Dependent item | hpe.synergy.fan["{#BAYNUMBER}","{#ENCLOSURENAME}",serial_number] Preprocessing
|
Fan [{#ENCLOSURENAME}:{#BAYNUMBER}]: Spare part number | The spare part number to be used when ordering an additional or replacement fan of this type. |
Dependent item | hpe.synergy.fan["{#BAYNUMBER}","{#ENCLOSURENAME}",sparepartnumber] Preprocessing
|
Fan [{#ENCLOSURENAME}:{#BAYNUMBER}]: State | The current state of the fan: Degraded - a fan is degraded; Failed - a fan has failed; Misplaced - a fan is present, but not required in this bay, and the overall fan configuration is not compliant with the enclosure fan placement rules; Missing - a fan is required, but is not present; OK - a fan bay has no issues; Unknown - the state of a fan is unknown. |
Dependent item | hpe.synergy.fan["{#BAYNUMBER}","{#ENCLOSURENAME}",state] Preprocessing
|
Fan [{#ENCLOSURENAME}:{#BAYNUMBER}]: Status | The overall health status of the fan: Critical - requires immediate attention; Disabled - the resource is currently not operational; OK - indicates normal/informational behavior; Unknown - the health status is not yet known or cannot be determined; Warning - requires attention soon. |
Dependent item | hpe.synergy.fan["{#BAYNUMBER}","{#ENCLOSURENAME}",status] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Fan [{#ENCLOSURENAME}:{#BAYNUMBER}]: Is degraded | The fan [{#ENCLOSURENAME}:{#BAYNUMBER}] is in degraded state. |
last(/HPE Synergy by HTTP/hpe.synergy.fan["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",state])=0 |Average |
||
Fan [{#ENCLOSURENAME}:{#BAYNUMBER}]: Is failed | The fan [{#ENCLOSURENAME}:{#BAYNUMBER}] is in failed state. |
last(/HPE Synergy by HTTP/hpe.synergy.fan["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",state])=1 |High |
||
Fan [{#ENCLOSURENAME}:{#BAYNUMBER}]: Is misplaced | The fan [{#ENCLOSURENAME}:{#BAYNUMBER}] is misplaced. |
last(/HPE Synergy by HTTP/hpe.synergy.fan["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",state])=2 |Warning |
||
Fan [{#ENCLOSURENAME}:{#BAYNUMBER}]: Is missing | The fan [{#ENCLOSURENAME}:{#BAYNUMBER}] is missing. |
last(/HPE Synergy by HTTP/hpe.synergy.fan["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",state])=3 |Average |
||
Fan [{#ENCLOSURENAME}:{#BAYNUMBER}]: Has critical status | The fan [{#ENCLOSURENAME}:{#BAYNUMBER}] status is critical. Needs immediate attention. |
last(/HPE Synergy by HTTP/hpe.synergy.fan["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",status])=0 |High |
||
Fan [{#ENCLOSURENAME}:{#BAYNUMBER}]: Has warning status | The fan [{#ENCLOSURENAME}:{#BAYNUMBER}] status is warning. Needs attention soon. |
last(/HPE Synergy by HTTP/hpe.synergy.fan["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",status])=4 |Warning |
||
Fan [{#ENCLOSURENAME}:{#BAYNUMBER}]: Is disabled | The fan [{#ENCLOSURENAME}:{#BAYNUMBER}] is currently not operational. |
last(/HPE Synergy by HTTP/hpe.synergy.fan["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",status])=1 |Info |
Name | Description | Type | Key and additional info |
---|---|---|---|
FC networks discovery | A list of the FC networks. |
Dependent item | hpe.synergy.fc.networks.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
FC network [{#NAME}]: Get data | Data of the FC network [{#NAME}]. |
Dependent item | hpe.synergy.fc.network["{#NAME}",data] Preprocessing
|
FC network [{#NAME}]: State | The current state of the resource. |
Dependent item | hpe.synergy.fc.network["{#NAME}",state] Preprocessing
|
FC network [{#NAME}]: Status | The overall health status of the resource. The following are the valid values for the status of the resource: OK - indicates normal/informational behavior; Disabled - indicates that the resource is not operational; Warning - requires attention soon; Critical - requires immediate attention; Unknown - should be avoided, but there may be rare occasions when the status is unknown. |
Dependent item | hpe.synergy.fc.network["{#NAME}",status] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
FC network [{#NAME}]: Has critical status | The FC network [{#NAME}] status is critical. Needs immediate attention. |
last(/HPE Synergy by HTTP/hpe.synergy.fc.network["{#NAME}",status])=0 |High |
||
FC network [{#NAME}]: Has warning status | The FC network [{#NAME}] status is warning. Needs attention soon. |
last(/HPE Synergy by HTTP/hpe.synergy.fc.network["{#NAME}",status])=4 |Warning |
||
FC network [{#NAME}]: Is disabled | The FC network [{#NAME}] is currently not operational. |
last(/HPE Synergy by HTTP/hpe.synergy.fc.network["{#NAME}",status])=1 |Info |
Name | Description | Type | Key and additional info |
---|---|---|---|
Hypervisor managers discovery | A list of the hypervisor managers. |
Dependent item | hpe.synergy.hypervisor.managers.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Hypervisor manager [{#NAME}]: Get data | Data of the hypervisor manager [{#NAME}]. |
Dependent item | hpe.synergy.hypervisor_manager["{#NAME}",data] Preprocessing
|
Hypervisor manager [{#NAME}]: State | The current state of the resource. The valid values include Connected, Disconnected, Configuring and Error. |
Dependent item | hpe.synergy.hypervisor_manager["{#NAME}",state] Preprocessing
|
Hypervisor manager [{#NAME}]: State reason | Indicates the reason why the resource is in its current state. |
Dependent item | hpe.synergy.hypervisormanager["{#NAME}",statereason] Preprocessing
|
Hypervisor manager [{#NAME}]: Status | The current status of this resource: Critical - requires immediate attention; Disabled - the resource is currently not operational; OK - indicates normal/informational behavior; Unknown - the health status is not yet known or cannot be determined; Warning - requires attention soon. |
Dependent item | hpe.synergy.hypervisor_manager["{#NAME}",status] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Hypervisor manager [{#NAME}]: Is in error state | The hypervisor manager [{#NAME}] has an error. |
last(/HPE Synergy by HTTP/hpe.synergy.hypervisor_manager["{#NAME}",state])=3 and length(last(/HPE Synergy by HTTP/hpe.synergy.hypervisor_manager["{#NAME}",state_reason]))>0 |High |
||
Hypervisor manager [{#NAME}]: Has critical status | The hypervisor manager [{#NAME}] status is critical. Needs immediate attention. |
last(/HPE Synergy by HTTP/hpe.synergy.hypervisor_manager["{#NAME}",status])=0 |High |
||
Hypervisor manager [{#NAME}]: Has warning status | The hypervisor manager [{#NAME}] status is warning. Needs attention soon. |
last(/HPE Synergy by HTTP/hpe.synergy.hypervisor_manager["{#NAME}",status])=4 |Warning |
||
Hypervisor manager [{#NAME}]: Is disabled | The hypervisor manager [{#NAME}] is currently not operational. |
last(/HPE Synergy by HTTP/hpe.synergy.hypervisor_manager["{#NAME}",status])=1 |Info |
Name | Description | Type | Key and additional info |
---|---|---|---|
Interconnects discovery | Interconnects are centrally managed by their containing logical interconnect. The interconnect provides a physical view of a detailed downlink and uplink port state and configuration, including the current link state, speed, port role (uplink, downlink, or stacking), current pluggable media, power state, and immediate connected neighbor. |
Dependent item | hpe.synergy.interconnects.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Interconnect [{#NAME}]: Get data | Data of the interconnect [{#NAME}]. |
Dependent item | hpe.synergy.interconnect["{#NAME}",data] Preprocessing
|
Interconnect [{#NAME}]: Hardware health | The health status of the interconnect hardware. |
Dependent item | hpe.synergy.interconnect["{#NAME}",hw.health] Preprocessing
|
Interconnect [{#NAME}]: Model | The interconnect model. |
Dependent item | hpe.synergy.interconnect["{#NAME}",model] Preprocessing
|
Interconnect [{#NAME}]: Part number | The part number of the interconnect. |
Dependent item | hpe.synergy.interconnect["{#NAME}",part_number] Preprocessing
|
Interconnect [{#NAME}]: Port count | The number of ports on the interconnect. |
Dependent item | hpe.synergy.interconnect["{#NAME}",port_count] Preprocessing
|
Interconnect [{#NAME}]: Serial number | The serial number of the interconnect. |
Dependent item | hpe.synergy.interconnect["{#NAME}",serial_number] Preprocessing
|
Interconnect [{#NAME}]: Spare part number | The spare part number of the interconnect. |
Dependent item | hpe.synergy.interconnect["{#NAME}",sparepartnumber] Preprocessing
|
Interconnect [{#NAME}]: State | The current state of the resource. |
Dependent item | hpe.synergy.interconnect["{#NAME}",state] Preprocessing
|
Interconnect [{#NAME}]: Status | The overall health status of the resource. The following are the valid values for the status of the resource: OK - indicates normal/informational behavior; Disabled - indicates that the resource is not operational; Warning - requires attention soon; Critical - requires immediate attention; Unknown - should be avoided, but there may be rare occasions when the status is unknown. |
Dependent item | hpe.synergy.interconnect["{#NAME}",status] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Interconnect [{#NAME}]: Has critical status | The interconnect [{#NAME}] status is critical. Needs immediate attention. |
last(/HPE Synergy by HTTP/hpe.synergy.interconnect["{#NAME}",status])=0 |High |
||
Interconnect [{#NAME}]: Has warning status | The interconnect [{#NAME}] status is warning. Needs attention soon. |
last(/HPE Synergy by HTTP/hpe.synergy.interconnect["{#NAME}",status])=4 |Warning |
||
Interconnect [{#NAME}]: Is disabled | The interconnect [{#NAME}] is currently not operational. |
last(/HPE Synergy by HTTP/hpe.synergy.interconnect["{#NAME}",status])=1 |Info |
Name | Description | Type | Key and additional info |
---|---|---|---|
Logical enclosures discovery | A list of the logical enclosures. |
Dependent item | hpe.synergy.logical_enclosures.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Logical enclosure [{#NAME}]: Get data | Data of the logical enclosure [{#NAME}]. |
Dependent item | hpe.synergy.logical_enclosure["{#NAME}",data] Preprocessing
|
Logical enclosure [{#NAME}]: State | The current resource state of the logical enclosure: Consistent - this is the expected state of the logical enclosure. The logical enclosure configuration is consistent with the enclosure group, and the configuration of the hardware resources is consistent with the logical enclosure configuration; Creating - the logical enclosure is being created; DeleteFailed - the prior attempt to delete the logical enclosure failed. Retry the delete operation potentially with the force option. No other logical enclosure operations are allowed in this state; Deleting - the logical enclosure is being deleted; Inconsistent - the configuration of the logical enclosure differs from that of the enclosure group, or the configuration of the hardware resources is inconsistent with the logical enclosure configuration. Perform an Update from group, Reapply configuration, or Update firmware action as an appropriate to bring the configuration back into consistency; Updating - configuration changes are being applied to the hardware configuration. |
Dependent item | hpe.synergy.logical_enclosure["{#NAME}",state] Preprocessing
|
Logical enclosure [{#NAME}]: Status | The overall health status of the resource. The following are the valid values for the status of the resource: OK - indicates normal/informational behavior; Disabled - indicates that the resource is not operational; Warning - requires attention soon; Critical - requires immediate attention; Unknown - should be avoided, but there may be rare occasions when the status is unknown. |
Dependent item | hpe.synergy.logical_enclosure["{#NAME}",status] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Logical enclosure [{#NAME}]: Delete failed | Indicator that the deletion of a logical enclosure failed. |
last(/HPE Synergy by HTTP/hpe.synergy.logical_enclosure["{#NAME}",state])=2 |Average |
||
Logical enclosure [{#NAME}]: Is inconsistent | The configuration of the logical enclosure differs from that of the enclosure group, or the configuration of the hardware resources is inconsistent with the logical enclosure configuration. Perform an Update from group, Reapply configuration, or Update firmware action as an appropriate to bring the configuration back into consistency. |
last(/HPE Synergy by HTTP/hpe.synergy.logical_enclosure["{#NAME}",state])=0 |Average |
||
Logical enclosure [{#NAME}]: Has critical status | The status of the logical enclosure [{#NAME}] is critical. Needs immediate attention. |
last(/HPE Synergy by HTTP/hpe.synergy.logical_enclosure["{#NAME}",status])=0 |High |
||
Logical enclosure [{#NAME}]: Has warning status | The status of the logical enclosure [{#NAME}] is warning. Needs attention soon. |
last(/HPE Synergy by HTTP/hpe.synergy.logical_enclosure["{#NAME}",status])=4 |Warning |
||
Logical enclosure [{#NAME}]: Is disabled | The logical enclosure [{#NAME}] is currently not operational. |
last(/HPE Synergy by HTTP/hpe.synergy.logical_enclosure["{#NAME}",status])=1 |Info |
Name | Description | Type | Key and additional info |
---|---|---|---|
nPar discovery | Electrically isolated hardware partition (nPar). Partition details are relevant only for enclosures with type "SDX". |
Dependent item | hpe.synergy.npar.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Partition [{#ENCLOSURENAME}:{#PARTITIONID}]: Get data | Data of the partition [{#ENCLOSURENAME}:{#PARTITIONID}]. |
Dependent item | hpe.synergy.partition["{#PARTITIONID}","{#ENCLOSURENAME}",data] Preprocessing
|
Partition [{#ENCLOSURENAME}:{#PARTITIONID}]: Devices count | The number of blades in the partition. |
Dependent item | hpe.synergy.partition["{#PARTITIONID}","{#ENCLOSURENAME}",device_count] Preprocessing
|
Partition [{#ENCLOSURENAME}:{#PARTITIONID}]: Health | Indicates the health of the partition and the health of its owned resources (blades and IO bays) as reported by the firmware. If a problem is detected with one of the resources, the health of the partition is reported as Degraded. If all the resources in the partition are operating correctly, the health of the partition is reported as OK. NparDegrade - one or more resources in the partition are unhealthy. NparHealthInvalid - a partition health is invalid. NparHealthMax - a delimiter defined by the firmware. NparOk - all the resources in the partition are healthy. |
Dependent item | hpe.synergy.partition["{#PARTITIONID}","{#ENCLOSURENAME}",health] Preprocessing
|
Partition [{#ENCLOSURENAME}:{#PARTITIONID}]: Memory | The total memory of the partition. |
Dependent item | hpe.synergy.partition["{#PARTITIONID}","{#ENCLOSURENAME}",memory] Preprocessing
|
Partition [{#ENCLOSURENAME}:{#PARTITIONID}]: Processors Count | The number of processors in the partition. |
Dependent item | hpe.synergy.partition["{#PARTITIONID}","{#ENCLOSURENAME}",processor_count] Preprocessing
|
Partition [{#ENCLOSURENAME}:{#PARTITIONID}]: Status | Indicates whether the partition has been booted and also indicates its current power state: ParStatusActive - a partition is active when a 'poweron' operation is initiated on the partition and the firmware boot process is started; ParStatusInactive - a partition is in an inactive state after it has been created or shut down; ParStatusInvalid - a partition status is invalid; ParStatusManualRepair - a partition under manual repair; ParStatusMax - a delimiter defined by the OA firmware; ParStatusUndefined - partition status is undefined; ParStatusUnknown - a partition might report an Unknown state after an OA restart. This state is possible when the firmware is not able to identify the correct partition state due to the internal firmware errors at an OA startup. The state is persistent and can only be cleared by force powering off of the partition from the OA. A partition in this state will not accept any partition operation except parstatus and force poweroff. Any active OS instances continue to run unhindered even when the partition is in an unknown state. |
Dependent item | hpe.synergy.partition["{#PARTITIONID}","{#ENCLOSURENAME}",status] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Partition [{#ENCLOSURENAME}:{#PARTITIONID}]: Health is invalid | The partition health is invalid. |
last(/HPE Synergy by HTTP/hpe.synergy.partition["{#PARTITION_ID}","{#ENCLOSURE_NAME}",health])=1 |Average |
||
Partition [{#ENCLOSURENAME}:{#PARTITIONID}]: Health is degraded | One or more resources in the partition are unhealthy. |
last(/HPE Synergy by HTTP/hpe.synergy.partition["{#PARTITION_ID}","{#ENCLOSURE_NAME}",health])=0 |High |
||
Partition [{#ENCLOSURENAME}:{#PARTITIONID}]: Is invalid | The partition status is invalid. |
last(/HPE Synergy by HTTP/hpe.synergy.partition["{#PARTITION_ID}","{#ENCLOSURE_NAME}",status])=2 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Power supplies discovery | List of power supply bays in the enclosure. |
Dependent item | hpe.synergy.ps.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Power Supply [{#ENCLOSURENAME}:{#BAYNUMBER}]: Get data | Data of the power Supply [{#ENCLOSURENAME}:{#BAYNUMBER}]. |
Dependent item | hpe.synergy.powersupply["{#BAYNUMBER}","{#ENCLOSURE_NAME}",data] Preprocessing
|
Power Supply [{#ENCLOSURENAME}:{#BAYNUMBER}]: Model | The common descriptive model of the power supply. |
Dependent item | hpe.synergy.powersupply["{#BAYNUMBER}","{#ENCLOSURE_NAME}",model] Preprocessing
|
Power Supply [{#ENCLOSURENAME}:{#BAYNUMBER}]: Output capacity | The output capacity of the power supply. |
Dependent item | hpe.synergy.powersupply["{#BAYNUMBER}","{#ENCLOSURENAME}",outputcapacity] Preprocessing
|
Power Supply [{#ENCLOSURENAME}:{#BAYNUMBER}]: Part number | The part number of the power supply. |
Dependent item | hpe.synergy.powersupply["{#BAYNUMBER}","{#ENCLOSURENAME}",partnumber] Preprocessing
|
Power Supply [{#ENCLOSURENAME}:{#BAYNUMBER}]: Presence | Indicates whether a power supply is present: Absent - the device slot is empty; PresenceNoOp - the device slot is uninitialized; PresenceUnknown - the device presence is unknown; Present - the device slot has a device in it; Subsumed - the device slot is configured to be part of another device slot. Not applicable for the fan or power supply bays. |
Dependent item | hpe.synergy.powersupply["{#BAYNUMBER}","{#ENCLOSURE_NAME}",presence] Preprocessing
|
Power Supply [{#ENCLOSURENAME}:{#BAYNUMBER}]: Serial number | The unique serial number of the power supply. |
Dependent item | hpe.synergy.powersupply["{#BAYNUMBER}","{#ENCLOSURENAME}",serialnumber] Preprocessing
|
Power Supply [{#ENCLOSURENAME}:{#BAYNUMBER}]: Spare part number | The spare part number to be used when ordering an additional or replacement power supply of this type. |
Dependent item | hpe.synergy.powersupply["{#BAYNUMBER}","{#ENCLOSURENAME}",sparepart_number] Preprocessing
|
Power Supply [{#ENCLOSURENAME}:{#BAYNUMBER}]: Status | The overall health status of the resource. The following are the valid values for the status of the resource: OK - indicates normal/informational behavior; Disabled - indicates that the resource is not operational; Warning - requires attention soon; Critical - requires immediate attention; Unknown - should be avoided, but there may be rare occasions when the status is unknown. |
Dependent item | hpe.synergy.powersupply["{#BAYNUMBER}","{#ENCLOSURE_NAME}",status] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Power Supply [{#ENCLOSURENAME}:{#BAYNUMBER}]: Has critical status | The status of the power supply [{#ENCLOSURENAME}:{#BAYNUMBER}] is critical. Needs immediate attention. |
last(/HPE Synergy by HTTP/hpe.synergy.power_supply["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",status])=0 |High |
||
Power Supply [{#ENCLOSURENAME}:{#BAYNUMBER}]: Has warning status | The status of the power supply [{#ENCLOSURENAME}:{#BAYNUMBER}] is warning. Needs attention soon. |
last(/HPE Synergy by HTTP/hpe.synergy.power_supply["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",status])=4 |Warning |
||
Power Supply [{#ENCLOSURENAME}:{#BAYNUMBER}]: Is disabled | The status of Power Supply [{#ENCLOSURENAME}:{#BAYNUMBER}] is currently not operational. |
last(/HPE Synergy by HTTP/hpe.synergy.power_supply["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",status])=1 |Info |
Name | Description | Type | Key and additional info |
---|---|---|---|
Racks discovery | A list of the racks. |
Dependent item | hpe.synergy.racks.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Rack [{#NAME}]: Get data | Data of the rack [{#NAME}]. |
Dependent item | hpe.synergy.rack["{#NAME}",data] Preprocessing
|
Rack [{#NAME}]: State | The current state of the resource. the valid values include Adding, AddError, Configured, CredentialError, Refreshing, RefreshError, Removing, RemoveError, and Unmanaged. |
Dependent item | hpe.synergy.rack["{#NAME}",state] Preprocessing
|
Rack [{#NAME}]: Status | The overall health status of the resource. The following are the valid values for the status of the resource: OK - indicates normal/informational behavior; Disabled - indicates that the resource is not operational; Warning - requires attention soon; Critical - requires immediate attention; Unknown - should be avoided, but there may be rare occasions when the status is unknown. |
Dependent item | hpe.synergy.rack["{#NAME}",status] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Rack [{#NAME}]: Add error | Adding the rack [{#NAME}] failed. |
last(/HPE Synergy by HTTP/hpe.synergy.rack["{#NAME}",state])=1 |Average |
||
Rack [{#NAME}]: Has credential error | The rack [{#NAME}] has credential error. |
last(/HPE Synergy by HTTP/hpe.synergy.rack["{#NAME}",state])=3 |Average |
||
Rack [{#NAME}]: Has refresh error | The rack [{#NAME}] has refresh error. |
last(/HPE Synergy by HTTP/hpe.synergy.rack["{#NAME}",state])=5 |Average |
||
Rack [{#NAME}]: Has remove error | The rack [{#NAME}] has remove error. |
last(/HPE Synergy by HTTP/hpe.synergy.rack["{#NAME}",state])=7 |Average |
||
Rack [{#NAME}]: Has critical status | The rack [{#NAME}] status is critical. Needs immediate attention. |
last(/HPE Synergy by HTTP/hpe.synergy.rack["{#NAME}",status])=0 |High |
||
Rack [{#NAME}]: Has warning status | The rack [{#NAME}] status is warning. Needs attention soon. |
last(/HPE Synergy by HTTP/hpe.synergy.rack["{#NAME}",status])=4 |Warning |
||
Rack [{#NAME}]: Is disabled | The rack [{#NAME}] is currently not operational. |
last(/HPE Synergy by HTTP/hpe.synergy.rack["{#NAME}",status])=1 |Info |
Name | Description | Type | Key and additional info |
---|---|---|---|
Server hardware discovery | The server hardware resource is a representation of a physical server. |
Dependent item | hpe.synergy.server_hardware.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Server [{#SERVER_NAME}:{#LOCATION}]: Get data | Data of the server [{#SERVER_NAME}:{#LOCATION}]. |
Dependent item | hpe.synergy.server_hardware["{#LOCATION}",data] Preprocessing
|
Server [{#SERVER_NAME}:{#LOCATION}]: Maintenance state | The maintenance flag of the Server Hardware - disruptive maintenance operations, such as firmware update, can cause many server hardware alerts to be generated in a short period of time. Example: network connectivity is lost or the server reset is detected. When this field is set, predefined alerts for this particular device are suppressed. This field is set only when firmware update is ongoing. The alerts are processed normally once firmware update operation completes. Possible values are Maintenance and Normal. |
Dependent item | hpe.synergy.serverhardware["{#LOCATION}",maintenancestate] Preprocessing
|
Server [{#SERVER_NAME}:{#LOCATION}]: Maintenance state reason | This field is set to Firmware update when the server is put under maintenance. |
Dependent item | hpe.synergy.serverhardware["{#LOCATION}",maintenancestate_reason] Preprocessing
|
Server [{#SERVER_NAME}:{#LOCATION}]: Memory | The amount of memory installed on this server hardware. |
Dependent item | hpe.synergy.server_hardware["{#LOCATION}",memory] Preprocessing
|
Server [{#SERVER_NAME}:{#LOCATION}]: Migration state | The state of an ongoing virtual connect manager (VCM) migration: Migrating - the enclosure is in the process of migrating from VCM; NotApplicable - the enclosure did not require or has already completed the migration; Unknown - the migration state is unknown. |
Dependent item | hpe.synergy.serverhardware["{#LOCATION}",migrationstate] Preprocessing
|
Server [{#SERVER_NAME}:{#LOCATION}]: Model | The model string of the full server hardware. |
Dependent item | hpe.synergy.server_hardware["{#LOCATION}",model] Preprocessing
|
Server [{#SERVER_NAME}:{#LOCATION}]: Part number | The part number for this server hardware. |
Dependent item | hpe.synergy.serverhardware["{#LOCATION}",partnumber] Preprocessing
|
Server [{#SERVER_NAME}:{#LOCATION}]: Power state | The current power state of the server hardware. The values are Unknown, On, Off, PoweringOn, PoweringOff or Resetting. |
Dependent item | hpe.synergy.serverhardware["{#LOCATION}",powerstate] Preprocessing
|
Server [{#SERVER_NAME}:{#LOCATION}]: Processor cores count | The number of cores available per processor. |
Dependent item | hpe.synergy.serverhardware.processor["{#LOCATION}",corescount] Preprocessing
|
Server [{#SERVER_NAME}:{#LOCATION}]: Processors count | The number of processors installed on this server hardware. |
Dependent item | hpe.synergy.server_hardware.processor["{#LOCATION}",count] Preprocessing
|
Server [{#SERVER_NAME}:{#LOCATION}]: Processor speed | The speed of the CPUs. |
Dependent item | hpe.synergy.server_hardware.processor["{#LOCATION}",speed] Preprocessing
|
Server [{#SERVER_NAME}:{#LOCATION}]: Processor type | The type of the CPU installed on this server hardware. |
Dependent item | hpe.synergy.server_hardware.processor["{#LOCATION}",type] Preprocessing
|
Server [{#SERVER_NAME}:{#LOCATION}]: Serial number | The serial number of the server hardware. |
Dependent item | hpe.synergy.serverhardware["{#LOCATION}",serialnumber] Preprocessing
|
Server [{#SERVER_NAME}:{#LOCATION}]: State | The current resource state of the server hardware. The allowable values are: Unknown - not initialized; Adding - a server is being added; NoProfileApplied - a server successfully added; Monitored - a server is being monitored; Unmanaged - a discovered and supported server; Removing - a server is being removed; RemoveFailed - an unsuccessful server removal; Removed - a server is successfully removed; ApplyingProfile - a server is successfully removed; ProfileApplied - a profile is successfully applied; RemovingProfile - a profile is being removed; ProfileError - an Unsuccessful profile is applied or removed; Unsupported - a server model or version is not currently supported by the appliance; UpdatingFirmware - a server firmware update is in progress. |
Dependent item | hpe.synergy.server_hardware["{#LOCATION}",state] Preprocessing
|
Server [{#SERVER_NAME}:{#LOCATION}]: State reason | The reason for the current resource state of the server hardware. This only applies if the state is Unmanaged, otherwise it is set to NotApplicable. The allowable values are: Unsupported - a server model or version is not currently supported by the appliance; UpdatingFirmware - a server firmware update is in progress; NotApplicable - when PhysicalServerState is anything besides Unmanaged; NotOwner - no claim on the server; Inventory - a server is added by the PDU; Unconfigured - the discovery data is incomplete or an iLO configuration has failed; UnsupportedFirmware - an iLO firmware version is below the minimum support level; Interrupted - when PhysicalServerState is a result of an operation that was terminated before completing; CommunicationError - an appliance cannot communicate with an iLO or an OA. |
Dependent item | hpe.synergy.serverhardware["{#LOCATION}",statereason] Preprocessing
|
Server [{#SERVER_NAME}:{#LOCATION}]: Status | The overall health status of the resource. The following are the valid values for the status of the resource: OK - indicates normal/informational behavior; Disabled - indicates that the resource is not operational; Warning - requires attention soon; Critical - requires immediate attention; Unknown - should be avoided, but there may be rare occasions when the status is unknown. |
Dependent item | hpe.synergy.server_hardware["{#LOCATION}",status] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Server [{#SERVER_NAME}:{#LOCATION}]: Is in maintenance mode | The disruptive maintenance operations like firmware update can cause many server hardware alerts to be generated in a short period of time. Example: Network connectivity is lost or the server reset is detected. |
last(/HPE Synergy by HTTP/hpe.synergy.server_hardware["{#LOCATION}",maintenance_state])=0 and length(last(/HPE Synergy by HTTP/hpe.synergy.server_hardware["{#LOCATION}",maintenance_state_reason]))>0 |Info |
||
Server [{#SERVER_NAME}:{#LOCATION}]: Has profile error | The unsuccessful profile application or removal. |
last(/HPE Synergy by HTTP/hpe.synergy.server_hardware["{#LOCATION}",state])=11 |Average |
||
Server [{#SERVER_NAME}:{#LOCATION}]: Is not initialized | The server is not initialized. |
last(/HPE Synergy by HTTP/hpe.synergy.server_hardware["{#LOCATION}",state])=0 |Warning |
||
Server [{#SERVER_NAME}:{#LOCATION}]: Is unsupported | The server model or version is not currently supported by the appliance. |
last(/HPE Synergy by HTTP/hpe.synergy.server_hardware["{#LOCATION}",state])=12 |Average |
||
Server [{#SERVER_NAME}:{#LOCATION}]: Remove failed | The previous operation to remove the server hardware did not succeed. The operation should be re-attempted. |
last(/HPE Synergy by HTTP/hpe.synergy.server_hardware["{#LOCATION}",state])=6 |Average |
||
Server [{#SERVER_NAME}:{#LOCATION}]: Is unmanaged | Discovered a supported server. |
last(/HPE Synergy by HTTP/hpe.synergy.server_hardware["{#LOCATION}",state])=4 and length(last(/HPE Synergy by HTTP/hpe.synergy.server_hardware["{#LOCATION}",state_reason]))>0 |Average |
||
Server [{#SERVER_NAME}:{#LOCATION}]: Has critical status | The status of the server [{#SERVER_NAME}:{#LOCATION}] is critical. Needs immediate attention. |
last(/HPE Synergy by HTTP/hpe.synergy.server_hardware["{#LOCATION}",status])=0 |High |
||
Server [{#SERVER_NAME}:{#LOCATION}]: Has warning status | The status of the server [{#SERVER_NAME}:{#LOCATION}] is warning. Needs attention soon. |
last(/HPE Synergy by HTTP/hpe.synergy.server_hardware["{#LOCATION}",status])=4 |Warning |
||
Server [{#SERVER_NAME}:{#LOCATION}]: Is disabled | The server [{#SERVER_NAME}:{#LOCATION}] is currently not operational. |
last(/HPE Synergy by HTTP/hpe.synergy.server_hardware["{#LOCATION}",status])=1 |Info |
Name | Description | Type | Key and additional info |
---|---|---|---|
Storage pools discovery | A list of the storage pools. |
Dependent item | hpe.synergy.storage_pools.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Storage pool [{#NAME}]: Get data | Data of the storage pool [{#NAME}]. |
Dependent item | hpe.synergy.storage.pools["{#NAME}",data] Preprocessing
|
Storage pool [{#NAME}]: Capacity allocated | The capacity allocated from the storage pool in bytes. |
Dependent item | hpe.synergy.storage.pools.capacity["{#NAME}",allocated] Preprocessing
|
Storage pool [{#NAME}]: Capacity free | The free capacity available from the storage pool in bytes. |
Dependent item | hpe.synergy.storage.pools.capacity["{#NAME}",free] Preprocessing
|
Storage pool [{#NAME}]: Capacity allocated to snapshots | The pool capacity allocated to the snapshots in bytes. |
Dependent item | hpe.synergy.storage.pools.capacity["{#NAME}",snapshot] Preprocessing
|
Storage pool [{#NAME}]: State | The current state of the resource: AddFailed - an attempt to add the resource failed; Adding - the resource is in the process of being added; Configured - the resource is configured; Connected - the appliance has connected to the resource; Copying - the resource is in the process of being copied; CreateFailed - an attempt to create the resource failed; Creating - the resource is in the process of being created; DeleteFailed - an attempt to delete the resource failed; Deleting - the resource is in the process of being deleted; Discovered - the resource has been discovered by the appliance, but it is not managed by the appliance; Managed - the resource is managed by the appliance; Normal - the resource is in a normal state; UpdateFailed - an attempt to update the resource failed; Updating - the resource is in the process of being updated. |
Dependent item | hpe.synergy.storage.pools["{#NAME}",state] Preprocessing
|
Storage pool [{#NAME}]: Status | The overall health status of the resource. The following are the valid values for the status of the resource: OK - indicates normal/informational behavior; Disabled - indicates that the resource is not operational; Warning - requires attention soon; Critical - requires immediate attention; Unknown - should be avoided, but there may be rare occasions when the status is unknown. |
Dependent item | hpe.synergy.storage.pools["{#NAME}",status] Preprocessing
|
Storage pool [{#NAME}]: Capacity total | The total capacity of the storage pool in bytes. |
Dependent item | hpe.synergy.storage.pools.capacity["{#NAME}",total] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Storage pool [{#NAME}]: Add error | Adding of the storage pool [{#NAME}] failed. |
last(/HPE Synergy by HTTP/hpe.synergy.storage.pools["{#NAME}",state])=0 |Average |
||
Storage pool [{#NAME}]: Create failed | Creating of the storage pool [{#NAME}] failed. |
last(/HPE Synergy by HTTP/hpe.synergy.storage.pools["{#NAME}",state])=5 |Average |
||
Storage pool [{#NAME}]: Delete failed | Deletion of the storage pool [{#NAME}] failed. |
last(/HPE Synergy by HTTP/hpe.synergy.storage.pools["{#NAME}",state])=7 |Average |
||
Storage pool [{#NAME}]: Update failed | Updating of the storage pool [{#NAME}] failed. |
last(/HPE Synergy by HTTP/hpe.synergy.storage.pools["{#NAME}",state])=12 |Average |
||
Storage pool [{#NAME}]: Has critical status | The status of the storage pool [{#NAME}] is critical. Needs immediate attention. |
last(/HPE Synergy by HTTP/hpe.synergy.storage.pools["{#NAME}",status])=0 |High |
||
Storage pool [{#NAME}]: Has warning status | The status of the storage pool [{#NAME}] is warning. Needs attention soon. |
last(/HPE Synergy by HTTP/hpe.synergy.storage.pools["{#NAME}",status])=4 |Warning |
||
Storage pool [{#NAME}]: Is disabled | The storage pool [{#NAME}] is currently not operational. |
last(/HPE Synergy by HTTP/hpe.synergy.storage.pools["{#NAME}",status])=1 |Info |
Name | Description | Type | Key and additional info |
---|---|---|---|
Storage systems discovery | A list of the storage systems. |
Dependent item | hpe.synergy.storage_systems.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Storage system [{#NAME}]: Get data | Data of the storage system [{#NAME}]. |
Dependent item | hpe.synergy.storage.system["{#NAME}",data] Preprocessing
|
Storage system [{#NAME}]: Capacity allocated | The capacity allocated in bytes. |
Dependent item | hpe.synergy.storage.system.capacity["{#NAME}",allocated] Preprocessing
|
Storage system [{#NAME}]: Capacity free | The free capacity of the storage system in bytes. |
Dependent item | hpe.synergy.storage.system.capacity["{#NAME}",free] Preprocessing
|
Storage system [{#NAME}]: State | The current state of the resource: AddFailed - an attempt to add the resource failed; Adding - the resource is in the process of being added; Configured - the resource is configured; Connected - the appliance has connected to the resource; Copying - the resource is in the process of being copied; CreateFailed - an attempt to create the resource failed; Creating - the resource is in the process of being created; DeleteFailed - an attempt to delete the resource failed; Deleting - the resource is in the process of being deleted; Discovered - the resource has been discovered by the appliance, but it is not managed by the appliance; Managed - the resource is managed by the appliance; Normal - the resource is in a normal state; UpdateFailed - an attempt to update the resource failed; Updating - the resource is in the process of being updated. |
Dependent item | hpe.synergy.storage.system["{#NAME}",state] Preprocessing
|
Storage system [{#NAME}]: Status | The overall health status of the resource. The following are the valid values for the status of the resource: OK - indicates normal/informational behavior; Disabled - indicates that the resource is not operational; Warning - requires attention soon; Critical - requires immediate attention; Unknown - should be avoided, but there may be rare occasions when the status is unknown. |
Dependent item | hpe.synergy.storage.system["{#NAME}",status] Preprocessing
|
Storage system [{#NAME}]: Capacity total | The total capacity of the storage system in bytes. |
Dependent item | hpe.synergy.storage.system.capacity["{#NAME}",total] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Storage system [{#NAME}]: Add error | Adding the storage system [{#NAME}] failed. |
last(/HPE Synergy by HTTP/hpe.synergy.storage.system["{#NAME}",state])=0 |Average |
||
Storage system [{#NAME}]: Create failed | Creating of the storage system [{#NAME}] failed. |
last(/HPE Synergy by HTTP/hpe.synergy.storage.system["{#NAME}",state])=5 |Average |
||
Storage system [{#NAME}]: Delete failed | Deletion of the storage system [{#NAME}] failed. |
last(/HPE Synergy by HTTP/hpe.synergy.storage.system["{#NAME}",state])=7 |Average |
||
Storage system [{#NAME}]: Update failed | Updating of the storage system [{#NAME}] failed. |
last(/HPE Synergy by HTTP/hpe.synergy.storage.system["{#NAME}",state])=12 |Average |
||
Storage system [{#NAME}]: Has critical status | The status of the storage system [{#NAME}] is critical. Needs immediate attention. |
last(/HPE Synergy by HTTP/hpe.synergy.storage.system["{#NAME}",status])=0 |High |
||
Storage system [{#NAME}]: Has warning status | The status of the storage system [{#NAME}] is warning. Needs attention soon. |
last(/HPE Synergy by HTTP/hpe.synergy.storage.system["{#NAME}",status])=4 |Warning |
||
Storage system [{#NAME}]: Is disabled | The storage system [{#NAME}] is currently not operational. |
last(/HPE Synergy by HTTP/hpe.synergy.storage.system["{#NAME}",status])=1 |Info |
Name | Description | Type | Key and additional info |
---|---|---|---|
Storage volumes discovery | A list of the storage volumes. |
Dependent item | hpe.synergy.storage_volumes.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Storage volume [{#NAME}]: Get data | Data of the storage volume [{#NAME}]. |
Dependent item | hpe.synergy.storage.volumes["{#NAME}",data] Preprocessing
|
Storage volume [{#NAME}]: Capacity allocated | The capacity allocated in bytes. |
Dependent item | hpe.synergy.storage.volumes.capacity["{#NAME}",allocated] Preprocessing
|
Storage volume [{#NAME}]: Capacity provisioned | The total provisioned capacity of the volume in bytes. |
Dependent item | hpe.synergy.storage.volumes.capacity["{#NAME}",provisioned] Preprocessing
|
Storage volume [{#NAME}]: State | The current state of the resource: AddFailed - an attempt to add the resource failed; Adding - the resource is in the process of being added; Configured - the resource is configured; Connected - the appliance has connected to the resource; Copying - the resource is in the process of being copied; CreateFailed - an attempt to create the resource failed; Creating - the resource is in the process of being created; DeleteFailed - an attempt to delete the resource failed; Deleting - the resource is in the process of being deleted; Discovered - the resource has been discovered by the appliance, but it is not managed by the appliance; Managed - the resource is managed by the appliance; Normal - the resource is in a normal state; UpdateFailed - an attempt to update the resource failed; Updating - the resource is in the process of being updated. |
Dependent item | hpe.synergy.storage.volumes["{#NAME}",state] Preprocessing
|
Storage volume [{#NAME}]: Status | The overall health status of the resource. The following are the valid values for the status of the resource: OK - indicates normal/informational behavior; Disabled - indicates that the resource is not operational; Warning - requires attention soon; Critical - requires immediate attention; Unknown - should be avoided, but there may be rare occasions when the status is unknown. |
Dependent item | hpe.synergy.storage.volumes["{#NAME}",status] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Storage volume [{#NAME}]: Add error | Adding the storage volume [{#NAME}] failed. |
last(/HPE Synergy by HTTP/hpe.synergy.storage.volumes["{#NAME}",state])=0 |Average |
||
Storage volume [{#NAME}]: Create failed | Creating of the storage volume [{#NAME}] failed. |
last(/HPE Synergy by HTTP/hpe.synergy.storage.volumes["{#NAME}",state])=5 |Average |
||
Storage volume [{#NAME}]: Delete failed | Deletion of the storage volume [{#NAME}] failed. |
last(/HPE Synergy by HTTP/hpe.synergy.storage.volumes["{#NAME}",state])=7 |Average |
||
Storage volume [{#NAME}]: Update failed | Updating of the storage volume [{#NAME}] failed. |
last(/HPE Synergy by HTTP/hpe.synergy.storage.volumes["{#NAME}",state])=12 |Average |
||
Storage volume [{#NAME}]: Has critical status | The status of the storage volume [{#NAME}] is critical. Needs immediate attention. |
last(/HPE Synergy by HTTP/hpe.synergy.storage.volumes["{#NAME}",status])=0 |High |
||
Storage volume [{#NAME}]: Has warning status | The status of the storage volume [{#NAME}] is warning. Needs attention soon. |
last(/HPE Synergy by HTTP/hpe.synergy.storage.volumes["{#NAME}",status])=4 |Warning |
||
Storage volume [{#NAME}]: Is disabled | The storage volume [{#NAME}] is currently not operational. |
last(/HPE Synergy by HTTP/hpe.synergy.storage.volumes["{#NAME}",status])=1 |Info |
Name | Description | Type | Key and additional info |
---|---|---|---|
Managers discovery | A list of the Synergy Frame Link Module bays. |
Dependent item | hpe.synergy.framelinkmodules.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Manager [{#ENCLOSURENAME}:{#BAYNUMBER}]: Get data | Data of the manager [{#ENCLOSURENAME}:{#BAYNUMBER}]. |
Dependent item | hpe.synergy.manager["{#BAYNUMBER}","{#ENCLOSURENAME}",data] Preprocessing
|
Manager [{#ENCLOSURENAME}:{#BAYNUMBER}]: Firmware version | The firmware version of the manager. |
Dependent item | hpe.synergy.manager["{#BAYNUMBER}","{#ENCLOSURENAME}",fw_version] Preprocessing
|
Manager [{#ENCLOSURENAME}:{#BAYNUMBER}]: Link port state | The state of the LINK port: Disabled - the port is disabled; Linked - the port is linked; Unlinked - the port is unlinked. |
Dependent item | hpe.synergy.manager["{#BAYNUMBER}","{#ENCLOSURENAME}",linkportstate] Preprocessing
|
Manager [{#ENCLOSURENAME}:{#BAYNUMBER}]: Link port status | The status of the LINK port: Critical - requires immediate attention; Disabled - the resource is currently not operational; OK - indicates normal/informational behavior; Unknown - the health status is not yet known or cannot be determined; Warning - requires attention soon. |
Dependent item | hpe.synergy.manager["{#BAYNUMBER}","{#ENCLOSURENAME}",linkportstatus] Preprocessing
|
Manager [{#ENCLOSURENAME}:{#BAYNUMBER}]: MGMT port state | The state of the MGMT port: Active - the port is in active mode; Disabled - the port is in disabled mode; I3s - the port is configured for the deployment of an OS network traffic. Other - the port is in other mode; Standby - the port is in standby mode; Unknown - the mode of the port is not known. |
Dependent item | hpe.synergy.manager["{#BAYNUMBER}","{#ENCLOSURENAME}",mgmtportstate] Preprocessing
|
Manager [{#ENCLOSURENAME}:{#BAYNUMBER}]: MGMT port status | The status of the MGMT port: Critical - requires immediate attention; Disabled - the resource is currently not operational; OK - indicates normal/informational behavior; Unknown - the health status is not yet known or cannot be determined; Warning - requires attention soon. |
Dependent item | hpe.synergy.manager["{#BAYNUMBER}","{#ENCLOSURENAME}",mgmtportstatus] Preprocessing
|
Manager [{#ENCLOSURENAME}:{#BAYNUMBER}]: Model | The model of the link module. |
Dependent item | hpe.synergy.manager["{#BAYNUMBER}","{#ENCLOSURENAME}",model] Preprocessing
|
Manager [{#ENCLOSURENAME}:{#BAYNUMBER}]: Part number | The part number of the link module. |
Dependent item | hpe.synergy.manager["{#BAYNUMBER}","{#ENCLOSURENAME}",part_number] Preprocessing
|
Manager [{#ENCLOSURENAME}:{#BAYNUMBER}]: Presence | Indicates whether a manager is present in the bay: Absent - the device slot is empty; PresenceNoOp - the device slot is uninitialized; PresenceUnknown - the device presence is unknown; Present - the device slot has a device in it; Subsumed - the device slot is configured to be part of another device slot. Not applicable for the fan or power supply bays. |
Dependent item | hpe.synergy.manager["{#BAYNUMBER}","{#ENCLOSURENAME}",presence] Preprocessing
|
Manager [{#ENCLOSURENAME}:{#BAYNUMBER}]: Serial number | The serial number of the link module. |
Dependent item | hpe.synergy.manager["{#BAYNUMBER}","{#ENCLOSURENAME}",serial_number] Preprocessing
|
Manager [{#ENCLOSURENAME}:{#BAYNUMBER}]: Spare part number | The spare part number of the link module. |
Dependent item | hpe.synergy.manager["{#BAYNUMBER}","{#ENCLOSURENAME}",sparepartnumber] Preprocessing
|
Manager [{#ENCLOSURENAME}:{#BAYNUMBER}]: Status | The health status of the link module: Critical - requires immediate attention; Disabled - the resource is currently not operational; OK - indicates normal/informational behavior; Unknown - the health status is not yet known or cannot be determined; Warning - requires attention soon. |
Dependent item | hpe.synergy.manager["{#BAYNUMBER}","{#ENCLOSURENAME}",status] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Manager [{#ENCLOSURENAME}:{#BAYNUMBER}]: Link port has critical status | The link port status of the manager [{#ENCLOSURENAME}:{#BAYNUMBER}] is critical. Needs immediate attention. |
last(/HPE Synergy by HTTP/hpe.synergy.manager["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",link_port_status])=0 |High |
||
Manager [{#ENCLOSURENAME}:{#BAYNUMBER}]: Link port has warning status | The link port status of the manager [{#ENCLOSURENAME}:{#BAYNUMBER}] is warning. Needs attention soon. |
last(/HPE Synergy by HTTP/hpe.synergy.manager["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",link_port_status])=4 |Warning |
||
Manager [{#ENCLOSURENAME}:{#BAYNUMBER}]: Link port is disabled | The link port of the manager [{#ENCLOSURENAME}:{#BAYNUMBER}] is currently not operational. |
last(/HPE Synergy by HTTP/hpe.synergy.manager["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",link_port_status])=1 |Info |
||
Manager [{#ENCLOSURENAME}:{#BAYNUMBER}]: MGMT port has critical status | The MGMT port status of the manager [{#ENCLOSURENAME}:{#BAYNUMBER}] is critical. Needs immediate attention. |
last(/HPE Synergy by HTTP/hpe.synergy.manager["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",mgmt_port_status])=0 |High |
||
Manager [{#ENCLOSURENAME}:{#BAYNUMBER}]: MGMT port has warning status | The MGMT port status of the manager [{#ENCLOSURENAME}:{#BAYNUMBER}] is warning. Needs attention soon. |
last(/HPE Synergy by HTTP/hpe.synergy.manager["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",mgmt_port_status])=4 |Warning |
||
Manager [{#ENCLOSURENAME}:{#BAYNUMBER}]: MGMT port is disabled | The MGMT port of the manager [{#ENCLOSURENAME}:{#BAYNUMBER}] is currently not operational. |
last(/HPE Synergy by HTTP/hpe.synergy.manager["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",mgmt_port_status])=1 |Info |
||
Manager [{#ENCLOSURENAME}:{#BAYNUMBER}]: Is subsumed | The device slot is configured to be part of another device slot. |
last(/HPE Synergy by HTTP/hpe.synergy.manager["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",presence])=4 |Average |
||
Manager [{#ENCLOSURENAME}:{#BAYNUMBER}]: Has critical status | The status of the manager [{#ENCLOSURENAME}:{#BAYNUMBER}] is critical. Needs immediate attention. |
last(/HPE Synergy by HTTP/hpe.synergy.manager["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",status])=0 |High |
||
Manager [{#ENCLOSURENAME}:{#BAYNUMBER}]: Has warning status | The status of the manager [{#ENCLOSURENAME}:{#BAYNUMBER}] is warning. Needs attention soon. |
last(/HPE Synergy by HTTP/hpe.synergy.manager["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",status])=4 |Warning |
||
Manager [{#ENCLOSURENAME}:{#BAYNUMBER}]: Is disabled | The manager [{#ENCLOSURENAME}:{#BAYNUMBER}] is currently not operational. |
last(/HPE Synergy by HTTP/hpe.synergy.manager["{#BAY_NUMBER}","{#ENCLOSURE_NAME}",status])=1 |Info |
Name | Description | Type | Key and additional info |
---|---|---|---|
Uplink sets discovery | A list of the uplink sets. |
Dependent item | hpe.synergy.uplink_sets.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Uplink set [{#NAME}]: Get data | Data of the uplink set [{#NAME}]. |
Dependent item | hpe.synergy.uplink_set["{#NAME}",data] Preprocessing
|
Uplink set [{#NAME}]: State | The current state of the resource. |
Dependent item | hpe.synergy.uplink_set["{#NAME}",state] Preprocessing
|
Uplink set [{#NAME}]: Status | The overall health status of the resource. The following are the valid values for the status of the resource: OK - indicates normal/informational behavior; Disabled - indicates that the resource is not operational; Warning - requires attention soon; Critical - requires immediate attention; Unknown - should be avoided, but there may be rare occasions when the status is unknown. |
Dependent item | hpe.synergy.uplink_set["{#NAME}",status] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Uplink set [{#NAME}]: Has critical status | The status of the uplink set [{#NAME}] is critical. Needs immediate attention. |
last(/HPE Synergy by HTTP/hpe.synergy.uplink_set["{#NAME}",status])=0 |High |
||
Uplink set [{#NAME}]: Has warning status | The status of the uplink set [{#NAME}] is warning. Needs attention soon. |
last(/HPE Synergy by HTTP/hpe.synergy.uplink_set["{#NAME}",status])=4 |Warning |
||
Uplink set [{#NAME}]: Is disabled | The uplink set [{#NAME}] is currently not operational. |
last(/HPE Synergy by HTTP/hpe.synergy.uplink_set["{#NAME}",status])=1 |Info |
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums
This template is designed for the effortless deployment of HPE iLO monitoring by Zabbix via iLO RESTful API and doesn't require any external scripts.
For more details about HPE Redfish services, refer to the official documentation
.
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
zbx_monitor
). The user will only need to have the Login
privilege, which can be assigned manually or by assigning the ReadOnly
role to the user.{$ILO.URL}
macro in the format <scheme>://<host>[:port]/
(port is optional).{$ILO.USER}
macro.{$ILO.PASSWORD}
macro.Name | Description | Default |
---|---|---|
{$ILO.URL} | The iLO API endpoint in the format " |
|
{$ILO.USER} | The name of the user that is used for monitoring. |
|
{$ILO.PASSWORD} | The password of the user that is used for monitoring. |
|
{$ILO.HTTP_PROXY} | The HTTP proxy for script items (set if needed). If the macro is empty, then no proxy is used. |
|
{$ILO.INTERVAL} | The update interval for the script item that retrieves data from API. |
1m |
{$ILO.TIMEOUT} | The timeout threshold for the script item that retrieves data from API. |
15s |
{$ILO.COMPUTER_SYSTEM.DISCOVERY.HOSTNAME.MATCHES} | The computer system hostname regex filter to use in computer systems related metrics discovery for including. Can be used with the following context to include metrics of the particular entity: System, Storage, Controller, Drive, Volume. |
.+ |
{$ILO.COMPUTERSYSTEM.DISCOVERY.HOSTNAME.NOTMATCHES} | The computer system hostname regex filter to use in computer systems related metrics discovery for excluding. Can be used with the following context to exclude metrics of the particular entity: System, Storage, Controller, Drive, Volume. |
CHANGE_IF_NEEDED |
{$ILO.COMPUTER_SYSTEM.DISCOVERY.TYPE.MATCHES} | The computer system type regex filter to use in computer systems related metrics discovery for including. Can be used with the following context to include metrics of the particular entity: System, Storage, Controller, Drive, Volume. |
.+ |
{$ILO.COMPUTERSYSTEM.DISCOVERY.TYPE.NOTMATCHES} | The computer system type regex filter to use in computer systems related metrics discovery for excluding. Can be used with the following context to exclude metrics of the particular entity: System, Storage, Controller, Drive, Volume. |
CHANGE_IF_NEEDED |
{$ILO.SENSOR.DISCOVERY.NAME.MATCHES} | The sensor name regex filter to use in temperature sensors discovery for including. |
.+ |
{$ILO.SENSOR.DISCOVERY.NAME.NOT_MATCHES} | The sensor name regex filter to use in temperature sensors discovery for excluding. |
CHANGE_IF_NEEDED |
{$ILO.SENSOR.DISCOVERY.CONTEXT.MATCHES} | The sensor physical context regex filter to use in temperature sensors discovery for including. |
.+ |
{$ILO.SENSOR.DISCOVERY.CONTEXT.NOT_MATCHES} | The sensor physical context regex filter to use in temperature sensors discovery for excluding. |
CHANGE_IF_NEEDED |
Name | Description | Type | Key and additional info |
---|---|---|---|
HPE iLO: Get data | The JSON with the result of API requests. |
Script | hpe.ilo.get_data |
HPE iLO: Get data check | The data collection check. |
Dependent item | hpe.ilo.get_data.check Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
HPE iLO: Failed to get data from API | Failed to get data from API. Check the debug log for more information. |
length(last(/HPE iLO by HTTP/hpe.ilo.get_data.check))>0 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
HPE iLO: Computer systems discovery | Discovers computer systems. |
Dependent item | hpe.ilo.computer_systems.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
HPE iLO: Computer system [{#SYSTEM_HOSTNAME}]: Get data | Get data about the computer system. |
Dependent item | hpe.ilo.computersystem.getdata[{#SYSTEM_ID}] Preprocessing
|
HPE iLO: Computer system [{#SYSTEM_HOSTNAME}]: System type | The type of the computer system. Possible values: 0 - "Physical", a computer system; 1 - "Virtual", a virtual machine instance running on this system; 2 - "OS", an operating system instance; 3 - "PhysicallyPartitioned", a hardware-based partition of a computer system; 4 - "VirtuallyPartitioned", a virtual or software-based partition of a computer system; 5 - "DPU", a virtual or software-based partition of a computer system; 10 - "Unknown", the computer system type is unknown. |
Dependent item | hpe.ilo.computersystem.type[{#SYSTEMID}] Preprocessing
|
HPE iLO: Computer system [{#SYSTEM_HOSTNAME}]: Model | The model name of the computer system. |
Dependent item | hpe.ilo.computersystem.model[{#SYSTEMID}] Preprocessing
|
HPE iLO: Computer system [{#SYSTEM_HOSTNAME}]: Serial number | The serial number of the computer system. |
Dependent item | hpe.ilo.computersystem.serialnumber[{#SYSTEM_ID}] Preprocessing
|
HPE iLO: Computer system [{#SYSTEM_HOSTNAME}]: BIOS current version | The current BIOS version of the computer system. |
Dependent item | hpe.ilo.computersystem.bios.currentversion[{#SYSTEM_ID}] Preprocessing
|
HPE iLO: Computer system [{#SYSTEM_HOSTNAME}]: Status | The overall health state from the view of this computer system. Possible values: 0 - "OK", the computer system is in normal condition; 1 - "Warning", the computer system is in condition that requires attention; 2 - "Critical", the computer system is in critical condition that requires immediate attention; 10 - "Unknown", the computer system is in unknown condition. |
Dependent item | hpe.ilo.computersystem.status[{#SYSTEMID}] Preprocessing
|
HPE iLO: Computer system [{#SYSTEM_HOSTNAME}]: CPU utilization, in % | Current CPU utilization of the computer system in percentage. |
Dependent item | hpe.ilo.computersystem.usage.cpuutil[{#SYSTEM_ID}] Preprocessing
|
HPE iLO: Computer system [{#SYSTEM_HOSTNAME}]: I/O bus utilization, in % | Current I/O bus utilization of the computer system in percentage. |
Dependent item | hpe.ilo.computersystem.usage.iobusutil[{#SYSTEMID}] Preprocessing
|
HPE iLO: Computer system [{#SYSTEM_HOSTNAME}]: Memory bus utilization, in % | Current memory bus utilization of the computer system in percentage. |
Dependent item | hpe.ilo.computersystem.usage.memorybusutil[{#SYSTEMID}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
HPE iLO: Computer system [{#SYSTEM_HOSTNAME}]: Computer system has been replaced | The computer system serial number has changed. Acknowledge to close the problem manually. |
change(/HPE iLO by HTTP/hpe.ilo.computer_system.serial_number[{#SYSTEM_ID}])=1 and length(last(/HPE iLO by HTTP/hpe.ilo.computer_system.serial_number[{#SYSTEM_ID}]))>0 |Info |
Manual close: Yes | |
HPE iLO: Computer system [{#SYSTEM_HOSTNAME}]: BIOS version has changed | The current version of BIOS has changed. Acknowledge to close the problem manually. |
change(/HPE iLO by HTTP/hpe.ilo.computer_system.bios.current_version[{#SYSTEM_ID}])=1 and length(last(/HPE iLO by HTTP/hpe.ilo.computer_system.bios.current_version[{#SYSTEM_ID}]))>0 |Info |
Manual close: Yes | |
HPE iLO: Computer system [{#SYSTEM_HOSTNAME}]: Computer system is in warning state | The computer system is in condition that requires attention. |
last(/HPE iLO by HTTP/hpe.ilo.computer_system.status[{#SYSTEM_ID}])=1 |Warning |
Depends on:
|
|
HPE iLO: Computer system [{#SYSTEM_HOSTNAME}]: Computer system is in critical state | The computer system is in critical condition that requires immediate attention. |
last(/HPE iLO by HTTP/hpe.ilo.computer_system.status[{#SYSTEM_ID}])=2 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
HPE iLO: Managers discovery | Discovers managers. |
Dependent item | hpe.ilo.managers.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
HPE iLO: Manager [{#MANAGER_ID}]: Get data | Get data about the manager. |
Dependent item | hpe.ilo.manager.getdata[{#MANAGERID}] Preprocessing
|
HPE iLO: Manager [{#MANAGER_ID}]: Manager type | The manager type. Possible values: 0 - "ManagementController", a controller used primarily to monitor or manage the operation of a device or system; 1 - "EnclosureManager", a controller which provides management functions for a chassis or group of devices or systems; 2 - "BMC", a controller which provides management functions for a single computer system; 10 - "Unknown", the manager type is unknown. |
Dependent item | hpe.ilo.manager.type[{#MANAGER_ID}] Preprocessing
|
HPE iLO: Manager [{#MANAGER_ID}]: Model | The model name of the manager. |
Dependent item | hpe.ilo.manager.model[{#MANAGER_ID}] Preprocessing
|
HPE iLO: Manager [{#MANAGER_ID}]: Current firmware version | The current firmware version of the manager. |
Dependent item | hpe.ilo.manager.firmware.currentversion[{#MANAGERID}] Preprocessing
|
HPE iLO: Manager [{#MANAGER_ID}]: Status | The health state of the manager. Possible values: 0 - "OK", the manager is in normal condition; 1 - "Warning", the manager is in condition that requires attention; 2 - "Critical", the manager is in critical condition that requires immediate attention; 10 - "Unknown", the manager is in unknown condition. |
Dependent item | hpe.ilo.manager.status[{#MANAGER_ID}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
HPE iLO: Manager [{#MANAGER_ID}]: Firmware version has changed | The current firmware version of the manager has changed. Acknowledge to close the problem manually. |
change(/HPE iLO by HTTP/hpe.ilo.manager.firmware.current_version[{#MANAGER_ID}])=1 and length(last(/HPE iLO by HTTP/hpe.ilo.manager.firmware.current_version[{#MANAGER_ID}]))>0 |Info |
Manual close: Yes | |
HPE iLO: Manager [{#MANAGER_ID}]: Manager is in warning state | The manager is in condition that requires attention. |
last(/HPE iLO by HTTP/hpe.ilo.manager.status[{#MANAGER_ID}])=1 |Warning |
Depends on:
|
|
HPE iLO: Manager [{#MANAGER_ID}]: Manager is in critical state | The manager is in critical condition that requires immediate attention. |
last(/HPE iLO by HTTP/hpe.ilo.manager.status[{#MANAGER_ID}])=2 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
HPE iLO: Storages discovery | Discovers computer system storages. |
Dependent item | hpe.ilo.storages.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Get data | Get data about the storage. |
Dependent item | hpe.ilo.storage.getdata[{#SYSTEMID}, {#STORAGE_ID}] Preprocessing
|
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Status | The overall health state from the view of this storage. Possible values: 0 - "OK", the storage is in normal condition; 1 - "Warning", the storage is in condition that requires attention; 2 - "Critical", the storage is in critical condition that requires immediate attention; 10 - "Unknown", the storage is in unknown condition. |
Dependent item | hpe.ilo.storage.status[{#SYSTEMID}, {#STORAGEID}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Storage is in warning state | The computer system is in condition that requires attention. |
last(/HPE iLO by HTTP/hpe.ilo.storage.status[{#SYSTEM_ID}, {#STORAGE_ID}])=1 |Warning |
Depends on:
|
|
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Storage is in critical state | The computer system is in critical condition that requires immediate attention. |
last(/HPE iLO by HTTP/hpe.ilo.storage.status[{#SYSTEM_ID}, {#STORAGE_ID}])=2 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
HPE iLO: Controllers discovery | Discovers storage controllers. |
Dependent item | hpe.ilo.controllers.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Controller [{#CONTROLLER_ID}]: Get data | Get data about the controller. |
Dependent item | hpe.ilo.controller.getdata[{#SYSTEMID}, {#STORAGEID}, {#CONTROLLERID}] Preprocessing
|
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Controller [{#CONTROLLER_ID}]: Model | The model name of the controller. |
Dependent item | hpe.ilo.controller.model[{#SYSTEMID}, {#STORAGEID}, {#CONTROLLER_ID}] Preprocessing
|
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Controller [{#CONTROLLER_ID}]: Serial number | The serial number of the controller. |
Dependent item | hpe.ilo.controller.serialnumber[{#SYSTEMID}, {#STORAGEID}, {#CONTROLLERID}] Preprocessing
|
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Controller [{#CONTROLLER_ID}]: Status | The health state of the controller. Possible values: 0 - "OK", the controller is in normal condition; 1 - "Warning", the controller is in condition that requires attention; 2 - "Critical", the controller is in critical condition that requires immediate attention; 10 - "Unknown", the controller is in unknown condition. |
Dependent item | hpe.ilo.controller.status[{#SYSTEMID}, {#STORAGEID}, {#CONTROLLER_ID}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Controller [{#CONTROLLER_ID}]: Controller has been replaced | The controller serial number has changed. Acknowledge to close the problem manually. |
change(/HPE iLO by HTTP/hpe.ilo.controller.serial_number[{#SYSTEM_ID}, {#STORAGE_ID}, {#CONTROLLER_ID}])=1 and length(last(/HPE iLO by HTTP/hpe.ilo.controller.serial_number[{#SYSTEM_ID}, {#STORAGE_ID}, {#CONTROLLER_ID}]))>0 |Info |
Manual close: Yes | |
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Controller [{#CONTROLLER_ID}]: Controller is in warning state | The controller is in condition that requires attention. |
last(/HPE iLO by HTTP/hpe.ilo.controller.status[{#SYSTEM_ID}, {#STORAGE_ID}, {#CONTROLLER_ID}])=1 |Warning |
Depends on:
|
|
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Controller [{#CONTROLLER_ID}]: Controller is in critical state | The controller is in critical condition that requires immediate attention. |
last(/HPE iLO by HTTP/hpe.ilo.controller.status[{#SYSTEM_ID}, {#STORAGE_ID}, {#CONTROLLER_ID}])=2 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
HPE iLO: Drives discovery | Discovers storage drives. |
Dependent item | hpe.ilo.drives.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Drive [{#DRIVE_ID}]: Get data | Get data about the drive. |
Dependent item | hpe.ilo.drive.getdata[{#SYSTEMID}, {#STORAGEID}, {#DRIVEID}] Preprocessing
|
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Drive [{#DRIVE_ID}]: Media type | The media type of the drive. |
Dependent item | hpe.ilo.drive.mediatype[{#SYSTEMID}, {#STORAGEID}, {#DRIVEID}] Preprocessing
|
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Drive [{#DRIVE_ID}]: Serial number | The serial number of the drive. |
Dependent item | hpe.ilo.drive.serialnumber[{#SYSTEMID}, {#STORAGEID}, {#DRIVEID}] Preprocessing
|
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Drive [{#DRIVE_ID}]: Model | The model name of the drive. |
Dependent item | hpe.ilo.drive.model[{#SYSTEMID}, {#STORAGEID}, {#DRIVE_ID}] Preprocessing
|
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Drive [{#DRIVE_ID}]: Capacity | The capacity of the drive. |
Dependent item | hpe.ilo.drive.capacity[{#SYSTEMID}, {#STORAGEID}, {#DRIVE_ID}] Preprocessing
|
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Drive [{#DRIVE_ID}]: Predicted media life left, in % | The percentage of reads and writes that are predicted to still be available for the drive. |
Dependent item | hpe.ilo.drive.predictedlifeleft[{#SYSTEMID}, {#STORAGEID}, {#DRIVE_ID}] Preprocessing
|
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Drive [{#DRIVE_ID}]: Status indicator | Status of drive. Possible values: 0 - "OK", the drive is ok; 1 - "Fail", the drive has failed; 2 - "Rebuild", the drive is being rebuilt; 3 - "PredictiveFailureAnalysis", the drive is still working but predicted to fail soon; 4 - "Hotspare", the drive is marked to be automatically rebuilt and used as a replacement for a failed drive; 5 - "InACriticalArray", the array that this drive is a part of is degraded; 6 - "InAFailedArray ", the array that this drive is a part of is failed; 10 - "Unknown", the drive status is unknown. |
Dependent item | hpe.ilo.drive.statusindicator[{#SYSTEMID}, {#STORAGEID}, {#DRIVEID}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Drive [{#DRIVE_ID}]: Drive has been replaced | The drive serial number has changed. Acknowledge to close the problem manually. |
change(/HPE iLO by HTTP/hpe.ilo.drive.serial_number[{#SYSTEM_ID}, {#STORAGE_ID}, {#DRIVE_ID}])=1 and length(last(/HPE iLO by HTTP/hpe.ilo.drive.serial_number[{#SYSTEM_ID}, {#STORAGE_ID}, {#DRIVE_ID}]))>0 |Info |
Manual close: Yes | |
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Drive [{#DRIVE_ID}]: Drive has failed | The drive has failed. |
last(/HPE iLO by HTTP/hpe.ilo.drive.status_indicator[{#SYSTEM_ID}, {#STORAGE_ID}, {#DRIVE_ID}])=1 |High |
||
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Drive [{#DRIVE_ID}]: Drive is predicted to fail soon | The drive is still working but predicted to fail soon. |
last(/HPE iLO by HTTP/hpe.ilo.drive.status_indicator[{#SYSTEM_ID}, {#STORAGE_ID}, {#DRIVE_ID}])=3 |High |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
HPE iLO: Volumes discovery | Discovers storage volumes. |
Dependent item | hpe.ilo.volumes.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Volume [{#VOLUME_ID}]: Get data | Get data about the volume. |
Dependent item | hpe.ilo.volume.getdata[{#SYSTEMID}, {#STORAGEID}, {#VOLUMEID}] Preprocessing
|
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Volume [{#VOLUME_ID}]: Capacity | The capacity of the volume. |
Dependent item | hpe.ilo.volume.capacity[{#SYSTEMID}, {#STORAGEID}, {#VOLUME_ID}] Preprocessing
|
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Volume [{#VOLUME_ID}]: Status | The health state of the volume. Possible values: 0 - "OK", the volume is in normal condition; 1 - "Warning", the volume is in condition that requires attention; 2 - "Critical", the volume is in critical condition that requires immediate attention; 10 - "Unknown", the volume is in unknown condition. |
Dependent item | hpe.ilo.volume.status[{#SYSTEMID}, {#STORAGEID}, {#VOLUME_ID}] Preprocessing
|
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Volume [{#VOLUME_ID}]: RAID level | The RAID level of the volume. |
Dependent item | hpe.ilo.volume.raidlevel[{#SYSTEMID}, {#STORAGEID}, {#VOLUMEID}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Volume [{#VOLUME_ID}]: Volume is in warning state | The volume is in condition that requires attention. |
last(/HPE iLO by HTTP/hpe.ilo.volume.status[{#SYSTEM_ID}, {#STORAGE_ID}, {#VOLUME_ID}])=1 |Warning |
Depends on:
|
|
HPE iLO: Computer system [{#SYSTEMHOSTNAME}]: Storage [{#STORAGEID}]: Volume [{#VOLUME_ID}]: Volume is in critical state | The volume is in critical condition that requires immediate attention. |
last(/HPE iLO by HTTP/hpe.ilo.volume.status[{#SYSTEM_ID}, {#STORAGE_ID}, {#VOLUME_ID}])=2 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
HPE iLO: Fans discovery | Discovers chassis fans. |
Dependent item | hpe.ilo.fans.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
HPE iLO: Chassis [{#CHASSISID}]: Fan [{#FANNAME}]: Get data | Get data about the fan. |
Dependent item | hpe.ilo.fan.getdata[{#CHASSISID}, {#FAN_ID}] Preprocessing
|
HPE iLO: Chassis [{#CHASSISID}]: Fan [{#FANNAME}]: Status | The health state of the fan. Possible values: 0 - "OK", the fan is in normal condition; 1 - "Warning", the fan is in condition that requires attention; 2 - "Critical", the fan is in critical condition that requires immediate attention; 10 - "Unknown", the fan is in unknown condition. |
Dependent item | hpe.ilo.fan.status[{#CHASSISID}, {#FANID}] Preprocessing
|
HPE iLO: Chassis [{#CHASSISID}]: Fan [{#FANNAME}]: Speed, in % | The current speed of the fan. |
Dependent item | hpe.ilo.fan.speed[{#CHASSISID}, {#FANID}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
HPE iLO: Chassis [{#CHASSISID}]: Fan [{#FANNAME}]: Fan is in warning state | The fan is in condition that requires attention. |
last(/HPE iLO by HTTP/hpe.ilo.fan.status[{#CHASSIS_ID}, {#FAN_ID}])=1 |Warning |
Depends on:
|
|
HPE iLO: Chassis [{#CHASSISID}]: Fan [{#FANNAME}]: Fan is in critical state | The fan is in critical condition that requires immediate attention. |
last(/HPE iLO by HTTP/hpe.ilo.fan.status[{#CHASSIS_ID}, {#FAN_ID}])=2 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
HPE iLO: Temperature sensors discovery | Discovers chassis temperature sensors. |
Dependent item | hpe.ilo.sensors.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
HPE iLO: Chassis [{#CHASSISID}]: Sensor [{#SENSORNAME}]: Get data | Get data about the sensor. |
Dependent item | hpe.ilo.sensor.getdata[{#CHASSISID}, {#SENSOR_ID}] Preprocessing
|
HPE iLO: Chassis [{#CHASSISID}]: Sensor [{#SENSORNAME}]: Status | The health state of the sensor. Possible values: 0 - "OK", the sensor is in normal condition; 1 - "Warning", the sensor is in condition that requires attention; 2 - "Critical", the sensor is in critical condition that requires immediate attention; 10 - "Unknown", the sensor is in unknown condition. |
Dependent item | hpe.ilo.sensor.status[{#CHASSISID}, {#SENSORID}] Preprocessing
|
HPE iLO: Chassis [{#CHASSISID}]: Sensor [{#SENSORNAME}]: Temperature | The current temperature reading in Celsius degrees for the sensor. |
Dependent item | hpe.ilo.sensor.temperature[{#CHASSISID}, {#SENSORID}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
HPE iLO: Chassis [{#CHASSISID}]: Sensor [{#SENSORNAME}]: Sensor is in warning state | The sensor is in condition that requires attention. |
last(/HPE iLO by HTTP/hpe.ilo.sensor.status[{#CHASSIS_ID}, {#SENSOR_ID}])=1 |Warning |
Depends on:
|
|
HPE iLO: Chassis [{#CHASSISID}]: Sensor [{#SENSORNAME}]: Sensor is in critical state | The sensor is in critical condition that requires immediate attention. |
last(/HPE iLO by HTTP/hpe.ilo.sensor.status[{#CHASSIS_ID}, {#SENSOR_ID}])=2 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
HPE iLO: PSU discovery | Discovers chassis power supply units (PSU). |
Dependent item | hpe.ilo.psu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
HPE iLO: Chassis [{#CHASSISID}]: PSU [{#PSUID}]: Get data | Get data about the PSU. |
Dependent item | hpe.ilo.psu.getdata[{#CHASSISID}, {#PSU_ID}] Preprocessing
|
HPE iLO: Chassis [{#CHASSISID}]: PSU [{#PSUID}]: Model | The model name of the PSU. |
Dependent item | hpe.ilo.psu.model[{#CHASSISID}, {#PSUID}] Preprocessing
|
HPE iLO: Chassis [{#CHASSISID}]: PSU [{#PSUID}]: Serial number | The serial number of the PSU. |
Dependent item | hpe.ilo.psu.serialnumber[{#CHASSISID}, {#PSU_ID}] Preprocessing
|
HPE iLO: Chassis [{#CHASSISID}]: PSU [{#PSUID}]: Status | The health state of the PSU. Possible values: 0 - "OK", the PSU is in normal condition; 1 - "Warning", the PSU is in condition that requires attention; 2 - "Critical", the PSU is in critical condition that requires immediate attention; 10 - "Unknown", the PSU is in unknown condition. |
Dependent item | hpe.ilo.psu.status[{#CHASSISID}, {#PSUID}] Preprocessing
|
HPE iLO: Chassis [{#CHASSISID}]: PSU [{#PSUID}]: Line input voltage | The line input voltage at which the PSU is operating. |
Dependent item | hpe.ilo.psu.lineinputvoltage[{#CHASSISID}, {#PSUID}] Preprocessing
|
HPE iLO: Chassis [{#CHASSISID}]: PSU [{#PSUID}]: Last power output | The average power output of the PSU. |
Dependent item | hpe.ilo.psu.lastpoweroutput[{#CHASSISID}, {#PSUID}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
HPE iLO: Chassis [{#CHASSISID}]: PSU [{#PSUID}]: PSU has been replaced | The PSU serial number has changed. Acknowledge to close the problem manually. |
change(/HPE iLO by HTTP/hpe.ilo.psu.serial_number[{#CHASSIS_ID}, {#PSU_ID}])=1 and length(last(/HPE iLO by HTTP/hpe.ilo.psu.serial_number[{#CHASSIS_ID}, {#PSU_ID}]))>0 |Info |
Manual close: Yes | |
HPE iLO: Chassis [{#CHASSISID}]: PSU [{#PSUID}]: PSU is in warning state | The PSU is in condition that requires attention. |
last(/HPE iLO by HTTP/hpe.ilo.psu.status[{#CHASSIS_ID}, {#PSU_ID}])=1 |Warning |
Depends on:
|
|
HPE iLO: Chassis [{#CHASSISID}]: PSU [{#PSUID}]: PSU is in critical state | The PSU is in critical condition that requires immediate attention. |
last(/HPE iLO by HTTP/hpe.ilo.psu.status[{#CHASSIS_ID}, {#PSU_ID}])=2 |High |
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums
for HP iLO adapters that support SNMP get. Or via operating system, using SNMP HP subagent
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
Refer to the vendor documentation.
Name | Description | Default |
---|---|---|
{$HEALTHCRITSTATUS} | 4 |
|
{$HEALTHWARNSTATUS} | 3 |
|
{$PSUCRITSTATUS} | 4 |
|
{$PSUWARNSTATUS} | 3 |
|
{$FANCRITSTATUS} | 4 |
|
{$FANWARNSTATUS} | 3 |
|
{$DISKARRAYCRIT_STATUS} | 4 |
|
{$DISKARRAYWARN_STATUS} | 3 |
|
{$DISKARRAYCACHECRITSTATUS:"cacheModCriticalFailure"} | 8 |
|
{$DISKARRAYCACHEWARNSTATUS:"invalid"} | 2 |
|
{$DISKARRAYCACHEWARNSTATUS:"cacheModDegradedFailsafeSpeed"} | 7 |
|
{$DISKARRAYCACHEWARNSTATUS:"cacheReadCacheNotMapped"} | 9 |
|
{$DISKARRAYCACHEWARNSTATUS:"cacheModFlashMemNotAttached"} | 6 |
|
{$DISKARRAYCACHEOKSTATUS:"enabled"} | 3 |
|
{$DISKARRAYCACHEBATTERYCRIT_STATUS:"failed"} | 4 |
|
{$DISKARRAYCACHEBATTERYCRIT_STATUS:"capacitorFailed"} | 7 |
|
{$DISKARRAYCACHEBATTERYWARN_STATUS:"degraded"} | 5 |
|
{$DISKARRAYCACHEBATTERYWARN_STATUS:"notPresent"} | 6 |
|
{$VDISKCRITSTATUS} | 3 |
|
{$VDISKOKSTATUS} | 2 |
|
{$DISKWARNSTATUS} | 4 |
|
{$DISKFAILSTATUS} | 3 |
|
{$DISKSMARTFAIL_STATUS:"replaceDrive"} | 3 |
|
{$DISKSMARTFAIL_STATUS:"replaceDriveSSDWearOut"} | 4 |
|
{$SNMP.TIMEOUT} | 5m |
|
{$ICMPLOSSWARN} | 20 |
|
{$ICMPRESPONSETIME_WARN} | 0.15 |
Name | Description | Type | Key and additional info |
---|---|---|---|
HP iLO: System temperature status | MIB: CPQHLTH-MIB This value specifies the overall condition of the system's thermal environment. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. |
SNMP agent | sensor.temp.status[cpqHeThermalCondition.0] Preprocessing
|
HP iLO: Overall system health status | MIB: CPQHLTH-MIB The overall condition. This object represents the overall status of the server information represented by this MIB. |
SNMP agent | system.status[cpqHeMibCondition.0] Preprocessing
|
HP iLO: Hardware model name | MIB: CPQSINFO-MIB The machine product name.The name of the machine used in this system. |
SNMP agent | system.hw.model Preprocessing
|
HP iLO: Hardware serial number | MIB: CPQSINFO-MIB The serial number of the physical system unit. The string will be empty if the system does not report the serial number function. |
SNMP agent | system.hw.serialnumber Preprocessing
|
HP iLO: Uptime (network) | MIB: SNMPv2-MIB The time (in hundredths of a second) since the network management portion of the system was last re-initialized. |
SNMP agent | system.net.uptime[sysUpTime.0] Preprocessing
|
HP iLO: Uptime (hardware) | MIB: HOST-RESOURCES-MIB The amount of time since this host was last initialized. Note that this is different from sysUpTime in the SNMPv2-MIB [RFC1907] because sysUpTime is the uptime of the network management portion of the system. |
SNMP agent | system.hw.uptime[hrSystemUptime.0] Preprocessing
|
HP iLO: SNMP traps (fallback) | The item is used to collect all SNMP traps unmatched by other snmptrap items |
SNMP trap | snmptrap.fallback |
HP iLO: System location | MIB: SNMPv2-MIB The physical location of this node (e.g., `telephone closet, 3rd floor'). If the location is unknown, the value is the zero-length string. |
SNMP agent | system.location[sysLocation.0] Preprocessing
|
HP iLO: System contact details | MIB: SNMPv2-MIB The textual identification of the contact person for this managed node, together with information on how to contact this person. If no contact information is known, the value is the zero-length string. |
SNMP agent | system.contact[sysContact.0] Preprocessing
|
HP iLO: System object ID | MIB: SNMPv2-MIB The vendor's authoritative identification of the network management subsystem contained in the entity. This value is allocated within the SMI enterprises subtree (1.3.6.1.4.1) and provides an easy and unambiguous means for determining |
SNMP agent | system.objectid[sysObjectID.0] Preprocessing
|
HP iLO: System name | MIB: SNMPv2-MIB An administratively-assigned name for this managed node.By convention, this is the node's fully-qualified domain name. If the name is unknown, the value is the zero-length string. |
SNMP agent | system.name Preprocessing
|
HP iLO: System description | MIB: SNMPv2-MIB A textual description of the entity. This value should include the full name and version identification of the system's hardware type, software operating-system, and networking software. |
SNMP agent | system.descr[sysDescr.0] Preprocessing
|
HP iLO: SNMP agent availability | Availability of SNMP checks on the host. The value of this item corresponds to availability icons in the host list. Possible value: 0 - not available 1 - available 2 - unknown |
Zabbix internal | zabbix[host,snmp,available] |
HP iLO: ICMP ping | Simple check | icmpping | |
HP iLO: ICMP loss | Simple check | icmppingloss | |
HP iLO: ICMP response time | Simple check | icmppingsec |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
HP iLO: System status is in critical state | Please check the device for errors |
count(/HP iLO by SNMP/system.status[cpqHeMibCondition.0],#1,"eq","{$HEALTH_CRIT_STATUS}")=1 |High |
||
HP iLO: System status is in warning state | Please check the device for warnings |
count(/HP iLO by SNMP/system.status[cpqHeMibCondition.0],#1,"eq","{$HEALTH_WARN_STATUS}")=1 |Warning |
Depends on:
|
|
HP iLO: Device has been replaced | Device serial number has changed. Acknowledge to close the problem manually. |
last(/HP iLO by SNMP/system.hw.serialnumber,#1)<>last(/HP iLO by SNMP/system.hw.serialnumber,#2) and length(last(/HP iLO by SNMP/system.hw.serialnumber))>0 |Info |
Manual close: Yes | |
HP iLO: Host has been restarted | Uptime is less than 10 minutes. |
(last(/HP iLO by SNMP/system.hw.uptime[hrSystemUptime.0])>0 and last(/HP iLO by SNMP/system.hw.uptime[hrSystemUptime.0])<10m) or (last(/HP iLO by SNMP/system.hw.uptime[hrSystemUptime.0])=0 and last(/HP iLO by SNMP/system.net.uptime[sysUpTime.0])<10m) |Warning |
Manual close: Yes Depends on:
|
|
HP iLO: System name has changed | The name of the system has changed. Acknowledge to close the problem manually. |
last(/HP iLO by SNMP/system.name,#1)<>last(/HP iLO by SNMP/system.name,#2) and length(last(/HP iLO by SNMP/system.name))>0 |Info |
Manual close: Yes | |
HP iLO: No SNMP data collection | SNMP is not available for polling. Please check device connectivity and SNMP settings. |
max(/HP iLO by SNMP/zabbix[host,snmp,available],{$SNMP.TIMEOUT})=0 |Warning |
Depends on:
|
|
HP iLO: Unavailable by ICMP ping | Last three attempts returned timeout. Please check device connectivity. |
max(/HP iLO by SNMP/icmpping,#3)=0 |High |
||
HP iLO: High ICMP ping loss | min(/HP iLO by SNMP/icmppingloss,5m)>{$ICMP_LOSS_WARN} and min(/HP iLO by SNMP/icmppingloss,5m)<100 |Warning |
Depends on:
|
||
HP iLO: High ICMP ping response time | avg(/HP iLO by SNMP/icmppingsec,5m)>{$ICMP_RESPONSE_TIME_WARN} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature Discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable |
SNMP agent | tempDescr.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: {#SNMPINDEX} |
SNMP agent | sensor.temp.value[cpqHeTemperatureCelsius.{#SNMPINDEX}] Preprocessing
|
{#SNMPINDEX}: Temperature sensor location | MIB: CPQHLTH-MIB This specifies the location of the temperature sensor present in the system. |
SNMP agent | sensor.temp.locale[cpqHeTemperatureLocale.{#SNMPINDEX}] Preprocessing
|
{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | sensor.temp.condition[cpqHeTemperatureCondition.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.{#SNMPINDEX}]) = 3 |Warning |
||
{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature Discovery Ambient | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with ambient(11) and 0.1 index filter |
SNMP agent | tempDescr.discovery.ambient Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Ambient: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: Ambient |
SNMP agent | sensor.temp.value[cpqHeTemperatureCelsius.Ambient.{#SNMPINDEX}] Preprocessing
|
Ambient: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | sensor.temp.condition[cpqHeTemperatureCondition.Ambient.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Ambient: Temperature could not be determined | Temperature could not be determined. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.Ambient.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
Ambient: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.Ambient.{#SNMPINDEX}]) = 3 |Warning |
||
Ambient: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.Ambient.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature Discovery CPU | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with cpu(6) filter |
SNMP agent | tempDescr.discovery.cpu Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
CPU-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: CPU-{#SNMPINDEX} |
SNMP agent | sensor.temp.value[cpqHeTemperatureCelsius.CPU.{#SNMPINDEX}] Preprocessing
|
CPU-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | sensor.temp.condition[cpqHeTemperatureCondition.CPU.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
CPU-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.CPU.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
CPU-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.CPU.{#SNMPINDEX}]) = 3 |Warning |
||
CPU-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.CPU.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature Discovery Memory | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with memory(7) filter |
SNMP agent | tempDescr.discovery.memory Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Memory-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: Memory-{#SNMPINDEX} |
SNMP agent | sensor.temp.value[cpqHeTemperatureCelsius.Memory.{#SNMPINDEX}] Preprocessing
|
Memory-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | sensor.temp.condition[cpqHeTemperatureCondition.Memory.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Memory-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.Memory.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
Memory-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.Memory.{#SNMPINDEX}]) = 3 |Warning |
||
Memory-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.Memory.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature Discovery PSU | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with powerSupply(10) filter |
SNMP agent | tempDescr.discovery.psu Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: PSU-{#SNMPINDEX} |
SNMP agent | sensor.temp.value[cpqHeTemperatureCelsius.PSU.{#SNMPINDEX}] Preprocessing
|
PSU-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | sensor.temp.condition[cpqHeTemperatureCondition.PSU.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
PSU-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.PSU.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
PSU-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.PSU.{#SNMPINDEX}]) = 3 |Warning |
||
PSU-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.PSU.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature Discovery I/O | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with ioBoard(5) filter |
SNMP agent | tempDescr.discovery.io Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
I/O-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: I/O-{#SNMPINDEX} |
SNMP agent | sensor.temp.value[cpqHeTemperatureCelsius."I/O.{#SNMPINDEX}"] Preprocessing
|
I/O-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | sensor.temp.condition[cpqHeTemperatureCondition."I/O.{#SNMPINDEX}"] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
I/O-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition."I/O.{#SNMPINDEX}"]) = 1 |Info |
Manual close: Yes | |
I/O-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition."I/O.{#SNMPINDEX}"]) = 3 |Warning |
||
I/O-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition."I/O.{#SNMPINDEX}"]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature Discovery System | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with system(3) filter |
SNMP agent | tempDescr.discovery.system Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
System-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: System-{#SNMPINDEX} |
SNMP agent | sensor.temp.value[cpqHeTemperatureCelsius.System.{#SNMPINDEX}] Preprocessing
|
System-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | sensor.temp.condition[cpqHeTemperatureCondition.System.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
System-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.System.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
System-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.System.{#SNMPINDEX}]) = 3 |Warning |
||
System-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HP iLO by SNMP/sensor.temp.condition[cpqHeTemperatureCondition.System.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU Discovery | CPQHLTH-MIB::cpqHeFltTolPowerSupplyStatus |
SNMP agent | psu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Chassis {#CHASSISNUM}, bay {#BAYNUM}: Power supply status | MIB: CPQHLTH-MIB The condition of the power supply. This value will be one of the following: other(1) The status could not be determined or not present. ok(2) The power supply is operating normally. degraded(3) A temperature sensor, fan or other power supply component is outside of normal operating range. failed(4) A power supply component detects a condition that could permanently damage the system. |
SNMP agent | sensor.psu.status[cpqHeFltTolPowerSupplyCondition.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Chassis {#CHASSISNUM}, bay {#BAYNUM}: Power supply is in critical state | Please check the power supply unit for errors |
count(/HP iLO by SNMP/sensor.psu.status[cpqHeFltTolPowerSupplyCondition.{#SNMPINDEX}],#1,"eq","{$PSU_CRIT_STATUS}")=1 |Average |
||
Chassis {#CHASSISNUM}, bay {#BAYNUM}: Power supply is in warning state | Please check the power supply unit for errors |
count(/HP iLO by SNMP/sensor.psu.status[cpqHeFltTolPowerSupplyCondition.{#SNMPINDEX}],#1,"eq","{$PSU_WARN_STATUS}")=1 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
FAN Discovery | CPQHLTH-MIB::cpqHeFltTolFanCondition |
SNMP agent | fan.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Fan {#SNMPINDEX}: Fan status | MIB: CPQHLTH-MIB The condition of the fan. This value will be one of the following: other(1) Fan status detection is not supported by this system or driver. ok(2) The fan is operating properly. degraded(2) A redundant fan is not operating properly. failed(4) A non-redundant fan is not operating properly. |
SNMP agent | sensor.fan.status[cpqHeFltTolFanCondition.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Fan {#SNMPINDEX}: Fan is in critical state | Please check the fan unit |
count(/HP iLO by SNMP/sensor.fan.status[cpqHeFltTolFanCondition.{#SNMPINDEX}],#1,"eq","{$FAN_CRIT_STATUS}")=1 |Average |
||
Fan {#SNMPINDEX}: Fan is in warning state | Please check the fan unit |
count(/HP iLO by SNMP/sensor.fan.status[cpqHeFltTolFanCondition.{#SNMPINDEX}],#1,"eq","{$FAN_WARN_STATUS}")=1 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Physical Disk Discovery | Scanning table of physical drive entries CPQIDA-MIB::cpqDaPhyDrvTable. |
SNMP agent | physicalDisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
{#DISK_LOCATION}: Physical disk status | MIB: CPQIDA-MIB Physical Drive Status. This shows the status of the physical drive. The following values are valid for the physical drive status: other (1) Indicates that the instrument agent does not recognize the drive. You may need to upgrade your instrument agent and/or driver software. ok (2) Indicates the drive is functioning properly. failed (3) Indicates that the drive is no longer operating and should be replaced. predictiveFailure(4) Indicates that the drive has a predictive failure error and should be replaced. |
SNMP agent | system.hw.physicaldisk.status[cpqDaPhyDrvStatus.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk S.M.A.R.T. status | MIB: CPQIDA-MIB Physical Drive S.M.A.R.T Status.The following values are defined: other(1) The agent is unable to determine if the status of S.M.A.R.T predictive failure monitoring for this drive. ok(2) Indicates the drive is functioning properly. replaceDrive(3) Indicates that the drive has a S.M.A.R.T predictive failure error and should be replaced. |
SNMP agent | system.hw.physicaldisk.smart_status[cpqDaPhyDrvSmartStatus.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk serial number | MIB: CPQIDA-MIB Physical Drive Serial Number. This is the serial number assigned to the physical drive. This value is based upon the serial number as returned by the SCSI inquiry command but may have been modified due to space limitations. This can be used for identification purposes. |
SNMP agent | system.hw.physicaldisk.serialnumber[cpqDaPhyDrvSerialNum.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk model name | MIB: CPQIDA-MIB Physical Drive Model.This is a text description of the physical drive. The text that appears depends upon who manufactured the drive and the drive type. If a drive fails, note the model to identify the type of drive necessary for replacement. If a model number is not present, you may not have properly initialized the drive array to which the physical drive is attached for monitoring. |
SNMP agent | system.hw.physicaldisk.model[cpqDaPhyDrvModel.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk media type | MIB: CPQIDA-MIB Drive Array Physical Drive Media Type.The following values are defined: other(1) The instrument agent is unable to determine the physical drive's media type. rotatingPlatters(2) The physical drive media is composed of rotating platters. solidState(3) The physical drive media is composed of solid state electronics. |
SNMP agent | system.hw.physicaldisk.media_type[cpqDaPhyDrvMediaType.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Disk size | MIB: CPQIDA-MIB Physical Drive Size in MB. This is the size of the physical drive in megabytes. This value is calculated using the value 1,048,576 (2^20) as a megabyte. Drive manufacturers sometimes use the number 1,000,000 as a megabyte when giving drive capacities so this value may differ from the advertised size of a drive. This field is only applicable for controllers which support SCSI drives, and therefore is not supported by the IDA or IDA-2 controllers. The field will contain 0xFFFFFFFF if the drive capacity cannot be calculated or if the controller does not support SCSI drives. |
SNMP agent | system.hw.physicaldisk.size[cpqDaPhyDrvMediaType.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#DISK_LOCATION}: Physical disk failed | Please check physical disk for warnings or errors |
count(/HP iLO by SNMP/system.hw.physicaldisk.status[cpqDaPhyDrvStatus.{#SNMPINDEX}],#1,"eq","{$DISK_FAIL_STATUS}")=1 |High |
||
{#DISK_LOCATION}: Physical disk is in warning state | Please check physical disk for warnings or errors |
count(/HP iLO by SNMP/system.hw.physicaldisk.status[cpqDaPhyDrvStatus.{#SNMPINDEX}],#1,"eq","{$DISK_WARN_STATUS}")=1 |Warning |
Depends on:
|
|
{#DISK_LOCATION}: Physical disk S.M.A.R.T. failed | Disk probably requires replacement. |
count(/HP iLO by SNMP/system.hw.physicaldisk.smart_status[cpqDaPhyDrvSmartStatus.{#SNMPINDEX}],#1,"eq","{$DISK_SMART_FAIL_STATUS:\"replaceDrive\"}")=1 or count(/HP iLO by SNMP/system.hw.physicaldisk.smart_status[cpqDaPhyDrvSmartStatus.{#SNMPINDEX}],#1,"eq","{$DISK_SMART_FAIL_STATUS:\"replaceDriveSSDWearOut\"}")=1 |High |
Depends on:
|
|
{#DISK_LOCATION}: Disk has been replaced | Disk serial number has changed. Acknowledge to close the problem manually. |
last(/HP iLO by SNMP/system.hw.physicaldisk.serialnumber[cpqDaPhyDrvSerialNum.{#SNMPINDEX}],#1)<>last(/HP iLO by SNMP/system.hw.physicaldisk.serialnumber[cpqDaPhyDrvSerialNum.{#SNMPINDEX}],#2) and length(last(/HP iLO by SNMP/system.hw.physicaldisk.serialnumber[cpqDaPhyDrvSerialNum.{#SNMPINDEX}]))>0 |Info |
Manual close: Yes |
Name | Description | Type | Key and additional info |
---|---|---|---|
Virtual Disk Discovery | CPQIDA-MIB::cpqDaLogDrvTable |
SNMP agent | virtualdisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Disk {#SNMPINDEX}({#DISK_NAME}): Status | Logical Drive Status. |
SNMP agent | system.hw.virtualdisk.status[cpqDaLogDrvStatus.{#SNMPINDEX}] Preprocessing
|
Disk {#SNMPINDEX}({#DISK_NAME}): Layout type | Logical Drive Fault Tolerance. This shows the fault tolerance mode of the logical drive. |
SNMP agent | system.hw.virtualdisk.layout[cpqDaLogDrvFaultTol.{#SNMPINDEX}] Preprocessing
|
Disk {#SNMPINDEX}({#DISK_NAME}): Disk size | Logical Drive Size. This is the size of the logical drive in megabytes. This value is calculated using the value 1,048,576 (2^20) as a megabyte. Drive manufacturers sometimes use the number 1,000,000 as a megabyte when giving drive capacities so this value may differ from the advertised size of a drive. |
SNMP agent | system.hw.virtualdisk.size[cpqDaLogDrvSize.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Disk {#SNMPINDEX}({#DISK_NAME}): Virtual disk failed | Please check virtual disk for warnings or errors |
count(/HP iLO by SNMP/system.hw.virtualdisk.status[cpqDaLogDrvStatus.{#SNMPINDEX}],#1,"eq","{$VDISK_CRIT_STATUS}")=1 |High |
||
Disk {#SNMPINDEX}({#DISK_NAME}): Virtual disk is not in OK state | Please check virtual disk for warnings or errors |
count(/HP iLO by SNMP/system.hw.virtualdisk.status[cpqDaLogDrvStatus.{#SNMPINDEX}],#1,"ne","{$VDISK_OK_STATUS}")=1 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Array Controller Discovery | Scanning table of Array controllers: CPQIDA-MIB::cpqDaCntlrTable |
SNMP agent | array.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
{#CNTLR_LOCATION}: Disk array controller status | MIB: CPQIDA-MIB This value represents the overall condition of this controller, and any associated logical drives,physical drives, and array accelerators. |
SNMP agent | system.hw.diskarray.status[cpqDaCntlrCondition.{#SNMPINDEX}] Preprocessing
|
{#CNTLR_LOCATION}: Disk array controller model | MIB: CPQIDA-MIB Array Controller Model. The type of controller card. |
SNMP agent | system.hw.diskarray.model[cpqDaCntlrModel.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#CNTLR_LOCATION}: Disk array controller is in critical state | Please check the device for faults |
count(/HP iLO by SNMP/system.hw.diskarray.status[cpqDaCntlrCondition.{#SNMPINDEX}],#1,"eq","{$DISK_ARRAY_CRIT_STATUS}")=1 |High |
||
{#CNTLR_LOCATION}: Disk array controller is in warning state | Please check the device for faults |
count(/HP iLO by SNMP/system.hw.diskarray.status[cpqDaCntlrCondition.{#SNMPINDEX}],#1,"eq","{$DISK_ARRAY_WARN_STATUS}")=1 |Average |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Array Controller Cache Discovery | Scanning table of Array controllers: CPQIDA-MIB::cpqDaAccelTable |
SNMP agent | array.cache.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
#{#CACHECNTRLINDEX}: Disk array cache controller status | MIB: CPQIDA-MIB Cache Module/Operations Status. This describes the status of the cache module and/or cache operations. Note that for some controller models, a cache module board that physically attaches to the controller or chipset may not be an available option. The status can be: Other (1) Indicates that the instrument agent does not recognize the status of the cache module. You may need to upgrade the instrument agent. Invalid (2) Indicates that a cache module board has not been installed in this system or is present but not configured. Enabled (3) Indicates that cache operations are currently configured and enabled for at least one logical drive. Temporarily Disabled (4) Indicates that cache operations have been temporarily disabled. View the cache module board error code object to determine why the write cache operations have been temporarily disabled. Permanently Disabled (5) Indicates that cache operations have been permanently disabled. View the cache module board error code object to determine why the write cache operations have been disabled. Cache Module Flash Memory Not Attached (6) Indicates that the flash memory component of the flash backed cache module is not attached. This status will be set when the flash memory is not attached and the Supercap is attached. This value is only used on flash backed cache modules that support removable flash memory. Cache Module Degraded Failsafe Speed (7) Indicates that the cache module board is currently degraded and operating at a failsafe speed. View variables cpqDaCacheMemoryDataWidth and cpqDaCacheMemoryTransferRate to obtain the cache module board`s current memory data width and memory transfer rate. Cache Module Critical Failure (8) Indicates that the cache module board has encountered a critical failure. The controller is currently operating in Zero Memory Raid mode. Read Cache Could Not Be Mapped (9) Indicates that the read cache memory in a split cache configuration could not be mapped by the operating system and as a result is not available. This status may be caused by virtual space limitations in certain operating systems and is only applicable to B-Series controllers. |
SNMP agent | system.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}] Preprocessing
|
#{#CACHECNTRLINDEX}: Disk array cache controller battery status | MIB: CPQIDA-MIB Cache Module Board Backup Power Status. This monitors the status of each backup power source on the board. The backup power source can only recharge when the system has power applied. The type of backup power source used is indicated by cpqDaAccelBackupPowerSource. The following values are valid: Other (1) Indicates that the instrument agent does not recognize backup power status. You may need to update your software. Ok (2) The backup power source is fully charged. Recharging (3) The array controller has one or more cache module backup power sources that are recharging. Cache module operations such as Battery/Flash Backed Write Cache, Expansion, Extension and Migration are temporarily suspended until the backup power source is fully charged. Cache module operations will automatically resume when charging is complete. Failed (4) The battery pack is below the sufficient voltage level and has not recharged in 36 hours. Your Cache Module board needs to be serviced. Degraded (5) The battery is still operating, however, one of the batteries in the pack has failed to recharge properly. Your Cache Module board should be serviced as soon as possible. NotPresent (6) A backup power source is not present on the cache module board. Some controllers do not have backup power sources. Capacitor Failed (7) The flash backed cache module capacitor is below the sufficient voltage level and has not recharged in 10 minutes. Your Cache Module board needs to be serviced. |
SNMP agent | system.hw.diskarray.cache.battery.status[cpqDaAccelBattery.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
#{#CACHECNTRLINDEX}: Disk array cache controller is in critical state! | Please check the device for faults |
count(/HP iLO by SNMP/system.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}],#1,"eq","{$DISK_ARRAY_CACHE_CRIT_STATUS:\"cacheModCriticalFailure\"}")=1 |Average |
||
#{#CACHECNTRLINDEX}: Disk array cache controller is in warning state | Please check the device for faults |
count(/HP iLO by SNMP/system.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}],#1,"eq","{$DISK_ARRAY_CACHE_WARN_STATUS:\"cacheModDegradedFailsafeSpeed\"}")=1 or count(/HP iLO by SNMP/system.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}],#1,"eq","{$DISK_ARRAY_CACHE_WARN_STATUS:\"cacheReadCacheNotMapped\"}")=1 or count(/HP iLO by SNMP/system.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}],#1,"eq","{$DISK_ARRAY_CACHE_WARN_STATUS:\"cacheModFlashMemNotAttached\"}")=1 |Warning |
Depends on:
|
|
#{#CACHECNTRLINDEX}: Disk array cache controller is not in optimal state | Please check the device for faults |
count(/HP iLO by SNMP/system.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}],#1,"ne","{$DISK_ARRAY_CACHE_OK_STATUS:\"enabled\"}")=1 and last(/HP iLO by SNMP/system.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}])<>{$DISK_ARRAY_CACHE_WARN_STATUS:"invalid"} |Warning |
Depends on:
|
|
#{#CACHECNTRLINDEX}: Disk array cache controller battery is in critical state! | Please check the device for faults |
count(/HP iLO by SNMP/system.hw.diskarray.cache.battery.status[cpqDaAccelBattery.{#SNMPINDEX}],#1,"eq","{$DISK_ARRAY_CACHE_BATTERY_CRIT_STATUS:\"failed\"}")=1 or count(/HP iLO by SNMP/system.hw.diskarray.cache.battery.status[cpqDaAccelBattery.{#SNMPINDEX}],#1,"eq","{$DISK_ARRAY_CACHE_BATTERY_CRIT_STATUS:\"capacitorFailed\"}")=1 |Average |
||
#{#CACHECNTRLINDEX}: Disk array cache controller battery is in warning state | Please check the device for faults |
count(/HP iLO by SNMP/system.hw.diskarray.cache.battery.status[cpqDaAccelBattery.{#SNMPINDEX}],#1,"eq","{$DISK_ARRAY_CACHE_BATTERY_WARN_STATUS:\"degraded\"}")=1 |Warning |
Depends on:
|
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums
This is a template for monitoring HPE ProLiant DL380 servers with HP iLO version 4 and later via Zabbix SNMP agent that works without any external scripts.
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
Refer to the vendor documentation.
Name | Description | Default |
---|---|---|
{$HEALTH.STATUS.CRIT} | The critical status of the health for trigger expression. |
4 |
{$HEALTH.STATUS.WARN} | The warning status of the health for trigger expression. |
3 |
{$PSU.STATUS.CRIT} | The critical value of the PSU sensor for trigger expression. |
4 |
{$PSU.STATUS.WARN} | The warning value of the PSU sensor for trigger expression. |
3 |
{$FAN.STATUS.CRIT} | The critical value of the FAN sensor for trigger expression. |
4 |
{$FAN.STATUS.WARN} | The warning value of the FAN sensor for trigger expression. |
3 |
{$DISK.ARRAY.STATUS.CRIT} | The critical status of the disk array for trigger expression. |
4 |
{$DISK.ARRAY.STATUS.WARN} | The warning status of the disk array for trigger expression. |
3 |
{$DISK.ARRAY.CACHE.STATUS.CRIT:"cacheModCriticalFailure"} | The critical status of the disk array cache for trigger expression. |
8 |
{$DISK.ARRAY.CACHE.STATUS.WARN:"invalid"} | The warning status of the disk array cache for trigger expression. |
2 |
{$DISK.ARRAY.CACHE.STATUS.WARN:"cacheModDegradedFailsafeSpeed"} | The warning status of the disk array cache for trigger expression. |
7 |
{$DISK.ARRAY.CACHE.STATUS.WARN:"cacheReadCacheNotMapped"} | The warning status of the disk array cache for trigger expression. |
9 |
{$DISK.ARRAY.CACHE.STATUS.WARN:"cacheModFlashMemNotAttached"} | The warning status of the disk array cache for trigger expression. |
6 |
{$DISK.ARRAY.CACHE.STATUS.OK:"enabled"} | The normal status of the disk array cache for trigger expression. |
3 |
{$DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT:"failed"} | The critical status of the disk array cache battery for trigger expression. |
4 |
{$DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT:"capacitorFailed"} | The critical status of the disk array cache battery for trigger expression. |
7 |
{$DISK.ARRAY.CACHE.BATTERY.STATUS.WARN:"degraded"} | The warning status of the disk array cache battery for trigger expression. |
5 |
{$DISK.ARRAY.CACHE.BATTERY.STATUS.WARN:"notPresent"} | The warning status of the disk array cache battery for trigger expression. |
6 |
{$VDISK.STATUS.CRIT} | The critical status of the virtual disk for trigger expression. |
3 |
{$VDISK.STATUS.OK} | The normal status of the virtual disk for trigger expression. |
2 |
{$DISK.STATUS.WARN} | The warning status of the disk for trigger expression. |
4 |
{$DISK.STATUS.FAIL} | The critical status of the disk for trigger expression. |
3 |
{$DISK.SMART.STATUS.FAIL:"replaceDrive"} | The critical S.M.A.R.T status of the disk for trigger expression. |
3 |
{$DISK.SMART.STATUS.FAIL:"replaceDriveSSDWearOut"} | The critical S.M.A.R.T status of the disk for trigger expression. |
4 |
{$SNMP.TIMEOUT} | The time interval for SNMP agent availability trigger expression. |
5m |
Name | Description | Type | Key and additional info |
---|---|---|---|
HPE ProLiant DL380: Overall system health status | MIB: CPQHLTH-MIB The overall condition. This object represents the overall status of the server information represented by this MIB. |
SNMP agent | hp.server.status[cpqHeMibCondition] Preprocessing
|
HPE ProLiant DL380: Hardware model name | MIB: CPQSINFO-MIB The machine product name. The name of the machine used in this system. |
SNMP agent | hp.server.hw.model[cpqSiProductName] Preprocessing
|
HPE ProLiant DL380: Hardware serial number | MIB: CPQSINFO-MIB The serial number of the physical system unit. The string will be empty if the system does not report the serial number function. |
SNMP agent | hp.server.hw.serialnumber[cpqSiSysSerialNum] Preprocessing
|
HPE ProLiant DL380: System temperature status | MIB: CPQHLTH-MIB This value specifies the overall condition of the system's thermal environment. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. |
SNMP agent | hp.server.sensor.temp.status[cpqHeThermalCondition] Preprocessing
|
HPE ProLiant DL380: Uptime (network) | MIB: SNMPv2-MIB The time (in hundredths of a second) since the network management portion of the system was last re-initialized. |
SNMP agent | hp.server.net.uptime[sysUpTime] Preprocessing
|
HPE ProLiant DL380: Uptime (hardware) | MIB: HOST-RESOURCES-MIB The amount of time since this host was last initialized. Note that this is different from sysUpTime in the SNMPv2-MIB [RFC1907] because sysUpTime is the uptime of the network management portion of the system. |
SNMP agent | hp.server.hw.uptime[hrSystemUptime] Preprocessing
|
HPE ProLiant DL380: SNMP traps (fallback) | The item is used to collect all SNMP traps unmatched by other snmptrap items. |
SNMP trap | snmptrap.fallback |
HPE ProLiant DL380: System location | MIB: SNMPv2-MIB The physical location of this node (e.g., `telephone closet, 3rd floor'). If the location is unknown, the value is the zero-length string. |
SNMP agent | hp.server.location[sysLocation] Preprocessing
|
HPE ProLiant DL380: System contact details | MIB: SNMPv2-MIB The textual identification of the contact person for this managed node, together with information on how to contact this person. If no contact information is known, the value is the zero-length string. |
SNMP agent | hp.server.contact[sysContact] Preprocessing
|
HPE ProLiant DL380: System object ID | MIB: SNMPv2-MIB The vendor's authoritative identification of the network management subsystem contained in the entity. This value is allocated within the SMI enterprises subtree (1.3.6.1.4.1) and provides an easy and unambiguous means for determining |
SNMP agent | hp.server.objectid[sysObjectID] Preprocessing
|
HPE ProLiant DL380: System name | MIB: SNMPv2-MIB An administratively-assigned name for this managed node.By convention, this is the node's fully-qualified domain name. If the name is unknown, the value is the zero-length string. |
SNMP agent | hp.server.name[sysName] Preprocessing
|
HPE ProLiant DL380: System description | MIB: SNMPv2-MIB A textual description of the entity. This value should include the full name and version identification of the system's hardware type, software operating-system, and networking software. |
SNMP agent | hp.server.descr[sysDescr] Preprocessing
|
HPE ProLiant DL380: SNMP agent availability | Zabbix internal | zabbix[host,snmp,available] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
HPE ProLiant DL380: System status is in critical state | Please check the device for errors. |
last(/HPE ProLiant DL380 by SNMP/hp.server.status[cpqHeMibCondition])={$HEALTH.STATUS.CRIT} |High |
||
HPE ProLiant DL380: System status is in warning state | Please check the device for warnings. |
last(/HPE ProLiant DL380 by SNMP/hp.server.status[cpqHeMibCondition])={$HEALTH.STATUS.WARN} |Warning |
Depends on:
|
|
HPE ProLiant DL380: Device has been replaced | Device serial number has changed. Acknowledge to close the problem manually. |
last(/HPE ProLiant DL380 by SNMP/hp.server.hw.serialnumber[cpqSiSysSerialNum],#1)<>last(/HPE ProLiant DL380 by SNMP/hp.server.hw.serialnumber[cpqSiSysSerialNum],#2) and length(last(/HPE ProLiant DL380 by SNMP/hp.server.hw.serialnumber[cpqSiSysSerialNum]))>0 |Info |
Manual close: Yes | |
HPE ProLiant DL380: Host has been restarted | Uptime is less than 10 minutes. |
(last(/HPE ProLiant DL380 by SNMP/hp.server.hw.uptime[hrSystemUptime])>0 and last(/HPE ProLiant DL380 by SNMP/hp.server.hw.uptime[hrSystemUptime])<10m) or (last(/HPE ProLiant DL380 by SNMP/hp.server.hw.uptime[hrSystemUptime])=0 and last(/HPE ProLiant DL380 by SNMP/hp.server.net.uptime[sysUpTime])<10m) |Warning |
Manual close: Yes | |
HPE ProLiant DL380: System name has changed | The name of the system has changed. Acknowledge to close the problem manually. |
last(/HPE ProLiant DL380 by SNMP/hp.server.name[sysName],#1)<>last(/HPE ProLiant DL380 by SNMP/hp.server.name[sysName],#2) and length(last(/HPE ProLiant DL380 by SNMP/hp.server.name[sysName]))>0 |Info |
Manual close: Yes | |
HPE ProLiant DL380: No SNMP data collection | SNMP is not available for polling. Please check device connectivity and SNMP settings. |
max(/HPE ProLiant DL380 by SNMP/zabbix[host,snmp,available],{$SNMP.TIMEOUT})=0 |Warning |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable |
SNMP agent | temp.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: {#SNMPINDEX} |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius.{#SNMPINDEX}] Preprocessing
|
{#SNMPINDEX}: Temperature sensor location | MIB: CPQHLTH-MIB This specifies the location of the temperature sensor present in the system. |
SNMP agent | hp.server.sensor.temp.locale[cpqHeTemperatureLocale.{#SNMPINDEX}] Preprocessing
|
{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant DL380 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant DL380 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.{#SNMPINDEX}]) = 3 |Warning |
||
{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant DL380 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature ambient discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with ambient(11) and 0.1 index filter |
SNMP agent | temp.ambient.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Ambient: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: Ambient |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius.Ambient.{#SNMPINDEX}] Preprocessing
|
Ambient: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Ambient.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Ambient: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant DL380 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Ambient.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
Ambient: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant DL380 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Ambient.{#SNMPINDEX}]) = 3 |Warning |
||
Ambient: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant DL380 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Ambient.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature CPU discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with cpu(6) filter |
SNMP agent | temp.cpu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
CPU-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: CPU-{#SNMPINDEX} |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius.CPU.{#SNMPINDEX}] Preprocessing
|
CPU-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition.CPU.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
CPU-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant DL380 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.CPU.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
CPU-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant DL380 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.CPU.{#SNMPINDEX}]) = 3 |Warning |
||
CPU-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant DL380 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.CPU.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature memory discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with memory(7) filter |
SNMP agent | temp.memory.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Memory-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: Memory-{#SNMPINDEX} |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius.Memory.{#SNMPINDEX}] Preprocessing
|
Memory-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Memory.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Memory-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant DL380 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Memory.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
Memory-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant DL380 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Memory.{#SNMPINDEX}]) = 3 |Warning |
||
Memory-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant DL380 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Memory.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature PSU discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with powerSupply(10) filter |
SNMP agent | temp.psu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: PSU-{#SNMPINDEX} |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius.PSU.{#SNMPINDEX}] Preprocessing
|
PSU-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition.PSU.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
PSU-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant DL380 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.PSU.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
PSU-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant DL380 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.PSU.{#SNMPINDEX}]) = 3 |Warning |
||
PSU-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant DL380 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.PSU.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature I/O discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with ioBoard(5) filter |
SNMP agent | temp.io.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
I/O-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: I/O-{#SNMPINDEX} |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius."I/O.{#SNMPINDEX}"] Preprocessing
|
I/O-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition."I/O.{#SNMPINDEX}"] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
I/O-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant DL380 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition."I/O.{#SNMPINDEX}"]) = 1 |Info |
Manual close: Yes | |
I/O-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant DL380 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition."I/O.{#SNMPINDEX}"]) = 3 |Warning |
||
I/O-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant DL380 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition."I/O.{#SNMPINDEX}"]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature system discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with system(3) filter |
SNMP agent | temp.system.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
System-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: System-{#SNMPINDEX} |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius.System.{#SNMPINDEX}] Preprocessing
|
System-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition.System.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
System-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant DL380 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.System.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
System-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant DL380 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.System.{#SNMPINDEX}]) = 3 |Warning |
||
System-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant DL380 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.System.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU discovery | CPQHLTH-MIB::cpqHeFltTolPowerSupplyStatus |
SNMP agent | psu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Chassis {#CHASSISNUM}, bay {#BAYNUM}: Power supply status | MIB: CPQHLTH-MIB The condition of the power supply. This value will be one of the following: other(1) The status could not be determined or not present. ok(2) The power supply is operating normally. degraded(3) A temperature sensor, fan or other power supply component is outside of normal operating range. failed(4) A power supply component detects a condition that could permanently damage the system. |
SNMP agent | hp.server.sensor.psu.status[cpqHeFltTolPowerSupplyCondition.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Chassis {#CHASSISNUM}, bay {#BAYNUM}: Power supply is in critical state | Please check the power supply unit for errors. |
last(/HPE ProLiant DL380 by SNMP/hp.server.sensor.psu.status[cpqHeFltTolPowerSupplyCondition.{#SNMPINDEX}])={$PSU.STATUS.CRIT} |Average |
||
Chassis {#CHASSISNUM}, bay {#BAYNUM}: Power supply is in warning state | Please check the power supply unit for errors. |
last(/HPE ProLiant DL380 by SNMP/hp.server.sensor.psu.status[cpqHeFltTolPowerSupplyCondition.{#SNMPINDEX}])={$PSU.STATUS.WARN} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
FAN discovery | CPQHLTH-MIB::cpqHeFltTolFanCondition |
SNMP agent | fan.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Fan {#SNMPINDEX}: Fan status | MIB: CPQHLTH-MIB The condition of the fan. This value will be one of the following: other(1) Fan status detection is not supported by this system or driver. ok(2) The fan is operating properly. degraded(2) A redundant fan is not operating properly. failed(4) A non-redundant fan is not operating properly. |
SNMP agent | hp.server.sensor.fan.status[cpqHeFltTolFanCondition.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Fan {#SNMPINDEX}: Fan is in critical state | Please check the fan unit. |
last(/HPE ProLiant DL380 by SNMP/hp.server.sensor.fan.status[cpqHeFltTolFanCondition.{#SNMPINDEX}])={$FAN.STATUS.CRIT} |Average |
||
Fan {#SNMPINDEX}: Fan is in warning state | Please check the fan unit. |
last(/HPE ProLiant DL380 by SNMP/hp.server.sensor.fan.status[cpqHeFltTolFanCondition.{#SNMPINDEX}])={$FAN.STATUS.WARN} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Array controller discovery | Scanning table of Array controllers: CPQIDA-MIB::cpqDaCntlrTable |
SNMP agent | array.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
{#CNTLR_LOCATION}: Disk array controller status | MIB: CPQIDA-MIB This value represents the overall condition of this controller, and any associated logical drives, physical drives, and array accelerators. |
SNMP agent | hp.server.hw.diskarray.status[cpqDaCntlrCondition.{#SNMPINDEX}] Preprocessing
|
{#CNTLR_LOCATION}: Disk array controller model | MIB: CPQIDA-MIB Array Controller Model. The type of controller card. |
SNMP agent | hp.server.hw.diskarray.model[cpqDaCntlrModel.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#CNTLR_LOCATION}: Disk array controller is in critical state | Please check the device for faults. |
last(/HPE ProLiant DL380 by SNMP/hp.server.hw.diskarray.status[cpqDaCntlrCondition.{#SNMPINDEX}])={$DISK.ARRAY.STATUS.CRIT} |High |
||
{#CNTLR_LOCATION}: Disk array controller is in warning state | Please check the device for faults. |
last(/HPE ProLiant DL380 by SNMP/hp.server.hw.diskarray.status[cpqDaCntlrCondition.{#SNMPINDEX}])={$DISK.ARRAY.STATUS.WARN} |Average |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Array controller cache discovery | Scanning table of Array controllers: CPQIDA-MIB::cpqDaAccelTable |
SNMP agent | array.cache.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
#{#CACHECNTRLINDEX}: Disk array cache controller status | MIB: CPQIDA-MIB Cache Module/Operations Status. This describes the status of the cache module and/or cache operations. Note that for some controller models, a cache module board that physically attaches to the controller or chipset may not be an available option. The status can be: Other (1) Indicates that the instrument agent does not recognize the status of the cache module. You may need to upgrade the instrument agent. Invalid (2) Indicates that a cache module board has not been installed in this system or is present but not configured. Enabled (3) Indicates that cache operations are currently configured and enabled for at least one logical drive. Temporarily Disabled (4) Indicates that cache operations have been temporarily disabled. View the cache module board error code object to determine why the write cache operations have been temporarily disabled. Permanently Disabled (5) Indicates that cache operations have been permanently disabled. View the cache module board error code object to determine why the write cache operations have been disabled. Cache Module Flash Memory Not Attached (6) Indicates that the flash memory component of the flash backed cache module is not attached. This status will be set when the flash memory is not attached and the Supercap is attached. This value is only used on flash backed cache modules that support removable flash memory. Cache Module Degraded Failsafe Speed (7) Indicates that the cache module board is currently degraded and operating at a failsafe speed. View variables cpqDaCacheMemoryDataWidth and cpqDaCacheMemoryTransferRate to obtain the cache module board`s current memory data width and memory transfer rate. Cache Module Critical Failure (8) Indicates that the cache module board has encountered a critical failure. The controller is currently operating in Zero Memory Raid mode. Read Cache Could Not Be Mapped (9) Indicates that the read cache memory in a split cache configuration could not be mapped by the operating system and as a result is not available. This status may be caused by virtual space limitations in certain operating systems and is only applicable to B-Series controllers. |
SNMP agent | hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}] Preprocessing
|
#{#CACHECNTRLINDEX}: Disk array cache controller battery status | MIB: CPQIDA-MIB Cache Module Board Backup Power Status. This monitors the status of each backup power source on the board. The backup power source can only recharge when the system has power applied. The type of backup power source used is indicated by cpqDaAccelBackupPowerSource. The following values are valid: Other (1) Indicates that the instrument agent does not recognize backup power status. You may need to update your software. Ok (2) The backup power source is fully charged. Recharging (3) The array controller has one or more cache module backup power sources that are recharging. Cache module operations such as Battery/Flash Backed Write Cache, Expansion, Extension and Migration are temporarily suspended until the backup power source is fully charged. Cache module operations will automatically resume when charging is complete. Failed (4) The battery pack is below the sufficient voltage level and has not recharged in 36 hours. Your Cache Module board needs to be serviced. Degraded (5) The battery is still operating, however, one of the batteries in the pack has failed to recharge properly. Your Cache Module board should be serviced as soon as possible. NotPresent (6) A backup power source is not present on the cache module board. Some controllers do not have backup power sources. Capacitor Failed (7) The flash backed cache module capacitor is below the sufficient voltage level and has not recharged in 10 minutes. Your Cache Module board needs to be serviced. |
SNMP agent | hp.server.hw.diskarray.cache.battery.status[cpqDaAccelBattery.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
#{#CACHECNTRLINDEX}: Disk array cache controller is in critical state! | Please check the device for faults. |
last(/HPE ProLiant DL380 by SNMP/hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.STATUS.CRIT:"cacheModCriticalFailure"} |Average |
||
#{#CACHECNTRLINDEX}: Disk array cache controller is in warning state | Please check the device for faults. |
last(/HPE ProLiant DL380 by SNMP/hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.STATUS.WARN:"cacheModDegradedFailsafeSpeed"} or last(/HPE ProLiant DL380 by SNMP/hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.STATUS.WARN:"cacheReadCacheNotMapped"} or last(/HPE ProLiant DL380 by SNMP/hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.STATUS.WARN:"cacheModFlashMemNotAttached"} |Warning |
Depends on:
|
|
#{#CACHECNTRLINDEX}: Disk array cache controller is not in optimal state | Please check the device for faults. |
last(/HPE ProLiant DL380 by SNMP/hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}])<>{$DISK.ARRAY.CACHE.STATUS.OK:"enabled"} and last(/HPE ProLiant DL380 by SNMP/hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}])<>{$DISK.ARRAY.CACHE.STATUS.WARN:"invalid"} |Warning |
Depends on:
|
|
#{#CACHECNTRLINDEX}: Disk array cache controller battery is in critical state | Please check the device for faults. |
last(/HPE ProLiant DL380 by SNMP/hp.server.hw.diskarray.cache.battery.status[cpqDaAccelBattery.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT:"failed"} or last(/HPE ProLiant DL380 by SNMP/hp.server.hw.diskarray.cache.battery.status[cpqDaAccelBattery.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT:"capacitorFailed"} |Average |
||
#{#CACHECNTRLINDEX}: Disk array cache controller battery is in warning state | Please check the device for faults. |
last(/HPE ProLiant DL380 by SNMP/hp.server.hw.diskarray.cache.battery.status[cpqDaAccelBattery.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.BATTERY.STATUS.WARN:"degraded"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Physical disk discovery | Scanning table of physical drive entries CPQIDA-MIB::cpqDaPhyDrvTable. |
SNMP agent | physicaldisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
{#DISK_LOCATION}: Physical disk status | MIB: CPQIDA-MIB Physical Drive Status. This shows the status of the physical drive. The following values are valid for the physical drive status: other (1) Indicates that the instrument agent does not recognize the drive. You may need to upgrade your instrument agent and/or driver software. ok (2) Indicates the drive is functioning properly. failed (3) Indicates that the drive is no longer operating and should be replaced. predictiveFailure(4) Indicates that the drive has a predictive failure error and should be replaced. |
SNMP agent | hp.server.hw.physicaldisk.status[cpqDaPhyDrvStatus.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk S.M.A.R.T. status | MIB: CPQIDA-MIB Physical Drive S.M.A.R.T Status. The following values are defined: other(1) The agent is unable to determine if the status of S.M.A.R.T predictive failure monitoring for this drive. ok(2) Indicates the drive is functioning properly. replaceDrive(3) Indicates that the drive has a S.M.A.R.T predictive failure error and should be replaced. |
SNMP agent | hp.server.hw.physicaldisk.smart_status[cpqDaPhyDrvSmartStatus.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk serial number | MIB: CPQIDA-MIB Physical Drive Serial Number. This is the serial number assigned to the physical drive. This value is based upon the serial number as returned by the SCSI inquiry command but may have been modified due to space limitations. This can be used for identification purposes. |
SNMP agent | hp.server.hw.physicaldisk.serialnumber[cpqDaPhyDrvSerialNum.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk model name | MIB: CPQIDA-MIB Physical Drive Model. This is a text description of the physical drive. The text that appears depends upon who manufactured the drive and the drive type. If a drive fails, note the model to identify the type of drive necessary for replacement. If a model number is not present, you may not have properly initialized the drive array to which the physical drive is attached for monitoring. |
SNMP agent | hp.server.hw.physicaldisk.model[cpqDaPhyDrvModel.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk media type | MIB: CPQIDA-MIB Drive Array Physical Drive Media Type. The following values are defined: other(1) The instrument agent is unable to determine the physical drive's media type. rotatingPlatters(2) The physical drive media is composed of rotating platters. solidState(3) The physical drive media is composed of solid state electronics. |
SNMP agent | hp.server.hw.physicaldisk.media_type[cpqDaPhyDrvMediaType.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Disk size | MIB: CPQIDA-MIB Physical Drive Size in MB. This is the size of the physical drive in megabytes. This value is calculated using the value 1,048,576 (2^20) as a megabyte. Drive manufacturers sometimes use the number 1,000,000 as a megabyte when giving drive capacities so this value may differ from the advertised size of a drive. This field is only applicable for controllers which support SCSI drives, and therefore is not supported by the IDA or IDA-2 controllers. The field will contain 0xFFFFFFFF if the drive capacity cannot be calculated or if the controller does not support SCSI drives. |
SNMP agent | hp.server.hw.physicaldisk.size[cpqDaPhyDrvMediaType.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#DISK_LOCATION}: Physical disk failed | Please check physical disk for warnings or errors. |
last(/HPE ProLiant DL380 by SNMP/hp.server.hw.physicaldisk.status[cpqDaPhyDrvStatus.{#SNMPINDEX}])={$DISK.STATUS.FAIL} |High |
||
{#DISK_LOCATION}: Physical disk is in warning state | Please check physical disk for warnings or errors. |
last(/HPE ProLiant DL380 by SNMP/hp.server.hw.physicaldisk.status[cpqDaPhyDrvStatus.{#SNMPINDEX}])={$DISK.STATUS.WARN} |Warning |
Depends on:
|
|
{#DISK_LOCATION}: Physical disk S.M.A.R.T. failed | Disk probably requires replacement. |
last(/HPE ProLiant DL380 by SNMP/hp.server.hw.physicaldisk.smart_status[cpqDaPhyDrvSmartStatus.{#SNMPINDEX}])={$DISK.SMART.STATUS.FAIL:"replaceDrive"} or last(/HPE ProLiant DL380 by SNMP/hp.server.hw.physicaldisk.smart_status[cpqDaPhyDrvSmartStatus.{#SNMPINDEX}])={$DISK.SMART.STATUS.FAIL:"replaceDriveSSDWearOut"} |High |
Depends on:
|
|
{#DISK_LOCATION}: Disk has been replaced | Disk serial number has changed. Acknowledge to close the problem manually. |
last(/HPE ProLiant DL380 by SNMP/hp.server.hw.physicaldisk.serialnumber[cpqDaPhyDrvSerialNum.{#SNMPINDEX}],#1)<>last(/HPE ProLiant DL380 by SNMP/hp.server.hw.physicaldisk.serialnumber[cpqDaPhyDrvSerialNum.{#SNMPINDEX}],#2) and length(last(/HPE ProLiant DL380 by SNMP/hp.server.hw.physicaldisk.serialnumber[cpqDaPhyDrvSerialNum.{#SNMPINDEX}]))>0 |Info |
Manual close: Yes |
Name | Description | Type | Key and additional info |
---|---|---|---|
Virtual disk discovery | CPQIDA-MIB::cpqDaLogDrvTable |
SNMP agent | virtualdisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Disk {#SNMPINDEX}({#DISK_NAME}): Status | Logical Drive Status. |
SNMP agent | hp.server.hw.virtualdisk.status[cpqDaLogDrvStatus.{#SNMPINDEX}] Preprocessing
|
Disk {#SNMPINDEX}({#DISK_NAME}): Layout type | Logical Drive Fault Tolerance. This shows the fault tolerance mode of the logical drive. |
SNMP agent | hp.server.hw.virtualdisk.layout[cpqDaLogDrvFaultTol.{#SNMPINDEX}] Preprocessing
|
Disk {#SNMPINDEX}({#DISK_NAME}): Disk size | Logical Drive Size. This is the size of the logical drive in megabytes. This value is calculated using the value 1,048,576 (2^20) as a megabyte. Drive manufacturers sometimes use the number 1,000,000 as a megabyte when giving drive capacities so this value may differ from the advertised size of a drive. |
SNMP agent | hp.server.hw.virtualdisk.size[cpqDaLogDrvSize.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Disk {#SNMPINDEX}({#DISK_NAME}): Virtual disk failed | Please check virtual disk for warnings or errors. |
last(/HPE ProLiant DL380 by SNMP/hp.server.hw.virtualdisk.status[cpqDaLogDrvStatus.{#SNMPINDEX}])={$VDISK.STATUS.CRIT} |High |
||
Disk {#SNMPINDEX}({#DISK_NAME}): Virtual disk is not in OK state | Please check virtual disk for warnings or errors. |
last(/HPE ProLiant DL380 by SNMP/hp.server.hw.virtualdisk.status[cpqDaLogDrvStatus.{#SNMPINDEX}])<>{$VDISK.STATUS.OK} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Network interface discovery | CPQIDA-MIB::cpqNicIfPhysAdapterTable |
SNMP agent | net.if.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
{#ADAPTERNAME} port {#ADAPTERINDEX}: Status | MIB: CPQNIC-MIB The physical adapter status. The following values are valid: unknown(1) The instrument agent was not able to determine the status of the adapter. The instrument agent may need to be upgraded. ok(2) The physical adapter is operating properly. generalFailure(3) The physical adapter has failed. linkFailure(4) The physical adapter has lost link. Check the cable connections to this adapter. |
SNMP agent | hp.server.net.if.status[cpqNicIfPhysAdapterStatus.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#ADAPTERNAME} port {#ADAPTERINDEX}: Adapter has failed | Please check the physical adapter. |
last(/HPE ProLiant DL380 by SNMP/hp.server.net.if.status[cpqNicIfPhysAdapterStatus.{#SNMPINDEX}])=3 |High |
||
{#ADAPTERNAME} port {#ADAPTERINDEX}: Adapter has lost link | Please check the cable connections to this adapter. |
last(/HPE ProLiant DL380 by SNMP/hp.server.net.if.status[cpqNicIfPhysAdapterStatus.{#SNMPINDEX}])=4 |Average |
Depends on:
|
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums
This is a template for monitoring HPE ProLiant DL360 servers with HP iLO version 4 and later via Zabbix SNMP agent that works without any external scripts.
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
Refer to the vendor documentation.
Name | Description | Default |
---|---|---|
{$HEALTH.STATUS.CRIT} | The critical status of the health for trigger expression. |
4 |
{$HEALTH.STATUS.WARN} | The warning status of the health for trigger expression. |
3 |
{$PSU.STATUS.CRIT} | The critical value of the PSU sensor for trigger expression. |
4 |
{$PSU.STATUS.WARN} | The warning value of the PSU sensor for trigger expression. |
3 |
{$FAN.STATUS.CRIT} | The critical value of the FAN sensor for trigger expression. |
4 |
{$FAN.STATUS.WARN} | The warning value of the FAN sensor for trigger expression. |
3 |
{$DISK.ARRAY.STATUS.CRIT} | The critical status of the disk array for trigger expression. |
4 |
{$DISK.ARRAY.STATUS.WARN} | The warning status of the disk array for trigger expression. |
3 |
{$DISK.ARRAY.CACHE.STATUS.CRIT:"cacheModCriticalFailure"} | The critical status of the disk array cache for trigger expression. |
8 |
{$DISK.ARRAY.CACHE.STATUS.WARN:"invalid"} | The warning status of the disk array cache for trigger expression. |
2 |
{$DISK.ARRAY.CACHE.STATUS.WARN:"cacheModDegradedFailsafeSpeed"} | The warning status of the disk array cache for trigger expression. |
7 |
{$DISK.ARRAY.CACHE.STATUS.WARN:"cacheReadCacheNotMapped"} | The warning status of the disk array cache for trigger expression. |
9 |
{$DISK.ARRAY.CACHE.STATUS.WARN:"cacheModFlashMemNotAttached"} | The warning status of the disk array cache for trigger expression. |
6 |
{$DISK.ARRAY.CACHE.STATUS.OK:"enabled"} | The normal status of the disk array cache for trigger expression. |
3 |
{$DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT:"failed"} | The critical status of the disk array cache battery for trigger expression. |
4 |
{$DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT:"capacitorFailed"} | The critical status of the disk array cache battery for trigger expression. |
7 |
{$DISK.ARRAY.CACHE.BATTERY.STATUS.WARN:"degraded"} | The warning status of the disk array cache battery for trigger expression. |
5 |
{$DISK.ARRAY.CACHE.BATTERY.STATUS.WARN:"notPresent"} | The warning status of the disk array cache battery for trigger expression. |
6 |
{$VDISK.STATUS.CRIT} | The critical status of the virtual disk for trigger expression. |
3 |
{$VDISK.STATUS.OK} | The normal status of the virtual disk for trigger expression. |
2 |
{$DISK.STATUS.WARN} | The warning status of the disk for trigger expression. |
4 |
{$DISK.STATUS.FAIL} | The critical status of the disk for trigger expression. |
3 |
{$DISK.SMART.STATUS.FAIL:"replaceDrive"} | The critical S.M.A.R.T status of the disk for trigger expression. |
3 |
{$DISK.SMART.STATUS.FAIL:"replaceDriveSSDWearOut"} | The critical S.M.A.R.T status of the disk for trigger expression. |
4 |
{$SNMP.TIMEOUT} | The time interval for SNMP agent availability trigger expression. |
5m |
Name | Description | Type | Key and additional info |
---|---|---|---|
HPE ProLiant DL360: Overall system health status | MIB: CPQHLTH-MIB The overall condition. This object represents the overall status of the server information represented by this MIB. |
SNMP agent | hp.server.status[cpqHeMibCondition] Preprocessing
|
HPE ProLiant DL360: Hardware model name | MIB: CPQSINFO-MIB The machine product name. The name of the machine used in this system. |
SNMP agent | hp.server.hw.model[cpqSiProductName] Preprocessing
|
HPE ProLiant DL360: Hardware serial number | MIB: CPQSINFO-MIB The serial number of the physical system unit. The string will be empty if the system does not report the serial number function. |
SNMP agent | hp.server.hw.serialnumber[cpqSiSysSerialNum] Preprocessing
|
HPE ProLiant DL360: System temperature status | MIB: CPQHLTH-MIB This value specifies the overall condition of the system's thermal environment. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. |
SNMP agent | hp.server.sensor.temp.status[cpqHeThermalCondition] Preprocessing
|
HPE ProLiant DL360: Uptime (network) | MIB: SNMPv2-MIB The time (in hundredths of a second) since the network management portion of the system was last re-initialized. |
SNMP agent | hp.server.net.uptime[sysUpTime] Preprocessing
|
HPE ProLiant DL360: Uptime (hardware) | MIB: HOST-RESOURCES-MIB The amount of time since this host was last initialized. Note that this is different from sysUpTime in the SNMPv2-MIB [RFC1907] because sysUpTime is the uptime of the network management portion of the system. |
SNMP agent | hp.server.hw.uptime[hrSystemUptime] Preprocessing
|
HPE ProLiant DL360: SNMP traps (fallback) | The item is used to collect all SNMP traps unmatched by other snmptrap items. |
SNMP trap | snmptrap.fallback |
HPE ProLiant DL360: System location | MIB: SNMPv2-MIB The physical location of this node (e.g., `telephone closet, 3rd floor'). If the location is unknown, the value is the zero-length string. |
SNMP agent | hp.server.location[sysLocation] Preprocessing
|
HPE ProLiant DL360: System contact details | MIB: SNMPv2-MIB The textual identification of the contact person for this managed node, together with information on how to contact this person. If no contact information is known, the value is the zero-length string. |
SNMP agent | hp.server.contact[sysContact] Preprocessing
|
HPE ProLiant DL360: System object ID | MIB: SNMPv2-MIB The vendor's authoritative identification of the network management subsystem contained in the entity. This value is allocated within the SMI enterprises subtree (1.3.6.1.4.1) and provides an easy and unambiguous means for determining |
SNMP agent | hp.server.objectid[sysObjectID] Preprocessing
|
HPE ProLiant DL360: System name | MIB: SNMPv2-MIB An administratively-assigned name for this managed node.By convention, this is the node's fully-qualified domain name. If the name is unknown, the value is the zero-length string. |
SNMP agent | hp.server.name[sysName] Preprocessing
|
HPE ProLiant DL360: System description | MIB: SNMPv2-MIB A textual description of the entity. This value should include the full name and version identification of the system's hardware type, software operating-system, and networking software. |
SNMP agent | hp.server.descr[sysDescr] Preprocessing
|
HPE ProLiant DL360: SNMP agent availability | Zabbix internal | zabbix[host,snmp,available] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
HPE ProLiant DL360: System status is in critical state | Please check the device for errors. |
last(/HPE ProLiant DL360 by SNMP/hp.server.status[cpqHeMibCondition])={$HEALTH.STATUS.CRIT} |High |
||
HPE ProLiant DL360: System status is in warning state | Please check the device for warnings. |
last(/HPE ProLiant DL360 by SNMP/hp.server.status[cpqHeMibCondition])={$HEALTH.STATUS.WARN} |Warning |
Depends on:
|
|
HPE ProLiant DL360: Device has been replaced | Device serial number has changed. Acknowledge to close the problem manually. |
last(/HPE ProLiant DL360 by SNMP/hp.server.hw.serialnumber[cpqSiSysSerialNum],#1)<>last(/HPE ProLiant DL360 by SNMP/hp.server.hw.serialnumber[cpqSiSysSerialNum],#2) and length(last(/HPE ProLiant DL360 by SNMP/hp.server.hw.serialnumber[cpqSiSysSerialNum]))>0 |Info |
Manual close: Yes | |
HPE ProLiant DL360: Host has been restarted | Uptime is less than 10 minutes. |
(last(/HPE ProLiant DL360 by SNMP/hp.server.hw.uptime[hrSystemUptime])>0 and last(/HPE ProLiant DL360 by SNMP/hp.server.hw.uptime[hrSystemUptime])<10m) or (last(/HPE ProLiant DL360 by SNMP/hp.server.hw.uptime[hrSystemUptime])=0 and last(/HPE ProLiant DL360 by SNMP/hp.server.net.uptime[sysUpTime])<10m) |Warning |
Manual close: Yes | |
HPE ProLiant DL360: System name has changed | The name of the system has changed. Acknowledge to close the problem manually. |
last(/HPE ProLiant DL360 by SNMP/hp.server.name[sysName],#1)<>last(/HPE ProLiant DL360 by SNMP/hp.server.name[sysName],#2) and length(last(/HPE ProLiant DL360 by SNMP/hp.server.name[sysName]))>0 |Info |
Manual close: Yes | |
HPE ProLiant DL360: No SNMP data collection | SNMP is not available for polling. Please check device connectivity and SNMP settings. |
max(/HPE ProLiant DL360 by SNMP/zabbix[host,snmp,available],{$SNMP.TIMEOUT})=0 |Warning |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable |
SNMP agent | temp.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: {#SNMPINDEX} |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius.{#SNMPINDEX}] Preprocessing
|
{#SNMPINDEX}: Temperature sensor location | MIB: CPQHLTH-MIB This specifies the location of the temperature sensor present in the system. |
SNMP agent | hp.server.sensor.temp.locale[cpqHeTemperatureLocale.{#SNMPINDEX}] Preprocessing
|
{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant DL360 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant DL360 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.{#SNMPINDEX}]) = 3 |Warning |
||
{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant DL360 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature ambient discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with ambient(11) and 0.1 index filter |
SNMP agent | temp.ambient.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Ambient: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: Ambient |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius.Ambient.{#SNMPINDEX}] Preprocessing
|
Ambient: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Ambient.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Ambient: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant DL360 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Ambient.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
Ambient: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant DL360 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Ambient.{#SNMPINDEX}]) = 3 |Warning |
||
Ambient: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant DL360 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Ambient.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature CPU discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with cpu(6) filter |
SNMP agent | temp.cpu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
CPU-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: CPU-{#SNMPINDEX} |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius.CPU.{#SNMPINDEX}] Preprocessing
|
CPU-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition.CPU.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
CPU-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant DL360 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.CPU.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
CPU-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant DL360 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.CPU.{#SNMPINDEX}]) = 3 |Warning |
||
CPU-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant DL360 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.CPU.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature memory discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with memory(7) filter |
SNMP agent | temp.memory.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Memory-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: Memory-{#SNMPINDEX} |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius.Memory.{#SNMPINDEX}] Preprocessing
|
Memory-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Memory.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Memory-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant DL360 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Memory.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
Memory-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant DL360 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Memory.{#SNMPINDEX}]) = 3 |Warning |
||
Memory-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant DL360 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Memory.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature PSU discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with powerSupply(10) filter |
SNMP agent | temp.psu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: PSU-{#SNMPINDEX} |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius.PSU.{#SNMPINDEX}] Preprocessing
|
PSU-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition.PSU.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
PSU-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant DL360 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.PSU.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
PSU-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant DL360 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.PSU.{#SNMPINDEX}]) = 3 |Warning |
||
PSU-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant DL360 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.PSU.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature I/O discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with ioBoard(5) filter |
SNMP agent | temp.io.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
I/O-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: I/O-{#SNMPINDEX} |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius."I/O.{#SNMPINDEX}"] Preprocessing
|
I/O-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition."I/O.{#SNMPINDEX}"] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
I/O-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant DL360 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition."I/O.{#SNMPINDEX}"]) = 1 |Info |
Manual close: Yes | |
I/O-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant DL360 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition."I/O.{#SNMPINDEX}"]) = 3 |Warning |
||
I/O-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant DL360 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition."I/O.{#SNMPINDEX}"]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature system discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with system(3) filter |
SNMP agent | temp.system.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
System-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: System-{#SNMPINDEX} |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius.System.{#SNMPINDEX}] Preprocessing
|
System-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition.System.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
System-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant DL360 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.System.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
System-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant DL360 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.System.{#SNMPINDEX}]) = 3 |Warning |
||
System-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant DL360 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.System.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU discovery | CPQHLTH-MIB::cpqHeFltTolPowerSupplyStatus |
SNMP agent | psu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Chassis {#CHASSISNUM}, bay {#BAYNUM}: Power supply status | MIB: CPQHLTH-MIB The condition of the power supply. This value will be one of the following: other(1) The status could not be determined or not present. ok(2) The power supply is operating normally. degraded(3) A temperature sensor, fan or other power supply component is outside of normal operating range. failed(4) A power supply component detects a condition that could permanently damage the system. |
SNMP agent | hp.server.sensor.psu.status[cpqHeFltTolPowerSupplyCondition.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Chassis {#CHASSISNUM}, bay {#BAYNUM}: Power supply is in critical state | Please check the power supply unit for errors. |
last(/HPE ProLiant DL360 by SNMP/hp.server.sensor.psu.status[cpqHeFltTolPowerSupplyCondition.{#SNMPINDEX}])={$PSU.STATUS.CRIT} |Average |
||
Chassis {#CHASSISNUM}, bay {#BAYNUM}: Power supply is in warning state | Please check the power supply unit for errors. |
last(/HPE ProLiant DL360 by SNMP/hp.server.sensor.psu.status[cpqHeFltTolPowerSupplyCondition.{#SNMPINDEX}])={$PSU.STATUS.WARN} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
FAN discovery | CPQHLTH-MIB::cpqHeFltTolFanCondition |
SNMP agent | fan.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Fan {#SNMPINDEX}: Fan status | MIB: CPQHLTH-MIB The condition of the fan. This value will be one of the following: other(1) Fan status detection is not supported by this system or driver. ok(2) The fan is operating properly. degraded(2) A redundant fan is not operating properly. failed(4) A non-redundant fan is not operating properly. |
SNMP agent | hp.server.sensor.fan.status[cpqHeFltTolFanCondition.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Fan {#SNMPINDEX}: Fan is in critical state | Please check the fan unit. |
last(/HPE ProLiant DL360 by SNMP/hp.server.sensor.fan.status[cpqHeFltTolFanCondition.{#SNMPINDEX}])={$FAN.STATUS.CRIT} |Average |
||
Fan {#SNMPINDEX}: Fan is in warning state | Please check the fan unit. |
last(/HPE ProLiant DL360 by SNMP/hp.server.sensor.fan.status[cpqHeFltTolFanCondition.{#SNMPINDEX}])={$FAN.STATUS.WARN} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Array controller discovery | Scanning table of Array controllers: CPQIDA-MIB::cpqDaCntlrTable |
SNMP agent | array.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
{#CNTLR_LOCATION}: Disk array controller status | MIB: CPQIDA-MIB This value represents the overall condition of this controller, and any associated logical drives, physical drives, and array accelerators. |
SNMP agent | hp.server.hw.diskarray.status[cpqDaCntlrCondition.{#SNMPINDEX}] Preprocessing
|
{#CNTLR_LOCATION}: Disk array controller model | MIB: CPQIDA-MIB Array Controller Model. The type of controller card. |
SNMP agent | hp.server.hw.diskarray.model[cpqDaCntlrModel.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#CNTLR_LOCATION}: Disk array controller is in critical state | Please check the device for faults. |
last(/HPE ProLiant DL360 by SNMP/hp.server.hw.diskarray.status[cpqDaCntlrCondition.{#SNMPINDEX}])={$DISK.ARRAY.STATUS.CRIT} |High |
||
{#CNTLR_LOCATION}: Disk array controller is in warning state | Please check the device for faults. |
last(/HPE ProLiant DL360 by SNMP/hp.server.hw.diskarray.status[cpqDaCntlrCondition.{#SNMPINDEX}])={$DISK.ARRAY.STATUS.WARN} |Average |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Array controller cache discovery | Scanning table of Array controllers: CPQIDA-MIB::cpqDaAccelTable |
SNMP agent | array.cache.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
#{#CACHECNTRLINDEX}: Disk array cache controller status | MIB: CPQIDA-MIB Cache Module/Operations Status. This describes the status of the cache module and/or cache operations. Note that for some controller models, a cache module board that physically attaches to the controller or chipset may not be an available option. The status can be: Other (1) Indicates that the instrument agent does not recognize the status of the cache module. You may need to upgrade the instrument agent. Invalid (2) Indicates that a cache module board has not been installed in this system or is present but not configured. Enabled (3) Indicates that cache operations are currently configured and enabled for at least one logical drive. Temporarily Disabled (4) Indicates that cache operations have been temporarily disabled. View the cache module board error code object to determine why the write cache operations have been temporarily disabled. Permanently Disabled (5) Indicates that cache operations have been permanently disabled. View the cache module board error code object to determine why the write cache operations have been disabled. Cache Module Flash Memory Not Attached (6) Indicates that the flash memory component of the flash backed cache module is not attached. This status will be set when the flash memory is not attached and the Supercap is attached. This value is only used on flash backed cache modules that support removable flash memory. Cache Module Degraded Failsafe Speed (7) Indicates that the cache module board is currently degraded and operating at a failsafe speed. View variables cpqDaCacheMemoryDataWidth and cpqDaCacheMemoryTransferRate to obtain the cache module board`s current memory data width and memory transfer rate. Cache Module Critical Failure (8) Indicates that the cache module board has encountered a critical failure. The controller is currently operating in Zero Memory Raid mode. Read Cache Could Not Be Mapped (9) Indicates that the read cache memory in a split cache configuration could not be mapped by the operating system and as a result is not available. This status may be caused by virtual space limitations in certain operating systems and is only applicable to B-Series controllers. |
SNMP agent | hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}] Preprocessing
|
#{#CACHECNTRLINDEX}: Disk array cache controller battery status | MIB: CPQIDA-MIB Cache Module Board Backup Power Status. This monitors the status of each backup power source on the board. The backup power source can only recharge when the system has power applied. The type of backup power source used is indicated by cpqDaAccelBackupPowerSource. The following values are valid: Other (1) Indicates that the instrument agent does not recognize backup power status. You may need to update your software. Ok (2) The backup power source is fully charged. Recharging (3) The array controller has one or more cache module backup power sources that are recharging. Cache module operations such as Battery/Flash Backed Write Cache, Expansion, Extension and Migration are temporarily suspended until the backup power source is fully charged. Cache module operations will automatically resume when charging is complete. Failed (4) The battery pack is below the sufficient voltage level and has not recharged in 36 hours. Your Cache Module board needs to be serviced. Degraded (5) The battery is still operating, however, one of the batteries in the pack has failed to recharge properly. Your Cache Module board should be serviced as soon as possible. NotPresent (6) A backup power source is not present on the cache module board. Some controllers do not have backup power sources. Capacitor Failed (7) The flash backed cache module capacitor is below the sufficient voltage level and has not recharged in 10 minutes. Your Cache Module board needs to be serviced. |
SNMP agent | hp.server.hw.diskarray.cache.battery.status[cpqDaAccelBattery.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
#{#CACHECNTRLINDEX}: Disk array cache controller is in critical state! | Please check the device for faults. |
last(/HPE ProLiant DL360 by SNMP/hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.STATUS.CRIT:"cacheModCriticalFailure"} |Average |
||
#{#CACHECNTRLINDEX}: Disk array cache controller is in warning state | Please check the device for faults. |
last(/HPE ProLiant DL360 by SNMP/hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.STATUS.WARN:"cacheModDegradedFailsafeSpeed"} or last(/HPE ProLiant DL360 by SNMP/hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.STATUS.WARN:"cacheReadCacheNotMapped"} or last(/HPE ProLiant DL360 by SNMP/hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.STATUS.WARN:"cacheModFlashMemNotAttached"} |Warning |
Depends on:
|
|
#{#CACHECNTRLINDEX}: Disk array cache controller is not in optimal state | Please check the device for faults. |
last(/HPE ProLiant DL360 by SNMP/hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}])<>{$DISK.ARRAY.CACHE.STATUS.OK:"enabled"} and last(/HPE ProLiant DL360 by SNMP/hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}])<>{$DISK.ARRAY.CACHE.STATUS.WARN:"invalid"} |Warning |
Depends on:
|
|
#{#CACHECNTRLINDEX}: Disk array cache controller battery is in critical state | Please check the device for faults. |
last(/HPE ProLiant DL360 by SNMP/hp.server.hw.diskarray.cache.battery.status[cpqDaAccelBattery.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT:"failed"} or last(/HPE ProLiant DL360 by SNMP/hp.server.hw.diskarray.cache.battery.status[cpqDaAccelBattery.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT:"capacitorFailed"} |Average |
||
#{#CACHECNTRLINDEX}: Disk array cache controller battery is in warning state | Please check the device for faults. |
last(/HPE ProLiant DL360 by SNMP/hp.server.hw.diskarray.cache.battery.status[cpqDaAccelBattery.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.BATTERY.STATUS.WARN:"degraded"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Physical disk discovery | Scanning table of physical drive entries CPQIDA-MIB::cpqDaPhyDrvTable. |
SNMP agent | physicaldisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
{#DISK_LOCATION}: Physical disk status | MIB: CPQIDA-MIB Physical Drive Status. This shows the status of the physical drive. The following values are valid for the physical drive status: other (1) Indicates that the instrument agent does not recognize the drive. You may need to upgrade your instrument agent and/or driver software. ok (2) Indicates the drive is functioning properly. failed (3) Indicates that the drive is no longer operating and should be replaced. predictiveFailure(4) Indicates that the drive has a predictive failure error and should be replaced. |
SNMP agent | hp.server.hw.physicaldisk.status[cpqDaPhyDrvStatus.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk S.M.A.R.T. status | MIB: CPQIDA-MIB Physical Drive S.M.A.R.T Status. The following values are defined: other(1) The agent is unable to determine if the status of S.M.A.R.T predictive failure monitoring for this drive. ok(2) Indicates the drive is functioning properly. replaceDrive(3) Indicates that the drive has a S.M.A.R.T predictive failure error and should be replaced. |
SNMP agent | hp.server.hw.physicaldisk.smart_status[cpqDaPhyDrvSmartStatus.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk serial number | MIB: CPQIDA-MIB Physical Drive Serial Number. This is the serial number assigned to the physical drive. This value is based upon the serial number as returned by the SCSI inquiry command but may have been modified due to space limitations. This can be used for identification purposes. |
SNMP agent | hp.server.hw.physicaldisk.serialnumber[cpqDaPhyDrvSerialNum.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk model name | MIB: CPQIDA-MIB Physical Drive Model. This is a text description of the physical drive. The text that appears depends upon who manufactured the drive and the drive type. If a drive fails, note the model to identify the type of drive necessary for replacement. If a model number is not present, you may not have properly initialized the drive array to which the physical drive is attached for monitoring. |
SNMP agent | hp.server.hw.physicaldisk.model[cpqDaPhyDrvModel.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk media type | MIB: CPQIDA-MIB Drive Array Physical Drive Media Type. The following values are defined: other(1) The instrument agent is unable to determine the physical drive's media type. rotatingPlatters(2) The physical drive media is composed of rotating platters. solidState(3) The physical drive media is composed of solid state electronics. |
SNMP agent | hp.server.hw.physicaldisk.media_type[cpqDaPhyDrvMediaType.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Disk size | MIB: CPQIDA-MIB Physical Drive Size in MB. This is the size of the physical drive in megabytes. This value is calculated using the value 1,048,576 (2^20) as a megabyte. Drive manufacturers sometimes use the number 1,000,000 as a megabyte when giving drive capacities so this value may differ from the advertised size of a drive. This field is only applicable for controllers which support SCSI drives, and therefore is not supported by the IDA or IDA-2 controllers. The field will contain 0xFFFFFFFF if the drive capacity cannot be calculated or if the controller does not support SCSI drives. |
SNMP agent | hp.server.hw.physicaldisk.size[cpqDaPhyDrvMediaType.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#DISK_LOCATION}: Physical disk failed | Please check physical disk for warnings or errors. |
last(/HPE ProLiant DL360 by SNMP/hp.server.hw.physicaldisk.status[cpqDaPhyDrvStatus.{#SNMPINDEX}])={$DISK.STATUS.FAIL} |High |
||
{#DISK_LOCATION}: Physical disk is in warning state | Please check physical disk for warnings or errors. |
last(/HPE ProLiant DL360 by SNMP/hp.server.hw.physicaldisk.status[cpqDaPhyDrvStatus.{#SNMPINDEX}])={$DISK.STATUS.WARN} |Warning |
Depends on:
|
|
{#DISK_LOCATION}: Physical disk S.M.A.R.T. failed | Disk probably requires replacement. |
last(/HPE ProLiant DL360 by SNMP/hp.server.hw.physicaldisk.smart_status[cpqDaPhyDrvSmartStatus.{#SNMPINDEX}])={$DISK.SMART.STATUS.FAIL:"replaceDrive"} or last(/HPE ProLiant DL360 by SNMP/hp.server.hw.physicaldisk.smart_status[cpqDaPhyDrvSmartStatus.{#SNMPINDEX}])={$DISK.SMART.STATUS.FAIL:"replaceDriveSSDWearOut"} |High |
Depends on:
|
|
{#DISK_LOCATION}: Disk has been replaced | Disk serial number has changed. Acknowledge to close the problem manually. |
last(/HPE ProLiant DL360 by SNMP/hp.server.hw.physicaldisk.serialnumber[cpqDaPhyDrvSerialNum.{#SNMPINDEX}],#1)<>last(/HPE ProLiant DL360 by SNMP/hp.server.hw.physicaldisk.serialnumber[cpqDaPhyDrvSerialNum.{#SNMPINDEX}],#2) and length(last(/HPE ProLiant DL360 by SNMP/hp.server.hw.physicaldisk.serialnumber[cpqDaPhyDrvSerialNum.{#SNMPINDEX}]))>0 |Info |
Manual close: Yes |
Name | Description | Type | Key and additional info |
---|---|---|---|
Virtual disk discovery | CPQIDA-MIB::cpqDaLogDrvTable |
SNMP agent | virtualdisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Disk {#SNMPINDEX}({#DISK_NAME}): Status | Logical Drive Status. |
SNMP agent | hp.server.hw.virtualdisk.status[cpqDaLogDrvStatus.{#SNMPINDEX}] Preprocessing
|
Disk {#SNMPINDEX}({#DISK_NAME}): Layout type | Logical Drive Fault Tolerance. This shows the fault tolerance mode of the logical drive. |
SNMP agent | hp.server.hw.virtualdisk.layout[cpqDaLogDrvFaultTol.{#SNMPINDEX}] Preprocessing
|
Disk {#SNMPINDEX}({#DISK_NAME}): Disk size | Logical Drive Size. This is the size of the logical drive in megabytes. This value is calculated using the value 1,048,576 (2^20) as a megabyte. Drive manufacturers sometimes use the number 1,000,000 as a megabyte when giving drive capacities so this value may differ from the advertised size of a drive. |
SNMP agent | hp.server.hw.virtualdisk.size[cpqDaLogDrvSize.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Disk {#SNMPINDEX}({#DISK_NAME}): Virtual disk failed | Please check virtual disk for warnings or errors. |
last(/HPE ProLiant DL360 by SNMP/hp.server.hw.virtualdisk.status[cpqDaLogDrvStatus.{#SNMPINDEX}])={$VDISK.STATUS.CRIT} |High |
||
Disk {#SNMPINDEX}({#DISK_NAME}): Virtual disk is not in OK state | Please check virtual disk for warnings or errors. |
last(/HPE ProLiant DL360 by SNMP/hp.server.hw.virtualdisk.status[cpqDaLogDrvStatus.{#SNMPINDEX}])<>{$VDISK.STATUS.OK} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Network interface discovery | CPQIDA-MIB::cpqNicIfPhysAdapterTable |
SNMP agent | net.if.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
{#ADAPTERNAME} port {#ADAPTERINDEX}: Status | MIB: CPQNIC-MIB The physical adapter status. The following values are valid: unknown(1) The instrument agent was not able to determine the status of the adapter. The instrument agent may need to be upgraded. ok(2) The physical adapter is operating properly. generalFailure(3) The physical adapter has failed. linkFailure(4) The physical adapter has lost link. Check the cable connections to this adapter. |
SNMP agent | hp.server.net.if.status[cpqNicIfPhysAdapterStatus.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#ADAPTERNAME} port {#ADAPTERINDEX}: Adapter has failed | Please check the physical adapter. |
last(/HPE ProLiant DL360 by SNMP/hp.server.net.if.status[cpqNicIfPhysAdapterStatus.{#SNMPINDEX}])=3 |High |
||
{#ADAPTERNAME} port {#ADAPTERINDEX}: Adapter has lost link | Please check the cable connections to this adapter. |
last(/HPE ProLiant DL360 by SNMP/hp.server.net.if.status[cpqNicIfPhysAdapterStatus.{#SNMPINDEX}])=4 |Average |
Depends on:
|
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums
This is a template for monitoring HPE ProLiant BL920 servers with HP iLO version 4 and later via Zabbix SNMP agent that works without any external scripts.
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
Refer to the vendor documentation.
Name | Description | Default |
---|---|---|
{$HEALTH.STATUS.CRIT} | The critical status of the health for trigger expression. |
4 |
{$HEALTH.STATUS.WARN} | The warning status of the health for trigger expression. |
3 |
{$PSU.STATUS.CRIT} | The critical value of the PSU sensor for trigger expression. |
4 |
{$PSU.STATUS.WARN} | The warning value of the PSU sensor for trigger expression. |
3 |
{$FAN.STATUS.CRIT} | The critical value of the FAN sensor for trigger expression. |
4 |
{$FAN.STATUS.WARN} | The warning value of the FAN sensor for trigger expression. |
3 |
{$DISK.ARRAY.STATUS.CRIT} | The critical status of the disk array for trigger expression. |
4 |
{$DISK.ARRAY.STATUS.WARN} | The warning status of the disk array for trigger expression. |
3 |
{$DISK.ARRAY.CACHE.STATUS.CRIT:"cacheModCriticalFailure"} | The critical status of the disk array cache for trigger expression. |
8 |
{$DISK.ARRAY.CACHE.STATUS.WARN:"invalid"} | The warning status of the disk array cache for trigger expression. |
2 |
{$DISK.ARRAY.CACHE.STATUS.WARN:"cacheModDegradedFailsafeSpeed"} | The warning status of the disk array cache for trigger expression. |
7 |
{$DISK.ARRAY.CACHE.STATUS.WARN:"cacheReadCacheNotMapped"} | The warning status of the disk array cache for trigger expression. |
9 |
{$DISK.ARRAY.CACHE.STATUS.WARN:"cacheModFlashMemNotAttached"} | The warning status of the disk array cache for trigger expression. |
6 |
{$DISK.ARRAY.CACHE.STATUS.OK:"enabled"} | The normal status of the disk array cache for trigger expression. |
3 |
{$DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT:"failed"} | The critical status of the disk array cache battery for trigger expression. |
4 |
{$DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT:"capacitorFailed"} | The critical status of the disk array cache battery for trigger expression. |
7 |
{$DISK.ARRAY.CACHE.BATTERY.STATUS.WARN:"degraded"} | The warning status of the disk array cache battery for trigger expression. |
5 |
{$DISK.ARRAY.CACHE.BATTERY.STATUS.WARN:"notPresent"} | The warning status of the disk array cache battery for trigger expression. |
6 |
{$VDISK.STATUS.CRIT} | The critical status of the virtual disk for trigger expression. |
3 |
{$VDISK.STATUS.OK} | The normal status of the virtual disk for trigger expression. |
2 |
{$DISK.STATUS.WARN} | The warning status of the disk for trigger expression. |
4 |
{$DISK.STATUS.FAIL} | The critical status of the disk for trigger expression. |
3 |
{$DISK.SMART.STATUS.FAIL:"replaceDrive"} | The critical S.M.A.R.T status of the disk for trigger expression. |
3 |
{$DISK.SMART.STATUS.FAIL:"replaceDriveSSDWearOut"} | The critical S.M.A.R.T status of the disk for trigger expression. |
4 |
{$SNMP.TIMEOUT} | The time interval for SNMP agent availability trigger expression. |
5m |
Name | Description | Type | Key and additional info |
---|---|---|---|
HPE ProLiant BL920: Overall system health status | MIB: CPQHLTH-MIB The overall condition. This object represents the overall status of the server information represented by this MIB. |
SNMP agent | hp.server.status[cpqHeMibCondition] Preprocessing
|
HPE ProLiant BL920: Hardware model name | MIB: CPQSINFO-MIB The machine product name. The name of the machine used in this system. |
SNMP agent | hp.server.hw.model[cpqSiProductName] Preprocessing
|
HPE ProLiant BL920: Hardware serial number | MIB: CPQSINFO-MIB The serial number of the physical system unit. The string will be empty if the system does not report the serial number function. |
SNMP agent | hp.server.hw.serialnumber[cpqSiSysSerialNum] Preprocessing
|
HPE ProLiant BL920: System temperature status | MIB: CPQHLTH-MIB This value specifies the overall condition of the system's thermal environment. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. |
SNMP agent | hp.server.sensor.temp.status[cpqHeThermalCondition] Preprocessing
|
HPE ProLiant BL920: Uptime (network) | MIB: SNMPv2-MIB The time (in hundredths of a second) since the network management portion of the system was last re-initialized. |
SNMP agent | hp.server.net.uptime[sysUpTime] Preprocessing
|
HPE ProLiant BL920: Uptime (hardware) | MIB: HOST-RESOURCES-MIB The amount of time since this host was last initialized. Note that this is different from sysUpTime in the SNMPv2-MIB [RFC1907] because sysUpTime is the uptime of the network management portion of the system. |
SNMP agent | hp.server.hw.uptime[hrSystemUptime] Preprocessing
|
HPE ProLiant BL920: SNMP traps (fallback) | The item is used to collect all SNMP traps unmatched by other snmptrap items. |
SNMP trap | snmptrap.fallback |
HPE ProLiant BL920: System location | MIB: SNMPv2-MIB The physical location of this node (e.g., `telephone closet, 3rd floor'). If the location is unknown, the value is the zero-length string. |
SNMP agent | hp.server.location[sysLocation] Preprocessing
|
HPE ProLiant BL920: System contact details | MIB: SNMPv2-MIB The textual identification of the contact person for this managed node, together with information on how to contact this person. If no contact information is known, the value is the zero-length string. |
SNMP agent | hp.server.contact[sysContact] Preprocessing
|
HPE ProLiant BL920: System object ID | MIB: SNMPv2-MIB The vendor's authoritative identification of the network management subsystem contained in the entity. This value is allocated within the SMI enterprises subtree (1.3.6.1.4.1) and provides an easy and unambiguous means for determining |
SNMP agent | hp.server.objectid[sysObjectID] Preprocessing
|
HPE ProLiant BL920: System name | MIB: SNMPv2-MIB An administratively-assigned name for this managed node.By convention, this is the node's fully-qualified domain name. If the name is unknown, the value is the zero-length string. |
SNMP agent | hp.server.name[sysName] Preprocessing
|
HPE ProLiant BL920: System description | MIB: SNMPv2-MIB A textual description of the entity. This value should include the full name and version identification of the system's hardware type, software operating-system, and networking software. |
SNMP agent | hp.server.descr[sysDescr] Preprocessing
|
HPE ProLiant BL920: SNMP agent availability | Zabbix internal | zabbix[host,snmp,available] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
HPE ProLiant BL920: System status is in critical state | Please check the device for errors. |
last(/HPE ProLiant BL920 by SNMP/hp.server.status[cpqHeMibCondition])={$HEALTH.STATUS.CRIT} |High |
||
HPE ProLiant BL920: System status is in warning state | Please check the device for warnings. |
last(/HPE ProLiant BL920 by SNMP/hp.server.status[cpqHeMibCondition])={$HEALTH.STATUS.WARN} |Warning |
Depends on:
|
|
HPE ProLiant BL920: Device has been replaced | Device serial number has changed. Acknowledge to close the problem manually. |
last(/HPE ProLiant BL920 by SNMP/hp.server.hw.serialnumber[cpqSiSysSerialNum],#1)<>last(/HPE ProLiant BL920 by SNMP/hp.server.hw.serialnumber[cpqSiSysSerialNum],#2) and length(last(/HPE ProLiant BL920 by SNMP/hp.server.hw.serialnumber[cpqSiSysSerialNum]))>0 |Info |
Manual close: Yes | |
HPE ProLiant BL920: Host has been restarted | Uptime is less than 10 minutes. |
(last(/HPE ProLiant BL920 by SNMP/hp.server.hw.uptime[hrSystemUptime])>0 and last(/HPE ProLiant BL920 by SNMP/hp.server.hw.uptime[hrSystemUptime])<10m) or (last(/HPE ProLiant BL920 by SNMP/hp.server.hw.uptime[hrSystemUptime])=0 and last(/HPE ProLiant BL920 by SNMP/hp.server.net.uptime[sysUpTime])<10m) |Warning |
Manual close: Yes | |
HPE ProLiant BL920: System name has changed | The name of the system has changed. Acknowledge to close the problem manually. |
last(/HPE ProLiant BL920 by SNMP/hp.server.name[sysName],#1)<>last(/HPE ProLiant BL920 by SNMP/hp.server.name[sysName],#2) and length(last(/HPE ProLiant BL920 by SNMP/hp.server.name[sysName]))>0 |Info |
Manual close: Yes | |
HPE ProLiant BL920: No SNMP data collection | SNMP is not available for polling. Please check device connectivity and SNMP settings. |
max(/HPE ProLiant BL920 by SNMP/zabbix[host,snmp,available],{$SNMP.TIMEOUT})=0 |Warning |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable |
SNMP agent | temp.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: {#SNMPINDEX} |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius.{#SNMPINDEX}] Preprocessing
|
{#SNMPINDEX}: Temperature sensor location | MIB: CPQHLTH-MIB This specifies the location of the temperature sensor present in the system. |
SNMP agent | hp.server.sensor.temp.locale[cpqHeTemperatureLocale.{#SNMPINDEX}] Preprocessing
|
{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant BL920 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant BL920 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.{#SNMPINDEX}]) = 3 |Warning |
||
{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant BL920 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature ambient discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with ambient(11) and 0.1 index filter |
SNMP agent | temp.ambient.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Ambient: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: Ambient |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius.Ambient.{#SNMPINDEX}] Preprocessing
|
Ambient: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Ambient.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Ambient: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant BL920 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Ambient.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
Ambient: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant BL920 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Ambient.{#SNMPINDEX}]) = 3 |Warning |
||
Ambient: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant BL920 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Ambient.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature CPU discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with cpu(6) filter |
SNMP agent | temp.cpu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
CPU-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: CPU-{#SNMPINDEX} |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius.CPU.{#SNMPINDEX}] Preprocessing
|
CPU-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition.CPU.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
CPU-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant BL920 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.CPU.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
CPU-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant BL920 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.CPU.{#SNMPINDEX}]) = 3 |Warning |
||
CPU-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant BL920 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.CPU.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature memory discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with memory(7) filter |
SNMP agent | temp.memory.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Memory-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: Memory-{#SNMPINDEX} |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius.Memory.{#SNMPINDEX}] Preprocessing
|
Memory-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Memory.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Memory-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant BL920 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Memory.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
Memory-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant BL920 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Memory.{#SNMPINDEX}]) = 3 |Warning |
||
Memory-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant BL920 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Memory.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature PSU discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with powerSupply(10) filter |
SNMP agent | temp.psu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: PSU-{#SNMPINDEX} |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius.PSU.{#SNMPINDEX}] Preprocessing
|
PSU-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition.PSU.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
PSU-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant BL920 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.PSU.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
PSU-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant BL920 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.PSU.{#SNMPINDEX}]) = 3 |Warning |
||
PSU-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant BL920 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.PSU.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature I/O discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with ioBoard(5) filter |
SNMP agent | temp.io.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
I/O-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: I/O-{#SNMPINDEX} |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius."I/O.{#SNMPINDEX}"] Preprocessing
|
I/O-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition."I/O.{#SNMPINDEX}"] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
I/O-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant BL920 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition."I/O.{#SNMPINDEX}"]) = 1 |Info |
Manual close: Yes | |
I/O-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant BL920 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition."I/O.{#SNMPINDEX}"]) = 3 |Warning |
||
I/O-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant BL920 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition."I/O.{#SNMPINDEX}"]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature system discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with system(3) filter |
SNMP agent | temp.system.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
System-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: System-{#SNMPINDEX} |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius.System.{#SNMPINDEX}] Preprocessing
|
System-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition.System.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
System-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant BL920 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.System.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
System-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant BL920 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.System.{#SNMPINDEX}]) = 3 |Warning |
||
System-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant BL920 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.System.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU discovery | CPQHLTH-MIB::cpqHeFltTolPowerSupplyStatus |
SNMP agent | psu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Chassis {#CHASSISNUM}, bay {#BAYNUM}: Power supply status | MIB: CPQHLTH-MIB The condition of the power supply. This value will be one of the following: other(1) The status could not be determined or not present. ok(2) The power supply is operating normally. degraded(3) A temperature sensor, fan or other power supply component is outside of normal operating range. failed(4) A power supply component detects a condition that could permanently damage the system. |
SNMP agent | hp.server.sensor.psu.status[cpqHeFltTolPowerSupplyCondition.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Chassis {#CHASSISNUM}, bay {#BAYNUM}: Power supply is in critical state | Please check the power supply unit for errors. |
last(/HPE ProLiant BL920 by SNMP/hp.server.sensor.psu.status[cpqHeFltTolPowerSupplyCondition.{#SNMPINDEX}])={$PSU.STATUS.CRIT} |Average |
||
Chassis {#CHASSISNUM}, bay {#BAYNUM}: Power supply is in warning state | Please check the power supply unit for errors. |
last(/HPE ProLiant BL920 by SNMP/hp.server.sensor.psu.status[cpqHeFltTolPowerSupplyCondition.{#SNMPINDEX}])={$PSU.STATUS.WARN} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
FAN discovery | CPQHLTH-MIB::cpqHeFltTolFanCondition |
SNMP agent | fan.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Fan {#SNMPINDEX}: Fan status | MIB: CPQHLTH-MIB The condition of the fan. This value will be one of the following: other(1) Fan status detection is not supported by this system or driver. ok(2) The fan is operating properly. degraded(2) A redundant fan is not operating properly. failed(4) A non-redundant fan is not operating properly. |
SNMP agent | hp.server.sensor.fan.status[cpqHeFltTolFanCondition.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Fan {#SNMPINDEX}: Fan is in critical state | Please check the fan unit. |
last(/HPE ProLiant BL920 by SNMP/hp.server.sensor.fan.status[cpqHeFltTolFanCondition.{#SNMPINDEX}])={$FAN.STATUS.CRIT} |Average |
||
Fan {#SNMPINDEX}: Fan is in warning state | Please check the fan unit. |
last(/HPE ProLiant BL920 by SNMP/hp.server.sensor.fan.status[cpqHeFltTolFanCondition.{#SNMPINDEX}])={$FAN.STATUS.WARN} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Array controller discovery | Scanning table of Array controllers: CPQIDA-MIB::cpqDaCntlrTable |
SNMP agent | array.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
{#CNTLR_LOCATION}: Disk array controller status | MIB: CPQIDA-MIB This value represents the overall condition of this controller, and any associated logical drives, physical drives, and array accelerators. |
SNMP agent | hp.server.hw.diskarray.status[cpqDaCntlrCondition.{#SNMPINDEX}] Preprocessing
|
{#CNTLR_LOCATION}: Disk array controller model | MIB: CPQIDA-MIB Array Controller Model. The type of controller card. |
SNMP agent | hp.server.hw.diskarray.model[cpqDaCntlrModel.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#CNTLR_LOCATION}: Disk array controller is in critical state | Please check the device for faults. |
last(/HPE ProLiant BL920 by SNMP/hp.server.hw.diskarray.status[cpqDaCntlrCondition.{#SNMPINDEX}])={$DISK.ARRAY.STATUS.CRIT} |High |
||
{#CNTLR_LOCATION}: Disk array controller is in warning state | Please check the device for faults. |
last(/HPE ProLiant BL920 by SNMP/hp.server.hw.diskarray.status[cpqDaCntlrCondition.{#SNMPINDEX}])={$DISK.ARRAY.STATUS.WARN} |Average |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Array controller cache discovery | Scanning table of Array controllers: CPQIDA-MIB::cpqDaAccelTable |
SNMP agent | array.cache.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
#{#CACHECNTRLINDEX}: Disk array cache controller status | MIB: CPQIDA-MIB Cache Module/Operations Status. This describes the status of the cache module and/or cache operations. Note that for some controller models, a cache module board that physically attaches to the controller or chipset may not be an available option. The status can be: Other (1) Indicates that the instrument agent does not recognize the status of the cache module. You may need to upgrade the instrument agent. Invalid (2) Indicates that a cache module board has not been installed in this system or is present but not configured. Enabled (3) Indicates that cache operations are currently configured and enabled for at least one logical drive. Temporarily Disabled (4) Indicates that cache operations have been temporarily disabled. View the cache module board error code object to determine why the write cache operations have been temporarily disabled. Permanently Disabled (5) Indicates that cache operations have been permanently disabled. View the cache module board error code object to determine why the write cache operations have been disabled. Cache Module Flash Memory Not Attached (6) Indicates that the flash memory component of the flash backed cache module is not attached. This status will be set when the flash memory is not attached and the Supercap is attached. This value is only used on flash backed cache modules that support removable flash memory. Cache Module Degraded Failsafe Speed (7) Indicates that the cache module board is currently degraded and operating at a failsafe speed. View variables cpqDaCacheMemoryDataWidth and cpqDaCacheMemoryTransferRate to obtain the cache module board`s current memory data width and memory transfer rate. Cache Module Critical Failure (8) Indicates that the cache module board has encountered a critical failure. The controller is currently operating in Zero Memory Raid mode. Read Cache Could Not Be Mapped (9) Indicates that the read cache memory in a split cache configuration could not be mapped by the operating system and as a result is not available. This status may be caused by virtual space limitations in certain operating systems and is only applicable to B-Series controllers. |
SNMP agent | hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}] Preprocessing
|
#{#CACHECNTRLINDEX}: Disk array cache controller battery status | MIB: CPQIDA-MIB Cache Module Board Backup Power Status. This monitors the status of each backup power source on the board. The backup power source can only recharge when the system has power applied. The type of backup power source used is indicated by cpqDaAccelBackupPowerSource. The following values are valid: Other (1) Indicates that the instrument agent does not recognize backup power status. You may need to update your software. Ok (2) The backup power source is fully charged. Recharging (3) The array controller has one or more cache module backup power sources that are recharging. Cache module operations such as Battery/Flash Backed Write Cache, Expansion, Extension and Migration are temporarily suspended until the backup power source is fully charged. Cache module operations will automatically resume when charging is complete. Failed (4) The battery pack is below the sufficient voltage level and has not recharged in 36 hours. Your Cache Module board needs to be serviced. Degraded (5) The battery is still operating, however, one of the batteries in the pack has failed to recharge properly. Your Cache Module board should be serviced as soon as possible. NotPresent (6) A backup power source is not present on the cache module board. Some controllers do not have backup power sources. Capacitor Failed (7) The flash backed cache module capacitor is below the sufficient voltage level and has not recharged in 10 minutes. Your Cache Module board needs to be serviced. |
SNMP agent | hp.server.hw.diskarray.cache.battery.status[cpqDaAccelBattery.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
#{#CACHECNTRLINDEX}: Disk array cache controller is in critical state! | Please check the device for faults. |
last(/HPE ProLiant BL920 by SNMP/hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.STATUS.CRIT:"cacheModCriticalFailure"} |Average |
||
#{#CACHECNTRLINDEX}: Disk array cache controller is in warning state | Please check the device for faults. |
last(/HPE ProLiant BL920 by SNMP/hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.STATUS.WARN:"cacheModDegradedFailsafeSpeed"} or last(/HPE ProLiant BL920 by SNMP/hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.STATUS.WARN:"cacheReadCacheNotMapped"} or last(/HPE ProLiant BL920 by SNMP/hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.STATUS.WARN:"cacheModFlashMemNotAttached"} |Warning |
Depends on:
|
|
#{#CACHECNTRLINDEX}: Disk array cache controller is not in optimal state | Please check the device for faults. |
last(/HPE ProLiant BL920 by SNMP/hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}])<>{$DISK.ARRAY.CACHE.STATUS.OK:"enabled"} and last(/HPE ProLiant BL920 by SNMP/hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}])<>{$DISK.ARRAY.CACHE.STATUS.WARN:"invalid"} |Warning |
Depends on:
|
|
#{#CACHECNTRLINDEX}: Disk array cache controller battery is in critical state | Please check the device for faults. |
last(/HPE ProLiant BL920 by SNMP/hp.server.hw.diskarray.cache.battery.status[cpqDaAccelBattery.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT:"failed"} or last(/HPE ProLiant BL920 by SNMP/hp.server.hw.diskarray.cache.battery.status[cpqDaAccelBattery.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT:"capacitorFailed"} |Average |
||
#{#CACHECNTRLINDEX}: Disk array cache controller battery is in warning state | Please check the device for faults. |
last(/HPE ProLiant BL920 by SNMP/hp.server.hw.diskarray.cache.battery.status[cpqDaAccelBattery.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.BATTERY.STATUS.WARN:"degraded"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Physical disk discovery | Scanning table of physical drive entries CPQIDA-MIB::cpqDaPhyDrvTable. |
SNMP agent | physicaldisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
{#DISK_LOCATION}: Physical disk status | MIB: CPQIDA-MIB Physical Drive Status. This shows the status of the physical drive. The following values are valid for the physical drive status: other (1) Indicates that the instrument agent does not recognize the drive. You may need to upgrade your instrument agent and/or driver software. ok (2) Indicates the drive is functioning properly. failed (3) Indicates that the drive is no longer operating and should be replaced. predictiveFailure(4) Indicates that the drive has a predictive failure error and should be replaced. |
SNMP agent | hp.server.hw.physicaldisk.status[cpqDaPhyDrvStatus.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk S.M.A.R.T. status | MIB: CPQIDA-MIB Physical Drive S.M.A.R.T Status. The following values are defined: other(1) The agent is unable to determine if the status of S.M.A.R.T predictive failure monitoring for this drive. ok(2) Indicates the drive is functioning properly. replaceDrive(3) Indicates that the drive has a S.M.A.R.T predictive failure error and should be replaced. |
SNMP agent | hp.server.hw.physicaldisk.smart_status[cpqDaPhyDrvSmartStatus.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk serial number | MIB: CPQIDA-MIB Physical Drive Serial Number. This is the serial number assigned to the physical drive. This value is based upon the serial number as returned by the SCSI inquiry command but may have been modified due to space limitations. This can be used for identification purposes. |
SNMP agent | hp.server.hw.physicaldisk.serialnumber[cpqDaPhyDrvSerialNum.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk model name | MIB: CPQIDA-MIB Physical Drive Model. This is a text description of the physical drive. The text that appears depends upon who manufactured the drive and the drive type. If a drive fails, note the model to identify the type of drive necessary for replacement. If a model number is not present, you may not have properly initialized the drive array to which the physical drive is attached for monitoring. |
SNMP agent | hp.server.hw.physicaldisk.model[cpqDaPhyDrvModel.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk media type | MIB: CPQIDA-MIB Drive Array Physical Drive Media Type. The following values are defined: other(1) The instrument agent is unable to determine the physical drive's media type. rotatingPlatters(2) The physical drive media is composed of rotating platters. solidState(3) The physical drive media is composed of solid state electronics. |
SNMP agent | hp.server.hw.physicaldisk.media_type[cpqDaPhyDrvMediaType.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Disk size | MIB: CPQIDA-MIB Physical Drive Size in MB. This is the size of the physical drive in megabytes. This value is calculated using the value 1,048,576 (2^20) as a megabyte. Drive manufacturers sometimes use the number 1,000,000 as a megabyte when giving drive capacities so this value may differ from the advertised size of a drive. This field is only applicable for controllers which support SCSI drives, and therefore is not supported by the IDA or IDA-2 controllers. The field will contain 0xFFFFFFFF if the drive capacity cannot be calculated or if the controller does not support SCSI drives. |
SNMP agent | hp.server.hw.physicaldisk.size[cpqDaPhyDrvMediaType.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#DISK_LOCATION}: Physical disk failed | Please check physical disk for warnings or errors. |
last(/HPE ProLiant BL920 by SNMP/hp.server.hw.physicaldisk.status[cpqDaPhyDrvStatus.{#SNMPINDEX}])={$DISK.STATUS.FAIL} |High |
||
{#DISK_LOCATION}: Physical disk is in warning state | Please check physical disk for warnings or errors. |
last(/HPE ProLiant BL920 by SNMP/hp.server.hw.physicaldisk.status[cpqDaPhyDrvStatus.{#SNMPINDEX}])={$DISK.STATUS.WARN} |Warning |
Depends on:
|
|
{#DISK_LOCATION}: Physical disk S.M.A.R.T. failed | Disk probably requires replacement. |
last(/HPE ProLiant BL920 by SNMP/hp.server.hw.physicaldisk.smart_status[cpqDaPhyDrvSmartStatus.{#SNMPINDEX}])={$DISK.SMART.STATUS.FAIL:"replaceDrive"} or last(/HPE ProLiant BL920 by SNMP/hp.server.hw.physicaldisk.smart_status[cpqDaPhyDrvSmartStatus.{#SNMPINDEX}])={$DISK.SMART.STATUS.FAIL:"replaceDriveSSDWearOut"} |High |
Depends on:
|
|
{#DISK_LOCATION}: Disk has been replaced | Disk serial number has changed. Acknowledge to close the problem manually. |
last(/HPE ProLiant BL920 by SNMP/hp.server.hw.physicaldisk.serialnumber[cpqDaPhyDrvSerialNum.{#SNMPINDEX}],#1)<>last(/HPE ProLiant BL920 by SNMP/hp.server.hw.physicaldisk.serialnumber[cpqDaPhyDrvSerialNum.{#SNMPINDEX}],#2) and length(last(/HPE ProLiant BL920 by SNMP/hp.server.hw.physicaldisk.serialnumber[cpqDaPhyDrvSerialNum.{#SNMPINDEX}]))>0 |Info |
Manual close: Yes |
Name | Description | Type | Key and additional info |
---|---|---|---|
Virtual disk discovery | CPQIDA-MIB::cpqDaLogDrvTable |
SNMP agent | virtualdisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Disk {#SNMPINDEX}({#DISK_NAME}): Status | Logical Drive Status. |
SNMP agent | hp.server.hw.virtualdisk.status[cpqDaLogDrvStatus.{#SNMPINDEX}] Preprocessing
|
Disk {#SNMPINDEX}({#DISK_NAME}): Layout type | Logical Drive Fault Tolerance. This shows the fault tolerance mode of the logical drive. |
SNMP agent | hp.server.hw.virtualdisk.layout[cpqDaLogDrvFaultTol.{#SNMPINDEX}] Preprocessing
|
Disk {#SNMPINDEX}({#DISK_NAME}): Disk size | Logical Drive Size. This is the size of the logical drive in megabytes. This value is calculated using the value 1,048,576 (2^20) as a megabyte. Drive manufacturers sometimes use the number 1,000,000 as a megabyte when giving drive capacities so this value may differ from the advertised size of a drive. |
SNMP agent | hp.server.hw.virtualdisk.size[cpqDaLogDrvSize.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Disk {#SNMPINDEX}({#DISK_NAME}): Virtual disk failed | Please check virtual disk for warnings or errors. |
last(/HPE ProLiant BL920 by SNMP/hp.server.hw.virtualdisk.status[cpqDaLogDrvStatus.{#SNMPINDEX}])={$VDISK.STATUS.CRIT} |High |
||
Disk {#SNMPINDEX}({#DISK_NAME}): Virtual disk is not in OK state | Please check virtual disk for warnings or errors. |
last(/HPE ProLiant BL920 by SNMP/hp.server.hw.virtualdisk.status[cpqDaLogDrvStatus.{#SNMPINDEX}])<>{$VDISK.STATUS.OK} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Network interface discovery | CPQIDA-MIB::cpqNicIfPhysAdapterTable |
SNMP agent | net.if.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
{#ADAPTERNAME} port {#ADAPTERINDEX}: Status | MIB: CPQNIC-MIB The physical adapter status. The following values are valid: unknown(1) The instrument agent was not able to determine the status of the adapter. The instrument agent may need to be upgraded. ok(2) The physical adapter is operating properly. generalFailure(3) The physical adapter has failed. linkFailure(4) The physical adapter has lost link. Check the cable connections to this adapter. |
SNMP agent | hp.server.net.if.status[cpqNicIfPhysAdapterStatus.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#ADAPTERNAME} port {#ADAPTERINDEX}: Adapter has failed | Please check the physical adapter. |
last(/HPE ProLiant BL920 by SNMP/hp.server.net.if.status[cpqNicIfPhysAdapterStatus.{#SNMPINDEX}])=3 |High |
||
{#ADAPTERNAME} port {#ADAPTERINDEX}: Adapter has lost link | Please check the cable connections to this adapter. |
last(/HPE ProLiant BL920 by SNMP/hp.server.net.if.status[cpqNicIfPhysAdapterStatus.{#SNMPINDEX}])=4 |Average |
Depends on:
|
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums
This is a template for monitoring HPE ProLiant BL460 servers with HP iLO version 4 and later via Zabbix SNMP agent that works without any external scripts.
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
Refer to the vendor documentation.
Name | Description | Default |
---|---|---|
{$HEALTH.STATUS.CRIT} | The critical status of the health for trigger expression. |
4 |
{$HEALTH.STATUS.WARN} | The warning status of the health for trigger expression. |
3 |
{$PSU.STATUS.CRIT} | The critical value of the PSU sensor for trigger expression. |
4 |
{$PSU.STATUS.WARN} | The warning value of the PSU sensor for trigger expression. |
3 |
{$FAN.STATUS.CRIT} | The critical value of the FAN sensor for trigger expression. |
4 |
{$FAN.STATUS.WARN} | The warning value of the FAN sensor for trigger expression. |
3 |
{$DISK.ARRAY.STATUS.CRIT} | The critical status of the disk array for trigger expression. |
4 |
{$DISK.ARRAY.STATUS.WARN} | The warning status of the disk array for trigger expression. |
3 |
{$DISK.ARRAY.CACHE.STATUS.CRIT:"cacheModCriticalFailure"} | The critical status of the disk array cache for trigger expression. |
8 |
{$DISK.ARRAY.CACHE.STATUS.WARN:"invalid"} | The warning status of the disk array cache for trigger expression. |
2 |
{$DISK.ARRAY.CACHE.STATUS.WARN:"cacheModDegradedFailsafeSpeed"} | The warning status of the disk array cache for trigger expression. |
7 |
{$DISK.ARRAY.CACHE.STATUS.WARN:"cacheReadCacheNotMapped"} | The warning status of the disk array cache for trigger expression. |
9 |
{$DISK.ARRAY.CACHE.STATUS.WARN:"cacheModFlashMemNotAttached"} | The warning status of the disk array cache for trigger expression. |
6 |
{$DISK.ARRAY.CACHE.STATUS.OK:"enabled"} | The normal status of the disk array cache for trigger expression. |
3 |
{$DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT:"failed"} | The critical status of the disk array cache battery for trigger expression. |
4 |
{$DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT:"capacitorFailed"} | The critical status of the disk array cache battery for trigger expression. |
7 |
{$DISK.ARRAY.CACHE.BATTERY.STATUS.WARN:"degraded"} | The warning status of the disk array cache battery for trigger expression. |
5 |
{$DISK.ARRAY.CACHE.BATTERY.STATUS.WARN:"notPresent"} | The warning status of the disk array cache battery for trigger expression. |
6 |
{$VDISK.STATUS.CRIT} | The critical status of the virtual disk for trigger expression. |
3 |
{$VDISK.STATUS.OK} | The normal status of the virtual disk for trigger expression. |
2 |
{$DISK.STATUS.WARN} | The warning status of the disk for trigger expression. |
4 |
{$DISK.STATUS.FAIL} | The critical status of the disk for trigger expression. |
3 |
{$DISK.SMART.STATUS.FAIL:"replaceDrive"} | The critical S.M.A.R.T status of the disk for trigger expression. |
3 |
{$DISK.SMART.STATUS.FAIL:"replaceDriveSSDWearOut"} | The critical S.M.A.R.T status of the disk for trigger expression. |
4 |
{$SNMP.TIMEOUT} | The time interval for SNMP agent availability trigger expression. |
5m |
Name | Description | Type | Key and additional info |
---|---|---|---|
HPE ProLiant BL460: Overall system health status | MIB: CPQHLTH-MIB The overall condition. This object represents the overall status of the server information represented by this MIB. |
SNMP agent | hp.server.status[cpqHeMibCondition] Preprocessing
|
HPE ProLiant BL460: Hardware model name | MIB: CPQSINFO-MIB The machine product name. The name of the machine used in this system. |
SNMP agent | hp.server.hw.model[cpqSiProductName] Preprocessing
|
HPE ProLiant BL460: Hardware serial number | MIB: CPQSINFO-MIB The serial number of the physical system unit. The string will be empty if the system does not report the serial number function. |
SNMP agent | hp.server.hw.serialnumber[cpqSiSysSerialNum] Preprocessing
|
HPE ProLiant BL460: System temperature status | MIB: CPQHLTH-MIB This value specifies the overall condition of the system's thermal environment. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. |
SNMP agent | hp.server.sensor.temp.status[cpqHeThermalCondition] Preprocessing
|
HPE ProLiant BL460: Uptime (network) | MIB: SNMPv2-MIB The time (in hundredths of a second) since the network management portion of the system was last re-initialized. |
SNMP agent | hp.server.net.uptime[sysUpTime] Preprocessing
|
HPE ProLiant BL460: Uptime (hardware) | MIB: HOST-RESOURCES-MIB The amount of time since this host was last initialized. Note that this is different from sysUpTime in the SNMPv2-MIB [RFC1907] because sysUpTime is the uptime of the network management portion of the system. |
SNMP agent | hp.server.hw.uptime[hrSystemUptime] Preprocessing
|
HPE ProLiant BL460: SNMP traps (fallback) | The item is used to collect all SNMP traps unmatched by other snmptrap items. |
SNMP trap | snmptrap.fallback |
HPE ProLiant BL460: System location | MIB: SNMPv2-MIB The physical location of this node (e.g., `telephone closet, 3rd floor'). If the location is unknown, the value is the zero-length string. |
SNMP agent | hp.server.location[sysLocation] Preprocessing
|
HPE ProLiant BL460: System contact details | MIB: SNMPv2-MIB The textual identification of the contact person for this managed node, together with information on how to contact this person. If no contact information is known, the value is the zero-length string. |
SNMP agent | hp.server.contact[sysContact] Preprocessing
|
HPE ProLiant BL460: System object ID | MIB: SNMPv2-MIB The vendor's authoritative identification of the network management subsystem contained in the entity. This value is allocated within the SMI enterprises subtree (1.3.6.1.4.1) and provides an easy and unambiguous means for determining |
SNMP agent | hp.server.objectid[sysObjectID] Preprocessing
|
HPE ProLiant BL460: System name | MIB: SNMPv2-MIB An administratively-assigned name for this managed node.By convention, this is the node's fully-qualified domain name. If the name is unknown, the value is the zero-length string. |
SNMP agent | hp.server.name[sysName] Preprocessing
|
HPE ProLiant BL460: System description | MIB: SNMPv2-MIB A textual description of the entity. This value should include the full name and version identification of the system's hardware type, software operating-system, and networking software. |
SNMP agent | hp.server.descr[sysDescr] Preprocessing
|
HPE ProLiant BL460: SNMP agent availability | Zabbix internal | zabbix[host,snmp,available] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
HPE ProLiant BL460: System status is in critical state | Please check the device for errors. |
last(/HPE ProLiant BL460 by SNMP/hp.server.status[cpqHeMibCondition])={$HEALTH.STATUS.CRIT} |High |
||
HPE ProLiant BL460: System status is in warning state | Please check the device for warnings. |
last(/HPE ProLiant BL460 by SNMP/hp.server.status[cpqHeMibCondition])={$HEALTH.STATUS.WARN} |Warning |
Depends on:
|
|
HPE ProLiant BL460: Device has been replaced | Device serial number has changed. Acknowledge to close the problem manually. |
last(/HPE ProLiant BL460 by SNMP/hp.server.hw.serialnumber[cpqSiSysSerialNum],#1)<>last(/HPE ProLiant BL460 by SNMP/hp.server.hw.serialnumber[cpqSiSysSerialNum],#2) and length(last(/HPE ProLiant BL460 by SNMP/hp.server.hw.serialnumber[cpqSiSysSerialNum]))>0 |Info |
Manual close: Yes | |
HPE ProLiant BL460: Host has been restarted | Uptime is less than 10 minutes. |
(last(/HPE ProLiant BL460 by SNMP/hp.server.hw.uptime[hrSystemUptime])>0 and last(/HPE ProLiant BL460 by SNMP/hp.server.hw.uptime[hrSystemUptime])<10m) or (last(/HPE ProLiant BL460 by SNMP/hp.server.hw.uptime[hrSystemUptime])=0 and last(/HPE ProLiant BL460 by SNMP/hp.server.net.uptime[sysUpTime])<10m) |Warning |
Manual close: Yes | |
HPE ProLiant BL460: System name has changed | The name of the system has changed. Acknowledge to close the problem manually. |
last(/HPE ProLiant BL460 by SNMP/hp.server.name[sysName],#1)<>last(/HPE ProLiant BL460 by SNMP/hp.server.name[sysName],#2) and length(last(/HPE ProLiant BL460 by SNMP/hp.server.name[sysName]))>0 |Info |
Manual close: Yes | |
HPE ProLiant BL460: No SNMP data collection | SNMP is not available for polling. Please check device connectivity and SNMP settings. |
max(/HPE ProLiant BL460 by SNMP/zabbix[host,snmp,available],{$SNMP.TIMEOUT})=0 |Warning |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable |
SNMP agent | temp.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: {#SNMPINDEX} |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius.{#SNMPINDEX}] Preprocessing
|
{#SNMPINDEX}: Temperature sensor location | MIB: CPQHLTH-MIB This specifies the location of the temperature sensor present in the system. |
SNMP agent | hp.server.sensor.temp.locale[cpqHeTemperatureLocale.{#SNMPINDEX}] Preprocessing
|
{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant BL460 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant BL460 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.{#SNMPINDEX}]) = 3 |Warning |
||
{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant BL460 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature ambient discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with ambient(11) and 0.1 index filter |
SNMP agent | temp.ambient.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Ambient: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: Ambient |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius.Ambient.{#SNMPINDEX}] Preprocessing
|
Ambient: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Ambient.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Ambient: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant BL460 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Ambient.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
Ambient: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant BL460 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Ambient.{#SNMPINDEX}]) = 3 |Warning |
||
Ambient: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant BL460 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Ambient.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature CPU discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with cpu(6) filter |
SNMP agent | temp.cpu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
CPU-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: CPU-{#SNMPINDEX} |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius.CPU.{#SNMPINDEX}] Preprocessing
|
CPU-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition.CPU.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
CPU-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant BL460 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.CPU.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
CPU-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant BL460 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.CPU.{#SNMPINDEX}]) = 3 |Warning |
||
CPU-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant BL460 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.CPU.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature memory discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with memory(7) filter |
SNMP agent | temp.memory.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Memory-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: Memory-{#SNMPINDEX} |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius.Memory.{#SNMPINDEX}] Preprocessing
|
Memory-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Memory.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Memory-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant BL460 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Memory.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
Memory-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant BL460 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Memory.{#SNMPINDEX}]) = 3 |Warning |
||
Memory-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant BL460 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.Memory.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature PSU discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with powerSupply(10) filter |
SNMP agent | temp.psu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: PSU-{#SNMPINDEX} |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius.PSU.{#SNMPINDEX}] Preprocessing
|
PSU-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition.PSU.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
PSU-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant BL460 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.PSU.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
PSU-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant BL460 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.PSU.{#SNMPINDEX}]) = 3 |Warning |
||
PSU-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant BL460 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.PSU.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature I/O discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with ioBoard(5) filter |
SNMP agent | temp.io.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
I/O-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: I/O-{#SNMPINDEX} |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius."I/O.{#SNMPINDEX}"] Preprocessing
|
I/O-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition."I/O.{#SNMPINDEX}"] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
I/O-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant BL460 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition."I/O.{#SNMPINDEX}"]) = 1 |Info |
Manual close: Yes | |
I/O-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant BL460 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition."I/O.{#SNMPINDEX}"]) = 3 |Warning |
||
I/O-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant BL460 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition."I/O.{#SNMPINDEX}"]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature system discovery | Scanning table of Temperature Sensor Entries: CPQHLTH-MIB::cpqHeTemperatureTable with system(3) filter |
SNMP agent | temp.system.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
System-{#SNMPINDEX}: Temperature | MIB: CPQHLTH-MIB Temperature readings of testpoint: System-{#SNMPINDEX} |
SNMP agent | hp.server.sensor.temp.value[cpqHeTemperatureCelsius.System.{#SNMPINDEX}] Preprocessing
|
System-{#SNMPINDEX}: Temperature sensor condition | MIB: CPQHLTH-MIB The Temperature sensor condition. This value will be one of the following: other(1) Temperature could not be determined. ok(2) The temperature sensor is within normal operating range. degraded(3) The temperature sensor is outside of normal operating range. failed(4) The temperature sensor detects a condition that could permanently damage the system. The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
SNMP agent | hp.server.sensor.temp.condition[cpqHeTemperatureCondition.System.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
System-{#SNMPINDEX}: Temperature could not be determined | Temperature could not be determined. |
last(/HPE ProLiant BL460 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.System.{#SNMPINDEX}]) = 1 |Info |
Manual close: Yes | |
System-{#SNMPINDEX}: The temperature sensor is outside of normal operating range | If the cpqHeThermalDegradedAction is set to shutdown(3) the system will be shutdown if the degraded(3) condition occurs. |
last(/HPE ProLiant BL460 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.System.{#SNMPINDEX}]) = 3 |Warning |
||
System-{#SNMPINDEX}: The temperature sensor detects a condition that could permanently damage the system. | The system will automatically shutdown if the failed(4) condition results, so it is unlikely that this value will ever be returned by the agent. |
last(/HPE ProLiant BL460 by SNMP/hp.server.sensor.temp.condition[cpqHeTemperatureCondition.System.{#SNMPINDEX}]) = 4 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU discovery | CPQHLTH-MIB::cpqHeFltTolPowerSupplyStatus |
SNMP agent | psu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Chassis {#CHASSISNUM}, bay {#BAYNUM}: Power supply status | MIB: CPQHLTH-MIB The condition of the power supply. This value will be one of the following: other(1) The status could not be determined or not present. ok(2) The power supply is operating normally. degraded(3) A temperature sensor, fan or other power supply component is outside of normal operating range. failed(4) A power supply component detects a condition that could permanently damage the system. |
SNMP agent | hp.server.sensor.psu.status[cpqHeFltTolPowerSupplyCondition.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Chassis {#CHASSISNUM}, bay {#BAYNUM}: Power supply is in critical state | Please check the power supply unit for errors. |
last(/HPE ProLiant BL460 by SNMP/hp.server.sensor.psu.status[cpqHeFltTolPowerSupplyCondition.{#SNMPINDEX}])={$PSU.STATUS.CRIT} |Average |
||
Chassis {#CHASSISNUM}, bay {#BAYNUM}: Power supply is in warning state | Please check the power supply unit for errors. |
last(/HPE ProLiant BL460 by SNMP/hp.server.sensor.psu.status[cpqHeFltTolPowerSupplyCondition.{#SNMPINDEX}])={$PSU.STATUS.WARN} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
FAN discovery | CPQHLTH-MIB::cpqHeFltTolFanCondition |
SNMP agent | fan.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Fan {#SNMPINDEX}: Fan status | MIB: CPQHLTH-MIB The condition of the fan. This value will be one of the following: other(1) Fan status detection is not supported by this system or driver. ok(2) The fan is operating properly. degraded(2) A redundant fan is not operating properly. failed(4) A non-redundant fan is not operating properly. |
SNMP agent | hp.server.sensor.fan.status[cpqHeFltTolFanCondition.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Fan {#SNMPINDEX}: Fan is in critical state | Please check the fan unit. |
last(/HPE ProLiant BL460 by SNMP/hp.server.sensor.fan.status[cpqHeFltTolFanCondition.{#SNMPINDEX}])={$FAN.STATUS.CRIT} |Average |
||
Fan {#SNMPINDEX}: Fan is in warning state | Please check the fan unit. |
last(/HPE ProLiant BL460 by SNMP/hp.server.sensor.fan.status[cpqHeFltTolFanCondition.{#SNMPINDEX}])={$FAN.STATUS.WARN} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Array controller discovery | Scanning table of Array controllers: CPQIDA-MIB::cpqDaCntlrTable |
SNMP agent | array.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
{#CNTLR_LOCATION}: Disk array controller status | MIB: CPQIDA-MIB This value represents the overall condition of this controller, and any associated logical drives, physical drives, and array accelerators. |
SNMP agent | hp.server.hw.diskarray.status[cpqDaCntlrCondition.{#SNMPINDEX}] Preprocessing
|
{#CNTLR_LOCATION}: Disk array controller model | MIB: CPQIDA-MIB Array Controller Model. The type of controller card. |
SNMP agent | hp.server.hw.diskarray.model[cpqDaCntlrModel.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#CNTLR_LOCATION}: Disk array controller is in critical state | Please check the device for faults. |
last(/HPE ProLiant BL460 by SNMP/hp.server.hw.diskarray.status[cpqDaCntlrCondition.{#SNMPINDEX}])={$DISK.ARRAY.STATUS.CRIT} |High |
||
{#CNTLR_LOCATION}: Disk array controller is in warning state | Please check the device for faults. |
last(/HPE ProLiant BL460 by SNMP/hp.server.hw.diskarray.status[cpqDaCntlrCondition.{#SNMPINDEX}])={$DISK.ARRAY.STATUS.WARN} |Average |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Array controller cache discovery | Scanning table of Array controllers: CPQIDA-MIB::cpqDaAccelTable |
SNMP agent | array.cache.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
#{#CACHECNTRLINDEX}: Disk array cache controller status | MIB: CPQIDA-MIB Cache Module/Operations Status. This describes the status of the cache module and/or cache operations. Note that for some controller models, a cache module board that physically attaches to the controller or chipset may not be an available option. The status can be: Other (1) Indicates that the instrument agent does not recognize the status of the cache module. You may need to upgrade the instrument agent. Invalid (2) Indicates that a cache module board has not been installed in this system or is present but not configured. Enabled (3) Indicates that cache operations are currently configured and enabled for at least one logical drive. Temporarily Disabled (4) Indicates that cache operations have been temporarily disabled. View the cache module board error code object to determine why the write cache operations have been temporarily disabled. Permanently Disabled (5) Indicates that cache operations have been permanently disabled. View the cache module board error code object to determine why the write cache operations have been disabled. Cache Module Flash Memory Not Attached (6) Indicates that the flash memory component of the flash backed cache module is not attached. This status will be set when the flash memory is not attached and the Supercap is attached. This value is only used on flash backed cache modules that support removable flash memory. Cache Module Degraded Failsafe Speed (7) Indicates that the cache module board is currently degraded and operating at a failsafe speed. View variables cpqDaCacheMemoryDataWidth and cpqDaCacheMemoryTransferRate to obtain the cache module board`s current memory data width and memory transfer rate. Cache Module Critical Failure (8) Indicates that the cache module board has encountered a critical failure. The controller is currently operating in Zero Memory Raid mode. Read Cache Could Not Be Mapped (9) Indicates that the read cache memory in a split cache configuration could not be mapped by the operating system and as a result is not available. This status may be caused by virtual space limitations in certain operating systems and is only applicable to B-Series controllers. |
SNMP agent | hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}] Preprocessing
|
#{#CACHECNTRLINDEX}: Disk array cache controller battery status | MIB: CPQIDA-MIB Cache Module Board Backup Power Status. This monitors the status of each backup power source on the board. The backup power source can only recharge when the system has power applied. The type of backup power source used is indicated by cpqDaAccelBackupPowerSource. The following values are valid: Other (1) Indicates that the instrument agent does not recognize backup power status. You may need to update your software. Ok (2) The backup power source is fully charged. Recharging (3) The array controller has one or more cache module backup power sources that are recharging. Cache module operations such as Battery/Flash Backed Write Cache, Expansion, Extension and Migration are temporarily suspended until the backup power source is fully charged. Cache module operations will automatically resume when charging is complete. Failed (4) The battery pack is below the sufficient voltage level and has not recharged in 36 hours. Your Cache Module board needs to be serviced. Degraded (5) The battery is still operating, however, one of the batteries in the pack has failed to recharge properly. Your Cache Module board should be serviced as soon as possible. NotPresent (6) A backup power source is not present on the cache module board. Some controllers do not have backup power sources. Capacitor Failed (7) The flash backed cache module capacitor is below the sufficient voltage level and has not recharged in 10 minutes. Your Cache Module board needs to be serviced. |
SNMP agent | hp.server.hw.diskarray.cache.battery.status[cpqDaAccelBattery.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
#{#CACHECNTRLINDEX}: Disk array cache controller is in critical state! | Please check the device for faults. |
last(/HPE ProLiant BL460 by SNMP/hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.STATUS.CRIT:"cacheModCriticalFailure"} |Average |
||
#{#CACHECNTRLINDEX}: Disk array cache controller is in warning state | Please check the device for faults. |
last(/HPE ProLiant BL460 by SNMP/hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.STATUS.WARN:"cacheModDegradedFailsafeSpeed"} or last(/HPE ProLiant BL460 by SNMP/hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.STATUS.WARN:"cacheReadCacheNotMapped"} or last(/HPE ProLiant BL460 by SNMP/hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.STATUS.WARN:"cacheModFlashMemNotAttached"} |Warning |
Depends on:
|
|
#{#CACHECNTRLINDEX}: Disk array cache controller is not in optimal state | Please check the device for faults. |
last(/HPE ProLiant BL460 by SNMP/hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}])<>{$DISK.ARRAY.CACHE.STATUS.OK:"enabled"} and last(/HPE ProLiant BL460 by SNMP/hp.server.hw.diskarray.cache.status[cpqDaAccelStatus.{#SNMPINDEX}])<>{$DISK.ARRAY.CACHE.STATUS.WARN:"invalid"} |Warning |
Depends on:
|
|
#{#CACHECNTRLINDEX}: Disk array cache controller battery is in critical state | Please check the device for faults. |
last(/HPE ProLiant BL460 by SNMP/hp.server.hw.diskarray.cache.battery.status[cpqDaAccelBattery.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT:"failed"} or last(/HPE ProLiant BL460 by SNMP/hp.server.hw.diskarray.cache.battery.status[cpqDaAccelBattery.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT:"capacitorFailed"} |Average |
||
#{#CACHECNTRLINDEX}: Disk array cache controller battery is in warning state | Please check the device for faults. |
last(/HPE ProLiant BL460 by SNMP/hp.server.hw.diskarray.cache.battery.status[cpqDaAccelBattery.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.BATTERY.STATUS.WARN:"degraded"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Physical disk discovery | Scanning table of physical drive entries CPQIDA-MIB::cpqDaPhyDrvTable. |
SNMP agent | physicaldisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
{#DISK_LOCATION}: Physical disk status | MIB: CPQIDA-MIB Physical Drive Status. This shows the status of the physical drive. The following values are valid for the physical drive status: other (1) Indicates that the instrument agent does not recognize the drive. You may need to upgrade your instrument agent and/or driver software. ok (2) Indicates the drive is functioning properly. failed (3) Indicates that the drive is no longer operating and should be replaced. predictiveFailure(4) Indicates that the drive has a predictive failure error and should be replaced. |
SNMP agent | hp.server.hw.physicaldisk.status[cpqDaPhyDrvStatus.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk S.M.A.R.T. status | MIB: CPQIDA-MIB Physical Drive S.M.A.R.T Status. The following values are defined: other(1) The agent is unable to determine if the status of S.M.A.R.T predictive failure monitoring for this drive. ok(2) Indicates the drive is functioning properly. replaceDrive(3) Indicates that the drive has a S.M.A.R.T predictive failure error and should be replaced. |
SNMP agent | hp.server.hw.physicaldisk.smart_status[cpqDaPhyDrvSmartStatus.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk serial number | MIB: CPQIDA-MIB Physical Drive Serial Number. This is the serial number assigned to the physical drive. This value is based upon the serial number as returned by the SCSI inquiry command but may have been modified due to space limitations. This can be used for identification purposes. |
SNMP agent | hp.server.hw.physicaldisk.serialnumber[cpqDaPhyDrvSerialNum.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk model name | MIB: CPQIDA-MIB Physical Drive Model. This is a text description of the physical drive. The text that appears depends upon who manufactured the drive and the drive type. If a drive fails, note the model to identify the type of drive necessary for replacement. If a model number is not present, you may not have properly initialized the drive array to which the physical drive is attached for monitoring. |
SNMP agent | hp.server.hw.physicaldisk.model[cpqDaPhyDrvModel.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk media type | MIB: CPQIDA-MIB Drive Array Physical Drive Media Type. The following values are defined: other(1) The instrument agent is unable to determine the physical drive's media type. rotatingPlatters(2) The physical drive media is composed of rotating platters. solidState(3) The physical drive media is composed of solid state electronics. |
SNMP agent | hp.server.hw.physicaldisk.media_type[cpqDaPhyDrvMediaType.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Disk size | MIB: CPQIDA-MIB Physical Drive Size in MB. This is the size of the physical drive in megabytes. This value is calculated using the value 1,048,576 (2^20) as a megabyte. Drive manufacturers sometimes use the number 1,000,000 as a megabyte when giving drive capacities so this value may differ from the advertised size of a drive. This field is only applicable for controllers which support SCSI drives, and therefore is not supported by the IDA or IDA-2 controllers. The field will contain 0xFFFFFFFF if the drive capacity cannot be calculated or if the controller does not support SCSI drives. |
SNMP agent | hp.server.hw.physicaldisk.size[cpqDaPhyDrvMediaType.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#DISK_LOCATION}: Physical disk failed | Please check physical disk for warnings or errors. |
last(/HPE ProLiant BL460 by SNMP/hp.server.hw.physicaldisk.status[cpqDaPhyDrvStatus.{#SNMPINDEX}])={$DISK.STATUS.FAIL} |High |
||
{#DISK_LOCATION}: Physical disk is in warning state | Please check physical disk for warnings or errors. |
last(/HPE ProLiant BL460 by SNMP/hp.server.hw.physicaldisk.status[cpqDaPhyDrvStatus.{#SNMPINDEX}])={$DISK.STATUS.WARN} |Warning |
Depends on:
|
|
{#DISK_LOCATION}: Physical disk S.M.A.R.T. failed | Disk probably requires replacement. |
last(/HPE ProLiant BL460 by SNMP/hp.server.hw.physicaldisk.smart_status[cpqDaPhyDrvSmartStatus.{#SNMPINDEX}])={$DISK.SMART.STATUS.FAIL:"replaceDrive"} or last(/HPE ProLiant BL460 by SNMP/hp.server.hw.physicaldisk.smart_status[cpqDaPhyDrvSmartStatus.{#SNMPINDEX}])={$DISK.SMART.STATUS.FAIL:"replaceDriveSSDWearOut"} |High |
Depends on:
|
|
{#DISK_LOCATION}: Disk has been replaced | Disk serial number has changed. Acknowledge to close the problem manually. |
last(/HPE ProLiant BL460 by SNMP/hp.server.hw.physicaldisk.serialnumber[cpqDaPhyDrvSerialNum.{#SNMPINDEX}],#1)<>last(/HPE ProLiant BL460 by SNMP/hp.server.hw.physicaldisk.serialnumber[cpqDaPhyDrvSerialNum.{#SNMPINDEX}],#2) and length(last(/HPE ProLiant BL460 by SNMP/hp.server.hw.physicaldisk.serialnumber[cpqDaPhyDrvSerialNum.{#SNMPINDEX}]))>0 |Info |
Manual close: Yes |
Name | Description | Type | Key and additional info |
---|---|---|---|
Virtual disk discovery | CPQIDA-MIB::cpqDaLogDrvTable |
SNMP agent | virtualdisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Disk {#SNMPINDEX}({#DISK_NAME}): Status | Logical Drive Status. |
SNMP agent | hp.server.hw.virtualdisk.status[cpqDaLogDrvStatus.{#SNMPINDEX}] Preprocessing
|
Disk {#SNMPINDEX}({#DISK_NAME}): Layout type | Logical Drive Fault Tolerance. This shows the fault tolerance mode of the logical drive. |
SNMP agent | hp.server.hw.virtualdisk.layout[cpqDaLogDrvFaultTol.{#SNMPINDEX}] Preprocessing
|
Disk {#SNMPINDEX}({#DISK_NAME}): Disk size | Logical Drive Size. This is the size of the logical drive in megabytes. This value is calculated using the value 1,048,576 (2^20) as a megabyte. Drive manufacturers sometimes use the number 1,000,000 as a megabyte when giving drive capacities so this value may differ from the advertised size of a drive. |
SNMP agent | hp.server.hw.virtualdisk.size[cpqDaLogDrvSize.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Disk {#SNMPINDEX}({#DISK_NAME}): Virtual disk failed | Please check virtual disk for warnings or errors. |
last(/HPE ProLiant BL460 by SNMP/hp.server.hw.virtualdisk.status[cpqDaLogDrvStatus.{#SNMPINDEX}])={$VDISK.STATUS.CRIT} |High |
||
Disk {#SNMPINDEX}({#DISK_NAME}): Virtual disk is not in OK state | Please check virtual disk for warnings or errors. |
last(/HPE ProLiant BL460 by SNMP/hp.server.hw.virtualdisk.status[cpqDaLogDrvStatus.{#SNMPINDEX}])<>{$VDISK.STATUS.OK} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Network interface discovery | CPQIDA-MIB::cpqNicIfPhysAdapterTable |
SNMP agent | net.if.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
{#ADAPTERNAME} port {#ADAPTERINDEX}: Status | MIB: CPQNIC-MIB The physical adapter status. The following values are valid: unknown(1) The instrument agent was not able to determine the status of the adapter. The instrument agent may need to be upgraded. ok(2) The physical adapter is operating properly. generalFailure(3) The physical adapter has failed. linkFailure(4) The physical adapter has lost link. Check the cable connections to this adapter. |
SNMP agent | hp.server.net.if.status[cpqNicIfPhysAdapterStatus.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#ADAPTERNAME} port {#ADAPTERINDEX}: Adapter has failed | Please check the physical adapter. |
last(/HPE ProLiant BL460 by SNMP/hp.server.net.if.status[cpqNicIfPhysAdapterStatus.{#SNMPINDEX}])=3 |High |
||
{#ADAPTERNAME} port {#ADAPTERINDEX}: Adapter has lost link | Please check the cable connections to this adapter. |
last(/HPE ProLiant BL460 by SNMP/hp.server.net.if.status[cpqNicIfPhysAdapterStatus.{#SNMPINDEX}])=4 |Average |
Depends on:
|
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums
This is a template for monitoring Dell iDRAC servers with iDRAC version 7 (and later) via Zabbix SNMP agent that works without any external scripts.
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
Refer to the vendor documentation.
Name | Description | Default |
---|---|---|
{$DELL.SNMP.DISCOVERY.VOLTAGE.NAME.MATCHES} | Sets the regex string of voltage probe names to allow in discovery. |
^.*Voltage.*$ |
{$DELL.SNMP.DISCOVERY.VOLTAGE.NAME.NOT_MATCHES} | Sets the regex string of voltage probe names to ignore in discovery. |
CHANGE_IF_NEEDED |
{$DELL.SNMP.DISCOVERY.VOLTAGE.TYPE.MATCHES} | Sets the regex string of voltage probe types to allow in discovery. |
18|16 |
{$DELL.SNMP.DISCOVERY.VOLTAGE.TYPE.NOT_MATCHES} | Sets the regex string of voltage probe types to ignore in discovery. |
CHANGE_IF_NEEDED |
{$DELL.SNMP.SENSOR.TEMP.STATUS.OK} | The OK status of the temperature probe for the trigger expression. |
3 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalUpper"} | The warning status of the temperature probe for the trigger expression. |
4 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalLower"} | The warning status of the temperature probe for the trigger expression. |
7 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalUpper"} | The critical status of the temperature probe for the trigger expression. |
5 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableUpper"} | The critical status of the temperature probe for the trigger expression. |
6 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalLower"} | The critical status of the temperature probe for the trigger expression. |
8 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableLower"} | The critical status of the temperature probe for the trigger expression. |
9 |
{$DELL.SNMP.HEALTH.STATUS.DISASTER} | The disaster status of health for the trigger expression. |
6 |
{$DELL.SNMP.HEALTH.STATUS.CRIT} | The critical status of health for the trigger expression. |
5 |
{$DELL.SNMP.HEALTH.STATUS.WARN} | The warning status of health for the trigger expression. |
4 |
{$DELL.SNMP.PSU.STATUS.WARN:"nonCritical"} | The warning value of the PSU sensor for the trigger expression. |
4 |
{$DELL.SNMP.PSU.STATUS.CRIT:"critical"} | The critical value of the PSU sensor for the trigger expression. |
5 |
{$DELL.SNMP.PSU.STATUS.CRIT:"nonRecoverable"} | The critical value of the PSU sensor for the trigger expression. |
6 |
{$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalUpper"} | The warning value of the FAN sensor for the trigger expression. |
4 |
{$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalLower"} | The warning value of the FAN sensor for the trigger expression. |
7 |
{$DELL.SNMP.FAN.STATUS.CRIT:"criticalUpper"} | The critical value of the FAN sensor for the trigger expression. |
5 |
{$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableUpper"} | The critical value of the FAN sensor for the trigger expression. |
6 |
{$DELL.SNMP.FAN.STATUS.CRIT:"criticalLower"} | The critical value of the FAN sensor for the trigger expression. |
8 |
{$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableLower"} | The critical value of the FAN sensor for the trigger expression. |
9 |
{$DELL.SNMP.FAN.STATUS.CRIT:"failed"} | The critical value of the FAN sensor for the trigger expression. |
10 |
{$DELL.SNMP.DISK.ARRAY.STATUS.FAIL} | The disaster status of the disk array for the trigger expression. |
6 |
{$DELL.SNMP.DISK.ARRAY.STATUS.CRIT} | The critical status of the disk array for the trigger expression. |
5 |
{$DELL.SNMP.DISK.ARRAY.STATUS.WARN} | The warning status of the disk array for the trigger expression. |
4 |
{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT} | The critical status of the disk array cache battery for the trigger expression. |
3 |
{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.WARN} | The warning status of the disk array cache battery for the trigger expression. |
4 |
{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.OK} | The OK status of the disk array cache battery for the trigger expression. |
2 |
{$DELL.SNMP.VDISK.STATUS.CRIT:"failed"} | The critical status of the virtual disk for the trigger expression. |
3 |
{$DELL.SNMP.VDISK.STATUS.WARN:"degraded"} | The warning status of the virtual disk for the trigger expression. |
4 |
{$DELL.SNMP.DISK.STATUS.WARN:"nonCritical"} | The warning status of the disk for the trigger expression. |
4 |
{$DELL.SNMP.DISK.STATUS.FAIL:"critical"} | The critical status of the disk for the trigger expression. |
5 |
{$DELL.SNMP.DISK.STATUS.FAIL:"nonRecoverable"} | The critical status of the disk for the trigger expression. |
6 |
{$DELL.SNMP.DISK.SMART.STATUS.FAIL} | The critical S.M.A.R.T status of the disk for the trigger expression. |
1 |
{$DELL.SNMP.TIMEOUT} | The time interval for the SNMP agent availability trigger expression. |
5m |
{$DELL.SNMP.IFCONTROL} | The link status trigger will be fired only for interfaces that have the context macro equal to "1". |
1 |
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell iDRAC: Overall system health status | MIB: IDRAC-MIB-SMIv2 This attribute defines the overall rollup status of all the components in the system monitored by the remote access card. Includes system, storage, IO devices, iDRAC, CPU, memory, etc. |
SNMP agent | dell.server.status[globalSystemStatus] Preprocessing
|
Dell iDRAC: Hardware model name | MIB: IDRAC-MIB-SMIv2 This attribute defines the model name of the system. |
SNMP agent | dell.server.hw.model[systemModelName] Preprocessing
|
Dell iDRAC: Hardware serial number | MIB: IDRAC-MIB-SMIv2 This attribute defines the service tag of the system. |
SNMP agent | dell.server.hw.serialnumber[systemServiceTag] Preprocessing
|
Dell iDRAC: Operating system | MIB: IDRAC-MIB-SMIv2 This attribute defines the name of the operating system that the host is running. |
SNMP agent | dell.server.sw.os[systemOSName] Preprocessing
|
Dell iDRAC: Firmware version | MIB: IDRAC-MIB-SMIv2 This attribute defines the firmware version of a remote access card. |
SNMP agent | dell.server.hw.firmware[racFirmwareVersion] Preprocessing
|
Dell iDRAC: Uptime (network) | MIB: SNMP-FRAMEWORK-MIB The number of seconds since the value of the snmpEngineBoots object last changed. |
SNMP agent | dell.server.net.uptime[snmpEngineTime] |
Dell iDRAC: Uptime (hardware) | MIB: IDRAC-MIB-SMIv2 This attribute defines the power-up time of the system in seconds. |
SNMP agent | dell.server.hw.uptime[systemPowerUpTime] |
Dell iDRAC: SNMP traps (fallback) | The item is used to collect all SNMP traps unmatched by other |
SNMP trap | snmptrap.fallback |
Dell iDRAC: System location | MIB: SNMPv2-MIB The physical location of this node (e.g., 'telephone closet, 3rd floor'). If the location is unknown, the value is a zero-length string. |
SNMP agent | dell.server.location[sysLocation] Preprocessing
|
Dell iDRAC: System contact details | MIB: SNMPv2-MIB Name and contact information of the contact person for the node. If not provided, the value is a zero-length string. |
SNMP agent | dell.server.contact[sysContact] Preprocessing
|
Dell iDRAC: System object ID | MIB: SNMPv2-MIB The vendor's authoritative identification of the entity as part of the vendor's SMI enterprises subtree with the prefix 1.3.6.1.4.1 (e.g., a vendor with the identifier 1.3.6.1.4.1.4242 might assign a system object with the OID 1.3.6.1.4.1.4242.1.1). |
SNMP agent | dell.server.objectid[sysObjectID] Preprocessing
|
Dell iDRAC: System name | MIB: SNMPv2-MIB An administratively-assigned name for this managed node. By convention, this is the node's fully-qualified domain name. If the name is unknown, the value is a zero-length string. |
SNMP agent | dell.server.name[sysName] Preprocessing
|
Dell iDRAC: System description | MIB: SNMPv2-MIB A textual description of the entity. This value should include the full name and version identification of the system's hardware type, software operating system, and networking software. |
SNMP agent | dell.server.descr[sysDescr] Preprocessing
|
Dell iDRAC: SNMP agent availability | Availability of SNMP checks on the host. The value of this item corresponds to availability icons in the host list. Possible values: 0 - not available 1 - available 2 - unknown |
Zabbix internal | zabbix[host,snmp,available] Preprocessing
|
Dell iDRAC: Memory, total size | Total memory amount on the device. |
Calculated | dell.server.memory.size.total Preprocessing
|
Dell iDRAC: BIOS version | MIB: IDRAC-MIB-SMIv2 This attribute defines the version name of the system BIOS. |
SNMP agent | dell.server.bios.version Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell iDRAC: System is in unrecoverable state | Please check the device for faults. |
last(/Dell iDRAC by SNMP/dell.server.status[globalSystemStatus])={$DELL.SNMP.HEALTH.STATUS.DISASTER} |High |
||
Dell iDRAC: System status is in critical state | Please check the device for errors. |
last(/Dell iDRAC by SNMP/dell.server.status[globalSystemStatus])={$DELL.SNMP.HEALTH.STATUS.CRIT} |Average |
||
Dell iDRAC: System status is in warning state | Please check the device for warnings. |
last(/Dell iDRAC by SNMP/dell.server.status[globalSystemStatus])={$DELL.SNMP.HEALTH.STATUS.WARN} |Warning |
Depends on:
|
|
Dell iDRAC: Device has been replaced | Device serial number has changed. Acknowledge to close the problem manually. |
last(/Dell iDRAC by SNMP/dell.server.hw.serialnumber[systemServiceTag],#1)<>last(/Dell iDRAC by SNMP/dell.server.hw.serialnumber[systemServiceTag],#2) and length(last(/Dell iDRAC by SNMP/dell.server.hw.serialnumber[systemServiceTag]))>0 |Info |
Manual close: Yes | |
Dell iDRAC: Operating system description has changed | Operating system description has changed. Possibly, the system has been updated or replaced. Acknowledge to close the problem manually. |
last(/Dell iDRAC by SNMP/dell.server.sw.os[systemOSName],#1)<>last(/Dell iDRAC by SNMP/dell.server.sw.os[systemOSName],#2) and length(last(/Dell iDRAC by SNMP/dell.server.sw.os[systemOSName]))>0 |Info |
Manual close: Yes | |
Dell iDRAC: Firmware has changed | Firmware version has changed. Acknowledge to close the problem manually. |
last(/Dell iDRAC by SNMP/dell.server.hw.firmware[racFirmwareVersion],#1)<>last(/Dell iDRAC by SNMP/dell.server.hw.firmware[racFirmwareVersion],#2) and length(last(/Dell iDRAC by SNMP/dell.server.hw.firmware[racFirmwareVersion]))>0 |Info |
Manual close: Yes | |
Dell iDRAC: Host has been restarted | Uptime is less than 10 minutes. |
(last(/Dell iDRAC by SNMP/dell.server.hw.uptime[systemPowerUpTime])>0 and last(/Dell iDRAC by SNMP/dell.server.hw.uptime[systemPowerUpTime])<10m) or (last(/Dell iDRAC by SNMP/dell.server.hw.uptime[systemPowerUpTime])=0 and last(/Dell iDRAC by SNMP/dell.server.net.uptime[snmpEngineTime])<10m) |Warning |
Manual close: Yes | |
Dell iDRAC: System name has changed | The name of the system has changed. Acknowledge to close the problem manually. |
last(/Dell iDRAC by SNMP/dell.server.name[sysName],#1)<>last(/Dell iDRAC by SNMP/dell.server.name[sysName],#2) and length(last(/Dell iDRAC by SNMP/dell.server.name[sysName]))>0 |Info |
Manual close: Yes | |
Dell iDRAC: No SNMP data collection | SNMP is not available for polling. Please check device connectivity and SNMP settings. |
max(/Dell iDRAC by SNMP/zabbix[host,snmp,available],{$DELL.SNMP.TIMEOUT})=0 |Warning |
||
Dell iDRAC: Memory amount has changed | change(/Dell iDRAC by SNMP/dell.server.memory.size.total)>0 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature discovery | Scanning table of Temperature Probe Table IDRAC-MIB-SMIv2::temperatureProbeTable |
SNMP agent | temp.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell iDRAC: Probe [{#SENSOR_LOCALE}]: Value | MIB: IDRAC-MIB-SMIv2 This attribute defines the reading for a temperature probe of type other than When the value for When the value for |
SNMP agent | dell.server.sensor.temp.value[temperatureProbeReading.{#SNMPINDEX}] Preprocessing
|
Dell iDRAC: Probe [{#SENSOR_LOCALE}]: Status | MIB: IDRAC-MIB-SMIv2 This attribute defines the probe status of the temperature probe. Possible values: other(1), -- probe status is not one of the following: unknown(2), -- probe status is unknown (not known or monitored) ok(3), -- probe is reporting a value within the thresholds nonCriticalUpper(4), -- probe has crossed the upper noncritical threshold criticalUpper(5), -- probe has crossed the upper critical threshold nonRecoverableUpper(6), -- probe has crossed the upper non-recoverable threshold nonCriticalLower(7), -- probe has crossed the lower noncritical threshold criticalLower(8), -- probe has crossed the lower critical threshold nonRecoverableLower(9), -- probe has crossed the lower non-recoverable threshold failed(10) -- probe is not functional |
SNMP agent | dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell iDRAC: Probe [{#SENSOR_LOCALE}]: Critical status | Please check the device for faults. |
last(/Dell iDRAC by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalUpper"} or last(/Dell iDRAC by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableUpper"} or last(/Dell iDRAC by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalLower"} or last(/Dell iDRAC by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableLower"} |Average |
||
Dell iDRAC: Probe [{#SENSOR_LOCALE}]: Warning status | Please check the device for faults. |
last(/Dell iDRAC by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalUpper"} or last(/Dell iDRAC by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalLower"} |Warning |
Depends on:
|
|
Dell iDRAC: Probe [{#SENSOR_LOCALE}]: Not in optimal status | Please check the device for faults. |
last(/Dell iDRAC by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])<>{$DELL.SNMP.SENSOR.TEMP.STATUS.OK} |Info |
Manual close: Yes Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU discovery | IDRAC-MIB-SMIv2::powerSupplyTable |
SNMP agent | psu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell iDRAC: Power supply [{#PSU_DESCR}]: State | MIB: IDRAC-MIB-SMIv2 This attribute defines the status of the power supply. |
SNMP agent | dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell iDRAC: Power supply [{#PSU_DESCR}]: Critical state | Please check the power supply unit for errors. |
last(/Dell iDRAC by SNMP/dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}])={$DELL.SNMP.PSU.STATUS.CRIT:"critical"} or last(/Dell iDRAC by SNMP/dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}])={$DELL.SNMP.PSU.STATUS.CRIT:"nonRecoverable"} |Average |
||
Dell iDRAC: Power supply [{#PSU_DESCR}]: Warning state | Please check the power supply unit for errors. |
last(/Dell iDRAC by SNMP/dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}])={$DELL.SNMP.PSU.STATUS.WARN:"nonCritical"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Fan discovery | IDRAC-MIB-SMIv2::coolingDeviceTable |
SNMP agent | fan.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell iDRAC: Fan [{#FAN_DESCR}]: Status | MIB: IDRAC-MIB-SMIv2 This attribute defines the probe status of the cooling device. |
SNMP agent | dell.server.sensor.fan.status[{#FAN_DESCR}] Preprocessing
|
Dell iDRAC: Fan [{#FAN_DESCR}]: Speed | MIB: IDRAC-MIB-SMIv2 This attribute defines the reading for a cooling device of a subtype other than When the value for When the value for |
SNMP agent | dell.server.sensor.fan.speed[{#FAN_DESCR}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell iDRAC: Fan [{#FAN_DESCR}]: Critical state | Please check the fan unit. |
last(/Dell iDRAC by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"criticalUpper"} or last(/Dell iDRAC by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableUpper"} or last(/Dell iDRAC by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"criticalLower"} or last(/Dell iDRAC by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableLower"} or last(/Dell iDRAC by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"failed"} |Average |
||
Dell iDRAC: Fan [{#FAN_DESCR}]: Warning state | Please check the fan unit. |
last(/Dell iDRAC by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalUpper"} or last(/Dell iDRAC by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalLower"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Array controller discovery | Scanning table of Array controllers: IDRAC-MIB-SMIv2::controllerTable |
SNMP agent | array.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell iDRAC: Controller [{#CNTLR_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 The status of the controller itself without the propagation of any contained component status. Possible values: 1: Other 2: Unknown 3: OK 4: Non-critical 5: Critical 6: Non-recoverable |
SNMP agent | dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}] Preprocessing
|
Dell iDRAC: Controller [{#CNTLR_NAME}]: Model | MIB: IDRAC-MIB-SMIv2 The controller's name as represented in Storage Management. |
SNMP agent | dell.server.hw.diskarray.model[controllerName.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell iDRAC: Controller [{#CNTLR_NAME}]: Unrecoverable state | Please check the device for faults. |
last(/Dell iDRAC by SNMP/dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.STATUS.FAIL} |High |
||
Dell iDRAC: Controller [{#CNTLR_NAME}]: Critical state | Please check the device for faults. |
last(/Dell iDRAC by SNMP/dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.STATUS.CRIT} |Average |
Depends on:
|
|
Dell iDRAC: Controller [{#CNTLR_NAME}]: Warning state | Please check the device for faults. |
last(/Dell iDRAC by SNMP/dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.STATUS.WARN} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Battery discovery | Scanning Battery Table: IDRAC-MIB-SMIv2::batteryTable |
SNMP agent | battery.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell iDRAC: Battery [{#BATTERY_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 Current state of battery. Possible values: 1: The current state could not be determined. 2: The battery is operating normally. 3: The battery has failed and needs to be replaced. 4: The battery temperature is high or charge level is depleting. 5: The battery is missing or not detected. 6: The battery is undergoing the re-charge phase. 7: The battery voltage or charge level is below the threshold. |
SNMP agent | dell.server.hw.battery.status[batteryState.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell iDRAC: Battery [{#BATTERY_NAME}]: Critical state | Please check the device for faults. |
last(/Dell iDRAC by SNMP/dell.server.hw.battery.status[batteryState.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT} |Average |
||
Dell iDRAC: Battery [{#BATTERY_NAME}]: Warning state | Please check the device for faults. |
last(/Dell iDRAC by SNMP/dell.server.hw.battery.status[batteryState.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.WARN} |Warning |
Depends on:
|
|
Dell iDRAC: Battery [{#BATTERY_NAME}]: Not in optimal state | Please check the device for faults. |
last(/Dell iDRAC by SNMP/dell.server.hw.battery.status[batteryState.{#SNMPINDEX}])<>{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.OK} |Info |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Physical disk discovery | Scanning table of physical drive entries IDRAC-MIB-SMIv2::physicalDiskTable. |
SNMP agent | physicaldisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell iDRAC: Physical disk [{#DISK_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 The status of the physical disk itself without the propagation of any contained component status. Possible values: 1: Other 2: Unknown 3: OK 4: Non-critical 5: Critical 6: Non-recoverable |
SNMP agent | dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}] Preprocessing
|
Dell iDRAC: Physical disk [{#DISK_NAME}]: S.M.A.R.T. Status | MIB: IDRAC-MIB-SMIv2 Indicates whether the physical disk has received a predictive failure alert. |
SNMP agent | dell.server.hw.physicaldisk.smart_status[physicalDiskSmartAlertIndication.{#SNMPINDEX}] Preprocessing
|
Dell iDRAC: Physical disk [{#DISK_NAME}]: Serial number | MIB: IDRAC-MIB-SMIv2 The physical disk's unique identification number from the manufacturer. |
SNMP agent | dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}] Preprocessing
|
Dell iDRAC: Physical disk [{#DISK_NAME}]: Model name | MIB: IDRAC-MIB-SMIv2 The model number of the physical disk. |
SNMP agent | dell.server.hw.physicaldisk.model[physicalDiskProductID.{#SNMPINDEX}] Preprocessing
|
Dell iDRAC: Physical disk [{#DISK_NAME}]: Media type | MIB: IDRAC-MIB-SMIv2 The media type of the physical disk. Possible Values: 1: The media type could not be determined. 2: Hard Disk Drive (HDD). 3: Solid State Drive (SSD). |
SNMP agent | dell.server.hw.physicaldisk.media_type[physicalDiskMediaType.{#SNMPINDEX}] Preprocessing
|
Dell iDRAC: Physical disk [{#DISK_NAME}]: Size | MIB: IDRAC-MIB-SMIv2 The size of the physical disk in megabytes. |
SNMP agent | dell.server.hw.physicaldisk.size[physicalDiskCapacityInMB.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell iDRAC: Physical disk [{#DISK_NAME}]: Failed state | Please check physical disk for warnings or errors. |
last(/Dell iDRAC by SNMP/dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.STATUS.FAIL:"critical"} or last(/Dell iDRAC by SNMP/dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.STATUS.FAIL:"nonRecoverable"} |High |
||
Dell iDRAC: Physical disk [{#DISK_NAME}]: Warning state | Please check physical disk for warnings or errors. |
last(/Dell iDRAC by SNMP/dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.STATUS.WARN:"nonCritical"} |Warning |
Depends on:
|
|
Dell iDRAC: Physical disk [{#DISK_NAME}]: S.M.A.R.T. failed | Disk probably requires replacement. |
last(/Dell iDRAC by SNMP/dell.server.hw.physicaldisk.smart_status[physicalDiskSmartAlertIndication.{#SNMPINDEX}])={$DELL.SNMP.DISK.SMART.STATUS.FAIL:"replaceDrive"} or last(/Dell iDRAC by SNMP/dell.server.hw.physicaldisk.smart_status[physicalDiskSmartAlertIndication.{#SNMPINDEX}])={$DELL.SNMP.DISK.SMART.STATUS.FAIL:"replaceDriveSSDWearOut"} |High |
Depends on:
|
|
Dell iDRAC: Physical disk [{#DISK_NAME}]: Has been replaced | [{#DISK_NAME}] serial number has changed. Acknowledge to close the problem manually. |
last(/Dell iDRAC by SNMP/dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}],#1)<>last(/Dell iDRAC by SNMP/dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}],#2) and length(last(/Dell iDRAC by SNMP/dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}]))>0 |Info |
Manual close: Yes |
Name | Description | Type | Key and additional info |
---|---|---|---|
Virtual disk discovery | IDRAC-MIB-SMIv2::virtualDiskTable |
SNMP agent | virtualdisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell iDRAC: Virtual disk [{#DISK_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 The current state of this virtual disk (which includes any member physical disks.) Possible states: 1: The current state could not be determined. 2: The virtual disk is operating normally or optimally. 3: The virtual disk has encountered a failure. Data on the disk is lost or is about to be lost. 4: The virtual disk encountered a failure with one or all of the constituent redundant physical disks. The data on the virtual disk might no longer be fault tolerant. |
SNMP agent | dell.server.hw.virtualdisk.status[virtualDiskState.{#SNMPINDEX}] Preprocessing
|
Dell iDRAC: Virtual disk [{#DISK_NAME}]: Layout type | MIB: IDRAC-MIB-SMIv2 The virtual disk's RAID type. Possible values: 1: Not one of the following 2: RAID-0 3: RAID-1 4: RAID-5 5: RAID-6 6: RAID-10 7: RAID-50 8: RAID-60 9: Concatenated RAID 1 10: Concatenated RAID 5 |
SNMP agent | dell.server.hw.virtualdisk.layout[virtualDiskLayout.{#SNMPINDEX}] Preprocessing
|
Dell iDRAC: Virtual disk [{#DISK_NAME}]: Size | MIB: IDRAC-MIB-SMIv2 The size of the virtual disk in megabytes. |
SNMP agent | dell.server.hw.virtualdisk.size[virtualDiskSizeInMB.{#SNMPINDEX}] Preprocessing
|
Dell iDRAC: Virtual disk [{#DISK_NAME}]: Operational state | MIB: IDRAC-MIB-SMIv2 The state of the virtual disk when there are progressive operations ongoing. Possible values: 1: There is no active operation running. 2: The virtual disk configuration has changed. The physical disks included in the virtual disk are being modified to support the new configuration. 3: A Consistency Check (CC) is being performed on the virtual disk. 4: The virtual disk is being initialized. 5: BackGround Initialization (BGI) is being performed on the virtual disk. |
SNMP agent | dell.server.hw.virtualdisk.state[virtualDiskOperationalState.{#SNMPINDEX}] |
Dell iDRAC: Virtual disk [{#DISK_NAME}]: Read policy | MIB: IDRAC-MIB-SMIv2 The read policy used by the controller for read operations on this virtual disk. Possible values: 1: No Read Ahead. 2: Read Ahead. 3: Adaptive Read Ahead. |
SNMP agent | dell.server.hw.virtualdisk.read_policy[virtualDiskReadPolicy.{#SNMPINDEX}] |
Dell iDRAC: Virtual disk [{#DISK_NAME}]: Write policy | MIB: IDRAC-MIB-SMIv2 The write policy used by the controller for write operations on this virtual disk. Possible values: 1: Write Through. 2: Write Back. 3: Force Write Back. |
SNMP agent | dell.server.hw.virtualdisk.write_policy[virtualDiskWritePolicy.{#SNMPINDEX}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell iDRAC: Virtual disk [{#DISK_NAME}]: Failed state | Please check the virtual disk for warnings or errors. |
last(/Dell iDRAC by SNMP/dell.server.hw.virtualdisk.status[virtualDiskState.{#SNMPINDEX}])={$DELL.SNMP.VDISK.STATUS.CRIT:"failed"} |High |
||
Dell iDRAC: Virtual disk [{#DISK_NAME}]: Warning state | Please check the virtual disk for warnings or errors. |
last(/Dell iDRAC by SNMP/dell.server.hw.virtualdisk.status[virtualDiskState.{#SNMPINDEX}])={$DELL.SNMP.VDISK.STATUS.WARN:"degraded"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Network interface discovery | Discovery of network interfaces. |
SNMP agent | net.if.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell iDRAC: NIC [{#NICFQDD}/{#NICMAC}]: Link status | This attribute defines the connection status of the network device. |
SNMP agent | dell.server.net.if.link[{#NIC_FQDD}] Preprocessing
|
Dell iDRAC: NIC [{#NICFQDD}/{#NICMAC}]: Status | This attribute defines the status of the network device. |
SNMP agent | dell.server.net.if.status[{#NIC_FQDD}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell iDRAC: NIC [{#NICFQDD}/{#NICMAC}]: Link down | This trigger expression works as follows: |
{$DELL.SNMP.IFCONTROL:"{#NIC_FQDD}"}=1 and last(/Dell iDRAC by SNMP/dell.server.net.if.link[{#NIC_FQDD}],#1)<>1 and last(/Dell iDRAC by SNMP/dell.server.net.if.link[{#NIC_FQDD}],#1)<>last(/Dell iDRAC by SNMP/dell.server.net.if.link[{#NIC_FQDD}],#2) |Average |
Manual close: Yes | |
Dell iDRAC: NIC [{#NICFQDD}/{#NICMAC}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/Dell iDRAC by SNMP/dell.server.net.if.status[{#NIC_FQDD}],#1)<>3 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
CPU status discovery | CPU status discovery. |
SNMP agent | cpu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell iDRAC: CPU [{#CPU_FQDD}]: Status | This attribute defines the status of the processor device status probe. This status will be joined into the |
SNMP agent | dell.server.cpu.status[cpu.{#SNMPINDEX}] Preprocessing
|
Dell iDRAC: CPU [{#CPU_FQDD}]: State | This attribute defines the reading of the processor device status probe. |
SNMP agent | dell.server.cpu.state[cpu.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell iDRAC: CPU [{#CPU_FQDD}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/Dell iDRAC by SNMP/dell.server.cpu.status[cpu.{#SNMPINDEX}],#1)<>3 |Average |
||
Dell iDRAC: CPU [{#CPU_FQDD}]: Reading error | MIB: IDRAC-MIB-SMIv2 |
bitand(last(/Dell iDRAC by SNMP/dell.server.cpu.state[cpu.{#SNMPINDEX}],#1),128)=0 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
System battery discovery | System battery discovery. |
SNMP agent | system.battery.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell iDRAC: System battery [{#SNMPVALUE}]: Status | This attribute defines the status of the battery. |
SNMP agent | dell.server.system.battery[{#SNMPVALUE}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell iDRAC: System battery [{#SNMPVALUE}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/Dell iDRAC by SNMP/dell.server.system.battery[{#SNMPVALUE}],#1)<>3 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Memory discovery | Memory discovery. |
SNMP agent | memory.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell iDRAC: Memory [{#SNMPVALUE}]: Status | This attribute defines the status of the memory device. |
SNMP agent | dell.server.memory.status[{#SNMPVALUE}] Preprocessing
|
Dell iDRAC: Memory [{#SNMPVALUE}]: Size | This attribute defines the size, in KB, of the memory device. Zero indicates no memory installed; 2,147,483,647 indicates an unknown memory size. |
SNMP agent | dell.server.memory.size[{#SNMPVALUE}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell iDRAC: Memory [{#SNMPVALUE}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/Dell iDRAC by SNMP/dell.server.memory.status[{#SNMPVALUE}],#1)<>3 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Voltage probe discovery | Voltage probe discovery. |
SNMP agent | voltage.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell iDRAC: Voltage probe [{#VPROBE_NAME}]: Voltage | This attribute defines the reading for a voltage probe. |
SNMP agent | dell.server.voltage.value[{#VPROBE_NAME}] Preprocessing
|
Dell iDRAC: Voltage probe [{#VPROBE_NAME}]: Status | This attribute defines the status of the voltage probe. |
SNMP agent | dell.server.voltage.status[{#VPROBE_NAME}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell iDRAC: Voltage probe [{#VPROBE_NAME}]: Status is not OK | Please check the device's voltage. |
last(/Dell iDRAC by SNMP/dell.server.voltage.status[{#VPROBE_NAME}])<>3 |Average |
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums
This is a template for monitoring DELL PowerEdge R840 servers with iDRAC version 7 (and later) via Zabbix SNMP agent that works without any external scripts.
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
Refer to the vendor documentation.
Name | Description | Default |
---|---|---|
{$DELL.SNMP.DISCOVERY.VOLTAGE.NAME.MATCHES} | Sets the regex string of voltage probe names to allow in discovery. |
^.*Voltage.*$ |
{$DELL.SNMP.DISCOVERY.VOLTAGE.NAME.NOT_MATCHES} | Sets the regex string of voltage probe names to ignore in discovery. |
CHANGE_IF_NEEDED |
{$DELL.SNMP.DISCOVERY.VOLTAGE.TYPE.MATCHES} | Sets the regex string of voltage probe types to allow in discovery. |
18|16 |
{$DELL.SNMP.DISCOVERY.VOLTAGE.TYPE.NOT_MATCHES} | Sets the regex string of voltage probe types to ignore in discovery. |
CHANGE_IF_NEEDED |
{$DELL.SNMP.SENSOR.TEMP.STATUS.OK} | The OK status of the temperature probe for the trigger expression. |
3 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalUpper"} | The warning status of the temperature probe for the trigger expression. |
4 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalLower"} | The warning status of the temperature probe for the trigger expression. |
7 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalUpper"} | The critical status of the temperature probe for the trigger expression. |
5 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableUpper"} | The critical status of the temperature probe for the trigger expression. |
6 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalLower"} | The critical status of the temperature probe for the trigger expression. |
8 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableLower"} | The critical status of the temperature probe for the trigger expression. |
9 |
{$DELL.SNMP.HEALTH.STATUS.DISASTER} | The disaster status of health for the trigger expression. |
6 |
{$DELL.SNMP.HEALTH.STATUS.CRIT} | The critical status of health for the trigger expression. |
5 |
{$DELL.SNMP.HEALTH.STATUS.WARN} | The warning status of health for the trigger expression. |
4 |
{$DELL.SNMP.PSU.STATUS.WARN:"nonCritical"} | The warning value of the PSU sensor for the trigger expression. |
4 |
{$DELL.SNMP.PSU.STATUS.CRIT:"critical"} | The critical value of the PSU sensor for the trigger expression. |
5 |
{$DELL.SNMP.PSU.STATUS.CRIT:"nonRecoverable"} | The critical value of the PSU sensor for the trigger expression. |
6 |
{$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalUpper"} | The warning value of the FAN sensor for the trigger expression. |
4 |
{$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalLower"} | The warning value of the FAN sensor for the trigger expression. |
7 |
{$DELL.SNMP.FAN.STATUS.CRIT:"criticalUpper"} | The critical value of the FAN sensor for the trigger expression. |
5 |
{$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableUpper"} | The critical value of the FAN sensor for the trigger expression. |
6 |
{$DELL.SNMP.FAN.STATUS.CRIT:"criticalLower"} | The critical value of the FAN sensor for the trigger expression. |
8 |
{$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableLower"} | The critical value of the FAN sensor for the trigger expression. |
9 |
{$DELL.SNMP.FAN.STATUS.CRIT:"failed"} | The critical value of the FAN sensor for the trigger expression. |
10 |
{$DELL.SNMP.DISK.ARRAY.STATUS.FAIL} | The disaster status of the disk array for the trigger expression. |
6 |
{$DELL.SNMP.DISK.ARRAY.STATUS.CRIT} | The critical status of the disk array for the trigger expression. |
5 |
{$DELL.SNMP.DISK.ARRAY.STATUS.WARN} | The warning status of the disk array for the trigger expression. |
4 |
{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT} | The critical status of the disk array cache battery for the trigger expression. |
3 |
{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.WARN} | The warning status of the disk array cache battery for the trigger expression. |
4 |
{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.OK} | The OK status of the disk array cache battery for the trigger expression. |
2 |
{$DELL.SNMP.VDISK.STATUS.CRIT:"failed"} | The critical status of the virtual disk for the trigger expression. |
3 |
{$DELL.SNMP.VDISK.STATUS.WARN:"degraded"} | The warning status of the virtual disk for the trigger expression. |
4 |
{$DELL.SNMP.DISK.STATUS.WARN:"nonCritical"} | The warning status of the disk for the trigger expression. |
4 |
{$DELL.SNMP.DISK.STATUS.FAIL:"critical"} | The critical status of the disk for the trigger expression. |
5 |
{$DELL.SNMP.DISK.STATUS.FAIL:"nonRecoverable"} | The critical status of the disk for the trigger expression. |
6 |
{$DELL.SNMP.DISK.SMART.STATUS.FAIL} | The critical S.M.A.R.T status of the disk for the trigger expression. |
1 |
{$DELL.SNMP.TIMEOUT} | The time interval for the SNMP agent availability trigger expression. |
5m |
{$DELL.SNMP.IFCONTROL} | The link status trigger will be fired only for interfaces that have the context macro equal to "1". |
1 |
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R840: Overall system health status | MIB: IDRAC-MIB-SMIv2 This attribute defines the overall rollup status of all the components in the system monitored by the remote access card. Includes system, storage, IO devices, iDRAC, CPU, memory, etc. |
SNMP agent | dell.server.status[globalSystemStatus] Preprocessing
|
Dell R840: Hardware model name | MIB: IDRAC-MIB-SMIv2 This attribute defines the model name of the system. |
SNMP agent | dell.server.hw.model[systemModelName] Preprocessing
|
Dell R840: Hardware serial number | MIB: IDRAC-MIB-SMIv2 This attribute defines the service tag of the system. |
SNMP agent | dell.server.hw.serialnumber[systemServiceTag] Preprocessing
|
Dell R840: Operating system | MIB: IDRAC-MIB-SMIv2 This attribute defines the name of the operating system that the host is running. |
SNMP agent | dell.server.sw.os[systemOSName] Preprocessing
|
Dell R840: Firmware version | MIB: IDRAC-MIB-SMIv2 This attribute defines the firmware version of a remote access card. |
SNMP agent | dell.server.hw.firmware[racFirmwareVersion] Preprocessing
|
Dell R840: Uptime (network) | MIB: SNMP-FRAMEWORK-MIB The number of seconds since the value of the snmpEngineBoots object last changed. |
SNMP agent | dell.server.net.uptime[snmpEngineTime] |
Dell R840: Uptime (hardware) | MIB: IDRAC-MIB-SMIv2 This attribute defines the power-up time of the system in seconds. |
SNMP agent | dell.server.hw.uptime[systemPowerUpTime] |
Dell R840: SNMP traps (fallback) | The item is used to collect all SNMP traps unmatched by other |
SNMP trap | snmptrap.fallback |
Dell R840: System location | MIB: SNMPv2-MIB The physical location of this node (e.g., 'telephone closet, 3rd floor'). If the location is unknown, the value is a zero-length string. |
SNMP agent | dell.server.location[sysLocation] Preprocessing
|
Dell R840: System contact details | MIB: SNMPv2-MIB Name and contact information of the contact person for the node. If not provided, the value is a zero-length string. |
SNMP agent | dell.server.contact[sysContact] Preprocessing
|
Dell R840: System object ID | MIB: SNMPv2-MIB The vendor's authoritative identification of the entity as part of the vendor's SMI enterprises subtree with the prefix 1.3.6.1.4.1 (e.g., a vendor with the identifier 1.3.6.1.4.1.4242 might assign a system object with the OID 1.3.6.1.4.1.4242.1.1). |
SNMP agent | dell.server.objectid[sysObjectID] Preprocessing
|
Dell R840: System name | MIB: SNMPv2-MIB An administratively-assigned name for this managed node. By convention, this is the node's fully-qualified domain name. If the name is unknown, the value is a zero-length string. |
SNMP agent | dell.server.name[sysName] Preprocessing
|
Dell R840: System description | MIB: SNMPv2-MIB A textual description of the entity. This value should include the full name and version identification of the system's hardware type, software operating system, and networking software. |
SNMP agent | dell.server.descr[sysDescr] Preprocessing
|
Dell R840: SNMP agent availability | Availability of SNMP checks on the host. The value of this item corresponds to availability icons in the host list. Possible values: 0 - not available 1 - available 2 - unknown |
Zabbix internal | zabbix[host,snmp,available] Preprocessing
|
Dell R840: Memory, total size | Total memory amount on the device. |
Calculated | dell.server.memory.size.total Preprocessing
|
Dell R840: BIOS version | MIB: IDRAC-MIB-SMIv2 This attribute defines the version name of the system BIOS. |
SNMP agent | dell.server.bios.version Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R840: System is in unrecoverable state | Please check the device for faults. |
last(/DELL PowerEdge R840 by SNMP/dell.server.status[globalSystemStatus])={$DELL.SNMP.HEALTH.STATUS.DISASTER} |High |
||
Dell R840: System status is in critical state | Please check the device for errors. |
last(/DELL PowerEdge R840 by SNMP/dell.server.status[globalSystemStatus])={$DELL.SNMP.HEALTH.STATUS.CRIT} |Average |
||
Dell R840: System status is in warning state | Please check the device for warnings. |
last(/DELL PowerEdge R840 by SNMP/dell.server.status[globalSystemStatus])={$DELL.SNMP.HEALTH.STATUS.WARN} |Warning |
Depends on:
|
|
Dell R840: Device has been replaced | Device serial number has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R840 by SNMP/dell.server.hw.serialnumber[systemServiceTag],#1)<>last(/DELL PowerEdge R840 by SNMP/dell.server.hw.serialnumber[systemServiceTag],#2) and length(last(/DELL PowerEdge R840 by SNMP/dell.server.hw.serialnumber[systemServiceTag]))>0 |Info |
Manual close: Yes | |
Dell R840: Operating system description has changed | Operating system description has changed. Possibly, the system has been updated or replaced. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R840 by SNMP/dell.server.sw.os[systemOSName],#1)<>last(/DELL PowerEdge R840 by SNMP/dell.server.sw.os[systemOSName],#2) and length(last(/DELL PowerEdge R840 by SNMP/dell.server.sw.os[systemOSName]))>0 |Info |
Manual close: Yes | |
Dell R840: Firmware has changed | Firmware version has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R840 by SNMP/dell.server.hw.firmware[racFirmwareVersion],#1)<>last(/DELL PowerEdge R840 by SNMP/dell.server.hw.firmware[racFirmwareVersion],#2) and length(last(/DELL PowerEdge R840 by SNMP/dell.server.hw.firmware[racFirmwareVersion]))>0 |Info |
Manual close: Yes | |
Dell R840: Host has been restarted | Uptime is less than 10 minutes. |
(last(/DELL PowerEdge R840 by SNMP/dell.server.hw.uptime[systemPowerUpTime])>0 and last(/DELL PowerEdge R840 by SNMP/dell.server.hw.uptime[systemPowerUpTime])<10m) or (last(/DELL PowerEdge R840 by SNMP/dell.server.hw.uptime[systemPowerUpTime])=0 and last(/DELL PowerEdge R840 by SNMP/dell.server.net.uptime[snmpEngineTime])<10m) |Warning |
Manual close: Yes | |
Dell R840: System name has changed | The name of the system has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R840 by SNMP/dell.server.name[sysName],#1)<>last(/DELL PowerEdge R840 by SNMP/dell.server.name[sysName],#2) and length(last(/DELL PowerEdge R840 by SNMP/dell.server.name[sysName]))>0 |Info |
Manual close: Yes | |
Dell R840: No SNMP data collection | SNMP is not available for polling. Please check device connectivity and SNMP settings. |
max(/DELL PowerEdge R840 by SNMP/zabbix[host,snmp,available],{$DELL.SNMP.TIMEOUT})=0 |Warning |
||
Dell R840: Memory amount has changed | change(/DELL PowerEdge R840 by SNMP/dell.server.memory.size.total)>0 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature discovery | Scanning table of Temperature Probe Table IDRAC-MIB-SMIv2::temperatureProbeTable |
SNMP agent | temp.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R840: Probe [{#SENSOR_LOCALE}]: Value | MIB: IDRAC-MIB-SMIv2 This attribute defines the reading for a temperature probe of type other than When the value for When the value for |
SNMP agent | dell.server.sensor.temp.value[temperatureProbeReading.{#SNMPINDEX}] Preprocessing
|
Dell R840: Probe [{#SENSOR_LOCALE}]: Status | MIB: IDRAC-MIB-SMIv2 This attribute defines the probe status of the temperature probe. Possible values: other(1), -- probe status is not one of the following: unknown(2), -- probe status is unknown (not known or monitored) ok(3), -- probe is reporting a value within the thresholds nonCriticalUpper(4), -- probe has crossed the upper noncritical threshold criticalUpper(5), -- probe has crossed the upper critical threshold nonRecoverableUpper(6), -- probe has crossed the upper non-recoverable threshold nonCriticalLower(7), -- probe has crossed the lower noncritical threshold criticalLower(8), -- probe has crossed the lower critical threshold nonRecoverableLower(9), -- probe has crossed the lower non-recoverable threshold failed(10) -- probe is not functional |
SNMP agent | dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R840: Probe [{#SENSOR_LOCALE}]: Critical status | Please check the device for faults. |
last(/DELL PowerEdge R840 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalUpper"} or last(/DELL PowerEdge R840 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableUpper"} or last(/DELL PowerEdge R840 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalLower"} or last(/DELL PowerEdge R840 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableLower"} |Average |
||
Dell R840: Probe [{#SENSOR_LOCALE}]: Warning status | Please check the device for faults. |
last(/DELL PowerEdge R840 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalUpper"} or last(/DELL PowerEdge R840 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalLower"} |Warning |
Depends on:
|
|
Dell R840: Probe [{#SENSOR_LOCALE}]: Not in optimal status | Please check the device for faults. |
last(/DELL PowerEdge R840 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])<>{$DELL.SNMP.SENSOR.TEMP.STATUS.OK} |Info |
Manual close: Yes Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU discovery | IDRAC-MIB-SMIv2::powerSupplyTable |
SNMP agent | psu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R840: Power supply [{#PSU_DESCR}]: State | MIB: IDRAC-MIB-SMIv2 This attribute defines the status of the power supply. |
SNMP agent | dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R840: Power supply [{#PSU_DESCR}]: Critical state | Please check the power supply unit for errors. |
last(/DELL PowerEdge R840 by SNMP/dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}])={$DELL.SNMP.PSU.STATUS.CRIT:"critical"} or last(/DELL PowerEdge R840 by SNMP/dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}])={$DELL.SNMP.PSU.STATUS.CRIT:"nonRecoverable"} |Average |
||
Dell R840: Power supply [{#PSU_DESCR}]: Warning state | Please check the power supply unit for errors. |
last(/DELL PowerEdge R840 by SNMP/dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}])={$DELL.SNMP.PSU.STATUS.WARN:"nonCritical"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Fan discovery | IDRAC-MIB-SMIv2::coolingDeviceTable |
SNMP agent | fan.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R840: Fan [{#FAN_DESCR}]: Status | MIB: IDRAC-MIB-SMIv2 This attribute defines the probe status of the cooling device. |
SNMP agent | dell.server.sensor.fan.status[{#FAN_DESCR}] Preprocessing
|
Dell R840: Fan [{#FAN_DESCR}]: Speed | MIB: IDRAC-MIB-SMIv2 This attribute defines the reading for a cooling device of a subtype other than When the value for When the value for |
SNMP agent | dell.server.sensor.fan.speed[{#FAN_DESCR}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R840: Fan [{#FAN_DESCR}]: Critical state | Please check the fan unit. |
last(/DELL PowerEdge R840 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"criticalUpper"} or last(/DELL PowerEdge R840 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableUpper"} or last(/DELL PowerEdge R840 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"criticalLower"} or last(/DELL PowerEdge R840 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableLower"} or last(/DELL PowerEdge R840 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"failed"} |Average |
||
Dell R840: Fan [{#FAN_DESCR}]: Warning state | Please check the fan unit. |
last(/DELL PowerEdge R840 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalUpper"} or last(/DELL PowerEdge R840 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalLower"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Array controller discovery | Scanning table of Array controllers: IDRAC-MIB-SMIv2::controllerTable |
SNMP agent | array.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R840: Controller [{#CNTLR_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 The status of the controller itself without the propagation of any contained component status. Possible values: 1: Other 2: Unknown 3: OK 4: Non-critical 5: Critical 6: Non-recoverable |
SNMP agent | dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}] Preprocessing
|
Dell R840: Controller [{#CNTLR_NAME}]: Model | MIB: IDRAC-MIB-SMIv2 The controller's name as represented in Storage Management. |
SNMP agent | dell.server.hw.diskarray.model[controllerName.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R840: Controller [{#CNTLR_NAME}]: Unrecoverable state | Please check the device for faults. |
last(/DELL PowerEdge R840 by SNMP/dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.STATUS.FAIL} |High |
||
Dell R840: Controller [{#CNTLR_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R840 by SNMP/dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.STATUS.CRIT} |Average |
Depends on:
|
|
Dell R840: Controller [{#CNTLR_NAME}]: Warning state | Please check the device for faults. |
last(/DELL PowerEdge R840 by SNMP/dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.STATUS.WARN} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Battery discovery | Scanning Battery Table: IDRAC-MIB-SMIv2::batteryTable |
SNMP agent | battery.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R840: Battery [{#BATTERY_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 Current state of battery. Possible values: 1: The current state could not be determined. 2: The battery is operating normally. 3: The battery has failed and needs to be replaced. 4: The battery temperature is high or charge level is depleting. 5: The battery is missing or not detected. 6: The battery is undergoing the re-charge phase. 7: The battery voltage or charge level is below the threshold. |
SNMP agent | dell.server.hw.battery.status[batteryState.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R840: Battery [{#BATTERY_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R840 by SNMP/dell.server.hw.battery.status[batteryState.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT} |Average |
||
Dell R840: Battery [{#BATTERY_NAME}]: Warning state | Please check the device for faults. |
last(/DELL PowerEdge R840 by SNMP/dell.server.hw.battery.status[batteryState.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.WARN} |Warning |
Depends on:
|
|
Dell R840: Battery [{#BATTERY_NAME}]: Not in optimal state | Please check the device for faults. |
last(/DELL PowerEdge R840 by SNMP/dell.server.hw.battery.status[batteryState.{#SNMPINDEX}])<>{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.OK} |Info |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Physical disk discovery | Scanning table of physical drive entries IDRAC-MIB-SMIv2::physicalDiskTable. |
SNMP agent | physicaldisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R840: Physical disk [{#DISK_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 The status of the physical disk itself without the propagation of any contained component status. Possible values: 1: Other 2: Unknown 3: OK 4: Non-critical 5: Critical 6: Non-recoverable |
SNMP agent | dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}] Preprocessing
|
Dell R840: Physical disk [{#DISK_NAME}]: S.M.A.R.T. Status | MIB: IDRAC-MIB-SMIv2 Indicates whether the physical disk has received a predictive failure alert. |
SNMP agent | dell.server.hw.physicaldisk.smart_status[physicalDiskSmartAlertIndication.{#SNMPINDEX}] Preprocessing
|
Dell R840: Physical disk [{#DISK_NAME}]: Serial number | MIB: IDRAC-MIB-SMIv2 The physical disk's unique identification number from the manufacturer. |
SNMP agent | dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}] Preprocessing
|
Dell R840: Physical disk [{#DISK_NAME}]: Model name | MIB: IDRAC-MIB-SMIv2 The model number of the physical disk. |
SNMP agent | dell.server.hw.physicaldisk.model[physicalDiskProductID.{#SNMPINDEX}] Preprocessing
|
Dell R840: Physical disk [{#DISK_NAME}]: Media type | MIB: IDRAC-MIB-SMIv2 The media type of the physical disk. Possible Values: 1: The media type could not be determined. 2: Hard Disk Drive (HDD). 3: Solid State Drive (SSD). |
SNMP agent | dell.server.hw.physicaldisk.media_type[physicalDiskMediaType.{#SNMPINDEX}] Preprocessing
|
Dell R840: Physical disk [{#DISK_NAME}]: Size | MIB: IDRAC-MIB-SMIv2 The size of the physical disk in megabytes. |
SNMP agent | dell.server.hw.physicaldisk.size[physicalDiskCapacityInMB.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R840: Physical disk [{#DISK_NAME}]: Failed state | Please check physical disk for warnings or errors. |
last(/DELL PowerEdge R840 by SNMP/dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.STATUS.FAIL:"critical"} or last(/DELL PowerEdge R840 by SNMP/dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.STATUS.FAIL:"nonRecoverable"} |High |
||
Dell R840: Physical disk [{#DISK_NAME}]: Warning state | Please check physical disk for warnings or errors. |
last(/DELL PowerEdge R840 by SNMP/dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.STATUS.WARN:"nonCritical"} |Warning |
Depends on:
|
|
Dell R840: Physical disk [{#DISK_NAME}]: S.M.A.R.T. failed | Disk probably requires replacement. |
last(/DELL PowerEdge R840 by SNMP/dell.server.hw.physicaldisk.smart_status[physicalDiskSmartAlertIndication.{#SNMPINDEX}])={$DELL.SNMP.DISK.SMART.STATUS.FAIL:"replaceDrive"} or last(/DELL PowerEdge R840 by SNMP/dell.server.hw.physicaldisk.smart_status[physicalDiskSmartAlertIndication.{#SNMPINDEX}])={$DELL.SNMP.DISK.SMART.STATUS.FAIL:"replaceDriveSSDWearOut"} |High |
Depends on:
|
|
Dell R840: Physical disk [{#DISK_NAME}]: Has been replaced | [{#DISK_NAME}] serial number has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R840 by SNMP/dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}],#1)<>last(/DELL PowerEdge R840 by SNMP/dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}],#2) and length(last(/DELL PowerEdge R840 by SNMP/dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}]))>0 |Info |
Manual close: Yes |
Name | Description | Type | Key and additional info |
---|---|---|---|
Virtual disk discovery | IDRAC-MIB-SMIv2::virtualDiskTable |
SNMP agent | virtualdisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R840: Virtual disk [{#DISK_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 The current state of this virtual disk (which includes any member physical disks.) Possible states: 1: The current state could not be determined. 2: The virtual disk is operating normally or optimally. 3: The virtual disk has encountered a failure. Data on the disk is lost or is about to be lost. 4: The virtual disk encountered a failure with one or all of the constituent redundant physical disks. The data on the virtual disk might no longer be fault tolerant. |
SNMP agent | dell.server.hw.virtualdisk.status[virtualDiskState.{#SNMPINDEX}] Preprocessing
|
Dell R840: Virtual disk [{#DISK_NAME}]: Layout type | MIB: IDRAC-MIB-SMIv2 The virtual disk's RAID type. Possible values: 1: Not one of the following 2: RAID-0 3: RAID-1 4: RAID-5 5: RAID-6 6: RAID-10 7: RAID-50 8: RAID-60 9: Concatenated RAID 1 10: Concatenated RAID 5 |
SNMP agent | dell.server.hw.virtualdisk.layout[virtualDiskLayout.{#SNMPINDEX}] Preprocessing
|
Dell R840: Virtual disk [{#DISK_NAME}]: Size | MIB: IDRAC-MIB-SMIv2 The size of the virtual disk in megabytes. |
SNMP agent | dell.server.hw.virtualdisk.size[virtualDiskSizeInMB.{#SNMPINDEX}] Preprocessing
|
Dell R840: Virtual disk [{#DISK_NAME}]: Operational state | MIB: IDRAC-MIB-SMIv2 The state of the virtual disk when there are progressive operations ongoing. Possible values: 1: There is no active operation running. 2: The virtual disk configuration has changed. The physical disks included in the virtual disk are being modified to support the new configuration. 3: A Consistency Check (CC) is being performed on the virtual disk. 4: The virtual disk is being initialized. 5: BackGround Initialization (BGI) is being performed on the virtual disk. |
SNMP agent | dell.server.hw.virtualdisk.state[virtualDiskOperationalState.{#SNMPINDEX}] |
Dell R840: Virtual disk [{#DISK_NAME}]: Read policy | MIB: IDRAC-MIB-SMIv2 The read policy used by the controller for read operations on this virtual disk. Possible values: 1: No Read Ahead. 2: Read Ahead. 3: Adaptive Read Ahead. |
SNMP agent | dell.server.hw.virtualdisk.read_policy[virtualDiskReadPolicy.{#SNMPINDEX}] |
Dell R840: Virtual disk [{#DISK_NAME}]: Write policy | MIB: IDRAC-MIB-SMIv2 The write policy used by the controller for write operations on this virtual disk. Possible values: 1: Write Through. 2: Write Back. 3: Force Write Back. |
SNMP agent | dell.server.hw.virtualdisk.write_policy[virtualDiskWritePolicy.{#SNMPINDEX}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R840: Virtual disk [{#DISK_NAME}]: Failed state | Please check the virtual disk for warnings or errors. |
last(/DELL PowerEdge R840 by SNMP/dell.server.hw.virtualdisk.status[virtualDiskState.{#SNMPINDEX}])={$DELL.SNMP.VDISK.STATUS.CRIT:"failed"} |High |
||
Dell R840: Virtual disk [{#DISK_NAME}]: Warning state | Please check the virtual disk for warnings or errors. |
last(/DELL PowerEdge R840 by SNMP/dell.server.hw.virtualdisk.status[virtualDiskState.{#SNMPINDEX}])={$DELL.SNMP.VDISK.STATUS.WARN:"degraded"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Network interface discovery | Discovery of network interfaces. |
SNMP agent | net.if.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R840: NIC [{#NICFQDD}/{#NICMAC}]: Link status | This attribute defines the connection status of the network device. |
SNMP agent | dell.server.net.if.link[{#NIC_FQDD}] Preprocessing
|
Dell R840: NIC [{#NICFQDD}/{#NICMAC}]: Status | This attribute defines the status of the network device. |
SNMP agent | dell.server.net.if.status[{#NIC_FQDD}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R840: NIC [{#NICFQDD}/{#NICMAC}]: Link down | This trigger expression works as follows: |
{$DELL.SNMP.IFCONTROL:"{#NIC_FQDD}"}=1 and last(/DELL PowerEdge R840 by SNMP/dell.server.net.if.link[{#NIC_FQDD}],#1)<>1 and last(/DELL PowerEdge R840 by SNMP/dell.server.net.if.link[{#NIC_FQDD}],#1)<>last(/DELL PowerEdge R840 by SNMP/dell.server.net.if.link[{#NIC_FQDD}],#2) |Average |
Manual close: Yes | |
Dell R840: NIC [{#NICFQDD}/{#NICMAC}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/DELL PowerEdge R840 by SNMP/dell.server.net.if.status[{#NIC_FQDD}],#1)<>3 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
CPU status discovery | CPU status discovery. |
SNMP agent | cpu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R840: CPU [{#CPU_FQDD}]: Status | This attribute defines the status of the processor device status probe. This status will be joined into the |
SNMP agent | dell.server.cpu.status[cpu.{#SNMPINDEX}] Preprocessing
|
Dell R840: CPU [{#CPU_FQDD}]: State | This attribute defines the reading of the processor device status probe. |
SNMP agent | dell.server.cpu.state[cpu.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R840: CPU [{#CPU_FQDD}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/DELL PowerEdge R840 by SNMP/dell.server.cpu.status[cpu.{#SNMPINDEX}],#1)<>3 |Average |
||
Dell R840: CPU [{#CPU_FQDD}]: Reading error | MIB: IDRAC-MIB-SMIv2 |
bitand(last(/DELL PowerEdge R840 by SNMP/dell.server.cpu.state[cpu.{#SNMPINDEX}],#1),128)=0 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
System battery discovery | System battery discovery. |
SNMP agent | system.battery.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R840: System battery [{#SNMPVALUE}]: Status | This attribute defines the status of the battery. |
SNMP agent | dell.server.system.battery[{#SNMPVALUE}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R840: System battery [{#SNMPVALUE}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/DELL PowerEdge R840 by SNMP/dell.server.system.battery[{#SNMPVALUE}],#1)<>3 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Memory discovery | Memory discovery. |
SNMP agent | memory.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R840: Memory [{#SNMPVALUE}]: Status | This attribute defines the status of the memory device. |
SNMP agent | dell.server.memory.status[{#SNMPVALUE}] Preprocessing
|
Dell R840: Memory [{#SNMPVALUE}]: Size | This attribute defines the size, in KB, of the memory device. Zero indicates no memory installed; 2,147,483,647 indicates an unknown memory size. |
SNMP agent | dell.server.memory.size[{#SNMPVALUE}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R840: Memory [{#SNMPVALUE}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/DELL PowerEdge R840 by SNMP/dell.server.memory.status[{#SNMPVALUE}],#1)<>3 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Voltage probe discovery | Voltage probe discovery. |
SNMP agent | voltage.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R840: Voltage probe [{#VPROBE_NAME}]: Voltage | This attribute defines the reading for a voltage probe. |
SNMP agent | dell.server.voltage.value[{#VPROBE_NAME}] Preprocessing
|
Dell R840: Voltage probe [{#VPROBE_NAME}]: Status | This attribute defines the status of the voltage probe. |
SNMP agent | dell.server.voltage.status[{#VPROBE_NAME}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R840: Voltage probe [{#VPROBE_NAME}]: Status is not OK | Please check the device's voltage. |
last(/DELL PowerEdge R840 by SNMP/dell.server.voltage.status[{#VPROBE_NAME}])<>3 |Average |
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums
This is a template for monitoring DELL PowerEdge R840 servers with iDRAC 8/9 firmware 4.32 (and later) with Redfish API enabled via Zabbix script items. This template works without any external scripts.
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
1. Enable Redfish API in the Dell iDRAC interface of your server.
2. Create a user for monitoring with read-only permissions in the Dell iDRAC interface.
3. Create a host for Dell server with iDRAC IP as the Zabbix agent interface.
4. Link the template to the host.
5. Customize the values of the {$DELL.HTTP.API.URL}
, {$DELL.HTTP.API.USER}
, and {$DELL.HTTP.API.PASSWORD}
macros.
NOTE! If you are experiencing timeouts on some of the items that are executing requests, adjust the
{$DELL.HTTP.REQUEST.TIMEOUT}
macro accordingly.
Name | Description | Default |
---|---|---|
{$DELL.HTTP.API.URL} | The Dell iDRAC Redfish API URL in the format |
<Put your URL here> |
{$DELL.HTTP.API.USER} | The Dell iDRAC username. |
<Put your username here> |
{$DELL.HTTP.API.PASSWORD} | The Dell iDRAC user password. |
<Put your password here> |
{$DELL.HTTP.PROXY} | Set an HTTP proxy for Redfish API requests if needed. |
|
{$DELL.HTTP.RETURN.CODE.OK} | Set the HTTP return code that represents an OK response from the API. The default is "200", but can vary, for example, if a proxy is used. |
200 |
{$DELL.HTTP.REQUEST.TIMEOUT} | Set the timeout for HTTP requests. |
10s |
{$DELL.HTTP.IFCONTROL} | Link status trigger will be fired only for interfaces that have the context macro equal to "1". |
1 |
{$DELL.HTTP.CPU.UTIL.HIGH} | Sets the percentage threshold for creating a "high" severity event about CPU utilization. |
90 |
{$DELL.HTTP.CPU.UTIL.WARN} | Sets the percentage threshold for creating a "warning" severity event about CPU utilization. |
75 |
{$DELL.HTTP.MEM.UTIL.HIGH} | Sets the percentage threshold for creating a "high" severity event about memory utilization. |
90 |
{$DELL.HTTP.MEM.UTIL.WARN} | Sets the percentage threshold for creating a "warning" severity event about memory utilization. |
75 |
{$DELL.HTTP.IO.UTIL.HIGH} | Sets the percentage threshold for creating a "high" severity event about IO utilization. |
90 |
{$DELL.HTTP.IO.UTIL.WARN} | Sets the percentage threshold for creating a "warning" severity event about IO utilization. |
75 |
{$DELL.HTTP.SYS.UTIL.HIGH} | Sets the percentage threshold for creating a "high" severity event about SYS utilization. |
90 |
{$DELL.HTTP.SYS.UTIL.WARN} | Sets the percentage threshold for creating a "warning" severity event about SYS utilization. |
75 |
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R840: Get system | Returns system metrics. |
Script | dell.server.system.get |
Dell R840: Get sensors | Returns sensors. |
Script | dell.server.sensors.get |
Dell R840: Get array controller resources | Returns array controller resources. |
Script | dell.server.array.resources.get |
Dell R840: Get disks | Returns storage resources. |
Script | dell.server.disks.get |
Dell R840: Get network interfaces | Returns network interfaces. |
Script | dell.server.net.iface.get |
Dell R840: CPU utilization, in % | CPU utilization. |
Dependent item | dell.server.util.cpu Preprocessing
|
Dell R840: Memory utilization, in % | Memory utilization. |
Dependent item | dell.server.util.mem Preprocessing
|
Dell R840: IO utilization, in % | IO utilization. |
Dependent item | dell.server.util.io Preprocessing
|
Dell R840: SYS utilization, in % | SYS utilization. |
Dependent item | dell.server.util.sys Preprocessing
|
Dell R840: Overall system health status | This attribute defines the overall rollup status of all the components in the system monitored by the remote access card. Includes system, storage, IO devices, iDRAC, CPU, memory, etc. |
Dependent item | dell.server.status Preprocessing
|
Dell R840: Hardware model name | This attribute defines the model name of the system. |
Dependent item | dell.server.hw.model Preprocessing
|
Dell R840: Hardware serial number | This attribute defines the service tag of the system. |
Dependent item | dell.server.hw.serialnumber Preprocessing
|
Dell R840: Firmware version | This attribute defines the firmware version of a remote access card. |
Dependent item | dell.server.hw.firmware Preprocessing
|
Dell R840: Redfish API status | Availability of Redfish API on the server. Possible values: 0 - Unavailable 1 - Available |
Simple check | net.tcp.service[https] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R840: CPU utilization is too high | Current CPU utilization has exceeded |
min(/DELL PowerEdge R840 by HTTP/dell.server.util.cpu,5m)>={$DELL.HTTP.CPU.UTIL.HIGH} |High |
||
Dell R840: CPU utilization is high | Current CPU utilization has exceeded |
min(/DELL PowerEdge R840 by HTTP/dell.server.util.cpu,5m)>={$DELL.HTTP.CPU.UTIL.WARN} |Warning |
Depends on:
|
|
Dell R840: Memory utilization is too high | Current memory utilization has exceeded |
min(/DELL PowerEdge R840 by HTTP/dell.server.util.mem,5m)>={$DELL.HTTP.MEM.UTIL.HIGH} |High |
||
Dell R840: Memory utilization is high | Current memory utilization has exceeded |
min(/DELL PowerEdge R840 by HTTP/dell.server.util.mem,5m)>={$DELL.HTTP.MEM.UTIL.WARN} |Warning |
Depends on:
|
|
Dell R840: IO utilization is too high | Current IO utilization has exceeded |
min(/DELL PowerEdge R840 by HTTP/dell.server.util.io,5m)>={$DELL.HTTP.IO.UTIL.HIGH} |High |
||
Dell R840: IO utilization is high | Current IO utilization has exceeded |
min(/DELL PowerEdge R840 by HTTP/dell.server.util.io,5m)>={$DELL.HTTP.IO.UTIL.WARN} |Warning |
Depends on:
|
|
Dell R840: SYS utilization is too high | Current SYS utilization has exceeded |
min(/DELL PowerEdge R840 by HTTP/dell.server.util.sys,5m)>={$DELL.HTTP.SYS.UTIL.HIGH} |High |
||
Dell R840: SYS utilization is high | Current SYS utilization has exceeded |
min(/DELL PowerEdge R840 by HTTP/dell.server.util.sys,5m)>={$DELL.HTTP.SYS.UTIL.WARN} |Warning |
Depends on:
|
|
Dell R840: Server is in a critical state | Please check the device for faults. |
last(/DELL PowerEdge R840 by HTTP/dell.server.status,)=3 |Average |
||
Dell R840: Server is in a warning state | Please check the device for warnings. |
last(/DELL PowerEdge R840 by HTTP/dell.server.status,)=2 |Warning |
Depends on:
|
|
Dell R840: Device has been replaced | The device serial number has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R840 by HTTP/dell.server.hw.serialnumber,#1)<>last(/DELL PowerEdge R840 by HTTP/dell.server.hw.serialnumber,#2) and length(last(/DELL PowerEdge R840 by HTTP/dell.server.hw.serialnumber))>0 |Info |
Manual close: Yes | |
Dell R840: Firmware has changed | The firmware version has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R840 by HTTP/dell.server.hw.firmware,#1)<>last(/DELL PowerEdge R840 by HTTP/dell.server.hw.firmware,#2) and length(last(/DELL PowerEdge R840 by HTTP/dell.server.hw.firmware))>0 |Info |
Manual close: Yes | |
Dell R840: Redfish API service is unavailable | The service is unavailable or does not accept TCP connections. |
last(/DELL PowerEdge R840 by HTTP/net.tcp.service[https])=0 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature discovery | Discovery of temperature sensors. |
Dependent item | dell.server.temp.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R840: Probe [{#SENSOR_NAME}]: Get sensor | Returns the metrics of a sensor. |
Dependent item | dell.server.sensor.temp.get[{#SENSOR_NAME}] Preprocessing
|
Dell R840: Probe [{#SENSOR_NAME}]: Value | Sensor value. |
Dependent item | dell.server.sensor.temp.value[{#SENSOR_NAME}] Preprocessing
|
Dell R840: Probe [{#SENSOR_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.sensor.temp.status[{#SENSOR_NAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R840: Probe [{#SENSOR_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R840 by HTTP/dell.server.sensor.temp.status[{#SENSOR_NAME}],)=3 |Average |
||
Dell R840: Probe [{#SENSOR_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R840 by HTTP/dell.server.sensor.temp.status[{#SENSOR_NAME}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU discovery | Discovery of PSU sensors. |
Dependent item | dell.server.psu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R840: Power supply [{#SENSOR_NAME}]: Get sensor | Returns the metrics of a sensor. |
Dependent item | dell.server.sensor.psu.get[{#SENSOR_NAME}] Preprocessing
|
Dell R840: Power supply [{#SENSOR_NAME}]: Voltage | Sensor value. |
Dependent item | dell.server.sensor.psu.voltage[{#SENSOR_NAME}] Preprocessing
|
Dell R840: Power supply [{#SENSOR_NAME}]: Voltage sensor status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.sensor.psu.voltage.status[{#SENSOR_NAME}] Preprocessing
|
Dell R840: Power supply [{#SENSOR_NAME}]: Current | Sensor value. |
Dependent item | dell.server.sensor.psu.current[{#SENSOR_NAME}] Preprocessing
|
Dell R840: Power supply [{#SENSOR_NAME}]: Current sensor status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.sensor.psu.current.status[{#SENSOR_NAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R840: Power supply [{#SENSOR_NAME}]: Voltage sensor: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R840 by HTTP/dell.server.sensor.psu.voltage.status[{#SENSOR_NAME}],)=3 |Average |
||
Dell R840: Power supply [{#SENSOR_NAME}]: Voltage sensor: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R840 by HTTP/dell.server.sensor.psu.voltage.status[{#SENSOR_NAME}],)=2 |Warning |
Depends on:
|
|
Dell R840: Power supply [{#SENSOR_NAME}]: Current sensor: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R840 by HTTP/dell.server.sensor.psu.current.status[{#SENSOR_NAME}],)=3 |Average |
||
Dell R840: Power supply [{#SENSOR_NAME}]: Current sensor: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R840 by HTTP/dell.server.sensor.psu.current.status[{#SENSOR_NAME}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
FAN discovery | Discovery of FAN sensors. |
Dependent item | dell.server.fan.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R840: Fan [{#SENSOR_NAME}]: Get sensor | Returns the metrics of a sensor. |
Dependent item | dell.server.sensor.fan.get[{#SENSOR_NAME}] Preprocessing
|
Dell R840: Fan [{#SENSOR_NAME}]: Speed | Sensor value. |
Dependent item | dell.server.sensor.fan.speed[{#SENSOR_NAME}] Preprocessing
|
Dell R840: Fan [{#SENSOR_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.sensor.fan.status[{#SENSOR_NAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R840: Fan [{#SENSOR_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R840 by HTTP/dell.server.sensor.fan.status[{#SENSOR_NAME}],)=3 |Average |
||
Dell R840: Fan [{#SENSOR_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R840 by HTTP/dell.server.sensor.fan.status[{#SENSOR_NAME}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Array controller discovery | Discovery of disk array controllers. |
Dependent item | dell.server.array.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R840: Controller [{#CNTLR_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.array.status[{#ID}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R840: Controller [{#CNTLR_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R840 by HTTP/dell.server.array.status[{#ID}],)=3 |Average |
||
Dell R840: Controller [{#CNTLR_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R840 by HTTP/dell.server.array.status[{#ID}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Battery discovery | Discovery of battery controllers. |
Dependent item | dell.server.controller.battery.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R840: Battery [{#BATTERY_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.controller.battery.status[{#ID}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R840: Battery [{#BATTERY_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R840 by HTTP/dell.server.controller.battery.status[{#ID}],)=3 |Average |
||
Dell R840: Battery [{#BATTERY_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R840 by HTTP/dell.server.controller.battery.status[{#ID}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Physical disk discovery | Discovery of physical disks. |
Dependent item | dell.server.physicaldisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R840: Physical disk [{#DISK_NAME}]: Get disk | Returns the metrics of a physical disk. |
Script | dell.server.hw.physicaldisk.get[{#DISK_NAME}] |
Dell R840: Physical disk [{#DISK_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.hw.physicaldisk.status[{#DISK_NAME}] Preprocessing
|
Dell R840: Physical disk [{#DISK_NAME}]: Serial number | The serial number of this drive. |
Dependent item | dell.server.hw.physicaldisk.serialnumber[{#DISK_NAME}] Preprocessing
|
Dell R840: Physical disk [{#DISK_NAME}]: Model name | The model number of the drive. |
Dependent item | dell.server.hw.physicaldisk.model[{#DISK_NAME}] Preprocessing
|
Dell R840: Physical disk [{#DISK_NAME}]: Media type | The type of media contained in this drive. Possible values: HDD, SSD, SMR, null. |
Dependent item | dell.server.hw.physicaldisk.mediatype[{#DISKNAME}] Preprocessing
|
Dell R840: Physical disk [{#DISK_NAME}]: Size | The size, in bytes, of this drive. |
Dependent item | dell.server.hw.physicaldisk.size[{#DISK_NAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R840: Physical disk [{#DISK_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R840 by HTTP/dell.server.hw.physicaldisk.status[{#DISK_NAME}],)=3 |Average |
||
Dell R840: Physical disk [{#DISK_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R840 by HTTP/dell.server.hw.physicaldisk.status[{#DISK_NAME}],)=2 |Warning |
Depends on:
|
|
Dell R840: Physical disk [{#DISK_NAME}]: Has been replaced | [{#DISK_NAME}] serial number has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R840 by HTTP/dell.server.hw.physicaldisk.serialnumber[{#DISK_NAME}],#1)<>last(/DELL PowerEdge R840 by HTTP/dell.server.hw.physicaldisk.serialnumber[{#DISK_NAME}],#2) and length(last(/DELL PowerEdge R840 by HTTP/dell.server.hw.physicaldisk.serialnumber[{#DISK_NAME}]))>0 |Info |
Manual close: Yes |
Name | Description | Type | Key and additional info |
---|---|---|---|
Virtual disk discovery | Discovery of virtual disks. |
Dependent item | dell.server.virtualdisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R840: Virtual disk [{#DISK_NAME}]: Get disk | Returns the metrics of a virtual disk. |
Script | dell.server.hw.virtualdisk.get[{#DISK_NAME}] |
Dell R840: Virtual disk [{#DISK_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.hw.virtualdisk.status[{#DISK_NAME}] Preprocessing
|
Dell R840: Virtual disk [{#DISK_NAME}]: RAID status | This property represents the RAID specific status. Possible values: Blocked, Degraded, Failed, Foreign, Offline, Online, Ready, Unknown, null. |
Dependent item | dell.server.hw.virtualdisk.raidstatus[{#DISKNAME}] Preprocessing
|
Dell R840: Virtual disk [{#DISK_NAME}]: Size | The size in bytes of this Volume. |
Dependent item | dell.server.hw.virtualdisk.size[{#DISK_NAME}] Preprocessing
|
Dell R840: Virtual disk [{#DISK_NAME}]: Current state | The known state of the Resource, for example, Enabled. Possible values: Enabled, Disabled, StandbyOffline, StandbySpare, InTest, Starting, Absent, UnavailableOffline, Deferring, Quiesced, Updating, Qualified. |
Dependent item | dell.server.hw.virtualdisk.state[{#DISK_NAME}] Preprocessing
|
Dell R840: Virtual disk [{#DISK_NAME}]: Read policy | Indicates the read cache policy setting for the Volume. Possible values: ReadAhead, NoReadAhead, AdaptiveReadAhead. |
Dependent item | dell.server.hw.virtualdisk.readpolicy[{#DISKNAME}] Preprocessing
|
Dell R840: Virtual disk [{#DISK_NAME}]: Write policy | Indicates the write cache policy setting for the Volume. Possible values: WriteThrough, WriteBack, ProtectedWriteBack, UnprotectedWriteBack. |
Dependent item | dell.server.hw.virtualdisk.writepolicy[{#DISKNAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R840: Virtual disk [{#DISK_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R840 by HTTP/dell.server.hw.virtualdisk.status[{#DISK_NAME}],)=3 |Average |
||
Dell R840: Virtual disk [{#DISK_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R840 by HTTP/dell.server.hw.virtualdisk.status[{#DISK_NAME}],)=2 |Warning |
Depends on:
|
|
Dell R840: Virtual disk [{#DISK_NAME}]: RAID status not OK | Please check the disk for faults. |
last(/DELL PowerEdge R840 by HTTP/dell.server.hw.virtualdisk.raid_status[{#DISK_NAME}],)<8 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Network interface discovery | Discovery of network interfaces. |
Dependent item | dell.server.net.if.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R840: Interface [{#IFNAME}]: Get interface | Returns the metrics of a network interface. |
Script | dell.server.net.if.get[{#IFNAME}] |
Dell R840: Interface [{#IFNAME}]: Speed | The network port current link speed. |
Dependent item | dell.server.net.if.speed[{#IFNAME}] Preprocessing
|
Dell R840: Interface [{#IFNAME}]: Link status | The status of the link between this port and its link partner. Possible values: Down, Up, null. |
Dependent item | dell.server.net.if.status[{#IFNAME}] Preprocessing
|
Dell R840: Interface [{#IFNAME}]: State | The known state of the Resource, for example, Enabled. Possible values: Enabled, Disabled, StandbyOffline, StandbySpare, InTest, Starting, Absent, UnavailableOffline, Deferring, Quiesced, Updating, Qualified. |
Dependent item | dell.server.net.if.state[{#IFNAME}] Preprocessing
|
Dell R840: Interface [{#IFNAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.net.if.health[{#IFNAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R840: Interface [{#IFNAME}]: Link down | This trigger expression works as follows: |
{$DELL.HTTP.IFCONTROL:"{#IFNAME}"}=1 and (last(/DELL PowerEdge R840 by HTTP/dell.server.net.if.status[{#IFNAME}],)=2 and last(/DELL PowerEdge R840 by HTTP/dell.server.net.if.status[{#IFNAME}],#1)<>last(/DELL PowerEdge R840 by HTTP/dell.server.net.if.status[{#IFNAME}],#2)) |Average |
Manual close: Yes | |
Dell R840: Interface [{#IFNAME}]: Link status issue | This trigger expression works as follows: |
{$DELL.HTTP.IFCONTROL:"{#IFNAME}"}=1 and (last(/DELL PowerEdge R840 by HTTP/dell.server.net.if.status[{#IFNAME}],)<2 and last(/DELL PowerEdge R840 by HTTP/dell.server.net.if.status[{#IFNAME}],#1)<>last(/DELL PowerEdge R840 by HTTP/dell.server.net.if.status[{#IFNAME}],#2)) |Average |
Manual close: Yes | |
Dell R840: Interface [{#IFNAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R840 by HTTP/dell.server.net.if.health[{#IFNAME}],)=3 |Average |
||
Dell R840: Interface [{#IFNAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R840 by HTTP/dell.server.net.if.health[{#IFNAME}],)=2 |Warning |
Depends on:
|
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums
This is a template for monitoring DELL PowerEdge R820 servers with iDRAC version 7 (and later) via Zabbix SNMP agent that works without any external scripts.
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
Refer to the vendor documentation.
Name | Description | Default |
---|---|---|
{$DELL.SNMP.DISCOVERY.VOLTAGE.NAME.MATCHES} | Sets the regex string of voltage probe names to allow in discovery. |
^.*Voltage.*$ |
{$DELL.SNMP.DISCOVERY.VOLTAGE.NAME.NOT_MATCHES} | Sets the regex string of voltage probe names to ignore in discovery. |
CHANGE_IF_NEEDED |
{$DELL.SNMP.DISCOVERY.VOLTAGE.TYPE.MATCHES} | Sets the regex string of voltage probe types to allow in discovery. |
18|16 |
{$DELL.SNMP.DISCOVERY.VOLTAGE.TYPE.NOT_MATCHES} | Sets the regex string of voltage probe types to ignore in discovery. |
CHANGE_IF_NEEDED |
{$DELL.SNMP.SENSOR.TEMP.STATUS.OK} | The OK status of the temperature probe for the trigger expression. |
3 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalUpper"} | The warning status of the temperature probe for the trigger expression. |
4 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalLower"} | The warning status of the temperature probe for the trigger expression. |
7 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalUpper"} | The critical status of the temperature probe for the trigger expression. |
5 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableUpper"} | The critical status of the temperature probe for the trigger expression. |
6 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalLower"} | The critical status of the temperature probe for the trigger expression. |
8 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableLower"} | The critical status of the temperature probe for the trigger expression. |
9 |
{$DELL.SNMP.HEALTH.STATUS.DISASTER} | The disaster status of health for the trigger expression. |
6 |
{$DELL.SNMP.HEALTH.STATUS.CRIT} | The critical status of health for the trigger expression. |
5 |
{$DELL.SNMP.HEALTH.STATUS.WARN} | The warning status of health for the trigger expression. |
4 |
{$DELL.SNMP.PSU.STATUS.WARN:"nonCritical"} | The warning value of the PSU sensor for the trigger expression. |
4 |
{$DELL.SNMP.PSU.STATUS.CRIT:"critical"} | The critical value of the PSU sensor for the trigger expression. |
5 |
{$DELL.SNMP.PSU.STATUS.CRIT:"nonRecoverable"} | The critical value of the PSU sensor for the trigger expression. |
6 |
{$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalUpper"} | The warning value of the FAN sensor for the trigger expression. |
4 |
{$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalLower"} | The warning value of the FAN sensor for the trigger expression. |
7 |
{$DELL.SNMP.FAN.STATUS.CRIT:"criticalUpper"} | The critical value of the FAN sensor for the trigger expression. |
5 |
{$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableUpper"} | The critical value of the FAN sensor for the trigger expression. |
6 |
{$DELL.SNMP.FAN.STATUS.CRIT:"criticalLower"} | The critical value of the FAN sensor for the trigger expression. |
8 |
{$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableLower"} | The critical value of the FAN sensor for the trigger expression. |
9 |
{$DELL.SNMP.FAN.STATUS.CRIT:"failed"} | The critical value of the FAN sensor for the trigger expression. |
10 |
{$DELL.SNMP.DISK.ARRAY.STATUS.FAIL} | The disaster status of the disk array for the trigger expression. |
6 |
{$DELL.SNMP.DISK.ARRAY.STATUS.CRIT} | The critical status of the disk array for the trigger expression. |
5 |
{$DELL.SNMP.DISK.ARRAY.STATUS.WARN} | The warning status of the disk array for the trigger expression. |
4 |
{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT} | The critical status of the disk array cache battery for the trigger expression. |
3 |
{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.WARN} | The warning status of the disk array cache battery for the trigger expression. |
4 |
{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.OK} | The OK status of the disk array cache battery for the trigger expression. |
2 |
{$DELL.SNMP.VDISK.STATUS.CRIT:"failed"} | The critical status of the virtual disk for the trigger expression. |
3 |
{$DELL.SNMP.VDISK.STATUS.WARN:"degraded"} | The warning status of the virtual disk for the trigger expression. |
4 |
{$DELL.SNMP.DISK.STATUS.WARN:"nonCritical"} | The warning status of the disk for the trigger expression. |
4 |
{$DELL.SNMP.DISK.STATUS.FAIL:"critical"} | The critical status of the disk for the trigger expression. |
5 |
{$DELL.SNMP.DISK.STATUS.FAIL:"nonRecoverable"} | The critical status of the disk for the trigger expression. |
6 |
{$DELL.SNMP.DISK.SMART.STATUS.FAIL} | The critical S.M.A.R.T status of the disk for the trigger expression. |
1 |
{$DELL.SNMP.TIMEOUT} | The time interval for the SNMP agent availability trigger expression. |
5m |
{$DELL.SNMP.IFCONTROL} | The link status trigger will be fired only for interfaces that have the context macro equal to "1". |
1 |
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R820: Overall system health status | MIB: IDRAC-MIB-SMIv2 This attribute defines the overall rollup status of all the components in the system monitored by the remote access card. Includes system, storage, IO devices, iDRAC, CPU, memory, etc. |
SNMP agent | dell.server.status[globalSystemStatus] Preprocessing
|
Dell R820: Hardware model name | MIB: IDRAC-MIB-SMIv2 This attribute defines the model name of the system. |
SNMP agent | dell.server.hw.model[systemModelName] Preprocessing
|
Dell R820: Hardware serial number | MIB: IDRAC-MIB-SMIv2 This attribute defines the service tag of the system. |
SNMP agent | dell.server.hw.serialnumber[systemServiceTag] Preprocessing
|
Dell R820: Operating system | MIB: IDRAC-MIB-SMIv2 This attribute defines the name of the operating system that the host is running. |
SNMP agent | dell.server.sw.os[systemOSName] Preprocessing
|
Dell R820: Firmware version | MIB: IDRAC-MIB-SMIv2 This attribute defines the firmware version of a remote access card. |
SNMP agent | dell.server.hw.firmware[racFirmwareVersion] Preprocessing
|
Dell R820: Uptime (network) | MIB: SNMP-FRAMEWORK-MIB The number of seconds since the value of the snmpEngineBoots object last changed. |
SNMP agent | dell.server.net.uptime[snmpEngineTime] |
Dell R820: Uptime (hardware) | MIB: IDRAC-MIB-SMIv2 This attribute defines the power-up time of the system in seconds. |
SNMP agent | dell.server.hw.uptime[systemPowerUpTime] |
Dell R820: SNMP traps (fallback) | The item is used to collect all SNMP traps unmatched by other |
SNMP trap | snmptrap.fallback |
Dell R820: System location | MIB: SNMPv2-MIB The physical location of this node (e.g., 'telephone closet, 3rd floor'). If the location is unknown, the value is a zero-length string. |
SNMP agent | dell.server.location[sysLocation] Preprocessing
|
Dell R820: System contact details | MIB: SNMPv2-MIB Name and contact information of the contact person for the node. If not provided, the value is a zero-length string. |
SNMP agent | dell.server.contact[sysContact] Preprocessing
|
Dell R820: System object ID | MIB: SNMPv2-MIB The vendor's authoritative identification of the entity as part of the vendor's SMI enterprises subtree with the prefix 1.3.6.1.4.1 (e.g., a vendor with the identifier 1.3.6.1.4.1.4242 might assign a system object with the OID 1.3.6.1.4.1.4242.1.1). |
SNMP agent | dell.server.objectid[sysObjectID] Preprocessing
|
Dell R820: System name | MIB: SNMPv2-MIB An administratively-assigned name for this managed node. By convention, this is the node's fully-qualified domain name. If the name is unknown, the value is a zero-length string. |
SNMP agent | dell.server.name[sysName] Preprocessing
|
Dell R820: System description | MIB: SNMPv2-MIB A textual description of the entity. This value should include the full name and version identification of the system's hardware type, software operating system, and networking software. |
SNMP agent | dell.server.descr[sysDescr] Preprocessing
|
Dell R820: SNMP agent availability | Availability of SNMP checks on the host. The value of this item corresponds to availability icons in the host list. Possible values: 0 - not available 1 - available 2 - unknown |
Zabbix internal | zabbix[host,snmp,available] Preprocessing
|
Dell R820: Memory, total size | Total memory amount on the device. |
Calculated | dell.server.memory.size.total Preprocessing
|
Dell R820: BIOS version | MIB: IDRAC-MIB-SMIv2 This attribute defines the version name of the system BIOS. |
SNMP agent | dell.server.bios.version Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R820: System is in unrecoverable state | Please check the device for faults. |
last(/DELL PowerEdge R820 by SNMP/dell.server.status[globalSystemStatus])={$DELL.SNMP.HEALTH.STATUS.DISASTER} |High |
||
Dell R820: System status is in critical state | Please check the device for errors. |
last(/DELL PowerEdge R820 by SNMP/dell.server.status[globalSystemStatus])={$DELL.SNMP.HEALTH.STATUS.CRIT} |Average |
||
Dell R820: System status is in warning state | Please check the device for warnings. |
last(/DELL PowerEdge R820 by SNMP/dell.server.status[globalSystemStatus])={$DELL.SNMP.HEALTH.STATUS.WARN} |Warning |
Depends on:
|
|
Dell R820: Device has been replaced | Device serial number has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R820 by SNMP/dell.server.hw.serialnumber[systemServiceTag],#1)<>last(/DELL PowerEdge R820 by SNMP/dell.server.hw.serialnumber[systemServiceTag],#2) and length(last(/DELL PowerEdge R820 by SNMP/dell.server.hw.serialnumber[systemServiceTag]))>0 |Info |
Manual close: Yes | |
Dell R820: Operating system description has changed | Operating system description has changed. Possibly, the system has been updated or replaced. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R820 by SNMP/dell.server.sw.os[systemOSName],#1)<>last(/DELL PowerEdge R820 by SNMP/dell.server.sw.os[systemOSName],#2) and length(last(/DELL PowerEdge R820 by SNMP/dell.server.sw.os[systemOSName]))>0 |Info |
Manual close: Yes | |
Dell R820: Firmware has changed | Firmware version has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R820 by SNMP/dell.server.hw.firmware[racFirmwareVersion],#1)<>last(/DELL PowerEdge R820 by SNMP/dell.server.hw.firmware[racFirmwareVersion],#2) and length(last(/DELL PowerEdge R820 by SNMP/dell.server.hw.firmware[racFirmwareVersion]))>0 |Info |
Manual close: Yes | |
Dell R820: Host has been restarted | Uptime is less than 10 minutes. |
(last(/DELL PowerEdge R820 by SNMP/dell.server.hw.uptime[systemPowerUpTime])>0 and last(/DELL PowerEdge R820 by SNMP/dell.server.hw.uptime[systemPowerUpTime])<10m) or (last(/DELL PowerEdge R820 by SNMP/dell.server.hw.uptime[systemPowerUpTime])=0 and last(/DELL PowerEdge R820 by SNMP/dell.server.net.uptime[snmpEngineTime])<10m) |Warning |
Manual close: Yes | |
Dell R820: System name has changed | The name of the system has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R820 by SNMP/dell.server.name[sysName],#1)<>last(/DELL PowerEdge R820 by SNMP/dell.server.name[sysName],#2) and length(last(/DELL PowerEdge R820 by SNMP/dell.server.name[sysName]))>0 |Info |
Manual close: Yes | |
Dell R820: No SNMP data collection | SNMP is not available for polling. Please check device connectivity and SNMP settings. |
max(/DELL PowerEdge R820 by SNMP/zabbix[host,snmp,available],{$DELL.SNMP.TIMEOUT})=0 |Warning |
||
Dell R820: Memory amount has changed | change(/DELL PowerEdge R820 by SNMP/dell.server.memory.size.total)>0 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature discovery | Scanning table of Temperature Probe Table IDRAC-MIB-SMIv2::temperatureProbeTable |
SNMP agent | temp.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R820: Probe [{#SENSOR_LOCALE}]: Value | MIB: IDRAC-MIB-SMIv2 This attribute defines the reading for a temperature probe of type other than When the value for When the value for |
SNMP agent | dell.server.sensor.temp.value[temperatureProbeReading.{#SNMPINDEX}] Preprocessing
|
Dell R820: Probe [{#SENSOR_LOCALE}]: Status | MIB: IDRAC-MIB-SMIv2 This attribute defines the probe status of the temperature probe. Possible values: other(1), -- probe status is not one of the following: unknown(2), -- probe status is unknown (not known or monitored) ok(3), -- probe is reporting a value within the thresholds nonCriticalUpper(4), -- probe has crossed the upper noncritical threshold criticalUpper(5), -- probe has crossed the upper critical threshold nonRecoverableUpper(6), -- probe has crossed the upper non-recoverable threshold nonCriticalLower(7), -- probe has crossed the lower noncritical threshold criticalLower(8), -- probe has crossed the lower critical threshold nonRecoverableLower(9), -- probe has crossed the lower non-recoverable threshold failed(10) -- probe is not functional |
SNMP agent | dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R820: Probe [{#SENSOR_LOCALE}]: Critical status | Please check the device for faults. |
last(/DELL PowerEdge R820 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalUpper"} or last(/DELL PowerEdge R820 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableUpper"} or last(/DELL PowerEdge R820 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalLower"} or last(/DELL PowerEdge R820 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableLower"} |Average |
||
Dell R820: Probe [{#SENSOR_LOCALE}]: Warning status | Please check the device for faults. |
last(/DELL PowerEdge R820 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalUpper"} or last(/DELL PowerEdge R820 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalLower"} |Warning |
Depends on:
|
|
Dell R820: Probe [{#SENSOR_LOCALE}]: Not in optimal status | Please check the device for faults. |
last(/DELL PowerEdge R820 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])<>{$DELL.SNMP.SENSOR.TEMP.STATUS.OK} |Info |
Manual close: Yes Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU discovery | IDRAC-MIB-SMIv2::powerSupplyTable |
SNMP agent | psu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R820: Power supply [{#PSU_DESCR}]: State | MIB: IDRAC-MIB-SMIv2 This attribute defines the status of the power supply. |
SNMP agent | dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R820: Power supply [{#PSU_DESCR}]: Critical state | Please check the power supply unit for errors. |
last(/DELL PowerEdge R820 by SNMP/dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}])={$DELL.SNMP.PSU.STATUS.CRIT:"critical"} or last(/DELL PowerEdge R820 by SNMP/dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}])={$DELL.SNMP.PSU.STATUS.CRIT:"nonRecoverable"} |Average |
||
Dell R820: Power supply [{#PSU_DESCR}]: Warning state | Please check the power supply unit for errors. |
last(/DELL PowerEdge R820 by SNMP/dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}])={$DELL.SNMP.PSU.STATUS.WARN:"nonCritical"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Fan discovery | IDRAC-MIB-SMIv2::coolingDeviceTable |
SNMP agent | fan.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R820: Fan [{#FAN_DESCR}]: Status | MIB: IDRAC-MIB-SMIv2 This attribute defines the probe status of the cooling device. |
SNMP agent | dell.server.sensor.fan.status[{#FAN_DESCR}] Preprocessing
|
Dell R820: Fan [{#FAN_DESCR}]: Speed | MIB: IDRAC-MIB-SMIv2 This attribute defines the reading for a cooling device of a subtype other than When the value for When the value for |
SNMP agent | dell.server.sensor.fan.speed[{#FAN_DESCR}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R820: Fan [{#FAN_DESCR}]: Critical state | Please check the fan unit. |
last(/DELL PowerEdge R820 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"criticalUpper"} or last(/DELL PowerEdge R820 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableUpper"} or last(/DELL PowerEdge R820 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"criticalLower"} or last(/DELL PowerEdge R820 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableLower"} or last(/DELL PowerEdge R820 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"failed"} |Average |
||
Dell R820: Fan [{#FAN_DESCR}]: Warning state | Please check the fan unit. |
last(/DELL PowerEdge R820 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalUpper"} or last(/DELL PowerEdge R820 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalLower"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Array controller discovery | Scanning table of Array controllers: IDRAC-MIB-SMIv2::controllerTable |
SNMP agent | array.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R820: Controller [{#CNTLR_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 The status of the controller itself without the propagation of any contained component status. Possible values: 1: Other 2: Unknown 3: OK 4: Non-critical 5: Critical 6: Non-recoverable |
SNMP agent | dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}] Preprocessing
|
Dell R820: Controller [{#CNTLR_NAME}]: Model | MIB: IDRAC-MIB-SMIv2 The controller's name as represented in Storage Management. |
SNMP agent | dell.server.hw.diskarray.model[controllerName.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R820: Controller [{#CNTLR_NAME}]: Unrecoverable state | Please check the device for faults. |
last(/DELL PowerEdge R820 by SNMP/dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.STATUS.FAIL} |High |
||
Dell R820: Controller [{#CNTLR_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R820 by SNMP/dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.STATUS.CRIT} |Average |
Depends on:
|
|
Dell R820: Controller [{#CNTLR_NAME}]: Warning state | Please check the device for faults. |
last(/DELL PowerEdge R820 by SNMP/dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.STATUS.WARN} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Battery discovery | Scanning Battery Table: IDRAC-MIB-SMIv2::batteryTable |
SNMP agent | battery.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R820: Battery [{#BATTERY_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 Current state of battery. Possible values: 1: The current state could not be determined. 2: The battery is operating normally. 3: The battery has failed and needs to be replaced. 4: The battery temperature is high or charge level is depleting. 5: The battery is missing or not detected. 6: The battery is undergoing the re-charge phase. 7: The battery voltage or charge level is below the threshold. |
SNMP agent | dell.server.hw.battery.status[batteryState.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R820: Battery [{#BATTERY_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R820 by SNMP/dell.server.hw.battery.status[batteryState.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT} |Average |
||
Dell R820: Battery [{#BATTERY_NAME}]: Warning state | Please check the device for faults. |
last(/DELL PowerEdge R820 by SNMP/dell.server.hw.battery.status[batteryState.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.WARN} |Warning |
Depends on:
|
|
Dell R820: Battery [{#BATTERY_NAME}]: Not in optimal state | Please check the device for faults. |
last(/DELL PowerEdge R820 by SNMP/dell.server.hw.battery.status[batteryState.{#SNMPINDEX}])<>{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.OK} |Info |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Physical disk discovery | Scanning table of physical drive entries IDRAC-MIB-SMIv2::physicalDiskTable. |
SNMP agent | physicaldisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R820: Physical disk [{#DISK_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 The status of the physical disk itself without the propagation of any contained component status. Possible values: 1: Other 2: Unknown 3: OK 4: Non-critical 5: Critical 6: Non-recoverable |
SNMP agent | dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}] Preprocessing
|
Dell R820: Physical disk [{#DISK_NAME}]: S.M.A.R.T. Status | MIB: IDRAC-MIB-SMIv2 Indicates whether the physical disk has received a predictive failure alert. |
SNMP agent | dell.server.hw.physicaldisk.smart_status[physicalDiskSmartAlertIndication.{#SNMPINDEX}] Preprocessing
|
Dell R820: Physical disk [{#DISK_NAME}]: Serial number | MIB: IDRAC-MIB-SMIv2 The physical disk's unique identification number from the manufacturer. |
SNMP agent | dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}] Preprocessing
|
Dell R820: Physical disk [{#DISK_NAME}]: Model name | MIB: IDRAC-MIB-SMIv2 The model number of the physical disk. |
SNMP agent | dell.server.hw.physicaldisk.model[physicalDiskProductID.{#SNMPINDEX}] Preprocessing
|
Dell R820: Physical disk [{#DISK_NAME}]: Media type | MIB: IDRAC-MIB-SMIv2 The media type of the physical disk. Possible Values: 1: The media type could not be determined. 2: Hard Disk Drive (HDD). 3: Solid State Drive (SSD). |
SNMP agent | dell.server.hw.physicaldisk.media_type[physicalDiskMediaType.{#SNMPINDEX}] Preprocessing
|
Dell R820: Physical disk [{#DISK_NAME}]: Size | MIB: IDRAC-MIB-SMIv2 The size of the physical disk in megabytes. |
SNMP agent | dell.server.hw.physicaldisk.size[physicalDiskCapacityInMB.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R820: Physical disk [{#DISK_NAME}]: Failed state | Please check physical disk for warnings or errors. |
last(/DELL PowerEdge R820 by SNMP/dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.STATUS.FAIL:"critical"} or last(/DELL PowerEdge R820 by SNMP/dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.STATUS.FAIL:"nonRecoverable"} |High |
||
Dell R820: Physical disk [{#DISK_NAME}]: Warning state | Please check physical disk for warnings or errors. |
last(/DELL PowerEdge R820 by SNMP/dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.STATUS.WARN:"nonCritical"} |Warning |
Depends on:
|
|
Dell R820: Physical disk [{#DISK_NAME}]: S.M.A.R.T. failed | Disk probably requires replacement. |
last(/DELL PowerEdge R820 by SNMP/dell.server.hw.physicaldisk.smart_status[physicalDiskSmartAlertIndication.{#SNMPINDEX}])={$DELL.SNMP.DISK.SMART.STATUS.FAIL:"replaceDrive"} or last(/DELL PowerEdge R820 by SNMP/dell.server.hw.physicaldisk.smart_status[physicalDiskSmartAlertIndication.{#SNMPINDEX}])={$DELL.SNMP.DISK.SMART.STATUS.FAIL:"replaceDriveSSDWearOut"} |High |
Depends on:
|
|
Dell R820: Physical disk [{#DISK_NAME}]: Has been replaced | [{#DISK_NAME}] serial number has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R820 by SNMP/dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}],#1)<>last(/DELL PowerEdge R820 by SNMP/dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}],#2) and length(last(/DELL PowerEdge R820 by SNMP/dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}]))>0 |Info |
Manual close: Yes |
Name | Description | Type | Key and additional info |
---|---|---|---|
Virtual disk discovery | IDRAC-MIB-SMIv2::virtualDiskTable |
SNMP agent | virtualdisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R820: Virtual disk [{#DISK_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 The current state of this virtual disk (which includes any member physical disks.) Possible states: 1: The current state could not be determined. 2: The virtual disk is operating normally or optimally. 3: The virtual disk has encountered a failure. Data on the disk is lost or is about to be lost. 4: The virtual disk encountered a failure with one or all of the constituent redundant physical disks. The data on the virtual disk might no longer be fault tolerant. |
SNMP agent | dell.server.hw.virtualdisk.status[virtualDiskState.{#SNMPINDEX}] Preprocessing
|
Dell R820: Virtual disk [{#DISK_NAME}]: Layout type | MIB: IDRAC-MIB-SMIv2 The virtual disk's RAID type. Possible values: 1: Not one of the following 2: RAID-0 3: RAID-1 4: RAID-5 5: RAID-6 6: RAID-10 7: RAID-50 8: RAID-60 9: Concatenated RAID 1 10: Concatenated RAID 5 |
SNMP agent | dell.server.hw.virtualdisk.layout[virtualDiskLayout.{#SNMPINDEX}] Preprocessing
|
Dell R820: Virtual disk [{#DISK_NAME}]: Size | MIB: IDRAC-MIB-SMIv2 The size of the virtual disk in megabytes. |
SNMP agent | dell.server.hw.virtualdisk.size[virtualDiskSizeInMB.{#SNMPINDEX}] Preprocessing
|
Dell R820: Virtual disk [{#DISK_NAME}]: Operational state | MIB: IDRAC-MIB-SMIv2 The state of the virtual disk when there are progressive operations ongoing. Possible values: 1: There is no active operation running. 2: The virtual disk configuration has changed. The physical disks included in the virtual disk are being modified to support the new configuration. 3: A Consistency Check (CC) is being performed on the virtual disk. 4: The virtual disk is being initialized. 5: BackGround Initialization (BGI) is being performed on the virtual disk. |
SNMP agent | dell.server.hw.virtualdisk.state[virtualDiskOperationalState.{#SNMPINDEX}] |
Dell R820: Virtual disk [{#DISK_NAME}]: Read policy | MIB: IDRAC-MIB-SMIv2 The read policy used by the controller for read operations on this virtual disk. Possible values: 1: No Read Ahead. 2: Read Ahead. 3: Adaptive Read Ahead. |
SNMP agent | dell.server.hw.virtualdisk.read_policy[virtualDiskReadPolicy.{#SNMPINDEX}] |
Dell R820: Virtual disk [{#DISK_NAME}]: Write policy | MIB: IDRAC-MIB-SMIv2 The write policy used by the controller for write operations on this virtual disk. Possible values: 1: Write Through. 2: Write Back. 3: Force Write Back. |
SNMP agent | dell.server.hw.virtualdisk.write_policy[virtualDiskWritePolicy.{#SNMPINDEX}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R820: Virtual disk [{#DISK_NAME}]: Failed state | Please check the virtual disk for warnings or errors. |
last(/DELL PowerEdge R820 by SNMP/dell.server.hw.virtualdisk.status[virtualDiskState.{#SNMPINDEX}])={$DELL.SNMP.VDISK.STATUS.CRIT:"failed"} |High |
||
Dell R820: Virtual disk [{#DISK_NAME}]: Warning state | Please check the virtual disk for warnings or errors. |
last(/DELL PowerEdge R820 by SNMP/dell.server.hw.virtualdisk.status[virtualDiskState.{#SNMPINDEX}])={$DELL.SNMP.VDISK.STATUS.WARN:"degraded"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Network interface discovery | Discovery of network interfaces. |
SNMP agent | net.if.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R820: NIC [{#NICFQDD}/{#NICMAC}]: Link status | This attribute defines the connection status of the network device. |
SNMP agent | dell.server.net.if.link[{#NIC_FQDD}] Preprocessing
|
Dell R820: NIC [{#NICFQDD}/{#NICMAC}]: Status | This attribute defines the status of the network device. |
SNMP agent | dell.server.net.if.status[{#NIC_FQDD}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R820: NIC [{#NICFQDD}/{#NICMAC}]: Link down | This trigger expression works as follows: |
{$DELL.SNMP.IFCONTROL:"{#NIC_FQDD}"}=1 and last(/DELL PowerEdge R820 by SNMP/dell.server.net.if.link[{#NIC_FQDD}],#1)<>1 and last(/DELL PowerEdge R820 by SNMP/dell.server.net.if.link[{#NIC_FQDD}],#1)<>last(/DELL PowerEdge R820 by SNMP/dell.server.net.if.link[{#NIC_FQDD}],#2) |Average |
Manual close: Yes | |
Dell R820: NIC [{#NICFQDD}/{#NICMAC}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/DELL PowerEdge R820 by SNMP/dell.server.net.if.status[{#NIC_FQDD}],#1)<>3 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
CPU status discovery | CPU status discovery. |
SNMP agent | cpu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R820: CPU [{#CPU_FQDD}]: Status | This attribute defines the status of the processor device status probe. This status will be joined into the |
SNMP agent | dell.server.cpu.status[cpu.{#SNMPINDEX}] Preprocessing
|
Dell R820: CPU [{#CPU_FQDD}]: State | This attribute defines the reading of the processor device status probe. |
SNMP agent | dell.server.cpu.state[cpu.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R820: CPU [{#CPU_FQDD}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/DELL PowerEdge R820 by SNMP/dell.server.cpu.status[cpu.{#SNMPINDEX}],#1)<>3 |Average |
||
Dell R820: CPU [{#CPU_FQDD}]: Reading error | MIB: IDRAC-MIB-SMIv2 |
bitand(last(/DELL PowerEdge R820 by SNMP/dell.server.cpu.state[cpu.{#SNMPINDEX}],#1),128)=0 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
System battery discovery | System battery discovery. |
SNMP agent | system.battery.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R820: System battery [{#SNMPVALUE}]: Status | This attribute defines the status of the battery. |
SNMP agent | dell.server.system.battery[{#SNMPVALUE}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R820: System battery [{#SNMPVALUE}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/DELL PowerEdge R820 by SNMP/dell.server.system.battery[{#SNMPVALUE}],#1)<>3 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Memory discovery | Memory discovery. |
SNMP agent | memory.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R820: Memory [{#SNMPVALUE}]: Status | This attribute defines the status of the memory device. |
SNMP agent | dell.server.memory.status[{#SNMPVALUE}] Preprocessing
|
Dell R820: Memory [{#SNMPVALUE}]: Size | This attribute defines the size, in KB, of the memory device. Zero indicates no memory installed; 2,147,483,647 indicates an unknown memory size. |
SNMP agent | dell.server.memory.size[{#SNMPVALUE}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R820: Memory [{#SNMPVALUE}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/DELL PowerEdge R820 by SNMP/dell.server.memory.status[{#SNMPVALUE}],#1)<>3 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Voltage probe discovery | Voltage probe discovery. |
SNMP agent | voltage.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R820: Voltage probe [{#VPROBE_NAME}]: Voltage | This attribute defines the reading for a voltage probe. |
SNMP agent | dell.server.voltage.value[{#VPROBE_NAME}] Preprocessing
|
Dell R820: Voltage probe [{#VPROBE_NAME}]: Status | This attribute defines the status of the voltage probe. |
SNMP agent | dell.server.voltage.status[{#VPROBE_NAME}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R820: Voltage probe [{#VPROBE_NAME}]: Status is not OK | Please check the device's voltage. |
last(/DELL PowerEdge R820 by SNMP/dell.server.voltage.status[{#VPROBE_NAME}])<>3 |Average |
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums
This is a template for monitoring DELL PowerEdge R820 servers with iDRAC 8/9 firmware 4.32 (and later) with Redfish API enabled via Zabbix script items. This template works without any external scripts.
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
1. Enable Redfish API in the Dell iDRAC interface of your server.
2. Create a user for monitoring with read-only permissions in the Dell iDRAC interface.
3. Create a host for Dell server with iDRAC IP as the Zabbix agent interface.
4. Link the template to the host.
5. Customize the values of the {$DELL.HTTP.API.URL}
, {$DELL.HTTP.API.USER}
, and {$DELL.HTTP.API.PASSWORD}
macros.
NOTE! If you are experiencing timeouts on some of the items that are executing requests, adjust the
{$DELL.HTTP.REQUEST.TIMEOUT}
macro accordingly.
Name | Description | Default |
---|---|---|
{$DELL.HTTP.API.URL} | The Dell iDRAC Redfish API URL in the format |
<Put your URL here> |
{$DELL.HTTP.API.USER} | The Dell iDRAC username. |
<Put your username here> |
{$DELL.HTTP.API.PASSWORD} | The Dell iDRAC user password. |
<Put your password here> |
{$DELL.HTTP.PROXY} | Set an HTTP proxy for Redfish API requests if needed. |
|
{$DELL.HTTP.RETURN.CODE.OK} | Set the HTTP return code that represents an OK response from the API. The default is "200", but can vary, for example, if a proxy is used. |
200 |
{$DELL.HTTP.REQUEST.TIMEOUT} | Set the timeout for HTTP requests. |
10s |
{$DELL.HTTP.IFCONTROL} | Link status trigger will be fired only for interfaces that have the context macro equal to "1". |
1 |
{$DELL.HTTP.CPU.UTIL.HIGH} | Sets the percentage threshold for creating a "high" severity event about CPU utilization. |
90 |
{$DELL.HTTP.CPU.UTIL.WARN} | Sets the percentage threshold for creating a "warning" severity event about CPU utilization. |
75 |
{$DELL.HTTP.MEM.UTIL.HIGH} | Sets the percentage threshold for creating a "high" severity event about memory utilization. |
90 |
{$DELL.HTTP.MEM.UTIL.WARN} | Sets the percentage threshold for creating a "warning" severity event about memory utilization. |
75 |
{$DELL.HTTP.IO.UTIL.HIGH} | Sets the percentage threshold for creating a "high" severity event about IO utilization. |
90 |
{$DELL.HTTP.IO.UTIL.WARN} | Sets the percentage threshold for creating a "warning" severity event about IO utilization. |
75 |
{$DELL.HTTP.SYS.UTIL.HIGH} | Sets the percentage threshold for creating a "high" severity event about SYS utilization. |
90 |
{$DELL.HTTP.SYS.UTIL.WARN} | Sets the percentage threshold for creating a "warning" severity event about SYS utilization. |
75 |
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R820: Get system | Returns system metrics. |
Script | dell.server.system.get |
Dell R820: Get sensors | Returns sensors. |
Script | dell.server.sensors.get |
Dell R820: Get array controller resources | Returns array controller resources. |
Script | dell.server.array.resources.get |
Dell R820: Get disks | Returns storage resources. |
Script | dell.server.disks.get |
Dell R820: Get network interfaces | Returns network interfaces. |
Script | dell.server.net.iface.get |
Dell R820: CPU utilization, in % | CPU utilization. |
Dependent item | dell.server.util.cpu Preprocessing
|
Dell R820: Memory utilization, in % | Memory utilization. |
Dependent item | dell.server.util.mem Preprocessing
|
Dell R820: IO utilization, in % | IO utilization. |
Dependent item | dell.server.util.io Preprocessing
|
Dell R820: SYS utilization, in % | SYS utilization. |
Dependent item | dell.server.util.sys Preprocessing
|
Dell R820: Overall system health status | This attribute defines the overall rollup status of all the components in the system monitored by the remote access card. Includes system, storage, IO devices, iDRAC, CPU, memory, etc. |
Dependent item | dell.server.status Preprocessing
|
Dell R820: Hardware model name | This attribute defines the model name of the system. |
Dependent item | dell.server.hw.model Preprocessing
|
Dell R820: Hardware serial number | This attribute defines the service tag of the system. |
Dependent item | dell.server.hw.serialnumber Preprocessing
|
Dell R820: Firmware version | This attribute defines the firmware version of a remote access card. |
Dependent item | dell.server.hw.firmware Preprocessing
|
Dell R820: Redfish API status | Availability of Redfish API on the server. Possible values: 0 - Unavailable 1 - Available |
Simple check | net.tcp.service[https] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R820: CPU utilization is too high | Current CPU utilization has exceeded |
min(/DELL PowerEdge R820 by HTTP/dell.server.util.cpu,5m)>={$DELL.HTTP.CPU.UTIL.HIGH} |High |
||
Dell R820: CPU utilization is high | Current CPU utilization has exceeded |
min(/DELL PowerEdge R820 by HTTP/dell.server.util.cpu,5m)>={$DELL.HTTP.CPU.UTIL.WARN} |Warning |
Depends on:
|
|
Dell R820: Memory utilization is too high | Current memory utilization has exceeded |
min(/DELL PowerEdge R820 by HTTP/dell.server.util.mem,5m)>={$DELL.HTTP.MEM.UTIL.HIGH} |High |
||
Dell R820: Memory utilization is high | Current memory utilization has exceeded |
min(/DELL PowerEdge R820 by HTTP/dell.server.util.mem,5m)>={$DELL.HTTP.MEM.UTIL.WARN} |Warning |
Depends on:
|
|
Dell R820: IO utilization is too high | Current IO utilization has exceeded |
min(/DELL PowerEdge R820 by HTTP/dell.server.util.io,5m)>={$DELL.HTTP.IO.UTIL.HIGH} |High |
||
Dell R820: IO utilization is high | Current IO utilization has exceeded |
min(/DELL PowerEdge R820 by HTTP/dell.server.util.io,5m)>={$DELL.HTTP.IO.UTIL.WARN} |Warning |
Depends on:
|
|
Dell R820: SYS utilization is too high | Current SYS utilization has exceeded |
min(/DELL PowerEdge R820 by HTTP/dell.server.util.sys,5m)>={$DELL.HTTP.SYS.UTIL.HIGH} |High |
||
Dell R820: SYS utilization is high | Current SYS utilization has exceeded |
min(/DELL PowerEdge R820 by HTTP/dell.server.util.sys,5m)>={$DELL.HTTP.SYS.UTIL.WARN} |Warning |
Depends on:
|
|
Dell R820: Server is in a critical state | Please check the device for faults. |
last(/DELL PowerEdge R820 by HTTP/dell.server.status,)=3 |Average |
||
Dell R820: Server is in a warning state | Please check the device for warnings. |
last(/DELL PowerEdge R820 by HTTP/dell.server.status,)=2 |Warning |
Depends on:
|
|
Dell R820: Device has been replaced | The device serial number has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R820 by HTTP/dell.server.hw.serialnumber,#1)<>last(/DELL PowerEdge R820 by HTTP/dell.server.hw.serialnumber,#2) and length(last(/DELL PowerEdge R820 by HTTP/dell.server.hw.serialnumber))>0 |Info |
Manual close: Yes | |
Dell R820: Firmware has changed | The firmware version has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R820 by HTTP/dell.server.hw.firmware,#1)<>last(/DELL PowerEdge R820 by HTTP/dell.server.hw.firmware,#2) and length(last(/DELL PowerEdge R820 by HTTP/dell.server.hw.firmware))>0 |Info |
Manual close: Yes | |
Dell R820: Redfish API service is unavailable | The service is unavailable or does not accept TCP connections. |
last(/DELL PowerEdge R820 by HTTP/net.tcp.service[https])=0 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature discovery | Discovery of temperature sensors. |
Dependent item | dell.server.temp.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R820: Probe [{#SENSOR_NAME}]: Get sensor | Returns the metrics of a sensor. |
Dependent item | dell.server.sensor.temp.get[{#SENSOR_NAME}] Preprocessing
|
Dell R820: Probe [{#SENSOR_NAME}]: Value | Sensor value. |
Dependent item | dell.server.sensor.temp.value[{#SENSOR_NAME}] Preprocessing
|
Dell R820: Probe [{#SENSOR_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.sensor.temp.status[{#SENSOR_NAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R820: Probe [{#SENSOR_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R820 by HTTP/dell.server.sensor.temp.status[{#SENSOR_NAME}],)=3 |Average |
||
Dell R820: Probe [{#SENSOR_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R820 by HTTP/dell.server.sensor.temp.status[{#SENSOR_NAME}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU discovery | Discovery of PSU sensors. |
Dependent item | dell.server.psu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R820: Power supply [{#SENSOR_NAME}]: Get sensor | Returns the metrics of a sensor. |
Dependent item | dell.server.sensor.psu.get[{#SENSOR_NAME}] Preprocessing
|
Dell R820: Power supply [{#SENSOR_NAME}]: Voltage | Sensor value. |
Dependent item | dell.server.sensor.psu.voltage[{#SENSOR_NAME}] Preprocessing
|
Dell R820: Power supply [{#SENSOR_NAME}]: Voltage sensor status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.sensor.psu.voltage.status[{#SENSOR_NAME}] Preprocessing
|
Dell R820: Power supply [{#SENSOR_NAME}]: Current | Sensor value. |
Dependent item | dell.server.sensor.psu.current[{#SENSOR_NAME}] Preprocessing
|
Dell R820: Power supply [{#SENSOR_NAME}]: Current sensor status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.sensor.psu.current.status[{#SENSOR_NAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R820: Power supply [{#SENSOR_NAME}]: Voltage sensor: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R820 by HTTP/dell.server.sensor.psu.voltage.status[{#SENSOR_NAME}],)=3 |Average |
||
Dell R820: Power supply [{#SENSOR_NAME}]: Voltage sensor: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R820 by HTTP/dell.server.sensor.psu.voltage.status[{#SENSOR_NAME}],)=2 |Warning |
Depends on:
|
|
Dell R820: Power supply [{#SENSOR_NAME}]: Current sensor: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R820 by HTTP/dell.server.sensor.psu.current.status[{#SENSOR_NAME}],)=3 |Average |
||
Dell R820: Power supply [{#SENSOR_NAME}]: Current sensor: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R820 by HTTP/dell.server.sensor.psu.current.status[{#SENSOR_NAME}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
FAN discovery | Discovery of FAN sensors. |
Dependent item | dell.server.fan.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R820: Fan [{#SENSOR_NAME}]: Get sensor | Returns the metrics of a sensor. |
Dependent item | dell.server.sensor.fan.get[{#SENSOR_NAME}] Preprocessing
|
Dell R820: Fan [{#SENSOR_NAME}]: Speed | Sensor value. |
Dependent item | dell.server.sensor.fan.speed[{#SENSOR_NAME}] Preprocessing
|
Dell R820: Fan [{#SENSOR_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.sensor.fan.status[{#SENSOR_NAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R820: Fan [{#SENSOR_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R820 by HTTP/dell.server.sensor.fan.status[{#SENSOR_NAME}],)=3 |Average |
||
Dell R820: Fan [{#SENSOR_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R820 by HTTP/dell.server.sensor.fan.status[{#SENSOR_NAME}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Array controller discovery | Discovery of disk array controllers. |
Dependent item | dell.server.array.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R820: Controller [{#CNTLR_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.array.status[{#ID}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R820: Controller [{#CNTLR_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R820 by HTTP/dell.server.array.status[{#ID}],)=3 |Average |
||
Dell R820: Controller [{#CNTLR_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R820 by HTTP/dell.server.array.status[{#ID}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Battery discovery | Discovery of battery controllers. |
Dependent item | dell.server.controller.battery.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R820: Battery [{#BATTERY_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.controller.battery.status[{#ID}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R820: Battery [{#BATTERY_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R820 by HTTP/dell.server.controller.battery.status[{#ID}],)=3 |Average |
||
Dell R820: Battery [{#BATTERY_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R820 by HTTP/dell.server.controller.battery.status[{#ID}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Physical disk discovery | Discovery of physical disks. |
Dependent item | dell.server.physicaldisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R820: Physical disk [{#DISK_NAME}]: Get disk | Returns the metrics of a physical disk. |
Script | dell.server.hw.physicaldisk.get[{#DISK_NAME}] |
Dell R820: Physical disk [{#DISK_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.hw.physicaldisk.status[{#DISK_NAME}] Preprocessing
|
Dell R820: Physical disk [{#DISK_NAME}]: Serial number | The serial number of this drive. |
Dependent item | dell.server.hw.physicaldisk.serialnumber[{#DISK_NAME}] Preprocessing
|
Dell R820: Physical disk [{#DISK_NAME}]: Model name | The model number of the drive. |
Dependent item | dell.server.hw.physicaldisk.model[{#DISK_NAME}] Preprocessing
|
Dell R820: Physical disk [{#DISK_NAME}]: Media type | The type of media contained in this drive. Possible values: HDD, SSD, SMR, null. |
Dependent item | dell.server.hw.physicaldisk.mediatype[{#DISKNAME}] Preprocessing
|
Dell R820: Physical disk [{#DISK_NAME}]: Size | The size, in bytes, of this drive. |
Dependent item | dell.server.hw.physicaldisk.size[{#DISK_NAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R820: Physical disk [{#DISK_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R820 by HTTP/dell.server.hw.physicaldisk.status[{#DISK_NAME}],)=3 |Average |
||
Dell R820: Physical disk [{#DISK_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R820 by HTTP/dell.server.hw.physicaldisk.status[{#DISK_NAME}],)=2 |Warning |
Depends on:
|
|
Dell R820: Physical disk [{#DISK_NAME}]: Has been replaced | [{#DISK_NAME}] serial number has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R820 by HTTP/dell.server.hw.physicaldisk.serialnumber[{#DISK_NAME}],#1)<>last(/DELL PowerEdge R820 by HTTP/dell.server.hw.physicaldisk.serialnumber[{#DISK_NAME}],#2) and length(last(/DELL PowerEdge R820 by HTTP/dell.server.hw.physicaldisk.serialnumber[{#DISK_NAME}]))>0 |Info |
Manual close: Yes |
Name | Description | Type | Key and additional info |
---|---|---|---|
Virtual disk discovery | Discovery of virtual disks. |
Dependent item | dell.server.virtualdisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R820: Virtual disk [{#DISK_NAME}]: Get disk | Returns the metrics of a virtual disk. |
Script | dell.server.hw.virtualdisk.get[{#DISK_NAME}] |
Dell R820: Virtual disk [{#DISK_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.hw.virtualdisk.status[{#DISK_NAME}] Preprocessing
|
Dell R820: Virtual disk [{#DISK_NAME}]: RAID status | This property represents the RAID specific status. Possible values: Blocked, Degraded, Failed, Foreign, Offline, Online, Ready, Unknown, null. |
Dependent item | dell.server.hw.virtualdisk.raidstatus[{#DISKNAME}] Preprocessing
|
Dell R820: Virtual disk [{#DISK_NAME}]: Size | The size in bytes of this Volume. |
Dependent item | dell.server.hw.virtualdisk.size[{#DISK_NAME}] Preprocessing
|
Dell R820: Virtual disk [{#DISK_NAME}]: Current state | The known state of the Resource, for example, Enabled. Possible values: Enabled, Disabled, StandbyOffline, StandbySpare, InTest, Starting, Absent, UnavailableOffline, Deferring, Quiesced, Updating, Qualified. |
Dependent item | dell.server.hw.virtualdisk.state[{#DISK_NAME}] Preprocessing
|
Dell R820: Virtual disk [{#DISK_NAME}]: Read policy | Indicates the read cache policy setting for the Volume. Possible values: ReadAhead, NoReadAhead, AdaptiveReadAhead. |
Dependent item | dell.server.hw.virtualdisk.readpolicy[{#DISKNAME}] Preprocessing
|
Dell R820: Virtual disk [{#DISK_NAME}]: Write policy | Indicates the write cache policy setting for the Volume. Possible values: WriteThrough, WriteBack, ProtectedWriteBack, UnprotectedWriteBack. |
Dependent item | dell.server.hw.virtualdisk.writepolicy[{#DISKNAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R820: Virtual disk [{#DISK_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R820 by HTTP/dell.server.hw.virtualdisk.status[{#DISK_NAME}],)=3 |Average |
||
Dell R820: Virtual disk [{#DISK_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R820 by HTTP/dell.server.hw.virtualdisk.status[{#DISK_NAME}],)=2 |Warning |
Depends on:
|
|
Dell R820: Virtual disk [{#DISK_NAME}]: RAID status not OK | Please check the disk for faults. |
last(/DELL PowerEdge R820 by HTTP/dell.server.hw.virtualdisk.raid_status[{#DISK_NAME}],)<8 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Network interface discovery | Discovery of network interfaces. |
Dependent item | dell.server.net.if.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R820: Interface [{#IFNAME}]: Get interface | Returns the metrics of a network interface. |
Script | dell.server.net.if.get[{#IFNAME}] |
Dell R820: Interface [{#IFNAME}]: Speed | The network port current link speed. |
Dependent item | dell.server.net.if.speed[{#IFNAME}] Preprocessing
|
Dell R820: Interface [{#IFNAME}]: Link status | The status of the link between this port and its link partner. Possible values: Down, Up, null. |
Dependent item | dell.server.net.if.status[{#IFNAME}] Preprocessing
|
Dell R820: Interface [{#IFNAME}]: State | The known state of the Resource, for example, Enabled. Possible values: Enabled, Disabled, StandbyOffline, StandbySpare, InTest, Starting, Absent, UnavailableOffline, Deferring, Quiesced, Updating, Qualified. |
Dependent item | dell.server.net.if.state[{#IFNAME}] Preprocessing
|
Dell R820: Interface [{#IFNAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.net.if.health[{#IFNAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R820: Interface [{#IFNAME}]: Link down | This trigger expression works as follows: |
{$DELL.HTTP.IFCONTROL:"{#IFNAME}"}=1 and (last(/DELL PowerEdge R820 by HTTP/dell.server.net.if.status[{#IFNAME}],)=2 and last(/DELL PowerEdge R820 by HTTP/dell.server.net.if.status[{#IFNAME}],#1)<>last(/DELL PowerEdge R820 by HTTP/dell.server.net.if.status[{#IFNAME}],#2)) |Average |
Manual close: Yes | |
Dell R820: Interface [{#IFNAME}]: Link status issue | This trigger expression works as follows: |
{$DELL.HTTP.IFCONTROL:"{#IFNAME}"}=1 and (last(/DELL PowerEdge R820 by HTTP/dell.server.net.if.status[{#IFNAME}],)<2 and last(/DELL PowerEdge R820 by HTTP/dell.server.net.if.status[{#IFNAME}],#1)<>last(/DELL PowerEdge R820 by HTTP/dell.server.net.if.status[{#IFNAME}],#2)) |Average |
Manual close: Yes | |
Dell R820: Interface [{#IFNAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R820 by HTTP/dell.server.net.if.health[{#IFNAME}],)=3 |Average |
||
Dell R820: Interface [{#IFNAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R820 by HTTP/dell.server.net.if.health[{#IFNAME}],)=2 |Warning |
Depends on:
|
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums
This is a template for monitoring DELL PowerEdge R750 servers with iDRAC version 7 (and later) via Zabbix SNMP agent that works without any external scripts.
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
Refer to the vendor documentation.
Name | Description | Default |
---|---|---|
{$DELL.SNMP.DISCOVERY.VOLTAGE.NAME.MATCHES} | Sets the regex string of voltage probe names to allow in discovery. |
^.*Voltage.*$ |
{$DELL.SNMP.DISCOVERY.VOLTAGE.NAME.NOT_MATCHES} | Sets the regex string of voltage probe names to ignore in discovery. |
CHANGE_IF_NEEDED |
{$DELL.SNMP.DISCOVERY.VOLTAGE.TYPE.MATCHES} | Sets the regex string of voltage probe types to allow in discovery. |
18|16 |
{$DELL.SNMP.DISCOVERY.VOLTAGE.TYPE.NOT_MATCHES} | Sets the regex string of voltage probe types to ignore in discovery. |
CHANGE_IF_NEEDED |
{$DELL.SNMP.SENSOR.TEMP.STATUS.OK} | The OK status of the temperature probe for the trigger expression. |
3 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalUpper"} | The warning status of the temperature probe for the trigger expression. |
4 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalLower"} | The warning status of the temperature probe for the trigger expression. |
7 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalUpper"} | The critical status of the temperature probe for the trigger expression. |
5 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableUpper"} | The critical status of the temperature probe for the trigger expression. |
6 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalLower"} | The critical status of the temperature probe for the trigger expression. |
8 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableLower"} | The critical status of the temperature probe for the trigger expression. |
9 |
{$DELL.SNMP.HEALTH.STATUS.DISASTER} | The disaster status of health for the trigger expression. |
6 |
{$DELL.SNMP.HEALTH.STATUS.CRIT} | The critical status of health for the trigger expression. |
5 |
{$DELL.SNMP.HEALTH.STATUS.WARN} | The warning status of health for the trigger expression. |
4 |
{$DELL.SNMP.PSU.STATUS.WARN:"nonCritical"} | The warning value of the PSU sensor for the trigger expression. |
4 |
{$DELL.SNMP.PSU.STATUS.CRIT:"critical"} | The critical value of the PSU sensor for the trigger expression. |
5 |
{$DELL.SNMP.PSU.STATUS.CRIT:"nonRecoverable"} | The critical value of the PSU sensor for the trigger expression. |
6 |
{$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalUpper"} | The warning value of the FAN sensor for the trigger expression. |
4 |
{$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalLower"} | The warning value of the FAN sensor for the trigger expression. |
7 |
{$DELL.SNMP.FAN.STATUS.CRIT:"criticalUpper"} | The critical value of the FAN sensor for the trigger expression. |
5 |
{$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableUpper"} | The critical value of the FAN sensor for the trigger expression. |
6 |
{$DELL.SNMP.FAN.STATUS.CRIT:"criticalLower"} | The critical value of the FAN sensor for the trigger expression. |
8 |
{$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableLower"} | The critical value of the FAN sensor for the trigger expression. |
9 |
{$DELL.SNMP.FAN.STATUS.CRIT:"failed"} | The critical value of the FAN sensor for the trigger expression. |
10 |
{$DELL.SNMP.DISK.ARRAY.STATUS.FAIL} | The disaster status of the disk array for the trigger expression. |
6 |
{$DELL.SNMP.DISK.ARRAY.STATUS.CRIT} | The critical status of the disk array for the trigger expression. |
5 |
{$DELL.SNMP.DISK.ARRAY.STATUS.WARN} | The warning status of the disk array for the trigger expression. |
4 |
{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT} | The critical status of the disk array cache battery for the trigger expression. |
3 |
{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.WARN} | The warning status of the disk array cache battery for the trigger expression. |
4 |
{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.OK} | The OK status of the disk array cache battery for the trigger expression. |
2 |
{$DELL.SNMP.VDISK.STATUS.CRIT:"failed"} | The critical status of the virtual disk for the trigger expression. |
3 |
{$DELL.SNMP.VDISK.STATUS.WARN:"degraded"} | The warning status of the virtual disk for the trigger expression. |
4 |
{$DELL.SNMP.DISK.STATUS.WARN:"nonCritical"} | The warning status of the disk for the trigger expression. |
4 |
{$DELL.SNMP.DISK.STATUS.FAIL:"critical"} | The critical status of the disk for the trigger expression. |
5 |
{$DELL.SNMP.DISK.STATUS.FAIL:"nonRecoverable"} | The critical status of the disk for the trigger expression. |
6 |
{$DELL.SNMP.DISK.SMART.STATUS.FAIL} | The critical S.M.A.R.T status of the disk for the trigger expression. |
1 |
{$DELL.SNMP.TIMEOUT} | The time interval for the SNMP agent availability trigger expression. |
5m |
{$DELL.SNMP.IFCONTROL} | The link status trigger will be fired only for interfaces that have the context macro equal to "1". |
1 |
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R750: Overall system health status | MIB: IDRAC-MIB-SMIv2 This attribute defines the overall rollup status of all the components in the system monitored by the remote access card. Includes system, storage, IO devices, iDRAC, CPU, memory, etc. |
SNMP agent | dell.server.status[globalSystemStatus] Preprocessing
|
Dell R750: Hardware model name | MIB: IDRAC-MIB-SMIv2 This attribute defines the model name of the system. |
SNMP agent | dell.server.hw.model[systemModelName] Preprocessing
|
Dell R750: Hardware serial number | MIB: IDRAC-MIB-SMIv2 This attribute defines the service tag of the system. |
SNMP agent | dell.server.hw.serialnumber[systemServiceTag] Preprocessing
|
Dell R750: Operating system | MIB: IDRAC-MIB-SMIv2 This attribute defines the name of the operating system that the host is running. |
SNMP agent | dell.server.sw.os[systemOSName] Preprocessing
|
Dell R750: Firmware version | MIB: IDRAC-MIB-SMIv2 This attribute defines the firmware version of a remote access card. |
SNMP agent | dell.server.hw.firmware[racFirmwareVersion] Preprocessing
|
Dell R750: Uptime (network) | MIB: SNMP-FRAMEWORK-MIB The number of seconds since the value of the snmpEngineBoots object last changed. |
SNMP agent | dell.server.net.uptime[snmpEngineTime] |
Dell R750: Uptime (hardware) | MIB: IDRAC-MIB-SMIv2 This attribute defines the power-up time of the system in seconds. |
SNMP agent | dell.server.hw.uptime[systemPowerUpTime] |
Dell R750: SNMP traps (fallback) | The item is used to collect all SNMP traps unmatched by other |
SNMP trap | snmptrap.fallback |
Dell R750: System location | MIB: SNMPv2-MIB The physical location of this node (e.g., 'telephone closet, 3rd floor'). If the location is unknown, the value is a zero-length string. |
SNMP agent | dell.server.location[sysLocation] Preprocessing
|
Dell R750: System contact details | MIB: SNMPv2-MIB Name and contact information of the contact person for the node. If not provided, the value is a zero-length string. |
SNMP agent | dell.server.contact[sysContact] Preprocessing
|
Dell R750: System object ID | MIB: SNMPv2-MIB The vendor's authoritative identification of the entity as part of the vendor's SMI enterprises subtree with the prefix 1.3.6.1.4.1 (e.g., a vendor with the identifier 1.3.6.1.4.1.4242 might assign a system object with the OID 1.3.6.1.4.1.4242.1.1). |
SNMP agent | dell.server.objectid[sysObjectID] Preprocessing
|
Dell R750: System name | MIB: SNMPv2-MIB An administratively-assigned name for this managed node. By convention, this is the node's fully-qualified domain name. If the name is unknown, the value is a zero-length string. |
SNMP agent | dell.server.name[sysName] Preprocessing
|
Dell R750: System description | MIB: SNMPv2-MIB A textual description of the entity. This value should include the full name and version identification of the system's hardware type, software operating system, and networking software. |
SNMP agent | dell.server.descr[sysDescr] Preprocessing
|
Dell R750: SNMP agent availability | Availability of SNMP checks on the host. The value of this item corresponds to availability icons in the host list. Possible values: 0 - not available 1 - available 2 - unknown |
Zabbix internal | zabbix[host,snmp,available] Preprocessing
|
Dell R750: Memory, total size | Total memory amount on the device. |
Calculated | dell.server.memory.size.total Preprocessing
|
Dell R750: BIOS version | MIB: IDRAC-MIB-SMIv2 This attribute defines the version name of the system BIOS. |
SNMP agent | dell.server.bios.version Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R750: System is in unrecoverable state | Please check the device for faults. |
last(/DELL PowerEdge R750 by SNMP/dell.server.status[globalSystemStatus])={$DELL.SNMP.HEALTH.STATUS.DISASTER} |High |
||
Dell R750: System status is in critical state | Please check the device for errors. |
last(/DELL PowerEdge R750 by SNMP/dell.server.status[globalSystemStatus])={$DELL.SNMP.HEALTH.STATUS.CRIT} |Average |
||
Dell R750: System status is in warning state | Please check the device for warnings. |
last(/DELL PowerEdge R750 by SNMP/dell.server.status[globalSystemStatus])={$DELL.SNMP.HEALTH.STATUS.WARN} |Warning |
Depends on:
|
|
Dell R750: Device has been replaced | Device serial number has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R750 by SNMP/dell.server.hw.serialnumber[systemServiceTag],#1)<>last(/DELL PowerEdge R750 by SNMP/dell.server.hw.serialnumber[systemServiceTag],#2) and length(last(/DELL PowerEdge R750 by SNMP/dell.server.hw.serialnumber[systemServiceTag]))>0 |Info |
Manual close: Yes | |
Dell R750: Operating system description has changed | Operating system description has changed. Possibly, the system has been updated or replaced. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R750 by SNMP/dell.server.sw.os[systemOSName],#1)<>last(/DELL PowerEdge R750 by SNMP/dell.server.sw.os[systemOSName],#2) and length(last(/DELL PowerEdge R750 by SNMP/dell.server.sw.os[systemOSName]))>0 |Info |
Manual close: Yes | |
Dell R750: Firmware has changed | Firmware version has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R750 by SNMP/dell.server.hw.firmware[racFirmwareVersion],#1)<>last(/DELL PowerEdge R750 by SNMP/dell.server.hw.firmware[racFirmwareVersion],#2) and length(last(/DELL PowerEdge R750 by SNMP/dell.server.hw.firmware[racFirmwareVersion]))>0 |Info |
Manual close: Yes | |
Dell R750: Host has been restarted | Uptime is less than 10 minutes. |
(last(/DELL PowerEdge R750 by SNMP/dell.server.hw.uptime[systemPowerUpTime])>0 and last(/DELL PowerEdge R750 by SNMP/dell.server.hw.uptime[systemPowerUpTime])<10m) or (last(/DELL PowerEdge R750 by SNMP/dell.server.hw.uptime[systemPowerUpTime])=0 and last(/DELL PowerEdge R750 by SNMP/dell.server.net.uptime[snmpEngineTime])<10m) |Warning |
Manual close: Yes | |
Dell R750: System name has changed | The name of the system has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R750 by SNMP/dell.server.name[sysName],#1)<>last(/DELL PowerEdge R750 by SNMP/dell.server.name[sysName],#2) and length(last(/DELL PowerEdge R750 by SNMP/dell.server.name[sysName]))>0 |Info |
Manual close: Yes | |
Dell R750: No SNMP data collection | SNMP is not available for polling. Please check device connectivity and SNMP settings. |
max(/DELL PowerEdge R750 by SNMP/zabbix[host,snmp,available],{$DELL.SNMP.TIMEOUT})=0 |Warning |
||
Dell R750: Memory amount has changed | change(/DELL PowerEdge R750 by SNMP/dell.server.memory.size.total)>0 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature discovery | Scanning table of Temperature Probe Table IDRAC-MIB-SMIv2::temperatureProbeTable |
SNMP agent | temp.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R750: Probe [{#SENSOR_LOCALE}]: Value | MIB: IDRAC-MIB-SMIv2 This attribute defines the reading for a temperature probe of type other than When the value for When the value for |
SNMP agent | dell.server.sensor.temp.value[temperatureProbeReading.{#SNMPINDEX}] Preprocessing
|
Dell R750: Probe [{#SENSOR_LOCALE}]: Status | MIB: IDRAC-MIB-SMIv2 This attribute defines the probe status of the temperature probe. Possible values: other(1), -- probe status is not one of the following: unknown(2), -- probe status is unknown (not known or monitored) ok(3), -- probe is reporting a value within the thresholds nonCriticalUpper(4), -- probe has crossed the upper noncritical threshold criticalUpper(5), -- probe has crossed the upper critical threshold nonRecoverableUpper(6), -- probe has crossed the upper non-recoverable threshold nonCriticalLower(7), -- probe has crossed the lower noncritical threshold criticalLower(8), -- probe has crossed the lower critical threshold nonRecoverableLower(9), -- probe has crossed the lower non-recoverable threshold failed(10) -- probe is not functional |
SNMP agent | dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R750: Probe [{#SENSOR_LOCALE}]: Critical status | Please check the device for faults. |
last(/DELL PowerEdge R750 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalUpper"} or last(/DELL PowerEdge R750 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableUpper"} or last(/DELL PowerEdge R750 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalLower"} or last(/DELL PowerEdge R750 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableLower"} |Average |
||
Dell R750: Probe [{#SENSOR_LOCALE}]: Warning status | Please check the device for faults. |
last(/DELL PowerEdge R750 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalUpper"} or last(/DELL PowerEdge R750 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalLower"} |Warning |
Depends on:
|
|
Dell R750: Probe [{#SENSOR_LOCALE}]: Not in optimal status | Please check the device for faults. |
last(/DELL PowerEdge R750 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])<>{$DELL.SNMP.SENSOR.TEMP.STATUS.OK} |Info |
Manual close: Yes Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU discovery | IDRAC-MIB-SMIv2::powerSupplyTable |
SNMP agent | psu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R750: Power supply [{#PSU_DESCR}]: State | MIB: IDRAC-MIB-SMIv2 This attribute defines the status of the power supply. |
SNMP agent | dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R750: Power supply [{#PSU_DESCR}]: Critical state | Please check the power supply unit for errors. |
last(/DELL PowerEdge R750 by SNMP/dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}])={$DELL.SNMP.PSU.STATUS.CRIT:"critical"} or last(/DELL PowerEdge R750 by SNMP/dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}])={$DELL.SNMP.PSU.STATUS.CRIT:"nonRecoverable"} |Average |
||
Dell R750: Power supply [{#PSU_DESCR}]: Warning state | Please check the power supply unit for errors. |
last(/DELL PowerEdge R750 by SNMP/dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}])={$DELL.SNMP.PSU.STATUS.WARN:"nonCritical"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Fan discovery | IDRAC-MIB-SMIv2::coolingDeviceTable |
SNMP agent | fan.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R750: Fan [{#FAN_DESCR}]: Status | MIB: IDRAC-MIB-SMIv2 This attribute defines the probe status of the cooling device. |
SNMP agent | dell.server.sensor.fan.status[{#FAN_DESCR}] Preprocessing
|
Dell R750: Fan [{#FAN_DESCR}]: Speed | MIB: IDRAC-MIB-SMIv2 This attribute defines the reading for a cooling device of a subtype other than When the value for When the value for |
SNMP agent | dell.server.sensor.fan.speed[{#FAN_DESCR}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R750: Fan [{#FAN_DESCR}]: Critical state | Please check the fan unit. |
last(/DELL PowerEdge R750 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"criticalUpper"} or last(/DELL PowerEdge R750 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableUpper"} or last(/DELL PowerEdge R750 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"criticalLower"} or last(/DELL PowerEdge R750 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableLower"} or last(/DELL PowerEdge R750 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"failed"} |Average |
||
Dell R750: Fan [{#FAN_DESCR}]: Warning state | Please check the fan unit. |
last(/DELL PowerEdge R750 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalUpper"} or last(/DELL PowerEdge R750 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalLower"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Array controller discovery | Scanning table of Array controllers: IDRAC-MIB-SMIv2::controllerTable |
SNMP agent | array.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R750: Controller [{#CNTLR_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 The status of the controller itself without the propagation of any contained component status. Possible values: 1: Other 2: Unknown 3: OK 4: Non-critical 5: Critical 6: Non-recoverable |
SNMP agent | dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}] Preprocessing
|
Dell R750: Controller [{#CNTLR_NAME}]: Model | MIB: IDRAC-MIB-SMIv2 The controller's name as represented in Storage Management. |
SNMP agent | dell.server.hw.diskarray.model[controllerName.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R750: Controller [{#CNTLR_NAME}]: Unrecoverable state | Please check the device for faults. |
last(/DELL PowerEdge R750 by SNMP/dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.STATUS.FAIL} |High |
||
Dell R750: Controller [{#CNTLR_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R750 by SNMP/dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.STATUS.CRIT} |Average |
Depends on:
|
|
Dell R750: Controller [{#CNTLR_NAME}]: Warning state | Please check the device for faults. |
last(/DELL PowerEdge R750 by SNMP/dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.STATUS.WARN} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Battery discovery | Scanning Battery Table: IDRAC-MIB-SMIv2::batteryTable |
SNMP agent | battery.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R750: Battery [{#BATTERY_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 Current state of battery. Possible values: 1: The current state could not be determined. 2: The battery is operating normally. 3: The battery has failed and needs to be replaced. 4: The battery temperature is high or charge level is depleting. 5: The battery is missing or not detected. 6: The battery is undergoing the re-charge phase. 7: The battery voltage or charge level is below the threshold. |
SNMP agent | dell.server.hw.battery.status[batteryState.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R750: Battery [{#BATTERY_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R750 by SNMP/dell.server.hw.battery.status[batteryState.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT} |Average |
||
Dell R750: Battery [{#BATTERY_NAME}]: Warning state | Please check the device for faults. |
last(/DELL PowerEdge R750 by SNMP/dell.server.hw.battery.status[batteryState.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.WARN} |Warning |
Depends on:
|
|
Dell R750: Battery [{#BATTERY_NAME}]: Not in optimal state | Please check the device for faults. |
last(/DELL PowerEdge R750 by SNMP/dell.server.hw.battery.status[batteryState.{#SNMPINDEX}])<>{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.OK} |Info |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Physical disk discovery | Scanning table of physical drive entries IDRAC-MIB-SMIv2::physicalDiskTable. |
SNMP agent | physicaldisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R750: Physical disk [{#DISK_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 The status of the physical disk itself without the propagation of any contained component status. Possible values: 1: Other 2: Unknown 3: OK 4: Non-critical 5: Critical 6: Non-recoverable |
SNMP agent | dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}] Preprocessing
|
Dell R750: Physical disk [{#DISK_NAME}]: S.M.A.R.T. Status | MIB: IDRAC-MIB-SMIv2 Indicates whether the physical disk has received a predictive failure alert. |
SNMP agent | dell.server.hw.physicaldisk.smart_status[physicalDiskSmartAlertIndication.{#SNMPINDEX}] Preprocessing
|
Dell R750: Physical disk [{#DISK_NAME}]: Serial number | MIB: IDRAC-MIB-SMIv2 The physical disk's unique identification number from the manufacturer. |
SNMP agent | dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}] Preprocessing
|
Dell R750: Physical disk [{#DISK_NAME}]: Model name | MIB: IDRAC-MIB-SMIv2 The model number of the physical disk. |
SNMP agent | dell.server.hw.physicaldisk.model[physicalDiskProductID.{#SNMPINDEX}] Preprocessing
|
Dell R750: Physical disk [{#DISK_NAME}]: Media type | MIB: IDRAC-MIB-SMIv2 The media type of the physical disk. Possible Values: 1: The media type could not be determined. 2: Hard Disk Drive (HDD). 3: Solid State Drive (SSD). |
SNMP agent | dell.server.hw.physicaldisk.media_type[physicalDiskMediaType.{#SNMPINDEX}] Preprocessing
|
Dell R750: Physical disk [{#DISK_NAME}]: Size | MIB: IDRAC-MIB-SMIv2 The size of the physical disk in megabytes. |
SNMP agent | dell.server.hw.physicaldisk.size[physicalDiskCapacityInMB.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R750: Physical disk [{#DISK_NAME}]: Failed state | Please check physical disk for warnings or errors. |
last(/DELL PowerEdge R750 by SNMP/dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.STATUS.FAIL:"critical"} or last(/DELL PowerEdge R750 by SNMP/dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.STATUS.FAIL:"nonRecoverable"} |High |
||
Dell R750: Physical disk [{#DISK_NAME}]: Warning state | Please check physical disk for warnings or errors. |
last(/DELL PowerEdge R750 by SNMP/dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.STATUS.WARN:"nonCritical"} |Warning |
Depends on:
|
|
Dell R750: Physical disk [{#DISK_NAME}]: S.M.A.R.T. failed | Disk probably requires replacement. |
last(/DELL PowerEdge R750 by SNMP/dell.server.hw.physicaldisk.smart_status[physicalDiskSmartAlertIndication.{#SNMPINDEX}])={$DELL.SNMP.DISK.SMART.STATUS.FAIL:"replaceDrive"} or last(/DELL PowerEdge R750 by SNMP/dell.server.hw.physicaldisk.smart_status[physicalDiskSmartAlertIndication.{#SNMPINDEX}])={$DELL.SNMP.DISK.SMART.STATUS.FAIL:"replaceDriveSSDWearOut"} |High |
Depends on:
|
|
Dell R750: Physical disk [{#DISK_NAME}]: Has been replaced | [{#DISK_NAME}] serial number has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R750 by SNMP/dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}],#1)<>last(/DELL PowerEdge R750 by SNMP/dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}],#2) and length(last(/DELL PowerEdge R750 by SNMP/dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}]))>0 |Info |
Manual close: Yes |
Name | Description | Type | Key and additional info |
---|---|---|---|
Virtual disk discovery | IDRAC-MIB-SMIv2::virtualDiskTable |
SNMP agent | virtualdisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R750: Virtual disk [{#DISK_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 The current state of this virtual disk (which includes any member physical disks.) Possible states: 1: The current state could not be determined. 2: The virtual disk is operating normally or optimally. 3: The virtual disk has encountered a failure. Data on the disk is lost or is about to be lost. 4: The virtual disk encountered a failure with one or all of the constituent redundant physical disks. The data on the virtual disk might no longer be fault tolerant. |
SNMP agent | dell.server.hw.virtualdisk.status[virtualDiskState.{#SNMPINDEX}] Preprocessing
|
Dell R750: Virtual disk [{#DISK_NAME}]: Layout type | MIB: IDRAC-MIB-SMIv2 The virtual disk's RAID type. Possible values: 1: Not one of the following 2: RAID-0 3: RAID-1 4: RAID-5 5: RAID-6 6: RAID-10 7: RAID-50 8: RAID-60 9: Concatenated RAID 1 10: Concatenated RAID 5 |
SNMP agent | dell.server.hw.virtualdisk.layout[virtualDiskLayout.{#SNMPINDEX}] Preprocessing
|
Dell R750: Virtual disk [{#DISK_NAME}]: Size | MIB: IDRAC-MIB-SMIv2 The size of the virtual disk in megabytes. |
SNMP agent | dell.server.hw.virtualdisk.size[virtualDiskSizeInMB.{#SNMPINDEX}] Preprocessing
|
Dell R750: Virtual disk [{#DISK_NAME}]: Operational state | MIB: IDRAC-MIB-SMIv2 The state of the virtual disk when there are progressive operations ongoing. Possible values: 1: There is no active operation running. 2: The virtual disk configuration has changed. The physical disks included in the virtual disk are being modified to support the new configuration. 3: A Consistency Check (CC) is being performed on the virtual disk. 4: The virtual disk is being initialized. 5: BackGround Initialization (BGI) is being performed on the virtual disk. |
SNMP agent | dell.server.hw.virtualdisk.state[virtualDiskOperationalState.{#SNMPINDEX}] |
Dell R750: Virtual disk [{#DISK_NAME}]: Read policy | MIB: IDRAC-MIB-SMIv2 The read policy used by the controller for read operations on this virtual disk. Possible values: 1: No Read Ahead. 2: Read Ahead. 3: Adaptive Read Ahead. |
SNMP agent | dell.server.hw.virtualdisk.read_policy[virtualDiskReadPolicy.{#SNMPINDEX}] |
Dell R750: Virtual disk [{#DISK_NAME}]: Write policy | MIB: IDRAC-MIB-SMIv2 The write policy used by the controller for write operations on this virtual disk. Possible values: 1: Write Through. 2: Write Back. 3: Force Write Back. |
SNMP agent | dell.server.hw.virtualdisk.write_policy[virtualDiskWritePolicy.{#SNMPINDEX}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R750: Virtual disk [{#DISK_NAME}]: Failed state | Please check the virtual disk for warnings or errors. |
last(/DELL PowerEdge R750 by SNMP/dell.server.hw.virtualdisk.status[virtualDiskState.{#SNMPINDEX}])={$DELL.SNMP.VDISK.STATUS.CRIT:"failed"} |High |
||
Dell R750: Virtual disk [{#DISK_NAME}]: Warning state | Please check the virtual disk for warnings or errors. |
last(/DELL PowerEdge R750 by SNMP/dell.server.hw.virtualdisk.status[virtualDiskState.{#SNMPINDEX}])={$DELL.SNMP.VDISK.STATUS.WARN:"degraded"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Network interface discovery | Discovery of network interfaces. |
SNMP agent | net.if.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R750: NIC [{#NICFQDD}/{#NICMAC}]: Link status | This attribute defines the connection status of the network device. |
SNMP agent | dell.server.net.if.link[{#NIC_FQDD}] Preprocessing
|
Dell R750: NIC [{#NICFQDD}/{#NICMAC}]: Status | This attribute defines the status of the network device. |
SNMP agent | dell.server.net.if.status[{#NIC_FQDD}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R750: NIC [{#NICFQDD}/{#NICMAC}]: Link down | This trigger expression works as follows: |
{$DELL.SNMP.IFCONTROL:"{#NIC_FQDD}"}=1 and last(/DELL PowerEdge R750 by SNMP/dell.server.net.if.link[{#NIC_FQDD}],#1)<>1 and last(/DELL PowerEdge R750 by SNMP/dell.server.net.if.link[{#NIC_FQDD}],#1)<>last(/DELL PowerEdge R750 by SNMP/dell.server.net.if.link[{#NIC_FQDD}],#2) |Average |
Manual close: Yes | |
Dell R750: NIC [{#NICFQDD}/{#NICMAC}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/DELL PowerEdge R750 by SNMP/dell.server.net.if.status[{#NIC_FQDD}],#1)<>3 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
CPU status discovery | CPU status discovery. |
SNMP agent | cpu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R750: CPU [{#CPU_FQDD}]: Status | This attribute defines the status of the processor device status probe. This status will be joined into the |
SNMP agent | dell.server.cpu.status[cpu.{#SNMPINDEX}] Preprocessing
|
Dell R750: CPU [{#CPU_FQDD}]: State | This attribute defines the reading of the processor device status probe. |
SNMP agent | dell.server.cpu.state[cpu.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R750: CPU [{#CPU_FQDD}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/DELL PowerEdge R750 by SNMP/dell.server.cpu.status[cpu.{#SNMPINDEX}],#1)<>3 |Average |
||
Dell R750: CPU [{#CPU_FQDD}]: Reading error | MIB: IDRAC-MIB-SMIv2 |
bitand(last(/DELL PowerEdge R750 by SNMP/dell.server.cpu.state[cpu.{#SNMPINDEX}],#1),128)=0 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
System battery discovery | System battery discovery. |
SNMP agent | system.battery.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R750: System battery [{#SNMPVALUE}]: Status | This attribute defines the status of the battery. |
SNMP agent | dell.server.system.battery[{#SNMPVALUE}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R750: System battery [{#SNMPVALUE}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/DELL PowerEdge R750 by SNMP/dell.server.system.battery[{#SNMPVALUE}],#1)<>3 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Memory discovery | Memory discovery. |
SNMP agent | memory.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R750: Memory [{#SNMPVALUE}]: Status | This attribute defines the status of the memory device. |
SNMP agent | dell.server.memory.status[{#SNMPVALUE}] Preprocessing
|
Dell R750: Memory [{#SNMPVALUE}]: Size | This attribute defines the size, in KB, of the memory device. Zero indicates no memory installed; 2,147,483,647 indicates an unknown memory size. |
SNMP agent | dell.server.memory.size[{#SNMPVALUE}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R750: Memory [{#SNMPVALUE}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/DELL PowerEdge R750 by SNMP/dell.server.memory.status[{#SNMPVALUE}],#1)<>3 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Voltage probe discovery | Voltage probe discovery. |
SNMP agent | voltage.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R750: Voltage probe [{#VPROBE_NAME}]: Voltage | This attribute defines the reading for a voltage probe. |
SNMP agent | dell.server.voltage.value[{#VPROBE_NAME}] Preprocessing
|
Dell R750: Voltage probe [{#VPROBE_NAME}]: Status | This attribute defines the status of the voltage probe. |
SNMP agent | dell.server.voltage.status[{#VPROBE_NAME}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R750: Voltage probe [{#VPROBE_NAME}]: Status is not OK | Please check the device's voltage. |
last(/DELL PowerEdge R750 by SNMP/dell.server.voltage.status[{#VPROBE_NAME}])<>3 |Average |
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums
This is a template for monitoring DELL PowerEdge R750 servers with iDRAC 8/9 firmware 4.32 (and later) with Redfish API enabled via Zabbix script items. This template works without any external scripts.
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
1. Enable Redfish API in the Dell iDRAC interface of your server.
2. Create a user for monitoring with read-only permissions in the Dell iDRAC interface.
3. Create a host for Dell server with iDRAC IP as the Zabbix agent interface.
4. Link the template to the host.
5. Customize the values of the {$DELL.HTTP.API.URL}
, {$DELL.HTTP.API.USER}
, and {$DELL.HTTP.API.PASSWORD}
macros.
NOTE! If you are experiencing timeouts on some of the items that are executing requests, adjust the
{$DELL.HTTP.REQUEST.TIMEOUT}
macro accordingly.
Name | Description | Default |
---|---|---|
{$DELL.HTTP.API.URL} | The Dell iDRAC Redfish API URL in the format |
<Put your URL here> |
{$DELL.HTTP.API.USER} | The Dell iDRAC username. |
<Put your username here> |
{$DELL.HTTP.API.PASSWORD} | The Dell iDRAC user password. |
<Put your password here> |
{$DELL.HTTP.PROXY} | Set an HTTP proxy for Redfish API requests if needed. |
|
{$DELL.HTTP.RETURN.CODE.OK} | Set the HTTP return code that represents an OK response from the API. The default is "200", but can vary, for example, if a proxy is used. |
200 |
{$DELL.HTTP.REQUEST.TIMEOUT} | Set the timeout for HTTP requests. |
10s |
{$DELL.HTTP.IFCONTROL} | Link status trigger will be fired only for interfaces that have the context macro equal to "1". |
1 |
{$DELL.HTTP.CPU.UTIL.HIGH} | Sets the percentage threshold for creating a "high" severity event about CPU utilization. |
90 |
{$DELL.HTTP.CPU.UTIL.WARN} | Sets the percentage threshold for creating a "warning" severity event about CPU utilization. |
75 |
{$DELL.HTTP.MEM.UTIL.HIGH} | Sets the percentage threshold for creating a "high" severity event about memory utilization. |
90 |
{$DELL.HTTP.MEM.UTIL.WARN} | Sets the percentage threshold for creating a "warning" severity event about memory utilization. |
75 |
{$DELL.HTTP.IO.UTIL.HIGH} | Sets the percentage threshold for creating a "high" severity event about IO utilization. |
90 |
{$DELL.HTTP.IO.UTIL.WARN} | Sets the percentage threshold for creating a "warning" severity event about IO utilization. |
75 |
{$DELL.HTTP.SYS.UTIL.HIGH} | Sets the percentage threshold for creating a "high" severity event about SYS utilization. |
90 |
{$DELL.HTTP.SYS.UTIL.WARN} | Sets the percentage threshold for creating a "warning" severity event about SYS utilization. |
75 |
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R750: Get system | Returns system metrics. |
Script | dell.server.system.get |
Dell R750: Get sensors | Returns sensors. |
Script | dell.server.sensors.get |
Dell R750: Get array controller resources | Returns array controller resources. |
Script | dell.server.array.resources.get |
Dell R750: Get disks | Returns storage resources. |
Script | dell.server.disks.get |
Dell R750: Get network interfaces | Returns network interfaces. |
Script | dell.server.net.iface.get |
Dell R750: CPU utilization, in % | CPU utilization. |
Dependent item | dell.server.util.cpu Preprocessing
|
Dell R750: Memory utilization, in % | Memory utilization. |
Dependent item | dell.server.util.mem Preprocessing
|
Dell R750: IO utilization, in % | IO utilization. |
Dependent item | dell.server.util.io Preprocessing
|
Dell R750: SYS utilization, in % | SYS utilization. |
Dependent item | dell.server.util.sys Preprocessing
|
Dell R750: Overall system health status | This attribute defines the overall rollup status of all the components in the system monitored by the remote access card. Includes system, storage, IO devices, iDRAC, CPU, memory, etc. |
Dependent item | dell.server.status Preprocessing
|
Dell R750: Hardware model name | This attribute defines the model name of the system. |
Dependent item | dell.server.hw.model Preprocessing
|
Dell R750: Hardware serial number | This attribute defines the service tag of the system. |
Dependent item | dell.server.hw.serialnumber Preprocessing
|
Dell R750: Firmware version | This attribute defines the firmware version of a remote access card. |
Dependent item | dell.server.hw.firmware Preprocessing
|
Dell R750: Redfish API status | Availability of Redfish API on the server. Possible values: 0 - Unavailable 1 - Available |
Simple check | net.tcp.service[https] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R750: CPU utilization is too high | Current CPU utilization has exceeded |
min(/DELL PowerEdge R750 by HTTP/dell.server.util.cpu,5m)>={$DELL.HTTP.CPU.UTIL.HIGH} |High |
||
Dell R750: CPU utilization is high | Current CPU utilization has exceeded |
min(/DELL PowerEdge R750 by HTTP/dell.server.util.cpu,5m)>={$DELL.HTTP.CPU.UTIL.WARN} |Warning |
Depends on:
|
|
Dell R750: Memory utilization is too high | Current memory utilization has exceeded |
min(/DELL PowerEdge R750 by HTTP/dell.server.util.mem,5m)>={$DELL.HTTP.MEM.UTIL.HIGH} |High |
||
Dell R750: Memory utilization is high | Current memory utilization has exceeded |
min(/DELL PowerEdge R750 by HTTP/dell.server.util.mem,5m)>={$DELL.HTTP.MEM.UTIL.WARN} |Warning |
Depends on:
|
|
Dell R750: IO utilization is too high | Current IO utilization has exceeded |
min(/DELL PowerEdge R750 by HTTP/dell.server.util.io,5m)>={$DELL.HTTP.IO.UTIL.HIGH} |High |
||
Dell R750: IO utilization is high | Current IO utilization has exceeded |
min(/DELL PowerEdge R750 by HTTP/dell.server.util.io,5m)>={$DELL.HTTP.IO.UTIL.WARN} |Warning |
Depends on:
|
|
Dell R750: SYS utilization is too high | Current SYS utilization has exceeded |
min(/DELL PowerEdge R750 by HTTP/dell.server.util.sys,5m)>={$DELL.HTTP.SYS.UTIL.HIGH} |High |
||
Dell R750: SYS utilization is high | Current SYS utilization has exceeded |
min(/DELL PowerEdge R750 by HTTP/dell.server.util.sys,5m)>={$DELL.HTTP.SYS.UTIL.WARN} |Warning |
Depends on:
|
|
Dell R750: Server is in a critical state | Please check the device for faults. |
last(/DELL PowerEdge R750 by HTTP/dell.server.status,)=3 |Average |
||
Dell R750: Server is in a warning state | Please check the device for warnings. |
last(/DELL PowerEdge R750 by HTTP/dell.server.status,)=2 |Warning |
Depends on:
|
|
Dell R750: Device has been replaced | The device serial number has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R750 by HTTP/dell.server.hw.serialnumber,#1)<>last(/DELL PowerEdge R750 by HTTP/dell.server.hw.serialnumber,#2) and length(last(/DELL PowerEdge R750 by HTTP/dell.server.hw.serialnumber))>0 |Info |
Manual close: Yes | |
Dell R750: Firmware has changed | The firmware version has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R750 by HTTP/dell.server.hw.firmware,#1)<>last(/DELL PowerEdge R750 by HTTP/dell.server.hw.firmware,#2) and length(last(/DELL PowerEdge R750 by HTTP/dell.server.hw.firmware))>0 |Info |
Manual close: Yes | |
Dell R750: Redfish API service is unavailable | The service is unavailable or does not accept TCP connections. |
last(/DELL PowerEdge R750 by HTTP/net.tcp.service[https])=0 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature discovery | Discovery of temperature sensors. |
Dependent item | dell.server.temp.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R750: Probe [{#SENSOR_NAME}]: Get sensor | Returns the metrics of a sensor. |
Dependent item | dell.server.sensor.temp.get[{#SENSOR_NAME}] Preprocessing
|
Dell R750: Probe [{#SENSOR_NAME}]: Value | Sensor value. |
Dependent item | dell.server.sensor.temp.value[{#SENSOR_NAME}] Preprocessing
|
Dell R750: Probe [{#SENSOR_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.sensor.temp.status[{#SENSOR_NAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R750: Probe [{#SENSOR_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R750 by HTTP/dell.server.sensor.temp.status[{#SENSOR_NAME}],)=3 |Average |
||
Dell R750: Probe [{#SENSOR_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R750 by HTTP/dell.server.sensor.temp.status[{#SENSOR_NAME}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU discovery | Discovery of PSU sensors. |
Dependent item | dell.server.psu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R750: Power supply [{#SENSOR_NAME}]: Get sensor | Returns the metrics of a sensor. |
Dependent item | dell.server.sensor.psu.get[{#SENSOR_NAME}] Preprocessing
|
Dell R750: Power supply [{#SENSOR_NAME}]: Voltage | Sensor value. |
Dependent item | dell.server.sensor.psu.voltage[{#SENSOR_NAME}] Preprocessing
|
Dell R750: Power supply [{#SENSOR_NAME}]: Voltage sensor status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.sensor.psu.voltage.status[{#SENSOR_NAME}] Preprocessing
|
Dell R750: Power supply [{#SENSOR_NAME}]: Current | Sensor value. |
Dependent item | dell.server.sensor.psu.current[{#SENSOR_NAME}] Preprocessing
|
Dell R750: Power supply [{#SENSOR_NAME}]: Current sensor status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.sensor.psu.current.status[{#SENSOR_NAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R750: Power supply [{#SENSOR_NAME}]: Voltage sensor: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R750 by HTTP/dell.server.sensor.psu.voltage.status[{#SENSOR_NAME}],)=3 |Average |
||
Dell R750: Power supply [{#SENSOR_NAME}]: Voltage sensor: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R750 by HTTP/dell.server.sensor.psu.voltage.status[{#SENSOR_NAME}],)=2 |Warning |
Depends on:
|
|
Dell R750: Power supply [{#SENSOR_NAME}]: Current sensor: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R750 by HTTP/dell.server.sensor.psu.current.status[{#SENSOR_NAME}],)=3 |Average |
||
Dell R750: Power supply [{#SENSOR_NAME}]: Current sensor: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R750 by HTTP/dell.server.sensor.psu.current.status[{#SENSOR_NAME}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
FAN discovery | Discovery of FAN sensors. |
Dependent item | dell.server.fan.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R750: Fan [{#SENSOR_NAME}]: Get sensor | Returns the metrics of a sensor. |
Dependent item | dell.server.sensor.fan.get[{#SENSOR_NAME}] Preprocessing
|
Dell R750: Fan [{#SENSOR_NAME}]: Speed | Sensor value. |
Dependent item | dell.server.sensor.fan.speed[{#SENSOR_NAME}] Preprocessing
|
Dell R750: Fan [{#SENSOR_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.sensor.fan.status[{#SENSOR_NAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R750: Fan [{#SENSOR_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R750 by HTTP/dell.server.sensor.fan.status[{#SENSOR_NAME}],)=3 |Average |
||
Dell R750: Fan [{#SENSOR_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R750 by HTTP/dell.server.sensor.fan.status[{#SENSOR_NAME}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Array controller discovery | Discovery of disk array controllers. |
Dependent item | dell.server.array.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R750: Controller [{#CNTLR_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.array.status[{#ID}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R750: Controller [{#CNTLR_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R750 by HTTP/dell.server.array.status[{#ID}],)=3 |Average |
||
Dell R750: Controller [{#CNTLR_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R750 by HTTP/dell.server.array.status[{#ID}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Battery discovery | Discovery of battery controllers. |
Dependent item | dell.server.controller.battery.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R750: Battery [{#BATTERY_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.controller.battery.status[{#ID}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R750: Battery [{#BATTERY_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R750 by HTTP/dell.server.controller.battery.status[{#ID}],)=3 |Average |
||
Dell R750: Battery [{#BATTERY_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R750 by HTTP/dell.server.controller.battery.status[{#ID}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Physical disk discovery | Discovery of physical disks. |
Dependent item | dell.server.physicaldisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R750: Physical disk [{#DISK_NAME}]: Get disk | Returns the metrics of a physical disk. |
Script | dell.server.hw.physicaldisk.get[{#DISK_NAME}] |
Dell R750: Physical disk [{#DISK_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.hw.physicaldisk.status[{#DISK_NAME}] Preprocessing
|
Dell R750: Physical disk [{#DISK_NAME}]: Serial number | The serial number of this drive. |
Dependent item | dell.server.hw.physicaldisk.serialnumber[{#DISK_NAME}] Preprocessing
|
Dell R750: Physical disk [{#DISK_NAME}]: Model name | The model number of the drive. |
Dependent item | dell.server.hw.physicaldisk.model[{#DISK_NAME}] Preprocessing
|
Dell R750: Physical disk [{#DISK_NAME}]: Media type | The type of media contained in this drive. Possible values: HDD, SSD, SMR, null. |
Dependent item | dell.server.hw.physicaldisk.mediatype[{#DISKNAME}] Preprocessing
|
Dell R750: Physical disk [{#DISK_NAME}]: Size | The size, in bytes, of this drive. |
Dependent item | dell.server.hw.physicaldisk.size[{#DISK_NAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R750: Physical disk [{#DISK_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R750 by HTTP/dell.server.hw.physicaldisk.status[{#DISK_NAME}],)=3 |Average |
||
Dell R750: Physical disk [{#DISK_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R750 by HTTP/dell.server.hw.physicaldisk.status[{#DISK_NAME}],)=2 |Warning |
Depends on:
|
|
Dell R750: Physical disk [{#DISK_NAME}]: Has been replaced | [{#DISK_NAME}] serial number has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R750 by HTTP/dell.server.hw.physicaldisk.serialnumber[{#DISK_NAME}],#1)<>last(/DELL PowerEdge R750 by HTTP/dell.server.hw.physicaldisk.serialnumber[{#DISK_NAME}],#2) and length(last(/DELL PowerEdge R750 by HTTP/dell.server.hw.physicaldisk.serialnumber[{#DISK_NAME}]))>0 |Info |
Manual close: Yes |
Name | Description | Type | Key and additional info |
---|---|---|---|
Virtual disk discovery | Discovery of virtual disks. |
Dependent item | dell.server.virtualdisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R750: Virtual disk [{#DISK_NAME}]: Get disk | Returns the metrics of a virtual disk. |
Script | dell.server.hw.virtualdisk.get[{#DISK_NAME}] |
Dell R750: Virtual disk [{#DISK_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.hw.virtualdisk.status[{#DISK_NAME}] Preprocessing
|
Dell R750: Virtual disk [{#DISK_NAME}]: RAID status | This property represents the RAID specific status. Possible values: Blocked, Degraded, Failed, Foreign, Offline, Online, Ready, Unknown, null. |
Dependent item | dell.server.hw.virtualdisk.raidstatus[{#DISKNAME}] Preprocessing
|
Dell R750: Virtual disk [{#DISK_NAME}]: Size | The size in bytes of this Volume. |
Dependent item | dell.server.hw.virtualdisk.size[{#DISK_NAME}] Preprocessing
|
Dell R750: Virtual disk [{#DISK_NAME}]: Current state | The known state of the Resource, for example, Enabled. Possible values: Enabled, Disabled, StandbyOffline, StandbySpare, InTest, Starting, Absent, UnavailableOffline, Deferring, Quiesced, Updating, Qualified. |
Dependent item | dell.server.hw.virtualdisk.state[{#DISK_NAME}] Preprocessing
|
Dell R750: Virtual disk [{#DISK_NAME}]: Read policy | Indicates the read cache policy setting for the Volume. Possible values: ReadAhead, NoReadAhead, AdaptiveReadAhead. |
Dependent item | dell.server.hw.virtualdisk.readpolicy[{#DISKNAME}] Preprocessing
|
Dell R750: Virtual disk [{#DISK_NAME}]: Write policy | Indicates the write cache policy setting for the Volume. Possible values: WriteThrough, WriteBack, ProtectedWriteBack, UnprotectedWriteBack. |
Dependent item | dell.server.hw.virtualdisk.writepolicy[{#DISKNAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R750: Virtual disk [{#DISK_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R750 by HTTP/dell.server.hw.virtualdisk.status[{#DISK_NAME}],)=3 |Average |
||
Dell R750: Virtual disk [{#DISK_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R750 by HTTP/dell.server.hw.virtualdisk.status[{#DISK_NAME}],)=2 |Warning |
Depends on:
|
|
Dell R750: Virtual disk [{#DISK_NAME}]: RAID status not OK | Please check the disk for faults. |
last(/DELL PowerEdge R750 by HTTP/dell.server.hw.virtualdisk.raid_status[{#DISK_NAME}],)<8 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Network interface discovery | Discovery of network interfaces. |
Dependent item | dell.server.net.if.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R750: Interface [{#IFNAME}]: Get interface | Returns the metrics of a network interface. |
Script | dell.server.net.if.get[{#IFNAME}] |
Dell R750: Interface [{#IFNAME}]: Speed | The network port current link speed. |
Dependent item | dell.server.net.if.speed[{#IFNAME}] Preprocessing
|
Dell R750: Interface [{#IFNAME}]: Link status | The status of the link between this port and its link partner. Possible values: Down, Up, null. |
Dependent item | dell.server.net.if.status[{#IFNAME}] Preprocessing
|
Dell R750: Interface [{#IFNAME}]: State | The known state of the Resource, for example, Enabled. Possible values: Enabled, Disabled, StandbyOffline, StandbySpare, InTest, Starting, Absent, UnavailableOffline, Deferring, Quiesced, Updating, Qualified. |
Dependent item | dell.server.net.if.state[{#IFNAME}] Preprocessing
|
Dell R750: Interface [{#IFNAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.net.if.health[{#IFNAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R750: Interface [{#IFNAME}]: Link down | This trigger expression works as follows: |
{$DELL.HTTP.IFCONTROL:"{#IFNAME}"}=1 and (last(/DELL PowerEdge R750 by HTTP/dell.server.net.if.status[{#IFNAME}],)=2 and last(/DELL PowerEdge R750 by HTTP/dell.server.net.if.status[{#IFNAME}],#1)<>last(/DELL PowerEdge R750 by HTTP/dell.server.net.if.status[{#IFNAME}],#2)) |Average |
Manual close: Yes | |
Dell R750: Interface [{#IFNAME}]: Link status issue | This trigger expression works as follows: |
{$DELL.HTTP.IFCONTROL:"{#IFNAME}"}=1 and (last(/DELL PowerEdge R750 by HTTP/dell.server.net.if.status[{#IFNAME}],)<2 and last(/DELL PowerEdge R750 by HTTP/dell.server.net.if.status[{#IFNAME}],#1)<>last(/DELL PowerEdge R750 by HTTP/dell.server.net.if.status[{#IFNAME}],#2)) |Average |
Manual close: Yes | |
Dell R750: Interface [{#IFNAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R750 by HTTP/dell.server.net.if.health[{#IFNAME}],)=3 |Average |
||
Dell R750: Interface [{#IFNAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R750 by HTTP/dell.server.net.if.health[{#IFNAME}],)=2 |Warning |
Depends on:
|
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums
This is a template for monitoring DELL PowerEdge R740 servers with iDRAC version 7 (and later) via Zabbix SNMP agent that works without any external scripts.
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
Refer to the vendor documentation.
Name | Description | Default |
---|---|---|
{$DELL.SNMP.DISCOVERY.VOLTAGE.NAME.MATCHES} | Sets the regex string of voltage probe names to allow in discovery. |
^.*Voltage.*$ |
{$DELL.SNMP.DISCOVERY.VOLTAGE.NAME.NOT_MATCHES} | Sets the regex string of voltage probe names to ignore in discovery. |
CHANGE_IF_NEEDED |
{$DELL.SNMP.DISCOVERY.VOLTAGE.TYPE.MATCHES} | Sets the regex string of voltage probe types to allow in discovery. |
18|16 |
{$DELL.SNMP.DISCOVERY.VOLTAGE.TYPE.NOT_MATCHES} | Sets the regex string of voltage probe types to ignore in discovery. |
CHANGE_IF_NEEDED |
{$DELL.SNMP.SENSOR.TEMP.STATUS.OK} | The OK status of the temperature probe for the trigger expression. |
3 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalUpper"} | The warning status of the temperature probe for the trigger expression. |
4 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalLower"} | The warning status of the temperature probe for the trigger expression. |
7 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalUpper"} | The critical status of the temperature probe for the trigger expression. |
5 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableUpper"} | The critical status of the temperature probe for the trigger expression. |
6 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalLower"} | The critical status of the temperature probe for the trigger expression. |
8 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableLower"} | The critical status of the temperature probe for the trigger expression. |
9 |
{$DELL.SNMP.HEALTH.STATUS.DISASTER} | The disaster status of health for the trigger expression. |
6 |
{$DELL.SNMP.HEALTH.STATUS.CRIT} | The critical status of health for the trigger expression. |
5 |
{$DELL.SNMP.HEALTH.STATUS.WARN} | The warning status of health for the trigger expression. |
4 |
{$DELL.SNMP.PSU.STATUS.WARN:"nonCritical"} | The warning value of the PSU sensor for the trigger expression. |
4 |
{$DELL.SNMP.PSU.STATUS.CRIT:"critical"} | The critical value of the PSU sensor for the trigger expression. |
5 |
{$DELL.SNMP.PSU.STATUS.CRIT:"nonRecoverable"} | The critical value of the PSU sensor for the trigger expression. |
6 |
{$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalUpper"} | The warning value of the FAN sensor for the trigger expression. |
4 |
{$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalLower"} | The warning value of the FAN sensor for the trigger expression. |
7 |
{$DELL.SNMP.FAN.STATUS.CRIT:"criticalUpper"} | The critical value of the FAN sensor for the trigger expression. |
5 |
{$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableUpper"} | The critical value of the FAN sensor for the trigger expression. |
6 |
{$DELL.SNMP.FAN.STATUS.CRIT:"criticalLower"} | The critical value of the FAN sensor for the trigger expression. |
8 |
{$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableLower"} | The critical value of the FAN sensor for the trigger expression. |
9 |
{$DELL.SNMP.FAN.STATUS.CRIT:"failed"} | The critical value of the FAN sensor for the trigger expression. |
10 |
{$DELL.SNMP.DISK.ARRAY.STATUS.FAIL} | The disaster status of the disk array for the trigger expression. |
6 |
{$DELL.SNMP.DISK.ARRAY.STATUS.CRIT} | The critical status of the disk array for the trigger expression. |
5 |
{$DELL.SNMP.DISK.ARRAY.STATUS.WARN} | The warning status of the disk array for the trigger expression. |
4 |
{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT} | The critical status of the disk array cache battery for the trigger expression. |
3 |
{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.WARN} | The warning status of the disk array cache battery for the trigger expression. |
4 |
{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.OK} | The OK status of the disk array cache battery for the trigger expression. |
2 |
{$DELL.SNMP.VDISK.STATUS.CRIT:"failed"} | The critical status of the virtual disk for the trigger expression. |
3 |
{$DELL.SNMP.VDISK.STATUS.WARN:"degraded"} | The warning status of the virtual disk for the trigger expression. |
4 |
{$DELL.SNMP.DISK.STATUS.WARN:"nonCritical"} | The warning status of the disk for the trigger expression. |
4 |
{$DELL.SNMP.DISK.STATUS.FAIL:"critical"} | The critical status of the disk for the trigger expression. |
5 |
{$DELL.SNMP.DISK.STATUS.FAIL:"nonRecoverable"} | The critical status of the disk for the trigger expression. |
6 |
{$DELL.SNMP.DISK.SMART.STATUS.FAIL} | The critical S.M.A.R.T status of the disk for the trigger expression. |
1 |
{$DELL.SNMP.TIMEOUT} | The time interval for the SNMP agent availability trigger expression. |
5m |
{$DELL.SNMP.IFCONTROL} | The link status trigger will be fired only for interfaces that have the context macro equal to "1". |
1 |
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R740: Overall system health status | MIB: IDRAC-MIB-SMIv2 This attribute defines the overall rollup status of all the components in the system monitored by the remote access card. Includes system, storage, IO devices, iDRAC, CPU, memory, etc. |
SNMP agent | dell.server.status[globalSystemStatus] Preprocessing
|
Dell R740: Hardware model name | MIB: IDRAC-MIB-SMIv2 This attribute defines the model name of the system. |
SNMP agent | dell.server.hw.model[systemModelName] Preprocessing
|
Dell R740: Hardware serial number | MIB: IDRAC-MIB-SMIv2 This attribute defines the service tag of the system. |
SNMP agent | dell.server.hw.serialnumber[systemServiceTag] Preprocessing
|
Dell R740: Operating system | MIB: IDRAC-MIB-SMIv2 This attribute defines the name of the operating system that the host is running. |
SNMP agent | dell.server.sw.os[systemOSName] Preprocessing
|
Dell R740: Firmware version | MIB: IDRAC-MIB-SMIv2 This attribute defines the firmware version of a remote access card. |
SNMP agent | dell.server.hw.firmware[racFirmwareVersion] Preprocessing
|
Dell R740: Uptime (network) | MIB: SNMP-FRAMEWORK-MIB The number of seconds since the value of the snmpEngineBoots object last changed. |
SNMP agent | dell.server.net.uptime[snmpEngineTime] |
Dell R740: Uptime (hardware) | MIB: IDRAC-MIB-SMIv2 This attribute defines the power-up time of the system in seconds. |
SNMP agent | dell.server.hw.uptime[systemPowerUpTime] |
Dell R740: SNMP traps (fallback) | The item is used to collect all SNMP traps unmatched by other |
SNMP trap | snmptrap.fallback |
Dell R740: System location | MIB: SNMPv2-MIB The physical location of this node (e.g., 'telephone closet, 3rd floor'). If the location is unknown, the value is a zero-length string. |
SNMP agent | dell.server.location[sysLocation] Preprocessing
|
Dell R740: System contact details | MIB: SNMPv2-MIB Name and contact information of the contact person for the node. If not provided, the value is a zero-length string. |
SNMP agent | dell.server.contact[sysContact] Preprocessing
|
Dell R740: System object ID | MIB: SNMPv2-MIB The vendor's authoritative identification of the entity as part of the vendor's SMI enterprises subtree with the prefix 1.3.6.1.4.1 (e.g., a vendor with the identifier 1.3.6.1.4.1.4242 might assign a system object with the OID 1.3.6.1.4.1.4242.1.1). |
SNMP agent | dell.server.objectid[sysObjectID] Preprocessing
|
Dell R740: System name | MIB: SNMPv2-MIB An administratively-assigned name for this managed node. By convention, this is the node's fully-qualified domain name. If the name is unknown, the value is a zero-length string. |
SNMP agent | dell.server.name[sysName] Preprocessing
|
Dell R740: System description | MIB: SNMPv2-MIB A textual description of the entity. This value should include the full name and version identification of the system's hardware type, software operating system, and networking software. |
SNMP agent | dell.server.descr[sysDescr] Preprocessing
|
Dell R740: SNMP agent availability | Availability of SNMP checks on the host. The value of this item corresponds to availability icons in the host list. Possible values: 0 - not available 1 - available 2 - unknown |
Zabbix internal | zabbix[host,snmp,available] Preprocessing
|
Dell R740: Memory, total size | Total memory amount on the device. |
Calculated | dell.server.memory.size.total Preprocessing
|
Dell R740: BIOS version | MIB: IDRAC-MIB-SMIv2 This attribute defines the version name of the system BIOS. |
SNMP agent | dell.server.bios.version Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R740: System is in unrecoverable state | Please check the device for faults. |
last(/DELL PowerEdge R740 by SNMP/dell.server.status[globalSystemStatus])={$DELL.SNMP.HEALTH.STATUS.DISASTER} |High |
||
Dell R740: System status is in critical state | Please check the device for errors. |
last(/DELL PowerEdge R740 by SNMP/dell.server.status[globalSystemStatus])={$DELL.SNMP.HEALTH.STATUS.CRIT} |Average |
||
Dell R740: System status is in warning state | Please check the device for warnings. |
last(/DELL PowerEdge R740 by SNMP/dell.server.status[globalSystemStatus])={$DELL.SNMP.HEALTH.STATUS.WARN} |Warning |
Depends on:
|
|
Dell R740: Device has been replaced | Device serial number has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R740 by SNMP/dell.server.hw.serialnumber[systemServiceTag],#1)<>last(/DELL PowerEdge R740 by SNMP/dell.server.hw.serialnumber[systemServiceTag],#2) and length(last(/DELL PowerEdge R740 by SNMP/dell.server.hw.serialnumber[systemServiceTag]))>0 |Info |
Manual close: Yes | |
Dell R740: Operating system description has changed | Operating system description has changed. Possibly, the system has been updated or replaced. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R740 by SNMP/dell.server.sw.os[systemOSName],#1)<>last(/DELL PowerEdge R740 by SNMP/dell.server.sw.os[systemOSName],#2) and length(last(/DELL PowerEdge R740 by SNMP/dell.server.sw.os[systemOSName]))>0 |Info |
Manual close: Yes | |
Dell R740: Firmware has changed | Firmware version has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R740 by SNMP/dell.server.hw.firmware[racFirmwareVersion],#1)<>last(/DELL PowerEdge R740 by SNMP/dell.server.hw.firmware[racFirmwareVersion],#2) and length(last(/DELL PowerEdge R740 by SNMP/dell.server.hw.firmware[racFirmwareVersion]))>0 |Info |
Manual close: Yes | |
Dell R740: Host has been restarted | Uptime is less than 10 minutes. |
(last(/DELL PowerEdge R740 by SNMP/dell.server.hw.uptime[systemPowerUpTime])>0 and last(/DELL PowerEdge R740 by SNMP/dell.server.hw.uptime[systemPowerUpTime])<10m) or (last(/DELL PowerEdge R740 by SNMP/dell.server.hw.uptime[systemPowerUpTime])=0 and last(/DELL PowerEdge R740 by SNMP/dell.server.net.uptime[snmpEngineTime])<10m) |Warning |
Manual close: Yes | |
Dell R740: System name has changed | The name of the system has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R740 by SNMP/dell.server.name[sysName],#1)<>last(/DELL PowerEdge R740 by SNMP/dell.server.name[sysName],#2) and length(last(/DELL PowerEdge R740 by SNMP/dell.server.name[sysName]))>0 |Info |
Manual close: Yes | |
Dell R740: No SNMP data collection | SNMP is not available for polling. Please check device connectivity and SNMP settings. |
max(/DELL PowerEdge R740 by SNMP/zabbix[host,snmp,available],{$DELL.SNMP.TIMEOUT})=0 |Warning |
||
Dell R740: Memory amount has changed | change(/DELL PowerEdge R740 by SNMP/dell.server.memory.size.total)>0 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature discovery | Scanning table of Temperature Probe Table IDRAC-MIB-SMIv2::temperatureProbeTable |
SNMP agent | temp.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R740: Probe [{#SENSOR_LOCALE}]: Value | MIB: IDRAC-MIB-SMIv2 This attribute defines the reading for a temperature probe of type other than When the value for When the value for |
SNMP agent | dell.server.sensor.temp.value[temperatureProbeReading.{#SNMPINDEX}] Preprocessing
|
Dell R740: Probe [{#SENSOR_LOCALE}]: Status | MIB: IDRAC-MIB-SMIv2 This attribute defines the probe status of the temperature probe. Possible values: other(1), -- probe status is not one of the following: unknown(2), -- probe status is unknown (not known or monitored) ok(3), -- probe is reporting a value within the thresholds nonCriticalUpper(4), -- probe has crossed the upper noncritical threshold criticalUpper(5), -- probe has crossed the upper critical threshold nonRecoverableUpper(6), -- probe has crossed the upper non-recoverable threshold nonCriticalLower(7), -- probe has crossed the lower noncritical threshold criticalLower(8), -- probe has crossed the lower critical threshold nonRecoverableLower(9), -- probe has crossed the lower non-recoverable threshold failed(10) -- probe is not functional |
SNMP agent | dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R740: Probe [{#SENSOR_LOCALE}]: Critical status | Please check the device for faults. |
last(/DELL PowerEdge R740 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalUpper"} or last(/DELL PowerEdge R740 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableUpper"} or last(/DELL PowerEdge R740 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalLower"} or last(/DELL PowerEdge R740 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableLower"} |Average |
||
Dell R740: Probe [{#SENSOR_LOCALE}]: Warning status | Please check the device for faults. |
last(/DELL PowerEdge R740 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalUpper"} or last(/DELL PowerEdge R740 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalLower"} |Warning |
Depends on:
|
|
Dell R740: Probe [{#SENSOR_LOCALE}]: Not in optimal status | Please check the device for faults. |
last(/DELL PowerEdge R740 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])<>{$DELL.SNMP.SENSOR.TEMP.STATUS.OK} |Info |
Manual close: Yes Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU discovery | IDRAC-MIB-SMIv2::powerSupplyTable |
SNMP agent | psu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R740: Power supply [{#PSU_DESCR}]: State | MIB: IDRAC-MIB-SMIv2 This attribute defines the status of the power supply. |
SNMP agent | dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R740: Power supply [{#PSU_DESCR}]: Critical state | Please check the power supply unit for errors. |
last(/DELL PowerEdge R740 by SNMP/dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}])={$DELL.SNMP.PSU.STATUS.CRIT:"critical"} or last(/DELL PowerEdge R740 by SNMP/dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}])={$DELL.SNMP.PSU.STATUS.CRIT:"nonRecoverable"} |Average |
||
Dell R740: Power supply [{#PSU_DESCR}]: Warning state | Please check the power supply unit for errors. |
last(/DELL PowerEdge R740 by SNMP/dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}])={$DELL.SNMP.PSU.STATUS.WARN:"nonCritical"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Fan discovery | IDRAC-MIB-SMIv2::coolingDeviceTable |
SNMP agent | fan.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R740: Fan [{#FAN_DESCR}]: Status | MIB: IDRAC-MIB-SMIv2 This attribute defines the probe status of the cooling device. |
SNMP agent | dell.server.sensor.fan.status[{#FAN_DESCR}] Preprocessing
|
Dell R740: Fan [{#FAN_DESCR}]: Speed | MIB: IDRAC-MIB-SMIv2 This attribute defines the reading for a cooling device of a subtype other than When the value for When the value for |
SNMP agent | dell.server.sensor.fan.speed[{#FAN_DESCR}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R740: Fan [{#FAN_DESCR}]: Critical state | Please check the fan unit. |
last(/DELL PowerEdge R740 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"criticalUpper"} or last(/DELL PowerEdge R740 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableUpper"} or last(/DELL PowerEdge R740 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"criticalLower"} or last(/DELL PowerEdge R740 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableLower"} or last(/DELL PowerEdge R740 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"failed"} |Average |
||
Dell R740: Fan [{#FAN_DESCR}]: Warning state | Please check the fan unit. |
last(/DELL PowerEdge R740 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalUpper"} or last(/DELL PowerEdge R740 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalLower"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Array controller discovery | Scanning table of Array controllers: IDRAC-MIB-SMIv2::controllerTable |
SNMP agent | array.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R740: Controller [{#CNTLR_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 The status of the controller itself without the propagation of any contained component status. Possible values: 1: Other 2: Unknown 3: OK 4: Non-critical 5: Critical 6: Non-recoverable |
SNMP agent | dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}] Preprocessing
|
Dell R740: Controller [{#CNTLR_NAME}]: Model | MIB: IDRAC-MIB-SMIv2 The controller's name as represented in Storage Management. |
SNMP agent | dell.server.hw.diskarray.model[controllerName.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R740: Controller [{#CNTLR_NAME}]: Unrecoverable state | Please check the device for faults. |
last(/DELL PowerEdge R740 by SNMP/dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.STATUS.FAIL} |High |
||
Dell R740: Controller [{#CNTLR_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R740 by SNMP/dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.STATUS.CRIT} |Average |
Depends on:
|
|
Dell R740: Controller [{#CNTLR_NAME}]: Warning state | Please check the device for faults. |
last(/DELL PowerEdge R740 by SNMP/dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.STATUS.WARN} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Battery discovery | Scanning Battery Table: IDRAC-MIB-SMIv2::batteryTable |
SNMP agent | battery.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R740: Battery [{#BATTERY_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 Current state of battery. Possible values: 1: The current state could not be determined. 2: The battery is operating normally. 3: The battery has failed and needs to be replaced. 4: The battery temperature is high or charge level is depleting. 5: The battery is missing or not detected. 6: The battery is undergoing the re-charge phase. 7: The battery voltage or charge level is below the threshold. |
SNMP agent | dell.server.hw.battery.status[batteryState.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R740: Battery [{#BATTERY_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R740 by SNMP/dell.server.hw.battery.status[batteryState.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT} |Average |
||
Dell R740: Battery [{#BATTERY_NAME}]: Warning state | Please check the device for faults. |
last(/DELL PowerEdge R740 by SNMP/dell.server.hw.battery.status[batteryState.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.WARN} |Warning |
Depends on:
|
|
Dell R740: Battery [{#BATTERY_NAME}]: Not in optimal state | Please check the device for faults. |
last(/DELL PowerEdge R740 by SNMP/dell.server.hw.battery.status[batteryState.{#SNMPINDEX}])<>{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.OK} |Info |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Physical disk discovery | Scanning table of physical drive entries IDRAC-MIB-SMIv2::physicalDiskTable. |
SNMP agent | physicaldisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R740: Physical disk [{#DISK_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 The status of the physical disk itself without the propagation of any contained component status. Possible values: 1: Other 2: Unknown 3: OK 4: Non-critical 5: Critical 6: Non-recoverable |
SNMP agent | dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}] Preprocessing
|
Dell R740: Physical disk [{#DISK_NAME}]: S.M.A.R.T. Status | MIB: IDRAC-MIB-SMIv2 Indicates whether the physical disk has received a predictive failure alert. |
SNMP agent | dell.server.hw.physicaldisk.smart_status[physicalDiskSmartAlertIndication.{#SNMPINDEX}] Preprocessing
|
Dell R740: Physical disk [{#DISK_NAME}]: Serial number | MIB: IDRAC-MIB-SMIv2 The physical disk's unique identification number from the manufacturer. |
SNMP agent | dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}] Preprocessing
|
Dell R740: Physical disk [{#DISK_NAME}]: Model name | MIB: IDRAC-MIB-SMIv2 The model number of the physical disk. |
SNMP agent | dell.server.hw.physicaldisk.model[physicalDiskProductID.{#SNMPINDEX}] Preprocessing
|
Dell R740: Physical disk [{#DISK_NAME}]: Media type | MIB: IDRAC-MIB-SMIv2 The media type of the physical disk. Possible Values: 1: The media type could not be determined. 2: Hard Disk Drive (HDD). 3: Solid State Drive (SSD). |
SNMP agent | dell.server.hw.physicaldisk.media_type[physicalDiskMediaType.{#SNMPINDEX}] Preprocessing
|
Dell R740: Physical disk [{#DISK_NAME}]: Size | MIB: IDRAC-MIB-SMIv2 The size of the physical disk in megabytes. |
SNMP agent | dell.server.hw.physicaldisk.size[physicalDiskCapacityInMB.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R740: Physical disk [{#DISK_NAME}]: Failed state | Please check physical disk for warnings or errors. |
last(/DELL PowerEdge R740 by SNMP/dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.STATUS.FAIL:"critical"} or last(/DELL PowerEdge R740 by SNMP/dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.STATUS.FAIL:"nonRecoverable"} |High |
||
Dell R740: Physical disk [{#DISK_NAME}]: Warning state | Please check physical disk for warnings or errors. |
last(/DELL PowerEdge R740 by SNMP/dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.STATUS.WARN:"nonCritical"} |Warning |
Depends on:
|
|
Dell R740: Physical disk [{#DISK_NAME}]: S.M.A.R.T. failed | Disk probably requires replacement. |
last(/DELL PowerEdge R740 by SNMP/dell.server.hw.physicaldisk.smart_status[physicalDiskSmartAlertIndication.{#SNMPINDEX}])={$DELL.SNMP.DISK.SMART.STATUS.FAIL:"replaceDrive"} or last(/DELL PowerEdge R740 by SNMP/dell.server.hw.physicaldisk.smart_status[physicalDiskSmartAlertIndication.{#SNMPINDEX}])={$DELL.SNMP.DISK.SMART.STATUS.FAIL:"replaceDriveSSDWearOut"} |High |
Depends on:
|
|
Dell R740: Physical disk [{#DISK_NAME}]: Has been replaced | [{#DISK_NAME}] serial number has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R740 by SNMP/dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}],#1)<>last(/DELL PowerEdge R740 by SNMP/dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}],#2) and length(last(/DELL PowerEdge R740 by SNMP/dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}]))>0 |Info |
Manual close: Yes |
Name | Description | Type | Key and additional info |
---|---|---|---|
Virtual disk discovery | IDRAC-MIB-SMIv2::virtualDiskTable |
SNMP agent | virtualdisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R740: Virtual disk [{#DISK_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 The current state of this virtual disk (which includes any member physical disks.) Possible states: 1: The current state could not be determined. 2: The virtual disk is operating normally or optimally. 3: The virtual disk has encountered a failure. Data on the disk is lost or is about to be lost. 4: The virtual disk encountered a failure with one or all of the constituent redundant physical disks. The data on the virtual disk might no longer be fault tolerant. |
SNMP agent | dell.server.hw.virtualdisk.status[virtualDiskState.{#SNMPINDEX}] Preprocessing
|
Dell R740: Virtual disk [{#DISK_NAME}]: Layout type | MIB: IDRAC-MIB-SMIv2 The virtual disk's RAID type. Possible values: 1: Not one of the following 2: RAID-0 3: RAID-1 4: RAID-5 5: RAID-6 6: RAID-10 7: RAID-50 8: RAID-60 9: Concatenated RAID 1 10: Concatenated RAID 5 |
SNMP agent | dell.server.hw.virtualdisk.layout[virtualDiskLayout.{#SNMPINDEX}] Preprocessing
|
Dell R740: Virtual disk [{#DISK_NAME}]: Size | MIB: IDRAC-MIB-SMIv2 The size of the virtual disk in megabytes. |
SNMP agent | dell.server.hw.virtualdisk.size[virtualDiskSizeInMB.{#SNMPINDEX}] Preprocessing
|
Dell R740: Virtual disk [{#DISK_NAME}]: Operational state | MIB: IDRAC-MIB-SMIv2 The state of the virtual disk when there are progressive operations ongoing. Possible values: 1: There is no active operation running. 2: The virtual disk configuration has changed. The physical disks included in the virtual disk are being modified to support the new configuration. 3: A Consistency Check (CC) is being performed on the virtual disk. 4: The virtual disk is being initialized. 5: BackGround Initialization (BGI) is being performed on the virtual disk. |
SNMP agent | dell.server.hw.virtualdisk.state[virtualDiskOperationalState.{#SNMPINDEX}] |
Dell R740: Virtual disk [{#DISK_NAME}]: Read policy | MIB: IDRAC-MIB-SMIv2 The read policy used by the controller for read operations on this virtual disk. Possible values: 1: No Read Ahead. 2: Read Ahead. 3: Adaptive Read Ahead. |
SNMP agent | dell.server.hw.virtualdisk.read_policy[virtualDiskReadPolicy.{#SNMPINDEX}] |
Dell R740: Virtual disk [{#DISK_NAME}]: Write policy | MIB: IDRAC-MIB-SMIv2 The write policy used by the controller for write operations on this virtual disk. Possible values: 1: Write Through. 2: Write Back. 3: Force Write Back. |
SNMP agent | dell.server.hw.virtualdisk.write_policy[virtualDiskWritePolicy.{#SNMPINDEX}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R740: Virtual disk [{#DISK_NAME}]: Failed state | Please check the virtual disk for warnings or errors. |
last(/DELL PowerEdge R740 by SNMP/dell.server.hw.virtualdisk.status[virtualDiskState.{#SNMPINDEX}])={$DELL.SNMP.VDISK.STATUS.CRIT:"failed"} |High |
||
Dell R740: Virtual disk [{#DISK_NAME}]: Warning state | Please check the virtual disk for warnings or errors. |
last(/DELL PowerEdge R740 by SNMP/dell.server.hw.virtualdisk.status[virtualDiskState.{#SNMPINDEX}])={$DELL.SNMP.VDISK.STATUS.WARN:"degraded"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Network interface discovery | Discovery of network interfaces. |
SNMP agent | net.if.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R740: NIC [{#NICFQDD}/{#NICMAC}]: Link status | This attribute defines the connection status of the network device. |
SNMP agent | dell.server.net.if.link[{#NIC_FQDD}] Preprocessing
|
Dell R740: NIC [{#NICFQDD}/{#NICMAC}]: Status | This attribute defines the status of the network device. |
SNMP agent | dell.server.net.if.status[{#NIC_FQDD}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R740: NIC [{#NICFQDD}/{#NICMAC}]: Link down | This trigger expression works as follows: |
{$DELL.SNMP.IFCONTROL:"{#NIC_FQDD}"}=1 and last(/DELL PowerEdge R740 by SNMP/dell.server.net.if.link[{#NIC_FQDD}],#1)<>1 and last(/DELL PowerEdge R740 by SNMP/dell.server.net.if.link[{#NIC_FQDD}],#1)<>last(/DELL PowerEdge R740 by SNMP/dell.server.net.if.link[{#NIC_FQDD}],#2) |Average |
Manual close: Yes | |
Dell R740: NIC [{#NICFQDD}/{#NICMAC}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/DELL PowerEdge R740 by SNMP/dell.server.net.if.status[{#NIC_FQDD}],#1)<>3 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
CPU status discovery | CPU status discovery. |
SNMP agent | cpu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R740: CPU [{#CPU_FQDD}]: Status | This attribute defines the status of the processor device status probe. This status will be joined into the |
SNMP agent | dell.server.cpu.status[cpu.{#SNMPINDEX}] Preprocessing
|
Dell R740: CPU [{#CPU_FQDD}]: State | This attribute defines the reading of the processor device status probe. |
SNMP agent | dell.server.cpu.state[cpu.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R740: CPU [{#CPU_FQDD}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/DELL PowerEdge R740 by SNMP/dell.server.cpu.status[cpu.{#SNMPINDEX}],#1)<>3 |Average |
||
Dell R740: CPU [{#CPU_FQDD}]: Reading error | MIB: IDRAC-MIB-SMIv2 |
bitand(last(/DELL PowerEdge R740 by SNMP/dell.server.cpu.state[cpu.{#SNMPINDEX}],#1),128)=0 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
System battery discovery | System battery discovery. |
SNMP agent | system.battery.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R740: System battery [{#SNMPVALUE}]: Status | This attribute defines the status of the battery. |
SNMP agent | dell.server.system.battery[{#SNMPVALUE}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R740: System battery [{#SNMPVALUE}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/DELL PowerEdge R740 by SNMP/dell.server.system.battery[{#SNMPVALUE}],#1)<>3 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Memory discovery | Memory discovery. |
SNMP agent | memory.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R740: Memory [{#SNMPVALUE}]: Status | This attribute defines the status of the memory device. |
SNMP agent | dell.server.memory.status[{#SNMPVALUE}] Preprocessing
|
Dell R740: Memory [{#SNMPVALUE}]: Size | This attribute defines the size, in KB, of the memory device. Zero indicates no memory installed; 2,147,483,647 indicates an unknown memory size. |
SNMP agent | dell.server.memory.size[{#SNMPVALUE}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R740: Memory [{#SNMPVALUE}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/DELL PowerEdge R740 by SNMP/dell.server.memory.status[{#SNMPVALUE}],#1)<>3 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Voltage probe discovery | Voltage probe discovery. |
SNMP agent | voltage.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R740: Voltage probe [{#VPROBE_NAME}]: Voltage | This attribute defines the reading for a voltage probe. |
SNMP agent | dell.server.voltage.value[{#VPROBE_NAME}] Preprocessing
|
Dell R740: Voltage probe [{#VPROBE_NAME}]: Status | This attribute defines the status of the voltage probe. |
SNMP agent | dell.server.voltage.status[{#VPROBE_NAME}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R740: Voltage probe [{#VPROBE_NAME}]: Status is not OK | Please check the device's voltage. |
last(/DELL PowerEdge R740 by SNMP/dell.server.voltage.status[{#VPROBE_NAME}])<>3 |Average |
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums
This is a template for monitoring DELL PowerEdge R740 servers with iDRAC 8/9 firmware 4.32 (and later) with Redfish API enabled via Zabbix script items. This template works without any external scripts.
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
1. Enable Redfish API in the Dell iDRAC interface of your server.
2. Create a user for monitoring with read-only permissions in the Dell iDRAC interface.
3. Create a host for Dell server with iDRAC IP as the Zabbix agent interface.
4. Link the template to the host.
5. Customize the values of the {$DELL.HTTP.API.URL}
, {$DELL.HTTP.API.USER}
, and {$DELL.HTTP.API.PASSWORD}
macros.
NOTE! If you are experiencing timeouts on some of the items that are executing requests, adjust the
{$DELL.HTTP.REQUEST.TIMEOUT}
macro accordingly.
Name | Description | Default |
---|---|---|
{$DELL.HTTP.API.URL} | The Dell iDRAC Redfish API URL in the format |
<Put your URL here> |
{$DELL.HTTP.API.USER} | The Dell iDRAC username. |
<Put your username here> |
{$DELL.HTTP.API.PASSWORD} | The Dell iDRAC user password. |
<Put your password here> |
{$DELL.HTTP.PROXY} | Set an HTTP proxy for Redfish API requests if needed. |
|
{$DELL.HTTP.RETURN.CODE.OK} | Set the HTTP return code that represents an OK response from the API. The default is "200", but can vary, for example, if a proxy is used. |
200 |
{$DELL.HTTP.REQUEST.TIMEOUT} | Set the timeout for HTTP requests. |
10s |
{$DELL.HTTP.IFCONTROL} | Link status trigger will be fired only for interfaces that have the context macro equal to "1". |
1 |
{$DELL.HTTP.CPU.UTIL.HIGH} | Sets the percentage threshold for creating a "high" severity event about CPU utilization. |
90 |
{$DELL.HTTP.CPU.UTIL.WARN} | Sets the percentage threshold for creating a "warning" severity event about CPU utilization. |
75 |
{$DELL.HTTP.MEM.UTIL.HIGH} | Sets the percentage threshold for creating a "high" severity event about memory utilization. |
90 |
{$DELL.HTTP.MEM.UTIL.WARN} | Sets the percentage threshold for creating a "warning" severity event about memory utilization. |
75 |
{$DELL.HTTP.IO.UTIL.HIGH} | Sets the percentage threshold for creating a "high" severity event about IO utilization. |
90 |
{$DELL.HTTP.IO.UTIL.WARN} | Sets the percentage threshold for creating a "warning" severity event about IO utilization. |
75 |
{$DELL.HTTP.SYS.UTIL.HIGH} | Sets the percentage threshold for creating a "high" severity event about SYS utilization. |
90 |
{$DELL.HTTP.SYS.UTIL.WARN} | Sets the percentage threshold for creating a "warning" severity event about SYS utilization. |
75 |
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R740: Get system | Returns system metrics. |
Script | dell.server.system.get |
Dell R740: Get sensors | Returns sensors. |
Script | dell.server.sensors.get |
Dell R740: Get array controller resources | Returns array controller resources. |
Script | dell.server.array.resources.get |
Dell R740: Get disks | Returns storage resources. |
Script | dell.server.disks.get |
Dell R740: Get network interfaces | Returns network interfaces. |
Script | dell.server.net.iface.get |
Dell R740: CPU utilization, in % | CPU utilization. |
Dependent item | dell.server.util.cpu Preprocessing
|
Dell R740: Memory utilization, in % | Memory utilization. |
Dependent item | dell.server.util.mem Preprocessing
|
Dell R740: IO utilization, in % | IO utilization. |
Dependent item | dell.server.util.io Preprocessing
|
Dell R740: SYS utilization, in % | SYS utilization. |
Dependent item | dell.server.util.sys Preprocessing
|
Dell R740: Overall system health status | This attribute defines the overall rollup status of all the components in the system monitored by the remote access card. Includes system, storage, IO devices, iDRAC, CPU, memory, etc. |
Dependent item | dell.server.status Preprocessing
|
Dell R740: Hardware model name | This attribute defines the model name of the system. |
Dependent item | dell.server.hw.model Preprocessing
|
Dell R740: Hardware serial number | This attribute defines the service tag of the system. |
Dependent item | dell.server.hw.serialnumber Preprocessing
|
Dell R740: Firmware version | This attribute defines the firmware version of a remote access card. |
Dependent item | dell.server.hw.firmware Preprocessing
|
Dell R740: Redfish API status | Availability of Redfish API on the server. Possible values: 0 - Unavailable 1 - Available |
Simple check | net.tcp.service[https] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R740: CPU utilization is too high | Current CPU utilization has exceeded |
min(/DELL PowerEdge R740 by HTTP/dell.server.util.cpu,5m)>={$DELL.HTTP.CPU.UTIL.HIGH} |High |
||
Dell R740: CPU utilization is high | Current CPU utilization has exceeded |
min(/DELL PowerEdge R740 by HTTP/dell.server.util.cpu,5m)>={$DELL.HTTP.CPU.UTIL.WARN} |Warning |
Depends on:
|
|
Dell R740: Memory utilization is too high | Current memory utilization has exceeded |
min(/DELL PowerEdge R740 by HTTP/dell.server.util.mem,5m)>={$DELL.HTTP.MEM.UTIL.HIGH} |High |
||
Dell R740: Memory utilization is high | Current memory utilization has exceeded |
min(/DELL PowerEdge R740 by HTTP/dell.server.util.mem,5m)>={$DELL.HTTP.MEM.UTIL.WARN} |Warning |
Depends on:
|
|
Dell R740: IO utilization is too high | Current IO utilization has exceeded |
min(/DELL PowerEdge R740 by HTTP/dell.server.util.io,5m)>={$DELL.HTTP.IO.UTIL.HIGH} |High |
||
Dell R740: IO utilization is high | Current IO utilization has exceeded |
min(/DELL PowerEdge R740 by HTTP/dell.server.util.io,5m)>={$DELL.HTTP.IO.UTIL.WARN} |Warning |
Depends on:
|
|
Dell R740: SYS utilization is too high | Current SYS utilization has exceeded |
min(/DELL PowerEdge R740 by HTTP/dell.server.util.sys,5m)>={$DELL.HTTP.SYS.UTIL.HIGH} |High |
||
Dell R740: SYS utilization is high | Current SYS utilization has exceeded |
min(/DELL PowerEdge R740 by HTTP/dell.server.util.sys,5m)>={$DELL.HTTP.SYS.UTIL.WARN} |Warning |
Depends on:
|
|
Dell R740: Server is in a critical state | Please check the device for faults. |
last(/DELL PowerEdge R740 by HTTP/dell.server.status,)=3 |Average |
||
Dell R740: Server is in a warning state | Please check the device for warnings. |
last(/DELL PowerEdge R740 by HTTP/dell.server.status,)=2 |Warning |
Depends on:
|
|
Dell R740: Device has been replaced | The device serial number has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R740 by HTTP/dell.server.hw.serialnumber,#1)<>last(/DELL PowerEdge R740 by HTTP/dell.server.hw.serialnumber,#2) and length(last(/DELL PowerEdge R740 by HTTP/dell.server.hw.serialnumber))>0 |Info |
Manual close: Yes | |
Dell R740: Firmware has changed | The firmware version has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R740 by HTTP/dell.server.hw.firmware,#1)<>last(/DELL PowerEdge R740 by HTTP/dell.server.hw.firmware,#2) and length(last(/DELL PowerEdge R740 by HTTP/dell.server.hw.firmware))>0 |Info |
Manual close: Yes | |
Dell R740: Redfish API service is unavailable | The service is unavailable or does not accept TCP connections. |
last(/DELL PowerEdge R740 by HTTP/net.tcp.service[https])=0 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature discovery | Discovery of temperature sensors. |
Dependent item | dell.server.temp.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R740: Probe [{#SENSOR_NAME}]: Get sensor | Returns the metrics of a sensor. |
Dependent item | dell.server.sensor.temp.get[{#SENSOR_NAME}] Preprocessing
|
Dell R740: Probe [{#SENSOR_NAME}]: Value | Sensor value. |
Dependent item | dell.server.sensor.temp.value[{#SENSOR_NAME}] Preprocessing
|
Dell R740: Probe [{#SENSOR_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.sensor.temp.status[{#SENSOR_NAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R740: Probe [{#SENSOR_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R740 by HTTP/dell.server.sensor.temp.status[{#SENSOR_NAME}],)=3 |Average |
||
Dell R740: Probe [{#SENSOR_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R740 by HTTP/dell.server.sensor.temp.status[{#SENSOR_NAME}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU discovery | Discovery of PSU sensors. |
Dependent item | dell.server.psu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R740: Power supply [{#SENSOR_NAME}]: Get sensor | Returns the metrics of a sensor. |
Dependent item | dell.server.sensor.psu.get[{#SENSOR_NAME}] Preprocessing
|
Dell R740: Power supply [{#SENSOR_NAME}]: Voltage | Sensor value. |
Dependent item | dell.server.sensor.psu.voltage[{#SENSOR_NAME}] Preprocessing
|
Dell R740: Power supply [{#SENSOR_NAME}]: Voltage sensor status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.sensor.psu.voltage.status[{#SENSOR_NAME}] Preprocessing
|
Dell R740: Power supply [{#SENSOR_NAME}]: Current | Sensor value. |
Dependent item | dell.server.sensor.psu.current[{#SENSOR_NAME}] Preprocessing
|
Dell R740: Power supply [{#SENSOR_NAME}]: Current sensor status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.sensor.psu.current.status[{#SENSOR_NAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R740: Power supply [{#SENSOR_NAME}]: Voltage sensor: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R740 by HTTP/dell.server.sensor.psu.voltage.status[{#SENSOR_NAME}],)=3 |Average |
||
Dell R740: Power supply [{#SENSOR_NAME}]: Voltage sensor: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R740 by HTTP/dell.server.sensor.psu.voltage.status[{#SENSOR_NAME}],)=2 |Warning |
Depends on:
|
|
Dell R740: Power supply [{#SENSOR_NAME}]: Current sensor: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R740 by HTTP/dell.server.sensor.psu.current.status[{#SENSOR_NAME}],)=3 |Average |
||
Dell R740: Power supply [{#SENSOR_NAME}]: Current sensor: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R740 by HTTP/dell.server.sensor.psu.current.status[{#SENSOR_NAME}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
FAN discovery | Discovery of FAN sensors. |
Dependent item | dell.server.fan.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R740: Fan [{#SENSOR_NAME}]: Get sensor | Returns the metrics of a sensor. |
Dependent item | dell.server.sensor.fan.get[{#SENSOR_NAME}] Preprocessing
|
Dell R740: Fan [{#SENSOR_NAME}]: Speed | Sensor value. |
Dependent item | dell.server.sensor.fan.speed[{#SENSOR_NAME}] Preprocessing
|
Dell R740: Fan [{#SENSOR_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.sensor.fan.status[{#SENSOR_NAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R740: Fan [{#SENSOR_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R740 by HTTP/dell.server.sensor.fan.status[{#SENSOR_NAME}],)=3 |Average |
||
Dell R740: Fan [{#SENSOR_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R740 by HTTP/dell.server.sensor.fan.status[{#SENSOR_NAME}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Array controller discovery | Discovery of disk array controllers. |
Dependent item | dell.server.array.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R740: Controller [{#CNTLR_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.array.status[{#ID}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R740: Controller [{#CNTLR_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R740 by HTTP/dell.server.array.status[{#ID}],)=3 |Average |
||
Dell R740: Controller [{#CNTLR_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R740 by HTTP/dell.server.array.status[{#ID}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Battery discovery | Discovery of battery controllers. |
Dependent item | dell.server.controller.battery.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R740: Battery [{#BATTERY_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.controller.battery.status[{#ID}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R740: Battery [{#BATTERY_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R740 by HTTP/dell.server.controller.battery.status[{#ID}],)=3 |Average |
||
Dell R740: Battery [{#BATTERY_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R740 by HTTP/dell.server.controller.battery.status[{#ID}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Physical disk discovery | Discovery of physical disks. |
Dependent item | dell.server.physicaldisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R740: Physical disk [{#DISK_NAME}]: Get disk | Returns the metrics of a physical disk. |
Script | dell.server.hw.physicaldisk.get[{#DISK_NAME}] |
Dell R740: Physical disk [{#DISK_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.hw.physicaldisk.status[{#DISK_NAME}] Preprocessing
|
Dell R740: Physical disk [{#DISK_NAME}]: Serial number | The serial number of this drive. |
Dependent item | dell.server.hw.physicaldisk.serialnumber[{#DISK_NAME}] Preprocessing
|
Dell R740: Physical disk [{#DISK_NAME}]: Model name | The model number of the drive. |
Dependent item | dell.server.hw.physicaldisk.model[{#DISK_NAME}] Preprocessing
|
Dell R740: Physical disk [{#DISK_NAME}]: Media type | The type of media contained in this drive. Possible values: HDD, SSD, SMR, null. |
Dependent item | dell.server.hw.physicaldisk.mediatype[{#DISKNAME}] Preprocessing
|
Dell R740: Physical disk [{#DISK_NAME}]: Size | The size, in bytes, of this drive. |
Dependent item | dell.server.hw.physicaldisk.size[{#DISK_NAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R740: Physical disk [{#DISK_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R740 by HTTP/dell.server.hw.physicaldisk.status[{#DISK_NAME}],)=3 |Average |
||
Dell R740: Physical disk [{#DISK_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R740 by HTTP/dell.server.hw.physicaldisk.status[{#DISK_NAME}],)=2 |Warning |
Depends on:
|
|
Dell R740: Physical disk [{#DISK_NAME}]: Has been replaced | [{#DISK_NAME}] serial number has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R740 by HTTP/dell.server.hw.physicaldisk.serialnumber[{#DISK_NAME}],#1)<>last(/DELL PowerEdge R740 by HTTP/dell.server.hw.physicaldisk.serialnumber[{#DISK_NAME}],#2) and length(last(/DELL PowerEdge R740 by HTTP/dell.server.hw.physicaldisk.serialnumber[{#DISK_NAME}]))>0 |Info |
Manual close: Yes |
Name | Description | Type | Key and additional info |
---|---|---|---|
Virtual disk discovery | Discovery of virtual disks. |
Dependent item | dell.server.virtualdisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R740: Virtual disk [{#DISK_NAME}]: Get disk | Returns the metrics of a virtual disk. |
Script | dell.server.hw.virtualdisk.get[{#DISK_NAME}] |
Dell R740: Virtual disk [{#DISK_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.hw.virtualdisk.status[{#DISK_NAME}] Preprocessing
|
Dell R740: Virtual disk [{#DISK_NAME}]: RAID status | This property represents the RAID specific status. Possible values: Blocked, Degraded, Failed, Foreign, Offline, Online, Ready, Unknown, null. |
Dependent item | dell.server.hw.virtualdisk.raidstatus[{#DISKNAME}] Preprocessing
|
Dell R740: Virtual disk [{#DISK_NAME}]: Size | The size in bytes of this Volume. |
Dependent item | dell.server.hw.virtualdisk.size[{#DISK_NAME}] Preprocessing
|
Dell R740: Virtual disk [{#DISK_NAME}]: Current state | The known state of the Resource, for example, Enabled. Possible values: Enabled, Disabled, StandbyOffline, StandbySpare, InTest, Starting, Absent, UnavailableOffline, Deferring, Quiesced, Updating, Qualified. |
Dependent item | dell.server.hw.virtualdisk.state[{#DISK_NAME}] Preprocessing
|
Dell R740: Virtual disk [{#DISK_NAME}]: Read policy | Indicates the read cache policy setting for the Volume. Possible values: ReadAhead, NoReadAhead, AdaptiveReadAhead. |
Dependent item | dell.server.hw.virtualdisk.readpolicy[{#DISKNAME}] Preprocessing
|
Dell R740: Virtual disk [{#DISK_NAME}]: Write policy | Indicates the write cache policy setting for the Volume. Possible values: WriteThrough, WriteBack, ProtectedWriteBack, UnprotectedWriteBack. |
Dependent item | dell.server.hw.virtualdisk.writepolicy[{#DISKNAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R740: Virtual disk [{#DISK_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R740 by HTTP/dell.server.hw.virtualdisk.status[{#DISK_NAME}],)=3 |Average |
||
Dell R740: Virtual disk [{#DISK_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R740 by HTTP/dell.server.hw.virtualdisk.status[{#DISK_NAME}],)=2 |Warning |
Depends on:
|
|
Dell R740: Virtual disk [{#DISK_NAME}]: RAID status not OK | Please check the disk for faults. |
last(/DELL PowerEdge R740 by HTTP/dell.server.hw.virtualdisk.raid_status[{#DISK_NAME}],)<8 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Network interface discovery | Discovery of network interfaces. |
Dependent item | dell.server.net.if.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R740: Interface [{#IFNAME}]: Get interface | Returns the metrics of a network interface. |
Script | dell.server.net.if.get[{#IFNAME}] |
Dell R740: Interface [{#IFNAME}]: Speed | The network port current link speed. |
Dependent item | dell.server.net.if.speed[{#IFNAME}] Preprocessing
|
Dell R740: Interface [{#IFNAME}]: Link status | The status of the link between this port and its link partner. Possible values: Down, Up, null. |
Dependent item | dell.server.net.if.status[{#IFNAME}] Preprocessing
|
Dell R740: Interface [{#IFNAME}]: State | The known state of the Resource, for example, Enabled. Possible values: Enabled, Disabled, StandbyOffline, StandbySpare, InTest, Starting, Absent, UnavailableOffline, Deferring, Quiesced, Updating, Qualified. |
Dependent item | dell.server.net.if.state[{#IFNAME}] Preprocessing
|
Dell R740: Interface [{#IFNAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.net.if.health[{#IFNAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R740: Interface [{#IFNAME}]: Link down | This trigger expression works as follows: |
{$DELL.HTTP.IFCONTROL:"{#IFNAME}"}=1 and (last(/DELL PowerEdge R740 by HTTP/dell.server.net.if.status[{#IFNAME}],)=2 and last(/DELL PowerEdge R740 by HTTP/dell.server.net.if.status[{#IFNAME}],#1)<>last(/DELL PowerEdge R740 by HTTP/dell.server.net.if.status[{#IFNAME}],#2)) |Average |
Manual close: Yes | |
Dell R740: Interface [{#IFNAME}]: Link status issue | This trigger expression works as follows: |
{$DELL.HTTP.IFCONTROL:"{#IFNAME}"}=1 and (last(/DELL PowerEdge R740 by HTTP/dell.server.net.if.status[{#IFNAME}],)<2 and last(/DELL PowerEdge R740 by HTTP/dell.server.net.if.status[{#IFNAME}],#1)<>last(/DELL PowerEdge R740 by HTTP/dell.server.net.if.status[{#IFNAME}],#2)) |Average |
Manual close: Yes | |
Dell R740: Interface [{#IFNAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R740 by HTTP/dell.server.net.if.health[{#IFNAME}],)=3 |Average |
||
Dell R740: Interface [{#IFNAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R740 by HTTP/dell.server.net.if.health[{#IFNAME}],)=2 |Warning |
Depends on:
|
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums
This is a template for monitoring DELL PowerEdge R720 servers with iDRAC version 7 (and later) via Zabbix SNMP agent that works without any external scripts.
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
Refer to the vendor documentation.
Name | Description | Default |
---|---|---|
{$DELL.SNMP.DISCOVERY.VOLTAGE.NAME.MATCHES} | Sets the regex string of voltage probe names to allow in discovery. |
^.*Voltage.*$ |
{$DELL.SNMP.DISCOVERY.VOLTAGE.NAME.NOT_MATCHES} | Sets the regex string of voltage probe names to ignore in discovery. |
CHANGE_IF_NEEDED |
{$DELL.SNMP.DISCOVERY.VOLTAGE.TYPE.MATCHES} | Sets the regex string of voltage probe types to allow in discovery. |
18|16 |
{$DELL.SNMP.DISCOVERY.VOLTAGE.TYPE.NOT_MATCHES} | Sets the regex string of voltage probe types to ignore in discovery. |
CHANGE_IF_NEEDED |
{$DELL.SNMP.SENSOR.TEMP.STATUS.OK} | The OK status of the temperature probe for the trigger expression. |
3 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalUpper"} | The warning status of the temperature probe for the trigger expression. |
4 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalLower"} | The warning status of the temperature probe for the trigger expression. |
7 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalUpper"} | The critical status of the temperature probe for the trigger expression. |
5 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableUpper"} | The critical status of the temperature probe for the trigger expression. |
6 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalLower"} | The critical status of the temperature probe for the trigger expression. |
8 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableLower"} | The critical status of the temperature probe for the trigger expression. |
9 |
{$DELL.SNMP.HEALTH.STATUS.DISASTER} | The disaster status of health for the trigger expression. |
6 |
{$DELL.SNMP.HEALTH.STATUS.CRIT} | The critical status of health for the trigger expression. |
5 |
{$DELL.SNMP.HEALTH.STATUS.WARN} | The warning status of health for the trigger expression. |
4 |
{$DELL.SNMP.PSU.STATUS.WARN:"nonCritical"} | The warning value of the PSU sensor for the trigger expression. |
4 |
{$DELL.SNMP.PSU.STATUS.CRIT:"critical"} | The critical value of the PSU sensor for the trigger expression. |
5 |
{$DELL.SNMP.PSU.STATUS.CRIT:"nonRecoverable"} | The critical value of the PSU sensor for the trigger expression. |
6 |
{$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalUpper"} | The warning value of the FAN sensor for the trigger expression. |
4 |
{$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalLower"} | The warning value of the FAN sensor for the trigger expression. |
7 |
{$DELL.SNMP.FAN.STATUS.CRIT:"criticalUpper"} | The critical value of the FAN sensor for the trigger expression. |
5 |
{$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableUpper"} | The critical value of the FAN sensor for the trigger expression. |
6 |
{$DELL.SNMP.FAN.STATUS.CRIT:"criticalLower"} | The critical value of the FAN sensor for the trigger expression. |
8 |
{$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableLower"} | The critical value of the FAN sensor for the trigger expression. |
9 |
{$DELL.SNMP.FAN.STATUS.CRIT:"failed"} | The critical value of the FAN sensor for the trigger expression. |
10 |
{$DELL.SNMP.DISK.ARRAY.STATUS.FAIL} | The disaster status of the disk array for the trigger expression. |
6 |
{$DELL.SNMP.DISK.ARRAY.STATUS.CRIT} | The critical status of the disk array for the trigger expression. |
5 |
{$DELL.SNMP.DISK.ARRAY.STATUS.WARN} | The warning status of the disk array for the trigger expression. |
4 |
{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT} | The critical status of the disk array cache battery for the trigger expression. |
3 |
{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.WARN} | The warning status of the disk array cache battery for the trigger expression. |
4 |
{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.OK} | The OK status of the disk array cache battery for the trigger expression. |
2 |
{$DELL.SNMP.VDISK.STATUS.CRIT:"failed"} | The critical status of the virtual disk for the trigger expression. |
3 |
{$DELL.SNMP.VDISK.STATUS.WARN:"degraded"} | The warning status of the virtual disk for the trigger expression. |
4 |
{$DELL.SNMP.DISK.STATUS.WARN:"nonCritical"} | The warning status of the disk for the trigger expression. |
4 |
{$DELL.SNMP.DISK.STATUS.FAIL:"critical"} | The critical status of the disk for the trigger expression. |
5 |
{$DELL.SNMP.DISK.STATUS.FAIL:"nonRecoverable"} | The critical status of the disk for the trigger expression. |
6 |
{$DELL.SNMP.DISK.SMART.STATUS.FAIL} | The critical S.M.A.R.T status of the disk for the trigger expression. |
1 |
{$DELL.SNMP.TIMEOUT} | The time interval for the SNMP agent availability trigger expression. |
5m |
{$DELL.SNMP.IFCONTROL} | The link status trigger will be fired only for interfaces that have the context macro equal to "1". |
1 |
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R720: Overall system health status | MIB: IDRAC-MIB-SMIv2 This attribute defines the overall rollup status of all the components in the system monitored by the remote access card. Includes system, storage, IO devices, iDRAC, CPU, memory, etc. |
SNMP agent | dell.server.status[globalSystemStatus] Preprocessing
|
Dell R720: Hardware model name | MIB: IDRAC-MIB-SMIv2 This attribute defines the model name of the system. |
SNMP agent | dell.server.hw.model[systemModelName] Preprocessing
|
Dell R720: Hardware serial number | MIB: IDRAC-MIB-SMIv2 This attribute defines the service tag of the system. |
SNMP agent | dell.server.hw.serialnumber[systemServiceTag] Preprocessing
|
Dell R720: Operating system | MIB: IDRAC-MIB-SMIv2 This attribute defines the name of the operating system that the host is running. |
SNMP agent | dell.server.sw.os[systemOSName] Preprocessing
|
Dell R720: Firmware version | MIB: IDRAC-MIB-SMIv2 This attribute defines the firmware version of a remote access card. |
SNMP agent | dell.server.hw.firmware[racFirmwareVersion] Preprocessing
|
Dell R720: Uptime (network) | MIB: SNMP-FRAMEWORK-MIB The number of seconds since the value of the snmpEngineBoots object last changed. |
SNMP agent | dell.server.net.uptime[snmpEngineTime] |
Dell R720: Uptime (hardware) | MIB: IDRAC-MIB-SMIv2 This attribute defines the power-up time of the system in seconds. |
SNMP agent | dell.server.hw.uptime[systemPowerUpTime] |
Dell R720: SNMP traps (fallback) | The item is used to collect all SNMP traps unmatched by other |
SNMP trap | snmptrap.fallback |
Dell R720: System location | MIB: SNMPv2-MIB The physical location of this node (e.g., 'telephone closet, 3rd floor'). If the location is unknown, the value is a zero-length string. |
SNMP agent | dell.server.location[sysLocation] Preprocessing
|
Dell R720: System contact details | MIB: SNMPv2-MIB Name and contact information of the contact person for the node. If not provided, the value is a zero-length string. |
SNMP agent | dell.server.contact[sysContact] Preprocessing
|
Dell R720: System object ID | MIB: SNMPv2-MIB The vendor's authoritative identification of the entity as part of the vendor's SMI enterprises subtree with the prefix 1.3.6.1.4.1 (e.g., a vendor with the identifier 1.3.6.1.4.1.4242 might assign a system object with the OID 1.3.6.1.4.1.4242.1.1). |
SNMP agent | dell.server.objectid[sysObjectID] Preprocessing
|
Dell R720: System name | MIB: SNMPv2-MIB An administratively-assigned name for this managed node. By convention, this is the node's fully-qualified domain name. If the name is unknown, the value is a zero-length string. |
SNMP agent | dell.server.name[sysName] Preprocessing
|
Dell R720: System description | MIB: SNMPv2-MIB A textual description of the entity. This value should include the full name and version identification of the system's hardware type, software operating system, and networking software. |
SNMP agent | dell.server.descr[sysDescr] Preprocessing
|
Dell R720: SNMP agent availability | Availability of SNMP checks on the host. The value of this item corresponds to availability icons in the host list. Possible values: 0 - not available 1 - available 2 - unknown |
Zabbix internal | zabbix[host,snmp,available] Preprocessing
|
Dell R720: Memory, total size | Total memory amount on the device. |
Calculated | dell.server.memory.size.total Preprocessing
|
Dell R720: BIOS version | MIB: IDRAC-MIB-SMIv2 This attribute defines the version name of the system BIOS. |
SNMP agent | dell.server.bios.version Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R720: System is in unrecoverable state | Please check the device for faults. |
last(/DELL PowerEdge R720 by SNMP/dell.server.status[globalSystemStatus])={$DELL.SNMP.HEALTH.STATUS.DISASTER} |High |
||
Dell R720: System status is in critical state | Please check the device for errors. |
last(/DELL PowerEdge R720 by SNMP/dell.server.status[globalSystemStatus])={$DELL.SNMP.HEALTH.STATUS.CRIT} |Average |
||
Dell R720: System status is in warning state | Please check the device for warnings. |
last(/DELL PowerEdge R720 by SNMP/dell.server.status[globalSystemStatus])={$DELL.SNMP.HEALTH.STATUS.WARN} |Warning |
Depends on:
|
|
Dell R720: Device has been replaced | Device serial number has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R720 by SNMP/dell.server.hw.serialnumber[systemServiceTag],#1)<>last(/DELL PowerEdge R720 by SNMP/dell.server.hw.serialnumber[systemServiceTag],#2) and length(last(/DELL PowerEdge R720 by SNMP/dell.server.hw.serialnumber[systemServiceTag]))>0 |Info |
Manual close: Yes | |
Dell R720: Operating system description has changed | Operating system description has changed. Possibly, the system has been updated or replaced. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R720 by SNMP/dell.server.sw.os[systemOSName],#1)<>last(/DELL PowerEdge R720 by SNMP/dell.server.sw.os[systemOSName],#2) and length(last(/DELL PowerEdge R720 by SNMP/dell.server.sw.os[systemOSName]))>0 |Info |
Manual close: Yes | |
Dell R720: Firmware has changed | Firmware version has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R720 by SNMP/dell.server.hw.firmware[racFirmwareVersion],#1)<>last(/DELL PowerEdge R720 by SNMP/dell.server.hw.firmware[racFirmwareVersion],#2) and length(last(/DELL PowerEdge R720 by SNMP/dell.server.hw.firmware[racFirmwareVersion]))>0 |Info |
Manual close: Yes | |
Dell R720: Host has been restarted | Uptime is less than 10 minutes. |
(last(/DELL PowerEdge R720 by SNMP/dell.server.hw.uptime[systemPowerUpTime])>0 and last(/DELL PowerEdge R720 by SNMP/dell.server.hw.uptime[systemPowerUpTime])<10m) or (last(/DELL PowerEdge R720 by SNMP/dell.server.hw.uptime[systemPowerUpTime])=0 and last(/DELL PowerEdge R720 by SNMP/dell.server.net.uptime[snmpEngineTime])<10m) |Warning |
Manual close: Yes | |
Dell R720: System name has changed | The name of the system has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R720 by SNMP/dell.server.name[sysName],#1)<>last(/DELL PowerEdge R720 by SNMP/dell.server.name[sysName],#2) and length(last(/DELL PowerEdge R720 by SNMP/dell.server.name[sysName]))>0 |Info |
Manual close: Yes | |
Dell R720: No SNMP data collection | SNMP is not available for polling. Please check device connectivity and SNMP settings. |
max(/DELL PowerEdge R720 by SNMP/zabbix[host,snmp,available],{$DELL.SNMP.TIMEOUT})=0 |Warning |
||
Dell R720: Memory amount has changed | change(/DELL PowerEdge R720 by SNMP/dell.server.memory.size.total)>0 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature discovery | Scanning table of Temperature Probe Table IDRAC-MIB-SMIv2::temperatureProbeTable |
SNMP agent | temp.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R720: Probe [{#SENSOR_LOCALE}]: Value | MIB: IDRAC-MIB-SMIv2 This attribute defines the reading for a temperature probe of type other than When the value for When the value for |
SNMP agent | dell.server.sensor.temp.value[temperatureProbeReading.{#SNMPINDEX}] Preprocessing
|
Dell R720: Probe [{#SENSOR_LOCALE}]: Status | MIB: IDRAC-MIB-SMIv2 This attribute defines the probe status of the temperature probe. Possible values: other(1), -- probe status is not one of the following: unknown(2), -- probe status is unknown (not known or monitored) ok(3), -- probe is reporting a value within the thresholds nonCriticalUpper(4), -- probe has crossed the upper noncritical threshold criticalUpper(5), -- probe has crossed the upper critical threshold nonRecoverableUpper(6), -- probe has crossed the upper non-recoverable threshold nonCriticalLower(7), -- probe has crossed the lower noncritical threshold criticalLower(8), -- probe has crossed the lower critical threshold nonRecoverableLower(9), -- probe has crossed the lower non-recoverable threshold failed(10) -- probe is not functional |
SNMP agent | dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R720: Probe [{#SENSOR_LOCALE}]: Critical status | Please check the device for faults. |
last(/DELL PowerEdge R720 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalUpper"} or last(/DELL PowerEdge R720 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableUpper"} or last(/DELL PowerEdge R720 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalLower"} or last(/DELL PowerEdge R720 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableLower"} |Average |
||
Dell R720: Probe [{#SENSOR_LOCALE}]: Warning status | Please check the device for faults. |
last(/DELL PowerEdge R720 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalUpper"} or last(/DELL PowerEdge R720 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalLower"} |Warning |
Depends on:
|
|
Dell R720: Probe [{#SENSOR_LOCALE}]: Not in optimal status | Please check the device for faults. |
last(/DELL PowerEdge R720 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])<>{$DELL.SNMP.SENSOR.TEMP.STATUS.OK} |Info |
Manual close: Yes Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU discovery | IDRAC-MIB-SMIv2::powerSupplyTable |
SNMP agent | psu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R720: Power supply [{#PSU_DESCR}]: State | MIB: IDRAC-MIB-SMIv2 This attribute defines the status of the power supply. |
SNMP agent | dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R720: Power supply [{#PSU_DESCR}]: Critical state | Please check the power supply unit for errors. |
last(/DELL PowerEdge R720 by SNMP/dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}])={$DELL.SNMP.PSU.STATUS.CRIT:"critical"} or last(/DELL PowerEdge R720 by SNMP/dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}])={$DELL.SNMP.PSU.STATUS.CRIT:"nonRecoverable"} |Average |
||
Dell R720: Power supply [{#PSU_DESCR}]: Warning state | Please check the power supply unit for errors. |
last(/DELL PowerEdge R720 by SNMP/dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}])={$DELL.SNMP.PSU.STATUS.WARN:"nonCritical"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Fan discovery | IDRAC-MIB-SMIv2::coolingDeviceTable |
SNMP agent | fan.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R720: Fan [{#FAN_DESCR}]: Status | MIB: IDRAC-MIB-SMIv2 This attribute defines the probe status of the cooling device. |
SNMP agent | dell.server.sensor.fan.status[{#FAN_DESCR}] Preprocessing
|
Dell R720: Fan [{#FAN_DESCR}]: Speed | MIB: IDRAC-MIB-SMIv2 This attribute defines the reading for a cooling device of a subtype other than When the value for When the value for |
SNMP agent | dell.server.sensor.fan.speed[{#FAN_DESCR}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R720: Fan [{#FAN_DESCR}]: Critical state | Please check the fan unit. |
last(/DELL PowerEdge R720 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"criticalUpper"} or last(/DELL PowerEdge R720 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableUpper"} or last(/DELL PowerEdge R720 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"criticalLower"} or last(/DELL PowerEdge R720 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableLower"} or last(/DELL PowerEdge R720 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"failed"} |Average |
||
Dell R720: Fan [{#FAN_DESCR}]: Warning state | Please check the fan unit. |
last(/DELL PowerEdge R720 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalUpper"} or last(/DELL PowerEdge R720 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalLower"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Array controller discovery | Scanning table of Array controllers: IDRAC-MIB-SMIv2::controllerTable |
SNMP agent | array.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R720: Controller [{#CNTLR_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 The status of the controller itself without the propagation of any contained component status. Possible values: 1: Other 2: Unknown 3: OK 4: Non-critical 5: Critical 6: Non-recoverable |
SNMP agent | dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}] Preprocessing
|
Dell R720: Controller [{#CNTLR_NAME}]: Model | MIB: IDRAC-MIB-SMIv2 The controller's name as represented in Storage Management. |
SNMP agent | dell.server.hw.diskarray.model[controllerName.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R720: Controller [{#CNTLR_NAME}]: Unrecoverable state | Please check the device for faults. |
last(/DELL PowerEdge R720 by SNMP/dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.STATUS.FAIL} |High |
||
Dell R720: Controller [{#CNTLR_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R720 by SNMP/dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.STATUS.CRIT} |Average |
Depends on:
|
|
Dell R720: Controller [{#CNTLR_NAME}]: Warning state | Please check the device for faults. |
last(/DELL PowerEdge R720 by SNMP/dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.STATUS.WARN} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Battery discovery | Scanning Battery Table: IDRAC-MIB-SMIv2::batteryTable |
SNMP agent | battery.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R720: Battery [{#BATTERY_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 Current state of battery. Possible values: 1: The current state could not be determined. 2: The battery is operating normally. 3: The battery has failed and needs to be replaced. 4: The battery temperature is high or charge level is depleting. 5: The battery is missing or not detected. 6: The battery is undergoing the re-charge phase. 7: The battery voltage or charge level is below the threshold. |
SNMP agent | dell.server.hw.battery.status[batteryState.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R720: Battery [{#BATTERY_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R720 by SNMP/dell.server.hw.battery.status[batteryState.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT} |Average |
||
Dell R720: Battery [{#BATTERY_NAME}]: Warning state | Please check the device for faults. |
last(/DELL PowerEdge R720 by SNMP/dell.server.hw.battery.status[batteryState.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.WARN} |Warning |
Depends on:
|
|
Dell R720: Battery [{#BATTERY_NAME}]: Not in optimal state | Please check the device for faults. |
last(/DELL PowerEdge R720 by SNMP/dell.server.hw.battery.status[batteryState.{#SNMPINDEX}])<>{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.OK} |Info |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Physical disk discovery | Scanning table of physical drive entries IDRAC-MIB-SMIv2::physicalDiskTable. |
SNMP agent | physicaldisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R720: Physical disk [{#DISK_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 The status of the physical disk itself without the propagation of any contained component status. Possible values: 1: Other 2: Unknown 3: OK 4: Non-critical 5: Critical 6: Non-recoverable |
SNMP agent | dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}] Preprocessing
|
Dell R720: Physical disk [{#DISK_NAME}]: S.M.A.R.T. Status | MIB: IDRAC-MIB-SMIv2 Indicates whether the physical disk has received a predictive failure alert. |
SNMP agent | dell.server.hw.physicaldisk.smart_status[physicalDiskSmartAlertIndication.{#SNMPINDEX}] Preprocessing
|
Dell R720: Physical disk [{#DISK_NAME}]: Serial number | MIB: IDRAC-MIB-SMIv2 The physical disk's unique identification number from the manufacturer. |
SNMP agent | dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}] Preprocessing
|
Dell R720: Physical disk [{#DISK_NAME}]: Model name | MIB: IDRAC-MIB-SMIv2 The model number of the physical disk. |
SNMP agent | dell.server.hw.physicaldisk.model[physicalDiskProductID.{#SNMPINDEX}] Preprocessing
|
Dell R720: Physical disk [{#DISK_NAME}]: Media type | MIB: IDRAC-MIB-SMIv2 The media type of the physical disk. Possible Values: 1: The media type could not be determined. 2: Hard Disk Drive (HDD). 3: Solid State Drive (SSD). |
SNMP agent | dell.server.hw.physicaldisk.media_type[physicalDiskMediaType.{#SNMPINDEX}] Preprocessing
|
Dell R720: Physical disk [{#DISK_NAME}]: Size | MIB: IDRAC-MIB-SMIv2 The size of the physical disk in megabytes. |
SNMP agent | dell.server.hw.physicaldisk.size[physicalDiskCapacityInMB.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R720: Physical disk [{#DISK_NAME}]: Failed state | Please check physical disk for warnings or errors. |
last(/DELL PowerEdge R720 by SNMP/dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.STATUS.FAIL:"critical"} or last(/DELL PowerEdge R720 by SNMP/dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.STATUS.FAIL:"nonRecoverable"} |High |
||
Dell R720: Physical disk [{#DISK_NAME}]: Warning state | Please check physical disk for warnings or errors. |
last(/DELL PowerEdge R720 by SNMP/dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.STATUS.WARN:"nonCritical"} |Warning |
Depends on:
|
|
Dell R720: Physical disk [{#DISK_NAME}]: S.M.A.R.T. failed | Disk probably requires replacement. |
last(/DELL PowerEdge R720 by SNMP/dell.server.hw.physicaldisk.smart_status[physicalDiskSmartAlertIndication.{#SNMPINDEX}])={$DELL.SNMP.DISK.SMART.STATUS.FAIL:"replaceDrive"} or last(/DELL PowerEdge R720 by SNMP/dell.server.hw.physicaldisk.smart_status[physicalDiskSmartAlertIndication.{#SNMPINDEX}])={$DELL.SNMP.DISK.SMART.STATUS.FAIL:"replaceDriveSSDWearOut"} |High |
Depends on:
|
|
Dell R720: Physical disk [{#DISK_NAME}]: Has been replaced | [{#DISK_NAME}] serial number has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R720 by SNMP/dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}],#1)<>last(/DELL PowerEdge R720 by SNMP/dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}],#2) and length(last(/DELL PowerEdge R720 by SNMP/dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}]))>0 |Info |
Manual close: Yes |
Name | Description | Type | Key and additional info |
---|---|---|---|
Virtual disk discovery | IDRAC-MIB-SMIv2::virtualDiskTable |
SNMP agent | virtualdisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R720: Virtual disk [{#DISK_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 The current state of this virtual disk (which includes any member physical disks.) Possible states: 1: The current state could not be determined. 2: The virtual disk is operating normally or optimally. 3: The virtual disk has encountered a failure. Data on the disk is lost or is about to be lost. 4: The virtual disk encountered a failure with one or all of the constituent redundant physical disks. The data on the virtual disk might no longer be fault tolerant. |
SNMP agent | dell.server.hw.virtualdisk.status[virtualDiskState.{#SNMPINDEX}] Preprocessing
|
Dell R720: Virtual disk [{#DISK_NAME}]: Layout type | MIB: IDRAC-MIB-SMIv2 The virtual disk's RAID type. Possible values: 1: Not one of the following 2: RAID-0 3: RAID-1 4: RAID-5 5: RAID-6 6: RAID-10 7: RAID-50 8: RAID-60 9: Concatenated RAID 1 10: Concatenated RAID 5 |
SNMP agent | dell.server.hw.virtualdisk.layout[virtualDiskLayout.{#SNMPINDEX}] Preprocessing
|
Dell R720: Virtual disk [{#DISK_NAME}]: Size | MIB: IDRAC-MIB-SMIv2 The size of the virtual disk in megabytes. |
SNMP agent | dell.server.hw.virtualdisk.size[virtualDiskSizeInMB.{#SNMPINDEX}] Preprocessing
|
Dell R720: Virtual disk [{#DISK_NAME}]: Operational state | MIB: IDRAC-MIB-SMIv2 The state of the virtual disk when there are progressive operations ongoing. Possible values: 1: There is no active operation running. 2: The virtual disk configuration has changed. The physical disks included in the virtual disk are being modified to support the new configuration. 3: A Consistency Check (CC) is being performed on the virtual disk. 4: The virtual disk is being initialized. 5: BackGround Initialization (BGI) is being performed on the virtual disk. |
SNMP agent | dell.server.hw.virtualdisk.state[virtualDiskOperationalState.{#SNMPINDEX}] |
Dell R720: Virtual disk [{#DISK_NAME}]: Read policy | MIB: IDRAC-MIB-SMIv2 The read policy used by the controller for read operations on this virtual disk. Possible values: 1: No Read Ahead. 2: Read Ahead. 3: Adaptive Read Ahead. |
SNMP agent | dell.server.hw.virtualdisk.read_policy[virtualDiskReadPolicy.{#SNMPINDEX}] |
Dell R720: Virtual disk [{#DISK_NAME}]: Write policy | MIB: IDRAC-MIB-SMIv2 The write policy used by the controller for write operations on this virtual disk. Possible values: 1: Write Through. 2: Write Back. 3: Force Write Back. |
SNMP agent | dell.server.hw.virtualdisk.write_policy[virtualDiskWritePolicy.{#SNMPINDEX}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R720: Virtual disk [{#DISK_NAME}]: Failed state | Please check the virtual disk for warnings or errors. |
last(/DELL PowerEdge R720 by SNMP/dell.server.hw.virtualdisk.status[virtualDiskState.{#SNMPINDEX}])={$DELL.SNMP.VDISK.STATUS.CRIT:"failed"} |High |
||
Dell R720: Virtual disk [{#DISK_NAME}]: Warning state | Please check the virtual disk for warnings or errors. |
last(/DELL PowerEdge R720 by SNMP/dell.server.hw.virtualdisk.status[virtualDiskState.{#SNMPINDEX}])={$DELL.SNMP.VDISK.STATUS.WARN:"degraded"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Network interface discovery | Discovery of network interfaces. |
SNMP agent | net.if.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R720: NIC [{#NICFQDD}/{#NICMAC}]: Link status | This attribute defines the connection status of the network device. |
SNMP agent | dell.server.net.if.link[{#NIC_FQDD}] Preprocessing
|
Dell R720: NIC [{#NICFQDD}/{#NICMAC}]: Status | This attribute defines the status of the network device. |
SNMP agent | dell.server.net.if.status[{#NIC_FQDD}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R720: NIC [{#NICFQDD}/{#NICMAC}]: Link down | This trigger expression works as follows: |
{$DELL.SNMP.IFCONTROL:"{#NIC_FQDD}"}=1 and last(/DELL PowerEdge R720 by SNMP/dell.server.net.if.link[{#NIC_FQDD}],#1)<>1 and last(/DELL PowerEdge R720 by SNMP/dell.server.net.if.link[{#NIC_FQDD}],#1)<>last(/DELL PowerEdge R720 by SNMP/dell.server.net.if.link[{#NIC_FQDD}],#2) |Average |
Manual close: Yes | |
Dell R720: NIC [{#NICFQDD}/{#NICMAC}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/DELL PowerEdge R720 by SNMP/dell.server.net.if.status[{#NIC_FQDD}],#1)<>3 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
CPU status discovery | CPU status discovery. |
SNMP agent | cpu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R720: CPU [{#CPU_FQDD}]: Status | This attribute defines the status of the processor device status probe. This status will be joined into the |
SNMP agent | dell.server.cpu.status[cpu.{#SNMPINDEX}] Preprocessing
|
Dell R720: CPU [{#CPU_FQDD}]: State | This attribute defines the reading of the processor device status probe. |
SNMP agent | dell.server.cpu.state[cpu.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R720: CPU [{#CPU_FQDD}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/DELL PowerEdge R720 by SNMP/dell.server.cpu.status[cpu.{#SNMPINDEX}],#1)<>3 |Average |
||
Dell R720: CPU [{#CPU_FQDD}]: Reading error | MIB: IDRAC-MIB-SMIv2 |
bitand(last(/DELL PowerEdge R720 by SNMP/dell.server.cpu.state[cpu.{#SNMPINDEX}],#1),128)=0 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
System battery discovery | System battery discovery. |
SNMP agent | system.battery.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R720: System battery [{#SNMPVALUE}]: Status | This attribute defines the status of the battery. |
SNMP agent | dell.server.system.battery[{#SNMPVALUE}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R720: System battery [{#SNMPVALUE}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/DELL PowerEdge R720 by SNMP/dell.server.system.battery[{#SNMPVALUE}],#1)<>3 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Memory discovery | Memory discovery. |
SNMP agent | memory.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R720: Memory [{#SNMPVALUE}]: Status | This attribute defines the status of the memory device. |
SNMP agent | dell.server.memory.status[{#SNMPVALUE}] Preprocessing
|
Dell R720: Memory [{#SNMPVALUE}]: Size | This attribute defines the size, in KB, of the memory device. Zero indicates no memory installed; 2,147,483,647 indicates an unknown memory size. |
SNMP agent | dell.server.memory.size[{#SNMPVALUE}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R720: Memory [{#SNMPVALUE}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/DELL PowerEdge R720 by SNMP/dell.server.memory.status[{#SNMPVALUE}],#1)<>3 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Voltage probe discovery | Voltage probe discovery. |
SNMP agent | voltage.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R720: Voltage probe [{#VPROBE_NAME}]: Voltage | This attribute defines the reading for a voltage probe. |
SNMP agent | dell.server.voltage.value[{#VPROBE_NAME}] Preprocessing
|
Dell R720: Voltage probe [{#VPROBE_NAME}]: Status | This attribute defines the status of the voltage probe. |
SNMP agent | dell.server.voltage.status[{#VPROBE_NAME}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R720: Voltage probe [{#VPROBE_NAME}]: Status is not OK | Please check the device's voltage. |
last(/DELL PowerEdge R720 by SNMP/dell.server.voltage.status[{#VPROBE_NAME}])<>3 |Average |
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums
This is a template for monitoring DELL PowerEdge R720 servers with iDRAC 8/9 firmware 4.32 (and later) with Redfish API enabled via Zabbix script items. This template works without any external scripts.
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
1. Enable Redfish API in the Dell iDRAC interface of your server.
2. Create a user for monitoring with read-only permissions in the Dell iDRAC interface.
3. Create a host for Dell server with iDRAC IP as the Zabbix agent interface.
4. Link the template to the host.
5. Customize the values of the {$DELL.HTTP.API.URL}
, {$DELL.HTTP.API.USER}
, and {$DELL.HTTP.API.PASSWORD}
macros.
NOTE! If you are experiencing timeouts on some of the items that are executing requests, adjust the
{$DELL.HTTP.REQUEST.TIMEOUT}
macro accordingly.
Name | Description | Default |
---|---|---|
{$DELL.HTTP.API.URL} | The Dell iDRAC Redfish API URL in the format |
<Put your URL here> |
{$DELL.HTTP.API.USER} | The Dell iDRAC username. |
<Put your username here> |
{$DELL.HTTP.API.PASSWORD} | The Dell iDRAC user password. |
<Put your password here> |
{$DELL.HTTP.PROXY} | Set an HTTP proxy for Redfish API requests if needed. |
|
{$DELL.HTTP.RETURN.CODE.OK} | Set the HTTP return code that represents an OK response from the API. The default is "200", but can vary, for example, if a proxy is used. |
200 |
{$DELL.HTTP.REQUEST.TIMEOUT} | Set the timeout for HTTP requests. |
10s |
{$DELL.HTTP.IFCONTROL} | Link status trigger will be fired only for interfaces that have the context macro equal to "1". |
1 |
{$DELL.HTTP.CPU.UTIL.HIGH} | Sets the percentage threshold for creating a "high" severity event about CPU utilization. |
90 |
{$DELL.HTTP.CPU.UTIL.WARN} | Sets the percentage threshold for creating a "warning" severity event about CPU utilization. |
75 |
{$DELL.HTTP.MEM.UTIL.HIGH} | Sets the percentage threshold for creating a "high" severity event about memory utilization. |
90 |
{$DELL.HTTP.MEM.UTIL.WARN} | Sets the percentage threshold for creating a "warning" severity event about memory utilization. |
75 |
{$DELL.HTTP.IO.UTIL.HIGH} | Sets the percentage threshold for creating a "high" severity event about IO utilization. |
90 |
{$DELL.HTTP.IO.UTIL.WARN} | Sets the percentage threshold for creating a "warning" severity event about IO utilization. |
75 |
{$DELL.HTTP.SYS.UTIL.HIGH} | Sets the percentage threshold for creating a "high" severity event about SYS utilization. |
90 |
{$DELL.HTTP.SYS.UTIL.WARN} | Sets the percentage threshold for creating a "warning" severity event about SYS utilization. |
75 |
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R720: Get system | Returns system metrics. |
Script | dell.server.system.get |
Dell R720: Get sensors | Returns sensors. |
Script | dell.server.sensors.get |
Dell R720: Get array controller resources | Returns array controller resources. |
Script | dell.server.array.resources.get |
Dell R720: Get disks | Returns storage resources. |
Script | dell.server.disks.get |
Dell R720: Get network interfaces | Returns network interfaces. |
Script | dell.server.net.iface.get |
Dell R720: CPU utilization, in % | CPU utilization. |
Dependent item | dell.server.util.cpu Preprocessing
|
Dell R720: Memory utilization, in % | Memory utilization. |
Dependent item | dell.server.util.mem Preprocessing
|
Dell R720: IO utilization, in % | IO utilization. |
Dependent item | dell.server.util.io Preprocessing
|
Dell R720: SYS utilization, in % | SYS utilization. |
Dependent item | dell.server.util.sys Preprocessing
|
Dell R720: Overall system health status | This attribute defines the overall rollup status of all the components in the system monitored by the remote access card. Includes system, storage, IO devices, iDRAC, CPU, memory, etc. |
Dependent item | dell.server.status Preprocessing
|
Dell R720: Hardware model name | This attribute defines the model name of the system. |
Dependent item | dell.server.hw.model Preprocessing
|
Dell R720: Hardware serial number | This attribute defines the service tag of the system. |
Dependent item | dell.server.hw.serialnumber Preprocessing
|
Dell R720: Firmware version | This attribute defines the firmware version of a remote access card. |
Dependent item | dell.server.hw.firmware Preprocessing
|
Dell R720: Redfish API status | Availability of Redfish API on the server. Possible values: 0 - Unavailable 1 - Available |
Simple check | net.tcp.service[https] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R720: CPU utilization is too high | Current CPU utilization has exceeded |
min(/DELL PowerEdge R720 by HTTP/dell.server.util.cpu,5m)>={$DELL.HTTP.CPU.UTIL.HIGH} |High |
||
Dell R720: CPU utilization is high | Current CPU utilization has exceeded |
min(/DELL PowerEdge R720 by HTTP/dell.server.util.cpu,5m)>={$DELL.HTTP.CPU.UTIL.WARN} |Warning |
Depends on:
|
|
Dell R720: Memory utilization is too high | Current memory utilization has exceeded |
min(/DELL PowerEdge R720 by HTTP/dell.server.util.mem,5m)>={$DELL.HTTP.MEM.UTIL.HIGH} |High |
||
Dell R720: Memory utilization is high | Current memory utilization has exceeded |
min(/DELL PowerEdge R720 by HTTP/dell.server.util.mem,5m)>={$DELL.HTTP.MEM.UTIL.WARN} |Warning |
Depends on:
|
|
Dell R720: IO utilization is too high | Current IO utilization has exceeded |
min(/DELL PowerEdge R720 by HTTP/dell.server.util.io,5m)>={$DELL.HTTP.IO.UTIL.HIGH} |High |
||
Dell R720: IO utilization is high | Current IO utilization has exceeded |
min(/DELL PowerEdge R720 by HTTP/dell.server.util.io,5m)>={$DELL.HTTP.IO.UTIL.WARN} |Warning |
Depends on:
|
|
Dell R720: SYS utilization is too high | Current SYS utilization has exceeded |
min(/DELL PowerEdge R720 by HTTP/dell.server.util.sys,5m)>={$DELL.HTTP.SYS.UTIL.HIGH} |High |
||
Dell R720: SYS utilization is high | Current SYS utilization has exceeded |
min(/DELL PowerEdge R720 by HTTP/dell.server.util.sys,5m)>={$DELL.HTTP.SYS.UTIL.WARN} |Warning |
Depends on:
|
|
Dell R720: Server is in a critical state | Please check the device for faults. |
last(/DELL PowerEdge R720 by HTTP/dell.server.status,)=3 |Average |
||
Dell R720: Server is in a warning state | Please check the device for warnings. |
last(/DELL PowerEdge R720 by HTTP/dell.server.status,)=2 |Warning |
Depends on:
|
|
Dell R720: Device has been replaced | The device serial number has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R720 by HTTP/dell.server.hw.serialnumber,#1)<>last(/DELL PowerEdge R720 by HTTP/dell.server.hw.serialnumber,#2) and length(last(/DELL PowerEdge R720 by HTTP/dell.server.hw.serialnumber))>0 |Info |
Manual close: Yes | |
Dell R720: Firmware has changed | The firmware version has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R720 by HTTP/dell.server.hw.firmware,#1)<>last(/DELL PowerEdge R720 by HTTP/dell.server.hw.firmware,#2) and length(last(/DELL PowerEdge R720 by HTTP/dell.server.hw.firmware))>0 |Info |
Manual close: Yes | |
Dell R720: Redfish API service is unavailable | The service is unavailable or does not accept TCP connections. |
last(/DELL PowerEdge R720 by HTTP/net.tcp.service[https])=0 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature discovery | Discovery of temperature sensors. |
Dependent item | dell.server.temp.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R720: Probe [{#SENSOR_NAME}]: Get sensor | Returns the metrics of a sensor. |
Dependent item | dell.server.sensor.temp.get[{#SENSOR_NAME}] Preprocessing
|
Dell R720: Probe [{#SENSOR_NAME}]: Value | Sensor value. |
Dependent item | dell.server.sensor.temp.value[{#SENSOR_NAME}] Preprocessing
|
Dell R720: Probe [{#SENSOR_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.sensor.temp.status[{#SENSOR_NAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R720: Probe [{#SENSOR_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R720 by HTTP/dell.server.sensor.temp.status[{#SENSOR_NAME}],)=3 |Average |
||
Dell R720: Probe [{#SENSOR_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R720 by HTTP/dell.server.sensor.temp.status[{#SENSOR_NAME}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU discovery | Discovery of PSU sensors. |
Dependent item | dell.server.psu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R720: Power supply [{#SENSOR_NAME}]: Get sensor | Returns the metrics of a sensor. |
Dependent item | dell.server.sensor.psu.get[{#SENSOR_NAME}] Preprocessing
|
Dell R720: Power supply [{#SENSOR_NAME}]: Voltage | Sensor value. |
Dependent item | dell.server.sensor.psu.voltage[{#SENSOR_NAME}] Preprocessing
|
Dell R720: Power supply [{#SENSOR_NAME}]: Voltage sensor status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.sensor.psu.voltage.status[{#SENSOR_NAME}] Preprocessing
|
Dell R720: Power supply [{#SENSOR_NAME}]: Current | Sensor value. |
Dependent item | dell.server.sensor.psu.current[{#SENSOR_NAME}] Preprocessing
|
Dell R720: Power supply [{#SENSOR_NAME}]: Current sensor status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.sensor.psu.current.status[{#SENSOR_NAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R720: Power supply [{#SENSOR_NAME}]: Voltage sensor: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R720 by HTTP/dell.server.sensor.psu.voltage.status[{#SENSOR_NAME}],)=3 |Average |
||
Dell R720: Power supply [{#SENSOR_NAME}]: Voltage sensor: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R720 by HTTP/dell.server.sensor.psu.voltage.status[{#SENSOR_NAME}],)=2 |Warning |
Depends on:
|
|
Dell R720: Power supply [{#SENSOR_NAME}]: Current sensor: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R720 by HTTP/dell.server.sensor.psu.current.status[{#SENSOR_NAME}],)=3 |Average |
||
Dell R720: Power supply [{#SENSOR_NAME}]: Current sensor: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R720 by HTTP/dell.server.sensor.psu.current.status[{#SENSOR_NAME}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
FAN discovery | Discovery of FAN sensors. |
Dependent item | dell.server.fan.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R720: Fan [{#SENSOR_NAME}]: Get sensor | Returns the metrics of a sensor. |
Dependent item | dell.server.sensor.fan.get[{#SENSOR_NAME}] Preprocessing
|
Dell R720: Fan [{#SENSOR_NAME}]: Speed | Sensor value. |
Dependent item | dell.server.sensor.fan.speed[{#SENSOR_NAME}] Preprocessing
|
Dell R720: Fan [{#SENSOR_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.sensor.fan.status[{#SENSOR_NAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R720: Fan [{#SENSOR_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R720 by HTTP/dell.server.sensor.fan.status[{#SENSOR_NAME}],)=3 |Average |
||
Dell R720: Fan [{#SENSOR_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R720 by HTTP/dell.server.sensor.fan.status[{#SENSOR_NAME}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Array controller discovery | Discovery of disk array controllers. |
Dependent item | dell.server.array.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R720: Controller [{#CNTLR_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.array.status[{#ID}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R720: Controller [{#CNTLR_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R720 by HTTP/dell.server.array.status[{#ID}],)=3 |Average |
||
Dell R720: Controller [{#CNTLR_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R720 by HTTP/dell.server.array.status[{#ID}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Battery discovery | Discovery of battery controllers. |
Dependent item | dell.server.controller.battery.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R720: Battery [{#BATTERY_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.controller.battery.status[{#ID}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R720: Battery [{#BATTERY_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R720 by HTTP/dell.server.controller.battery.status[{#ID}],)=3 |Average |
||
Dell R720: Battery [{#BATTERY_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R720 by HTTP/dell.server.controller.battery.status[{#ID}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Physical disk discovery | Discovery of physical disks. |
Dependent item | dell.server.physicaldisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R720: Physical disk [{#DISK_NAME}]: Get disk | Returns the metrics of a physical disk. |
Script | dell.server.hw.physicaldisk.get[{#DISK_NAME}] |
Dell R720: Physical disk [{#DISK_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.hw.physicaldisk.status[{#DISK_NAME}] Preprocessing
|
Dell R720: Physical disk [{#DISK_NAME}]: Serial number | The serial number of this drive. |
Dependent item | dell.server.hw.physicaldisk.serialnumber[{#DISK_NAME}] Preprocessing
|
Dell R720: Physical disk [{#DISK_NAME}]: Model name | The model number of the drive. |
Dependent item | dell.server.hw.physicaldisk.model[{#DISK_NAME}] Preprocessing
|
Dell R720: Physical disk [{#DISK_NAME}]: Media type | The type of media contained in this drive. Possible values: HDD, SSD, SMR, null. |
Dependent item | dell.server.hw.physicaldisk.mediatype[{#DISKNAME}] Preprocessing
|
Dell R720: Physical disk [{#DISK_NAME}]: Size | The size, in bytes, of this drive. |
Dependent item | dell.server.hw.physicaldisk.size[{#DISK_NAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R720: Physical disk [{#DISK_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R720 by HTTP/dell.server.hw.physicaldisk.status[{#DISK_NAME}],)=3 |Average |
||
Dell R720: Physical disk [{#DISK_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R720 by HTTP/dell.server.hw.physicaldisk.status[{#DISK_NAME}],)=2 |Warning |
Depends on:
|
|
Dell R720: Physical disk [{#DISK_NAME}]: Has been replaced | [{#DISK_NAME}] serial number has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R720 by HTTP/dell.server.hw.physicaldisk.serialnumber[{#DISK_NAME}],#1)<>last(/DELL PowerEdge R720 by HTTP/dell.server.hw.physicaldisk.serialnumber[{#DISK_NAME}],#2) and length(last(/DELL PowerEdge R720 by HTTP/dell.server.hw.physicaldisk.serialnumber[{#DISK_NAME}]))>0 |Info |
Manual close: Yes |
Name | Description | Type | Key and additional info |
---|---|---|---|
Virtual disk discovery | Discovery of virtual disks. |
Dependent item | dell.server.virtualdisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R720: Virtual disk [{#DISK_NAME}]: Get disk | Returns the metrics of a virtual disk. |
Script | dell.server.hw.virtualdisk.get[{#DISK_NAME}] |
Dell R720: Virtual disk [{#DISK_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.hw.virtualdisk.status[{#DISK_NAME}] Preprocessing
|
Dell R720: Virtual disk [{#DISK_NAME}]: RAID status | This property represents the RAID specific status. Possible values: Blocked, Degraded, Failed, Foreign, Offline, Online, Ready, Unknown, null. |
Dependent item | dell.server.hw.virtualdisk.raidstatus[{#DISKNAME}] Preprocessing
|
Dell R720: Virtual disk [{#DISK_NAME}]: Size | The size in bytes of this Volume. |
Dependent item | dell.server.hw.virtualdisk.size[{#DISK_NAME}] Preprocessing
|
Dell R720: Virtual disk [{#DISK_NAME}]: Current state | The known state of the Resource, for example, Enabled. Possible values: Enabled, Disabled, StandbyOffline, StandbySpare, InTest, Starting, Absent, UnavailableOffline, Deferring, Quiesced, Updating, Qualified. |
Dependent item | dell.server.hw.virtualdisk.state[{#DISK_NAME}] Preprocessing
|
Dell R720: Virtual disk [{#DISK_NAME}]: Read policy | Indicates the read cache policy setting for the Volume. Possible values: ReadAhead, NoReadAhead, AdaptiveReadAhead. |
Dependent item | dell.server.hw.virtualdisk.readpolicy[{#DISKNAME}] Preprocessing
|
Dell R720: Virtual disk [{#DISK_NAME}]: Write policy | Indicates the write cache policy setting for the Volume. Possible values: WriteThrough, WriteBack, ProtectedWriteBack, UnprotectedWriteBack. |
Dependent item | dell.server.hw.virtualdisk.writepolicy[{#DISKNAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R720: Virtual disk [{#DISK_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R720 by HTTP/dell.server.hw.virtualdisk.status[{#DISK_NAME}],)=3 |Average |
||
Dell R720: Virtual disk [{#DISK_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R720 by HTTP/dell.server.hw.virtualdisk.status[{#DISK_NAME}],)=2 |Warning |
Depends on:
|
|
Dell R720: Virtual disk [{#DISK_NAME}]: RAID status not OK | Please check the disk for faults. |
last(/DELL PowerEdge R720 by HTTP/dell.server.hw.virtualdisk.raid_status[{#DISK_NAME}],)<8 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Network interface discovery | Discovery of network interfaces. |
Dependent item | dell.server.net.if.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R720: Interface [{#IFNAME}]: Get interface | Returns the metrics of a network interface. |
Script | dell.server.net.if.get[{#IFNAME}] |
Dell R720: Interface [{#IFNAME}]: Speed | The network port current link speed. |
Dependent item | dell.server.net.if.speed[{#IFNAME}] Preprocessing
|
Dell R720: Interface [{#IFNAME}]: Link status | The status of the link between this port and its link partner. Possible values: Down, Up, null. |
Dependent item | dell.server.net.if.status[{#IFNAME}] Preprocessing
|
Dell R720: Interface [{#IFNAME}]: State | The known state of the Resource, for example, Enabled. Possible values: Enabled, Disabled, StandbyOffline, StandbySpare, InTest, Starting, Absent, UnavailableOffline, Deferring, Quiesced, Updating, Qualified. |
Dependent item | dell.server.net.if.state[{#IFNAME}] Preprocessing
|
Dell R720: Interface [{#IFNAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.net.if.health[{#IFNAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R720: Interface [{#IFNAME}]: Link down | This trigger expression works as follows: |
{$DELL.HTTP.IFCONTROL:"{#IFNAME}"}=1 and (last(/DELL PowerEdge R720 by HTTP/dell.server.net.if.status[{#IFNAME}],)=2 and last(/DELL PowerEdge R720 by HTTP/dell.server.net.if.status[{#IFNAME}],#1)<>last(/DELL PowerEdge R720 by HTTP/dell.server.net.if.status[{#IFNAME}],#2)) |Average |
Manual close: Yes | |
Dell R720: Interface [{#IFNAME}]: Link status issue | This trigger expression works as follows: |
{$DELL.HTTP.IFCONTROL:"{#IFNAME}"}=1 and (last(/DELL PowerEdge R720 by HTTP/dell.server.net.if.status[{#IFNAME}],)<2 and last(/DELL PowerEdge R720 by HTTP/dell.server.net.if.status[{#IFNAME}],#1)<>last(/DELL PowerEdge R720 by HTTP/dell.server.net.if.status[{#IFNAME}],#2)) |Average |
Manual close: Yes | |
Dell R720: Interface [{#IFNAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R720 by HTTP/dell.server.net.if.health[{#IFNAME}],)=3 |Average |
||
Dell R720: Interface [{#IFNAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R720 by HTTP/dell.server.net.if.health[{#IFNAME}],)=2 |Warning |
Depends on:
|
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums
This is a template for monitoring DELL PowerEdge R660 servers with iDRAC version 7 (and later) via Zabbix SNMP agent that works without any external scripts.
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
Refer to the vendor documentation.
Name | Description | Default |
---|---|---|
{$DELL.SNMP.DISCOVERY.VOLTAGE.NAME.MATCHES} | Sets the regex string of voltage probe names to allow in discovery. |
^.*Voltage.*$ |
{$DELL.SNMP.DISCOVERY.VOLTAGE.NAME.NOT_MATCHES} | Sets the regex string of voltage probe names to ignore in discovery. |
CHANGE_IF_NEEDED |
{$DELL.SNMP.DISCOVERY.VOLTAGE.TYPE.MATCHES} | Sets the regex string of voltage probe types to allow in discovery. |
18|16 |
{$DELL.SNMP.DISCOVERY.VOLTAGE.TYPE.NOT_MATCHES} | Sets the regex string of voltage probe types to ignore in discovery. |
CHANGE_IF_NEEDED |
{$DELL.SNMP.SENSOR.TEMP.STATUS.OK} | The OK status of the temperature probe for the trigger expression. |
3 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalUpper"} | The warning status of the temperature probe for the trigger expression. |
4 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalLower"} | The warning status of the temperature probe for the trigger expression. |
7 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalUpper"} | The critical status of the temperature probe for the trigger expression. |
5 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableUpper"} | The critical status of the temperature probe for the trigger expression. |
6 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalLower"} | The critical status of the temperature probe for the trigger expression. |
8 |
{$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableLower"} | The critical status of the temperature probe for the trigger expression. |
9 |
{$DELL.SNMP.HEALTH.STATUS.DISASTER} | The disaster status of health for the trigger expression. |
6 |
{$DELL.SNMP.HEALTH.STATUS.CRIT} | The critical status of health for the trigger expression. |
5 |
{$DELL.SNMP.HEALTH.STATUS.WARN} | The warning status of health for the trigger expression. |
4 |
{$DELL.SNMP.PSU.STATUS.WARN:"nonCritical"} | The warning value of the PSU sensor for the trigger expression. |
4 |
{$DELL.SNMP.PSU.STATUS.CRIT:"critical"} | The critical value of the PSU sensor for the trigger expression. |
5 |
{$DELL.SNMP.PSU.STATUS.CRIT:"nonRecoverable"} | The critical value of the PSU sensor for the trigger expression. |
6 |
{$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalUpper"} | The warning value of the FAN sensor for the trigger expression. |
4 |
{$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalLower"} | The warning value of the FAN sensor for the trigger expression. |
7 |
{$DELL.SNMP.FAN.STATUS.CRIT:"criticalUpper"} | The critical value of the FAN sensor for the trigger expression. |
5 |
{$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableUpper"} | The critical value of the FAN sensor for the trigger expression. |
6 |
{$DELL.SNMP.FAN.STATUS.CRIT:"criticalLower"} | The critical value of the FAN sensor for the trigger expression. |
8 |
{$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableLower"} | The critical value of the FAN sensor for the trigger expression. |
9 |
{$DELL.SNMP.FAN.STATUS.CRIT:"failed"} | The critical value of the FAN sensor for the trigger expression. |
10 |
{$DELL.SNMP.DISK.ARRAY.STATUS.FAIL} | The disaster status of the disk array for the trigger expression. |
6 |
{$DELL.SNMP.DISK.ARRAY.STATUS.CRIT} | The critical status of the disk array for the trigger expression. |
5 |
{$DELL.SNMP.DISK.ARRAY.STATUS.WARN} | The warning status of the disk array for the trigger expression. |
4 |
{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT} | The critical status of the disk array cache battery for the trigger expression. |
3 |
{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.WARN} | The warning status of the disk array cache battery for the trigger expression. |
4 |
{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.OK} | The OK status of the disk array cache battery for the trigger expression. |
2 |
{$DELL.SNMP.VDISK.STATUS.CRIT:"failed"} | The critical status of the virtual disk for the trigger expression. |
3 |
{$DELL.SNMP.VDISK.STATUS.WARN:"degraded"} | The warning status of the virtual disk for the trigger expression. |
4 |
{$DELL.SNMP.DISK.STATUS.WARN:"nonCritical"} | The warning status of the disk for the trigger expression. |
4 |
{$DELL.SNMP.DISK.STATUS.FAIL:"critical"} | The critical status of the disk for the trigger expression. |
5 |
{$DELL.SNMP.DISK.STATUS.FAIL:"nonRecoverable"} | The critical status of the disk for the trigger expression. |
6 |
{$DELL.SNMP.DISK.SMART.STATUS.FAIL} | The critical S.M.A.R.T status of the disk for the trigger expression. |
1 |
{$DELL.SNMP.TIMEOUT} | The time interval for the SNMP agent availability trigger expression. |
5m |
{$DELL.SNMP.IFCONTROL} | The link status trigger will be fired only for interfaces that have the context macro equal to "1". |
1 |
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R660: Overall system health status | MIB: IDRAC-MIB-SMIv2 This attribute defines the overall rollup status of all the components in the system monitored by the remote access card. Includes system, storage, IO devices, iDRAC, CPU, memory, etc. |
SNMP agent | dell.server.status[globalSystemStatus] Preprocessing
|
Dell R660: Hardware model name | MIB: IDRAC-MIB-SMIv2 This attribute defines the model name of the system. |
SNMP agent | dell.server.hw.model[systemModelName] Preprocessing
|
Dell R660: Hardware serial number | MIB: IDRAC-MIB-SMIv2 This attribute defines the service tag of the system. |
SNMP agent | dell.server.hw.serialnumber[systemServiceTag] Preprocessing
|
Dell R660: Operating system | MIB: IDRAC-MIB-SMIv2 This attribute defines the name of the operating system that the host is running. |
SNMP agent | dell.server.sw.os[systemOSName] Preprocessing
|
Dell R660: Firmware version | MIB: IDRAC-MIB-SMIv2 This attribute defines the firmware version of a remote access card. |
SNMP agent | dell.server.hw.firmware[racFirmwareVersion] Preprocessing
|
Dell R660: Uptime (network) | MIB: SNMP-FRAMEWORK-MIB The number of seconds since the value of the snmpEngineBoots object last changed. |
SNMP agent | dell.server.net.uptime[snmpEngineTime] |
Dell R660: Uptime (hardware) | MIB: IDRAC-MIB-SMIv2 This attribute defines the power-up time of the system in seconds. |
SNMP agent | dell.server.hw.uptime[systemPowerUpTime] |
Dell R660: SNMP traps (fallback) | The item is used to collect all SNMP traps unmatched by other |
SNMP trap | snmptrap.fallback |
Dell R660: System location | MIB: SNMPv2-MIB The physical location of this node (e.g., 'telephone closet, 3rd floor'). If the location is unknown, the value is a zero-length string. |
SNMP agent | dell.server.location[sysLocation] Preprocessing
|
Dell R660: System contact details | MIB: SNMPv2-MIB Name and contact information of the contact person for the node. If not provided, the value is a zero-length string. |
SNMP agent | dell.server.contact[sysContact] Preprocessing
|
Dell R660: System object ID | MIB: SNMPv2-MIB The vendor's authoritative identification of the entity as part of the vendor's SMI enterprises subtree with the prefix 1.3.6.1.4.1 (e.g., a vendor with the identifier 1.3.6.1.4.1.4242 might assign a system object with the OID 1.3.6.1.4.1.4242.1.1). |
SNMP agent | dell.server.objectid[sysObjectID] Preprocessing
|
Dell R660: System name | MIB: SNMPv2-MIB An administratively-assigned name for this managed node. By convention, this is the node's fully-qualified domain name. If the name is unknown, the value is a zero-length string. |
SNMP agent | dell.server.name[sysName] Preprocessing
|
Dell R660: System description | MIB: SNMPv2-MIB A textual description of the entity. This value should include the full name and version identification of the system's hardware type, software operating system, and networking software. |
SNMP agent | dell.server.descr[sysDescr] Preprocessing
|
Dell R660: SNMP agent availability | Availability of SNMP checks on the host. The value of this item corresponds to availability icons in the host list. Possible values: 0 - not available 1 - available 2 - unknown |
Zabbix internal | zabbix[host,snmp,available] Preprocessing
|
Dell R660: Memory, total size | Total memory amount on the device. |
Calculated | dell.server.memory.size.total Preprocessing
|
Dell R660: BIOS version | MIB: IDRAC-MIB-SMIv2 This attribute defines the version name of the system BIOS. |
SNMP agent | dell.server.bios.version Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R660: System is in unrecoverable state | Please check the device for faults. |
last(/DELL PowerEdge R660 by SNMP/dell.server.status[globalSystemStatus])={$DELL.SNMP.HEALTH.STATUS.DISASTER} |High |
||
Dell R660: System status is in critical state | Please check the device for errors. |
last(/DELL PowerEdge R660 by SNMP/dell.server.status[globalSystemStatus])={$DELL.SNMP.HEALTH.STATUS.CRIT} |Average |
||
Dell R660: System status is in warning state | Please check the device for warnings. |
last(/DELL PowerEdge R660 by SNMP/dell.server.status[globalSystemStatus])={$DELL.SNMP.HEALTH.STATUS.WARN} |Warning |
Depends on:
|
|
Dell R660: Device has been replaced | Device serial number has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R660 by SNMP/dell.server.hw.serialnumber[systemServiceTag],#1)<>last(/DELL PowerEdge R660 by SNMP/dell.server.hw.serialnumber[systemServiceTag],#2) and length(last(/DELL PowerEdge R660 by SNMP/dell.server.hw.serialnumber[systemServiceTag]))>0 |Info |
Manual close: Yes | |
Dell R660: Operating system description has changed | Operating system description has changed. Possibly, the system has been updated or replaced. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R660 by SNMP/dell.server.sw.os[systemOSName],#1)<>last(/DELL PowerEdge R660 by SNMP/dell.server.sw.os[systemOSName],#2) and length(last(/DELL PowerEdge R660 by SNMP/dell.server.sw.os[systemOSName]))>0 |Info |
Manual close: Yes | |
Dell R660: Firmware has changed | Firmware version has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R660 by SNMP/dell.server.hw.firmware[racFirmwareVersion],#1)<>last(/DELL PowerEdge R660 by SNMP/dell.server.hw.firmware[racFirmwareVersion],#2) and length(last(/DELL PowerEdge R660 by SNMP/dell.server.hw.firmware[racFirmwareVersion]))>0 |Info |
Manual close: Yes | |
Dell R660: Host has been restarted | Uptime is less than 10 minutes. |
(last(/DELL PowerEdge R660 by SNMP/dell.server.hw.uptime[systemPowerUpTime])>0 and last(/DELL PowerEdge R660 by SNMP/dell.server.hw.uptime[systemPowerUpTime])<10m) or (last(/DELL PowerEdge R660 by SNMP/dell.server.hw.uptime[systemPowerUpTime])=0 and last(/DELL PowerEdge R660 by SNMP/dell.server.net.uptime[snmpEngineTime])<10m) |Warning |
Manual close: Yes | |
Dell R660: System name has changed | The name of the system has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R660 by SNMP/dell.server.name[sysName],#1)<>last(/DELL PowerEdge R660 by SNMP/dell.server.name[sysName],#2) and length(last(/DELL PowerEdge R660 by SNMP/dell.server.name[sysName]))>0 |Info |
Manual close: Yes | |
Dell R660: No SNMP data collection | SNMP is not available for polling. Please check device connectivity and SNMP settings. |
max(/DELL PowerEdge R660 by SNMP/zabbix[host,snmp,available],{$DELL.SNMP.TIMEOUT})=0 |Warning |
||
Dell R660: Memory amount has changed | change(/DELL PowerEdge R660 by SNMP/dell.server.memory.size.total)>0 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature discovery | Scanning table of Temperature Probe Table IDRAC-MIB-SMIv2::temperatureProbeTable |
SNMP agent | temp.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R660: Probe [{#SENSOR_LOCALE}]: Value | MIB: IDRAC-MIB-SMIv2 This attribute defines the reading for a temperature probe of type other than When the value for When the value for |
SNMP agent | dell.server.sensor.temp.value[temperatureProbeReading.{#SNMPINDEX}] Preprocessing
|
Dell R660: Probe [{#SENSOR_LOCALE}]: Status | MIB: IDRAC-MIB-SMIv2 This attribute defines the probe status of the temperature probe. Possible values: other(1), -- probe status is not one of the following: unknown(2), -- probe status is unknown (not known or monitored) ok(3), -- probe is reporting a value within the thresholds nonCriticalUpper(4), -- probe has crossed the upper noncritical threshold criticalUpper(5), -- probe has crossed the upper critical threshold nonRecoverableUpper(6), -- probe has crossed the upper non-recoverable threshold nonCriticalLower(7), -- probe has crossed the lower noncritical threshold criticalLower(8), -- probe has crossed the lower critical threshold nonRecoverableLower(9), -- probe has crossed the lower non-recoverable threshold failed(10) -- probe is not functional |
SNMP agent | dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R660: Probe [{#SENSOR_LOCALE}]: Critical status | Please check the device for faults. |
last(/DELL PowerEdge R660 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalUpper"} or last(/DELL PowerEdge R660 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableUpper"} or last(/DELL PowerEdge R660 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"criticalLower"} or last(/DELL PowerEdge R660 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.CRIT:"nonRecoverableLower"} |Average |
||
Dell R660: Probe [{#SENSOR_LOCALE}]: Warning status | Please check the device for faults. |
last(/DELL PowerEdge R660 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalUpper"} or last(/DELL PowerEdge R660 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])={$DELL.SNMP.SENSOR.TEMP.STATUS.WARN:"nonCriticalLower"} |Warning |
Depends on:
|
|
Dell R660: Probe [{#SENSOR_LOCALE}]: Not in optimal status | Please check the device for faults. |
last(/DELL PowerEdge R660 by SNMP/dell.server.sensor.temp.status[temperatureProbeStatus.{#SNMPINDEX}])<>{$DELL.SNMP.SENSOR.TEMP.STATUS.OK} |Info |
Manual close: Yes Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU discovery | IDRAC-MIB-SMIv2::powerSupplyTable |
SNMP agent | psu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R660: Power supply [{#PSU_DESCR}]: State | MIB: IDRAC-MIB-SMIv2 This attribute defines the status of the power supply. |
SNMP agent | dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R660: Power supply [{#PSU_DESCR}]: Critical state | Please check the power supply unit for errors. |
last(/DELL PowerEdge R660 by SNMP/dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}])={$DELL.SNMP.PSU.STATUS.CRIT:"critical"} or last(/DELL PowerEdge R660 by SNMP/dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}])={$DELL.SNMP.PSU.STATUS.CRIT:"nonRecoverable"} |Average |
||
Dell R660: Power supply [{#PSU_DESCR}]: Warning state | Please check the power supply unit for errors. |
last(/DELL PowerEdge R660 by SNMP/dell.server.sensor.psu.status[powerSupplyStatus.{#SNMPINDEX}])={$DELL.SNMP.PSU.STATUS.WARN:"nonCritical"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Fan discovery | IDRAC-MIB-SMIv2::coolingDeviceTable |
SNMP agent | fan.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R660: Fan [{#FAN_DESCR}]: Status | MIB: IDRAC-MIB-SMIv2 This attribute defines the probe status of the cooling device. |
SNMP agent | dell.server.sensor.fan.status[{#FAN_DESCR}] Preprocessing
|
Dell R660: Fan [{#FAN_DESCR}]: Speed | MIB: IDRAC-MIB-SMIv2 This attribute defines the reading for a cooling device of a subtype other than When the value for When the value for |
SNMP agent | dell.server.sensor.fan.speed[{#FAN_DESCR}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R660: Fan [{#FAN_DESCR}]: Critical state | Please check the fan unit. |
last(/DELL PowerEdge R660 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"criticalUpper"} or last(/DELL PowerEdge R660 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableUpper"} or last(/DELL PowerEdge R660 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"criticalLower"} or last(/DELL PowerEdge R660 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"nonRecoverableLower"} or last(/DELL PowerEdge R660 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.CRIT:"failed"} |Average |
||
Dell R660: Fan [{#FAN_DESCR}]: Warning state | Please check the fan unit. |
last(/DELL PowerEdge R660 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalUpper"} or last(/DELL PowerEdge R660 by SNMP/dell.server.sensor.fan.status[{#FAN_DESCR}])={$DELL.SNMP.FAN.STATUS.WARN:"nonCriticalLower"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Array controller discovery | Scanning table of Array controllers: IDRAC-MIB-SMIv2::controllerTable |
SNMP agent | array.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R660: Controller [{#CNTLR_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 The status of the controller itself without the propagation of any contained component status. Possible values: 1: Other 2: Unknown 3: OK 4: Non-critical 5: Critical 6: Non-recoverable |
SNMP agent | dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}] Preprocessing
|
Dell R660: Controller [{#CNTLR_NAME}]: Model | MIB: IDRAC-MIB-SMIv2 The controller's name as represented in Storage Management. |
SNMP agent | dell.server.hw.diskarray.model[controllerName.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R660: Controller [{#CNTLR_NAME}]: Unrecoverable state | Please check the device for faults. |
last(/DELL PowerEdge R660 by SNMP/dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.STATUS.FAIL} |High |
||
Dell R660: Controller [{#CNTLR_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R660 by SNMP/dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.STATUS.CRIT} |Average |
Depends on:
|
|
Dell R660: Controller [{#CNTLR_NAME}]: Warning state | Please check the device for faults. |
last(/DELL PowerEdge R660 by SNMP/dell.server.hw.diskarray.status[controllerComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.STATUS.WARN} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Battery discovery | Scanning Battery Table: IDRAC-MIB-SMIv2::batteryTable |
SNMP agent | battery.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R660: Battery [{#BATTERY_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 Current state of battery. Possible values: 1: The current state could not be determined. 2: The battery is operating normally. 3: The battery has failed and needs to be replaced. 4: The battery temperature is high or charge level is depleting. 5: The battery is missing or not detected. 6: The battery is undergoing the re-charge phase. 7: The battery voltage or charge level is below the threshold. |
SNMP agent | dell.server.hw.battery.status[batteryState.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R660: Battery [{#BATTERY_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R660 by SNMP/dell.server.hw.battery.status[batteryState.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT} |Average |
||
Dell R660: Battery [{#BATTERY_NAME}]: Warning state | Please check the device for faults. |
last(/DELL PowerEdge R660 by SNMP/dell.server.hw.battery.status[batteryState.{#SNMPINDEX}])={$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.WARN} |Warning |
Depends on:
|
|
Dell R660: Battery [{#BATTERY_NAME}]: Not in optimal state | Please check the device for faults. |
last(/DELL PowerEdge R660 by SNMP/dell.server.hw.battery.status[batteryState.{#SNMPINDEX}])<>{$DELL.SNMP.DISK.ARRAY.CACHE.BATTERY.STATUS.OK} |Info |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Physical disk discovery | Scanning table of physical drive entries IDRAC-MIB-SMIv2::physicalDiskTable. |
SNMP agent | physicaldisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R660: Physical disk [{#DISK_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 The status of the physical disk itself without the propagation of any contained component status. Possible values: 1: Other 2: Unknown 3: OK 4: Non-critical 5: Critical 6: Non-recoverable |
SNMP agent | dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}] Preprocessing
|
Dell R660: Physical disk [{#DISK_NAME}]: S.M.A.R.T. Status | MIB: IDRAC-MIB-SMIv2 Indicates whether the physical disk has received a predictive failure alert. |
SNMP agent | dell.server.hw.physicaldisk.smart_status[physicalDiskSmartAlertIndication.{#SNMPINDEX}] Preprocessing
|
Dell R660: Physical disk [{#DISK_NAME}]: Serial number | MIB: IDRAC-MIB-SMIv2 The physical disk's unique identification number from the manufacturer. |
SNMP agent | dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}] Preprocessing
|
Dell R660: Physical disk [{#DISK_NAME}]: Model name | MIB: IDRAC-MIB-SMIv2 The model number of the physical disk. |
SNMP agent | dell.server.hw.physicaldisk.model[physicalDiskProductID.{#SNMPINDEX}] Preprocessing
|
Dell R660: Physical disk [{#DISK_NAME}]: Media type | MIB: IDRAC-MIB-SMIv2 The media type of the physical disk. Possible Values: 1: The media type could not be determined. 2: Hard Disk Drive (HDD). 3: Solid State Drive (SSD). |
SNMP agent | dell.server.hw.physicaldisk.media_type[physicalDiskMediaType.{#SNMPINDEX}] Preprocessing
|
Dell R660: Physical disk [{#DISK_NAME}]: Size | MIB: IDRAC-MIB-SMIv2 The size of the physical disk in megabytes. |
SNMP agent | dell.server.hw.physicaldisk.size[physicalDiskCapacityInMB.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R660: Physical disk [{#DISK_NAME}]: Failed state | Please check physical disk for warnings or errors. |
last(/DELL PowerEdge R660 by SNMP/dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.STATUS.FAIL:"critical"} or last(/DELL PowerEdge R660 by SNMP/dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.STATUS.FAIL:"nonRecoverable"} |High |
||
Dell R660: Physical disk [{#DISK_NAME}]: Warning state | Please check physical disk for warnings or errors. |
last(/DELL PowerEdge R660 by SNMP/dell.server.hw.physicaldisk.status[physicalDiskComponentStatus.{#SNMPINDEX}])={$DELL.SNMP.DISK.STATUS.WARN:"nonCritical"} |Warning |
Depends on:
|
|
Dell R660: Physical disk [{#DISK_NAME}]: S.M.A.R.T. failed | Disk probably requires replacement. |
last(/DELL PowerEdge R660 by SNMP/dell.server.hw.physicaldisk.smart_status[physicalDiskSmartAlertIndication.{#SNMPINDEX}])={$DELL.SNMP.DISK.SMART.STATUS.FAIL:"replaceDrive"} or last(/DELL PowerEdge R660 by SNMP/dell.server.hw.physicaldisk.smart_status[physicalDiskSmartAlertIndication.{#SNMPINDEX}])={$DELL.SNMP.DISK.SMART.STATUS.FAIL:"replaceDriveSSDWearOut"} |High |
Depends on:
|
|
Dell R660: Physical disk [{#DISK_NAME}]: Has been replaced | [{#DISK_NAME}] serial number has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R660 by SNMP/dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}],#1)<>last(/DELL PowerEdge R660 by SNMP/dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}],#2) and length(last(/DELL PowerEdge R660 by SNMP/dell.server.hw.physicaldisk.serialnumber[physicalDiskSerialNo.{#SNMPINDEX}]))>0 |Info |
Manual close: Yes |
Name | Description | Type | Key and additional info |
---|---|---|---|
Virtual disk discovery | IDRAC-MIB-SMIv2::virtualDiskTable |
SNMP agent | virtualdisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R660: Virtual disk [{#DISK_NAME}]: Status | MIB: IDRAC-MIB-SMIv2 The current state of this virtual disk (which includes any member physical disks.) Possible states: 1: The current state could not be determined. 2: The virtual disk is operating normally or optimally. 3: The virtual disk has encountered a failure. Data on the disk is lost or is about to be lost. 4: The virtual disk encountered a failure with one or all of the constituent redundant physical disks. The data on the virtual disk might no longer be fault tolerant. |
SNMP agent | dell.server.hw.virtualdisk.status[virtualDiskState.{#SNMPINDEX}] Preprocessing
|
Dell R660: Virtual disk [{#DISK_NAME}]: Layout type | MIB: IDRAC-MIB-SMIv2 The virtual disk's RAID type. Possible values: 1: Not one of the following 2: RAID-0 3: RAID-1 4: RAID-5 5: RAID-6 6: RAID-10 7: RAID-50 8: RAID-60 9: Concatenated RAID 1 10: Concatenated RAID 5 |
SNMP agent | dell.server.hw.virtualdisk.layout[virtualDiskLayout.{#SNMPINDEX}] Preprocessing
|
Dell R660: Virtual disk [{#DISK_NAME}]: Size | MIB: IDRAC-MIB-SMIv2 The size of the virtual disk in megabytes. |
SNMP agent | dell.server.hw.virtualdisk.size[virtualDiskSizeInMB.{#SNMPINDEX}] Preprocessing
|
Dell R660: Virtual disk [{#DISK_NAME}]: Operational state | MIB: IDRAC-MIB-SMIv2 The state of the virtual disk when there are progressive operations ongoing. Possible values: 1: There is no active operation running. 2: The virtual disk configuration has changed. The physical disks included in the virtual disk are being modified to support the new configuration. 3: A Consistency Check (CC) is being performed on the virtual disk. 4: The virtual disk is being initialized. 5: BackGround Initialization (BGI) is being performed on the virtual disk. |
SNMP agent | dell.server.hw.virtualdisk.state[virtualDiskOperationalState.{#SNMPINDEX}] |
Dell R660: Virtual disk [{#DISK_NAME}]: Read policy | MIB: IDRAC-MIB-SMIv2 The read policy used by the controller for read operations on this virtual disk. Possible values: 1: No Read Ahead. 2: Read Ahead. 3: Adaptive Read Ahead. |
SNMP agent | dell.server.hw.virtualdisk.read_policy[virtualDiskReadPolicy.{#SNMPINDEX}] |
Dell R660: Virtual disk [{#DISK_NAME}]: Write policy | MIB: IDRAC-MIB-SMIv2 The write policy used by the controller for write operations on this virtual disk. Possible values: 1: Write Through. 2: Write Back. 3: Force Write Back. |
SNMP agent | dell.server.hw.virtualdisk.write_policy[virtualDiskWritePolicy.{#SNMPINDEX}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R660: Virtual disk [{#DISK_NAME}]: Failed state | Please check the virtual disk for warnings or errors. |
last(/DELL PowerEdge R660 by SNMP/dell.server.hw.virtualdisk.status[virtualDiskState.{#SNMPINDEX}])={$DELL.SNMP.VDISK.STATUS.CRIT:"failed"} |High |
||
Dell R660: Virtual disk [{#DISK_NAME}]: Warning state | Please check the virtual disk for warnings or errors. |
last(/DELL PowerEdge R660 by SNMP/dell.server.hw.virtualdisk.status[virtualDiskState.{#SNMPINDEX}])={$DELL.SNMP.VDISK.STATUS.WARN:"degraded"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Network interface discovery | Discovery of network interfaces. |
SNMP agent | net.if.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R660: NIC [{#NICFQDD}/{#NICMAC}]: Link status | This attribute defines the connection status of the network device. |
SNMP agent | dell.server.net.if.link[{#NIC_FQDD}] Preprocessing
|
Dell R660: NIC [{#NICFQDD}/{#NICMAC}]: Status | This attribute defines the status of the network device. |
SNMP agent | dell.server.net.if.status[{#NIC_FQDD}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R660: NIC [{#NICFQDD}/{#NICMAC}]: Link down | This trigger expression works as follows: |
{$DELL.SNMP.IFCONTROL:"{#NIC_FQDD}"}=1 and last(/DELL PowerEdge R660 by SNMP/dell.server.net.if.link[{#NIC_FQDD}],#1)<>1 and last(/DELL PowerEdge R660 by SNMP/dell.server.net.if.link[{#NIC_FQDD}],#1)<>last(/DELL PowerEdge R660 by SNMP/dell.server.net.if.link[{#NIC_FQDD}],#2) |Average |
Manual close: Yes | |
Dell R660: NIC [{#NICFQDD}/{#NICMAC}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/DELL PowerEdge R660 by SNMP/dell.server.net.if.status[{#NIC_FQDD}],#1)<>3 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
CPU status discovery | CPU status discovery. |
SNMP agent | cpu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R660: CPU [{#CPU_FQDD}]: Status | This attribute defines the status of the processor device status probe. This status will be joined into the |
SNMP agent | dell.server.cpu.status[cpu.{#SNMPINDEX}] Preprocessing
|
Dell R660: CPU [{#CPU_FQDD}]: State | This attribute defines the reading of the processor device status probe. |
SNMP agent | dell.server.cpu.state[cpu.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R660: CPU [{#CPU_FQDD}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/DELL PowerEdge R660 by SNMP/dell.server.cpu.status[cpu.{#SNMPINDEX}],#1)<>3 |Average |
||
Dell R660: CPU [{#CPU_FQDD}]: Reading error | MIB: IDRAC-MIB-SMIv2 |
bitand(last(/DELL PowerEdge R660 by SNMP/dell.server.cpu.state[cpu.{#SNMPINDEX}],#1),128)=0 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
System battery discovery | System battery discovery. |
SNMP agent | system.battery.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R660: System battery [{#SNMPVALUE}]: Status | This attribute defines the status of the battery. |
SNMP agent | dell.server.system.battery[{#SNMPVALUE}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R660: System battery [{#SNMPVALUE}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/DELL PowerEdge R660 by SNMP/dell.server.system.battery[{#SNMPVALUE}],#1)<>3 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Memory discovery | Memory discovery. |
SNMP agent | memory.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R660: Memory [{#SNMPVALUE}]: Status | This attribute defines the status of the memory device. |
SNMP agent | dell.server.memory.status[{#SNMPVALUE}] Preprocessing
|
Dell R660: Memory [{#SNMPVALUE}]: Size | This attribute defines the size, in KB, of the memory device. Zero indicates no memory installed; 2,147,483,647 indicates an unknown memory size. |
SNMP agent | dell.server.memory.size[{#SNMPVALUE}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R660: Memory [{#SNMPVALUE}]: Status is not OK | MIB: IDRAC-MIB-SMIv2 |
last(/DELL PowerEdge R660 by SNMP/dell.server.memory.status[{#SNMPVALUE}],#1)<>3 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Voltage probe discovery | Voltage probe discovery. |
SNMP agent | voltage.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R660: Voltage probe [{#VPROBE_NAME}]: Voltage | This attribute defines the reading for a voltage probe. |
SNMP agent | dell.server.voltage.value[{#VPROBE_NAME}] Preprocessing
|
Dell R660: Voltage probe [{#VPROBE_NAME}]: Status | This attribute defines the status of the voltage probe. |
SNMP agent | dell.server.voltage.status[{#VPROBE_NAME}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R660: Voltage probe [{#VPROBE_NAME}]: Status is not OK | Please check the device's voltage. |
last(/DELL PowerEdge R660 by SNMP/dell.server.voltage.status[{#VPROBE_NAME}])<>3 |Average |
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums
This is a template for monitoring DELL PowerEdge R660 servers with iDRAC 8/9 firmware 4.32 (and later) with Redfish API enabled via Zabbix script items. This template works without any external scripts.
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
1. Enable Redfish API in the Dell iDRAC interface of your server.
2. Create a user for monitoring with read-only permissions in the Dell iDRAC interface.
3. Create a host for Dell server with iDRAC IP as the Zabbix agent interface.
4. Link the template to the host.
5. Customize the values of the {$DELL.HTTP.API.URL}
, {$DELL.HTTP.API.USER}
, and {$DELL.HTTP.API.PASSWORD}
macros.
NOTE! If you are experiencing timeouts on some of the items that are executing requests, adjust the
{$DELL.HTTP.REQUEST.TIMEOUT}
macro accordingly.
Name | Description | Default |
---|---|---|
{$DELL.HTTP.API.URL} | The Dell iDRAC Redfish API URL in the format |
<Put your URL here> |
{$DELL.HTTP.API.USER} | The Dell iDRAC username. |
<Put your username here> |
{$DELL.HTTP.API.PASSWORD} | The Dell iDRAC user password. |
<Put your password here> |
{$DELL.HTTP.PROXY} | Set an HTTP proxy for Redfish API requests if needed. |
|
{$DELL.HTTP.RETURN.CODE.OK} | Set the HTTP return code that represents an OK response from the API. The default is "200", but can vary, for example, if a proxy is used. |
200 |
{$DELL.HTTP.REQUEST.TIMEOUT} | Set the timeout for HTTP requests. |
10s |
{$DELL.HTTP.IFCONTROL} | Link status trigger will be fired only for interfaces that have the context macro equal to "1". |
1 |
{$DELL.HTTP.CPU.UTIL.HIGH} | Sets the percentage threshold for creating a "high" severity event about CPU utilization. |
90 |
{$DELL.HTTP.CPU.UTIL.WARN} | Sets the percentage threshold for creating a "warning" severity event about CPU utilization. |
75 |
{$DELL.HTTP.MEM.UTIL.HIGH} | Sets the percentage threshold for creating a "high" severity event about memory utilization. |
90 |
{$DELL.HTTP.MEM.UTIL.WARN} | Sets the percentage threshold for creating a "warning" severity event about memory utilization. |
75 |
{$DELL.HTTP.IO.UTIL.HIGH} | Sets the percentage threshold for creating a "high" severity event about IO utilization. |
90 |
{$DELL.HTTP.IO.UTIL.WARN} | Sets the percentage threshold for creating a "warning" severity event about IO utilization. |
75 |
{$DELL.HTTP.SYS.UTIL.HIGH} | Sets the percentage threshold for creating a "high" severity event about SYS utilization. |
90 |
{$DELL.HTTP.SYS.UTIL.WARN} | Sets the percentage threshold for creating a "warning" severity event about SYS utilization. |
75 |
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R660: Get system | Returns system metrics. |
Script | dell.server.system.get |
Dell R660: Get sensors | Returns sensors. |
Script | dell.server.sensors.get |
Dell R660: Get array controller resources | Returns array controller resources. |
Script | dell.server.array.resources.get |
Dell R660: Get disks | Returns storage resources. |
Script | dell.server.disks.get |
Dell R660: Get network interfaces | Returns network interfaces. |
Script | dell.server.net.iface.get |
Dell R660: CPU utilization, in % | CPU utilization. |
Dependent item | dell.server.util.cpu Preprocessing
|
Dell R660: Memory utilization, in % | Memory utilization. |
Dependent item | dell.server.util.mem Preprocessing
|
Dell R660: IO utilization, in % | IO utilization. |
Dependent item | dell.server.util.io Preprocessing
|
Dell R660: SYS utilization, in % | SYS utilization. |
Dependent item | dell.server.util.sys Preprocessing
|
Dell R660: Overall system health status | This attribute defines the overall rollup status of all the components in the system monitored by the remote access card. Includes system, storage, IO devices, iDRAC, CPU, memory, etc. |
Dependent item | dell.server.status Preprocessing
|
Dell R660: Hardware model name | This attribute defines the model name of the system. |
Dependent item | dell.server.hw.model Preprocessing
|
Dell R660: Hardware serial number | This attribute defines the service tag of the system. |
Dependent item | dell.server.hw.serialnumber Preprocessing
|
Dell R660: Firmware version | This attribute defines the firmware version of a remote access card. |
Dependent item | dell.server.hw.firmware Preprocessing
|
Dell R660: Redfish API status | Availability of Redfish API on the server. Possible values: 0 - Unavailable 1 - Available |
Simple check | net.tcp.service[https] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R660: CPU utilization is too high | Current CPU utilization has exceeded |
min(/DELL PowerEdge R660 by HTTP/dell.server.util.cpu,5m)>={$DELL.HTTP.CPU.UTIL.HIGH} |High |
||
Dell R660: CPU utilization is high | Current CPU utilization has exceeded |
min(/DELL PowerEdge R660 by HTTP/dell.server.util.cpu,5m)>={$DELL.HTTP.CPU.UTIL.WARN} |Warning |
Depends on:
|
|
Dell R660: Memory utilization is too high | Current memory utilization has exceeded |
min(/DELL PowerEdge R660 by HTTP/dell.server.util.mem,5m)>={$DELL.HTTP.MEM.UTIL.HIGH} |High |
||
Dell R660: Memory utilization is high | Current memory utilization has exceeded |
min(/DELL PowerEdge R660 by HTTP/dell.server.util.mem,5m)>={$DELL.HTTP.MEM.UTIL.WARN} |Warning |
Depends on:
|
|
Dell R660: IO utilization is too high | Current IO utilization has exceeded |
min(/DELL PowerEdge R660 by HTTP/dell.server.util.io,5m)>={$DELL.HTTP.IO.UTIL.HIGH} |High |
||
Dell R660: IO utilization is high | Current IO utilization has exceeded |
min(/DELL PowerEdge R660 by HTTP/dell.server.util.io,5m)>={$DELL.HTTP.IO.UTIL.WARN} |Warning |
Depends on:
|
|
Dell R660: SYS utilization is too high | Current SYS utilization has exceeded |
min(/DELL PowerEdge R660 by HTTP/dell.server.util.sys,5m)>={$DELL.HTTP.SYS.UTIL.HIGH} |High |
||
Dell R660: SYS utilization is high | Current SYS utilization has exceeded |
min(/DELL PowerEdge R660 by HTTP/dell.server.util.sys,5m)>={$DELL.HTTP.SYS.UTIL.WARN} |Warning |
Depends on:
|
|
Dell R660: Server is in a critical state | Please check the device for faults. |
last(/DELL PowerEdge R660 by HTTP/dell.server.status,)=3 |Average |
||
Dell R660: Server is in a warning state | Please check the device for warnings. |
last(/DELL PowerEdge R660 by HTTP/dell.server.status,)=2 |Warning |
Depends on:
|
|
Dell R660: Device has been replaced | The device serial number has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R660 by HTTP/dell.server.hw.serialnumber,#1)<>last(/DELL PowerEdge R660 by HTTP/dell.server.hw.serialnumber,#2) and length(last(/DELL PowerEdge R660 by HTTP/dell.server.hw.serialnumber))>0 |Info |
Manual close: Yes | |
Dell R660: Firmware has changed | The firmware version has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R660 by HTTP/dell.server.hw.firmware,#1)<>last(/DELL PowerEdge R660 by HTTP/dell.server.hw.firmware,#2) and length(last(/DELL PowerEdge R660 by HTTP/dell.server.hw.firmware))>0 |Info |
Manual close: Yes | |
Dell R660: Redfish API service is unavailable | The service is unavailable or does not accept TCP connections. |
last(/DELL PowerEdge R660 by HTTP/net.tcp.service[https])=0 |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature discovery | Discovery of temperature sensors. |
Dependent item | dell.server.temp.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R660: Probe [{#SENSOR_NAME}]: Get sensor | Returns the metrics of a sensor. |
Dependent item | dell.server.sensor.temp.get[{#SENSOR_NAME}] Preprocessing
|
Dell R660: Probe [{#SENSOR_NAME}]: Value | Sensor value. |
Dependent item | dell.server.sensor.temp.value[{#SENSOR_NAME}] Preprocessing
|
Dell R660: Probe [{#SENSOR_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.sensor.temp.status[{#SENSOR_NAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R660: Probe [{#SENSOR_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R660 by HTTP/dell.server.sensor.temp.status[{#SENSOR_NAME}],)=3 |Average |
||
Dell R660: Probe [{#SENSOR_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R660 by HTTP/dell.server.sensor.temp.status[{#SENSOR_NAME}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU discovery | Discovery of PSU sensors. |
Dependent item | dell.server.psu.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R660: Power supply [{#SENSOR_NAME}]: Get sensor | Returns the metrics of a sensor. |
Dependent item | dell.server.sensor.psu.get[{#SENSOR_NAME}] Preprocessing
|
Dell R660: Power supply [{#SENSOR_NAME}]: Voltage | Sensor value. |
Dependent item | dell.server.sensor.psu.voltage[{#SENSOR_NAME}] Preprocessing
|
Dell R660: Power supply [{#SENSOR_NAME}]: Voltage sensor status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.sensor.psu.voltage.status[{#SENSOR_NAME}] Preprocessing
|
Dell R660: Power supply [{#SENSOR_NAME}]: Current | Sensor value. |
Dependent item | dell.server.sensor.psu.current[{#SENSOR_NAME}] Preprocessing
|
Dell R660: Power supply [{#SENSOR_NAME}]: Current sensor status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.sensor.psu.current.status[{#SENSOR_NAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R660: Power supply [{#SENSOR_NAME}]: Voltage sensor: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R660 by HTTP/dell.server.sensor.psu.voltage.status[{#SENSOR_NAME}],)=3 |Average |
||
Dell R660: Power supply [{#SENSOR_NAME}]: Voltage sensor: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R660 by HTTP/dell.server.sensor.psu.voltage.status[{#SENSOR_NAME}],)=2 |Warning |
Depends on:
|
|
Dell R660: Power supply [{#SENSOR_NAME}]: Current sensor: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R660 by HTTP/dell.server.sensor.psu.current.status[{#SENSOR_NAME}],)=3 |Average |
||
Dell R660: Power supply [{#SENSOR_NAME}]: Current sensor: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R660 by HTTP/dell.server.sensor.psu.current.status[{#SENSOR_NAME}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
FAN discovery | Discovery of FAN sensors. |
Dependent item | dell.server.fan.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R660: Fan [{#SENSOR_NAME}]: Get sensor | Returns the metrics of a sensor. |
Dependent item | dell.server.sensor.fan.get[{#SENSOR_NAME}] Preprocessing
|
Dell R660: Fan [{#SENSOR_NAME}]: Speed | Sensor value. |
Dependent item | dell.server.sensor.fan.speed[{#SENSOR_NAME}] Preprocessing
|
Dell R660: Fan [{#SENSOR_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.sensor.fan.status[{#SENSOR_NAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R660: Fan [{#SENSOR_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R660 by HTTP/dell.server.sensor.fan.status[{#SENSOR_NAME}],)=3 |Average |
||
Dell R660: Fan [{#SENSOR_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R660 by HTTP/dell.server.sensor.fan.status[{#SENSOR_NAME}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Array controller discovery | Discovery of disk array controllers. |
Dependent item | dell.server.array.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R660: Controller [{#CNTLR_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.array.status[{#ID}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R660: Controller [{#CNTLR_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R660 by HTTP/dell.server.array.status[{#ID}],)=3 |Average |
||
Dell R660: Controller [{#CNTLR_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R660 by HTTP/dell.server.array.status[{#ID}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Battery discovery | Discovery of battery controllers. |
Dependent item | dell.server.controller.battery.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R660: Battery [{#BATTERY_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.controller.battery.status[{#ID}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R660: Battery [{#BATTERY_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R660 by HTTP/dell.server.controller.battery.status[{#ID}],)=3 |Average |
||
Dell R660: Battery [{#BATTERY_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R660 by HTTP/dell.server.controller.battery.status[{#ID}],)=2 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Physical disk discovery | Discovery of physical disks. |
Dependent item | dell.server.physicaldisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R660: Physical disk [{#DISK_NAME}]: Get disk | Returns the metrics of a physical disk. |
Script | dell.server.hw.physicaldisk.get[{#DISK_NAME}] |
Dell R660: Physical disk [{#DISK_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.hw.physicaldisk.status[{#DISK_NAME}] Preprocessing
|
Dell R660: Physical disk [{#DISK_NAME}]: Serial number | The serial number of this drive. |
Dependent item | dell.server.hw.physicaldisk.serialnumber[{#DISK_NAME}] Preprocessing
|
Dell R660: Physical disk [{#DISK_NAME}]: Model name | The model number of the drive. |
Dependent item | dell.server.hw.physicaldisk.model[{#DISK_NAME}] Preprocessing
|
Dell R660: Physical disk [{#DISK_NAME}]: Media type | The type of media contained in this drive. Possible values: HDD, SSD, SMR, null. |
Dependent item | dell.server.hw.physicaldisk.mediatype[{#DISKNAME}] Preprocessing
|
Dell R660: Physical disk [{#DISK_NAME}]: Size | The size, in bytes, of this drive. |
Dependent item | dell.server.hw.physicaldisk.size[{#DISK_NAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R660: Physical disk [{#DISK_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R660 by HTTP/dell.server.hw.physicaldisk.status[{#DISK_NAME}],)=3 |Average |
||
Dell R660: Physical disk [{#DISK_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R660 by HTTP/dell.server.hw.physicaldisk.status[{#DISK_NAME}],)=2 |Warning |
Depends on:
|
|
Dell R660: Physical disk [{#DISK_NAME}]: Has been replaced | [{#DISK_NAME}] serial number has changed. Acknowledge to close the problem manually. |
last(/DELL PowerEdge R660 by HTTP/dell.server.hw.physicaldisk.serialnumber[{#DISK_NAME}],#1)<>last(/DELL PowerEdge R660 by HTTP/dell.server.hw.physicaldisk.serialnumber[{#DISK_NAME}],#2) and length(last(/DELL PowerEdge R660 by HTTP/dell.server.hw.physicaldisk.serialnumber[{#DISK_NAME}]))>0 |Info |
Manual close: Yes |
Name | Description | Type | Key and additional info |
---|---|---|---|
Virtual disk discovery | Discovery of virtual disks. |
Dependent item | dell.server.virtualdisk.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R660: Virtual disk [{#DISK_NAME}]: Get disk | Returns the metrics of a virtual disk. |
Script | dell.server.hw.virtualdisk.get[{#DISK_NAME}] |
Dell R660: Virtual disk [{#DISK_NAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.hw.virtualdisk.status[{#DISK_NAME}] Preprocessing
|
Dell R660: Virtual disk [{#DISK_NAME}]: RAID status | This property represents the RAID specific status. Possible values: Blocked, Degraded, Failed, Foreign, Offline, Online, Ready, Unknown, null. |
Dependent item | dell.server.hw.virtualdisk.raidstatus[{#DISKNAME}] Preprocessing
|
Dell R660: Virtual disk [{#DISK_NAME}]: Size | The size in bytes of this Volume. |
Dependent item | dell.server.hw.virtualdisk.size[{#DISK_NAME}] Preprocessing
|
Dell R660: Virtual disk [{#DISK_NAME}]: Current state | The known state of the Resource, for example, Enabled. Possible values: Enabled, Disabled, StandbyOffline, StandbySpare, InTest, Starting, Absent, UnavailableOffline, Deferring, Quiesced, Updating, Qualified. |
Dependent item | dell.server.hw.virtualdisk.state[{#DISK_NAME}] Preprocessing
|
Dell R660: Virtual disk [{#DISK_NAME}]: Read policy | Indicates the read cache policy setting for the Volume. Possible values: ReadAhead, NoReadAhead, AdaptiveReadAhead. |
Dependent item | dell.server.hw.virtualdisk.readpolicy[{#DISKNAME}] Preprocessing
|
Dell R660: Virtual disk [{#DISK_NAME}]: Write policy | Indicates the write cache policy setting for the Volume. Possible values: WriteThrough, WriteBack, ProtectedWriteBack, UnprotectedWriteBack. |
Dependent item | dell.server.hw.virtualdisk.writepolicy[{#DISKNAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R660: Virtual disk [{#DISK_NAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R660 by HTTP/dell.server.hw.virtualdisk.status[{#DISK_NAME}],)=3 |Average |
||
Dell R660: Virtual disk [{#DISK_NAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R660 by HTTP/dell.server.hw.virtualdisk.status[{#DISK_NAME}],)=2 |Warning |
Depends on:
|
|
Dell R660: Virtual disk [{#DISK_NAME}]: RAID status not OK | Please check the disk for faults. |
last(/DELL PowerEdge R660 by HTTP/dell.server.hw.virtualdisk.raid_status[{#DISK_NAME}],)<8 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Network interface discovery | Discovery of network interfaces. |
Dependent item | dell.server.net.if.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Dell R660: Interface [{#IFNAME}]: Get interface | Returns the metrics of a network interface. |
Script | dell.server.net.if.get[{#IFNAME}] |
Dell R660: Interface [{#IFNAME}]: Speed | The network port current link speed. |
Dependent item | dell.server.net.if.speed[{#IFNAME}] Preprocessing
|
Dell R660: Interface [{#IFNAME}]: Link status | The status of the link between this port and its link partner. Possible values: Down, Up, null. |
Dependent item | dell.server.net.if.status[{#IFNAME}] Preprocessing
|
Dell R660: Interface [{#IFNAME}]: State | The known state of the Resource, for example, Enabled. Possible values: Enabled, Disabled, StandbyOffline, StandbySpare, InTest, Starting, Absent, UnavailableOffline, Deferring, Quiesced, Updating, Qualified. |
Dependent item | dell.server.net.if.state[{#IFNAME}] Preprocessing
|
Dell R660: Interface [{#IFNAME}]: Status | The status of the job. Possible values: OK, Warning, Critical. |
Dependent item | dell.server.net.if.health[{#IFNAME}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Dell R660: Interface [{#IFNAME}]: Link down | This trigger expression works as follows: |
{$DELL.HTTP.IFCONTROL:"{#IFNAME}"}=1 and (last(/DELL PowerEdge R660 by HTTP/dell.server.net.if.status[{#IFNAME}],)=2 and last(/DELL PowerEdge R660 by HTTP/dell.server.net.if.status[{#IFNAME}],#1)<>last(/DELL PowerEdge R660 by HTTP/dell.server.net.if.status[{#IFNAME}],#2)) |Average |
Manual close: Yes | |
Dell R660: Interface [{#IFNAME}]: Link status issue | This trigger expression works as follows: |
{$DELL.HTTP.IFCONTROL:"{#IFNAME}"}=1 and (last(/DELL PowerEdge R660 by HTTP/dell.server.net.if.status[{#IFNAME}],)<2 and last(/DELL PowerEdge R660 by HTTP/dell.server.net.if.status[{#IFNAME}],#1)<>last(/DELL PowerEdge R660 by HTTP/dell.server.net.if.status[{#IFNAME}],#2)) |Average |
Manual close: Yes | |
Dell R660: Interface [{#IFNAME}]: Critical state | Please check the device for faults. |
last(/DELL PowerEdge R660 by HTTP/dell.server.net.if.health[{#IFNAME}],)=3 |Average |
||
Dell R660: Interface [{#IFNAME}]: Warning state | Please check the device for warnings. |
last(/DELL PowerEdge R660 by HTTP/dell.server.net.if.health[{#IFNAME}],)=2 |Warning |
Depends on:
|
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums
Template for Cisco UCS monitoring via Integrated Management Controller
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
Refer to the vendor documentation.
Name | Description | Default |
---|---|---|
{$PSUCRITSTATUS:"inoperable"} | 2 |
|
{$PSUWARNSTATUS:"degraded"} | 3 |
|
{$FANCRITSTATUS:"inoperable"} | 2 |
|
{$FANWARNSTATUS:"degraded"} | 3 |
|
{$TEMP_CRIT:"Ambient"} | 35 |
|
{$TEMP_WARN:"Ambient"} | 30 |
|
{$DISKARRAYCACHEBATTERYOK_STATUS} | 1 |
|
{$DISKARRAYCACHEBATTERYCRIT_STATUS} | 2 |
|
{$DISKARRAYCRIT_STATUS:"inoperable"} | 2 |
|
{$DISKARRAYWARN_STATUS:"degraded"} | 3 |
|
{$DISKARRAYOK_STATUS:"operable"} | 1 |
|
{$DISKFAILSTATUS:"failed"} | 9 |
|
{$DISKCRITSTATUS:"predictiveFailure"} | 11 |
|
{$DISKCRITSTATUS:"bad"} | 16 |
|
{$VDISKOKSTATUS:"equipped"} | 10 |
|
{$HEALTHCRITSTATUS:"computeFailed"} | 30 |
|
{$HEALTHCRITSTATUS:"configFailure"} | 33 |
|
{$HEALTHCRITSTATUS:"unconfigFailure"} | 34 |
|
{$HEALTHCRITSTATUS:"inoperable"} | 60 |
|
{$HEALTHWARNSTATUS:"testFailed"} | 35 |
|
{$HEALTHWARNSTATUS:"thermalProblem"} | 60 |
|
{$HEALTHWARNSTATUS:"powerProblem"} | 62 |
|
{$HEALTHWARNSTATUS:"voltageProblem"} | 62 |
|
{$HEALTHWARNSTATUS:"diagnosticsFailed"} | 204 |
|
{$TEMP_CRIT} | 60 |
|
{$TEMPCRITLOW} | 5 |
|
{$TEMP_WARN} | 50 |
|
{$SNMP.TIMEOUT} | 5m |
|
{$ICMPLOSSWARN} | 20 |
|
{$ICMPRESPONSETIME_WARN} | 0.15 |
Name | Description | Type | Key and additional info |
---|---|---|---|
Cisco UCS: Uptime (network) | MIB: SNMPv2-MIB The time (in hundredths of a second) since the network management portion of the system was last re-initialized. |
SNMP agent | system.net.uptime[sysUpTime.0] Preprocessing
|
Cisco UCS: Uptime (hardware) | MIB: HOST-RESOURCES-MIB The amount of time since this host was last initialized. Note that this is different from sysUpTime in the SNMPv2-MIB [RFC1907] because sysUpTime is the uptime of the network management portion of the system. |
SNMP agent | system.hw.uptime[hrSystemUptime.0] Preprocessing
|
Cisco UCS: SNMP traps (fallback) | The item is used to collect all SNMP traps unmatched by other snmptrap items |
SNMP trap | snmptrap.fallback |
Cisco UCS: System location | MIB: SNMPv2-MIB The physical location of this node (e.g., `telephone closet, 3rd floor'). If the location is unknown, the value is the zero-length string. |
SNMP agent | system.location[sysLocation.0] Preprocessing
|
Cisco UCS: System contact details | MIB: SNMPv2-MIB The textual identification of the contact person for this managed node, together with information on how to contact this person. If no contact information is known, the value is the zero-length string. |
SNMP agent | system.contact[sysContact.0] Preprocessing
|
Cisco UCS: System object ID | MIB: SNMPv2-MIB The vendor's authoritative identification of the network management subsystem contained in the entity. This value is allocated within the SMI enterprises subtree (1.3.6.1.4.1) and provides an easy and unambiguous means for determining |
SNMP agent | system.objectid[sysObjectID.0] Preprocessing
|
Cisco UCS: System name | MIB: SNMPv2-MIB An administratively-assigned name for this managed node.By convention, this is the node's fully-qualified domain name. If the name is unknown, the value is the zero-length string. |
SNMP agent | system.name Preprocessing
|
Cisco UCS: System description | MIB: SNMPv2-MIB A textual description of the entity. This value should include the full name and version identification of the system's hardware type, software operating-system, and networking software. |
SNMP agent | system.descr[sysDescr.0] Preprocessing
|
Cisco UCS: SNMP agent availability | Availability of SNMP checks on the host. The value of this item corresponds to availability icons in the host list. Possible value: 0 - not available 1 - available 2 - unknown |
Zabbix internal | zabbix[host,snmp,available] |
Cisco UCS: ICMP ping | Simple check | icmpping | |
Cisco UCS: ICMP loss | Simple check | icmppingloss | |
Cisco UCS: ICMP response time | Simple check | icmppingsec |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Cisco UCS: Host has been restarted | Uptime is less than 10 minutes. |
(last(/Cisco UCS by SNMP/system.hw.uptime[hrSystemUptime.0])>0 and last(/Cisco UCS by SNMP/system.hw.uptime[hrSystemUptime.0])<10m) or (last(/Cisco UCS by SNMP/system.hw.uptime[hrSystemUptime.0])=0 and last(/Cisco UCS by SNMP/system.net.uptime[sysUpTime.0])<10m) |Warning |
Manual close: Yes Depends on:
|
|
Cisco UCS: System name has changed | The name of the system has changed. Acknowledge to close the problem manually. |
last(/Cisco UCS by SNMP/system.name,#1)<>last(/Cisco UCS by SNMP/system.name,#2) and length(last(/Cisco UCS by SNMP/system.name))>0 |Info |
Manual close: Yes | |
Cisco UCS: No SNMP data collection | SNMP is not available for polling. Please check device connectivity and SNMP settings. |
max(/Cisco UCS by SNMP/zabbix[host,snmp,available],{$SNMP.TIMEOUT})=0 |Warning |
Depends on:
|
|
Cisco UCS: Unavailable by ICMP ping | Last three attempts returned timeout. Please check device connectivity. |
max(/Cisco UCS by SNMP/icmpping,#3)=0 |High |
||
Cisco UCS: High ICMP ping loss | min(/Cisco UCS by SNMP/icmppingloss,5m)>{$ICMP_LOSS_WARN} and min(/Cisco UCS by SNMP/icmppingloss,5m)<100 |Warning |
Depends on:
|
||
Cisco UCS: High ICMP ping response time | avg(/Cisco UCS by SNMP/icmppingsec,5m)>{$ICMP_RESPONSE_TIME_WARN} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature Discovery | SNMP agent | temp.discovery |
Name | Description | Type | Key and additional info |
---|---|---|---|
{#SENSOR_LOCATION}.Ambient: Temperature | MIB: CISCO-UNIFIED-COMPUTING-COMPUTE-MIB Temperature readings of testpoint: {#SENSOR_LOCATION}.Ambient |
SNMP agent | sensor.temp.value[cucsComputeRackUnitMbTempStatsAmbientTemp.{#SNMPINDEX}] |
{#SENSOR_LOCATION}.Front: Temperature | MIB: CISCO-UNIFIED-COMPUTING-COMPUTE-MIB Cisco UCS compute:RackUnitMbTempStats:frontTemp managed object property |
SNMP agent | sensor.temp.value[cucsComputeRackUnitMbTempStatsFrontTemp.{#SNMPINDEX}] |
{#SENSOR_LOCATION}.Rear: Temperature | MIB: CISCO-UNIFIED-COMPUTING-COMPUTE-MIB Cisco UCS compute:RackUnitMbTempStats:rearTemp managed object property |
SNMP agent | sensor.temp.value[cucsComputeRackUnitMbTempStatsRearTemp.{#SNMPINDEX}] |
{#SENSOR_LOCATION}.IOH: Temperature | MIB: CISCO-UNIFIED-COMPUTING-COMPUTE-MIB Cisco UCS compute:RackUnitMbTempStats:ioh1Temp managed object property |
SNMP agent | sensor.temp.value[cucsComputeRackUnitMbTempSltatsIoh1Temp.{#SNMPINDEX}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#SENSOR_LOCATION}.Ambient: Temperature is above warning threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/Cisco UCS by SNMP/sensor.temp.value[cucsComputeRackUnitMbTempStatsAmbientTemp.{#SNMPINDEX}],5m)>{$TEMP_WARN:"Ambient"} |Warning |
Depends on:
|
|
{#SENSOR_LOCATION}.Ambient: Temperature is above critical threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/Cisco UCS by SNMP/sensor.temp.value[cucsComputeRackUnitMbTempStatsAmbientTemp.{#SNMPINDEX}],5m)>{$TEMP_CRIT:"Ambient"} |High |
||
{#SENSOR_LOCATION}.Ambient: Temperature is too low | avg(/Cisco UCS by SNMP/sensor.temp.value[cucsComputeRackUnitMbTempStatsAmbientTemp.{#SNMPINDEX}],5m)<{$TEMP_CRIT_LOW:"Ambient"} |Average |
|||
{#SENSOR_LOCATION}.Front: Temperature is above warning threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/Cisco UCS by SNMP/sensor.temp.value[cucsComputeRackUnitMbTempStatsFrontTemp.{#SNMPINDEX}],5m)>{$TEMP_WARN:"Ambient"} |Warning |
Depends on:
|
|
{#SENSOR_LOCATION}.Front: Temperature is above critical threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/Cisco UCS by SNMP/sensor.temp.value[cucsComputeRackUnitMbTempStatsFrontTemp.{#SNMPINDEX}],5m)>{$TEMP_CRIT:"Ambient"} |High |
||
{#SENSOR_LOCATION}.Front: Temperature is too low | avg(/Cisco UCS by SNMP/sensor.temp.value[cucsComputeRackUnitMbTempStatsFrontTemp.{#SNMPINDEX}],5m)<{$TEMP_CRIT_LOW:"Ambient"} |Average |
|||
{#SENSOR_LOCATION}.Rear: Temperature is above warning threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/Cisco UCS by SNMP/sensor.temp.value[cucsComputeRackUnitMbTempStatsRearTemp.{#SNMPINDEX}],5m)>{$TEMP_WARN:"Ambient"} |Warning |
Depends on:
|
|
{#SENSOR_LOCATION}.Rear: Temperature is above critical threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/Cisco UCS by SNMP/sensor.temp.value[cucsComputeRackUnitMbTempStatsRearTemp.{#SNMPINDEX}],5m)>{$TEMP_CRIT:"Ambient"} |High |
||
{#SENSOR_LOCATION}.Rear: Temperature is too low | avg(/Cisco UCS by SNMP/sensor.temp.value[cucsComputeRackUnitMbTempStatsRearTemp.{#SNMPINDEX}],5m)<{$TEMP_CRIT_LOW:"Ambient"} |Average |
|||
{#SENSOR_LOCATION}.IOH: Temperature is above warning threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/Cisco UCS by SNMP/sensor.temp.value[cucsComputeRackUnitMbTempSltatsIoh1Temp.{#SNMPINDEX}],5m)>{$TEMP_WARN:"Ambient"} |Warning |
Depends on:
|
|
{#SENSOR_LOCATION}.IOH: Temperature is above critical threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/Cisco UCS by SNMP/sensor.temp.value[cucsComputeRackUnitMbTempSltatsIoh1Temp.{#SNMPINDEX}],5m)>{$TEMP_CRIT:"Ambient"} |High |
||
{#SENSOR_LOCATION}.IOH: Temperature is too low | avg(/Cisco UCS by SNMP/sensor.temp.value[cucsComputeRackUnitMbTempSltatsIoh1Temp.{#SNMPINDEX}],5m)<{$TEMP_CRIT_LOW:"Ambient"} |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature CPU Discovery | SNMP agent | temp.cpu.discovery |
Name | Description | Type | Key and additional info |
---|---|---|---|
{#SENSOR_LOCATION}: Temperature | MIB: CISCO-UNIFIED-COMPUTING-PROCESSOR-MIB Cisco UCS processor:EnvStats:temperature managed object property |
SNMP agent | sensor.temp.value[cucsProcessorEnvStatsTemperature.{#SNMPINDEX}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#SENSOR_LOCATION}: Temperature is above warning threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/Cisco UCS by SNMP/sensor.temp.value[cucsProcessorEnvStatsTemperature.{#SNMPINDEX}],5m)>{$TEMP_WARN:"CPU"} |Warning |
Depends on:
|
|
{#SENSOR_LOCATION}: Temperature is above critical threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/Cisco UCS by SNMP/sensor.temp.value[cucsProcessorEnvStatsTemperature.{#SNMPINDEX}],5m)>{$TEMP_CRIT:"CPU"} |High |
||
{#SENSOR_LOCATION}: Temperature is too low | avg(/Cisco UCS by SNMP/sensor.temp.value[cucsProcessorEnvStatsTemperature.{#SNMPINDEX}],5m)<{$TEMP_CRIT_LOW:"CPU"} |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU Discovery | SNMP agent | psu.discovery |
Name | Description | Type | Key and additional info |
---|---|---|---|
{#PSU_LOCATION}: Power supply status | MIB: CISCO-UNIFIED-COMPUTING-EQUIPMENT-MIB Cisco UCS equipment:Psu:operState managed object property |
SNMP agent | sensor.psu.status[cucsEquipmentPsuOperState.{#SNMPINDEX}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#PSU_LOCATION}: Power supply is in critical state | Please check the power supply unit for errors |
count(/Cisco UCS by SNMP/sensor.psu.status[cucsEquipmentPsuOperState.{#SNMPINDEX}],#1,"eq","{$PSU_CRIT_STATUS:\"inoperable\"}")=1 |Average |
||
{#PSU_LOCATION}: Power supply is in warning state | Please check the power supply unit for errors |
count(/Cisco UCS by SNMP/sensor.psu.status[cucsEquipmentPsuOperState.{#SNMPINDEX}],#1,"eq","{$PSU_WARN_STATUS:\"degraded\"}")=1 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Unit Discovery | SNMP agent | unit.discovery |
Name | Description | Type | Key and additional info |
---|---|---|---|
{#UNIT_LOCATION}: Overall system health status | MIB: CISCO-UNIFIED-COMPUTING-COMPUTE-MIB Cisco UCS compute:RackUnit:operState managed object property |
SNMP agent | system.status[cucsComputeRackUnitOperState.{#SNMPINDEX}] |
{#UNIT_LOCATION}: Hardware model name | MIB: CISCO-UNIFIED-COMPUTING-COMPUTE-MIB Cisco UCS compute:RackUnit:model managed object property |
SNMP agent | system.hw.model[cucsComputeRackUnitModel.{#SNMPINDEX}] Preprocessing
|
{#UNIT_LOCATION}: Hardware serial number | MIB: CISCO-UNIFIED-COMPUTING-COMPUTE-MIB Cisco UCS compute:RackUnit:serial managed object property |
SNMP agent | system.hw.serialnumber[cucsComputeRackUnitSerial.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#UNIT_LOCATION}: System status is in critical state | Please check the device for errors |
count(/Cisco UCS by SNMP/system.status[cucsComputeRackUnitOperState.{#SNMPINDEX}],#1,"eq","{$HEALTH_CRIT_STATUS:\"computeFailed\"}")=1 or count(/Cisco UCS by SNMP/system.status[cucsComputeRackUnitOperState.{#SNMPINDEX}],#1,"eq","{$HEALTH_CRIT_STATUS:\"configFailure\"}")=1 or count(/Cisco UCS by SNMP/system.status[cucsComputeRackUnitOperState.{#SNMPINDEX}],#1,"eq","{$HEALTH_CRIT_STATUS:\"unconfigFailure\"}")=1 or count(/Cisco UCS by SNMP/system.status[cucsComputeRackUnitOperState.{#SNMPINDEX}],#1,"eq","{$HEALTH_CRIT_STATUS:\"inoperable\"}")=1 |High |
||
{#UNIT_LOCATION}: System status is in warning state | Please check the device for warnings |
count(/Cisco UCS by SNMP/system.status[cucsComputeRackUnitOperState.{#SNMPINDEX}],#1,"eq","{$HEALTH_WARN_STATUS:\"testFailed\"}")=1 or count(/Cisco UCS by SNMP/system.status[cucsComputeRackUnitOperState.{#SNMPINDEX}],#1,"eq","{$HEALTH_WARN_STATUS:\"thermalProblem\"}")=1 or count(/Cisco UCS by SNMP/system.status[cucsComputeRackUnitOperState.{#SNMPINDEX}],#1,"eq","{$HEALTH_WARN_STATUS:\"powerProblem\"}")=1 or count(/Cisco UCS by SNMP/system.status[cucsComputeRackUnitOperState.{#SNMPINDEX}],#1,"eq","{$HEALTH_WARN_STATUS:\"voltageProblem\"}")=1 or count(/Cisco UCS by SNMP/system.status[cucsComputeRackUnitOperState.{#SNMPINDEX}],#1,"eq","{$HEALTH_WARN_STATUS:\"diagnosticsFailed\"}")=1 |Warning |
Depends on:
|
|
{#UNIT_LOCATION}: Device has been replaced | Device serial number has changed. Acknowledge to close the problem manually. |
last(/Cisco UCS by SNMP/system.hw.serialnumber[cucsComputeRackUnitSerial.{#SNMPINDEX}],#1)<>last(/Cisco UCS by SNMP/system.hw.serialnumber[cucsComputeRackUnitSerial.{#SNMPINDEX}],#2) and length(last(/Cisco UCS by SNMP/system.hw.serialnumber[cucsComputeRackUnitSerial.{#SNMPINDEX}]))>0 |Info |
Manual close: Yes |
Name | Description | Type | Key and additional info |
---|---|---|---|
FAN Discovery | SNMP agent | fan.discovery |
Name | Description | Type | Key and additional info |
---|---|---|---|
{#FAN_LOCATION}: Fan status | MIB: CISCO-UNIFIED-COMPUTING-EQUIPMENT-MIB Cisco UCS equipment:Fan:operState managed object property |
SNMP agent | sensor.fan.status[cucsEquipmentFanOperState.{#SNMPINDEX}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#FAN_LOCATION}: Fan is in warning state | Please check the fan unit |
count(/Cisco UCS by SNMP/sensor.fan.status[cucsEquipmentFanOperState.{#SNMPINDEX}],#1,"eq","{$FAN_WARN_STATUS:\"degraded\"}")=1 |Warning |
Depends on:
|
|
{#FAN_LOCATION}: Fan is in critical state | Please check the fan unit |
count(/Cisco UCS by SNMP/sensor.fan.status[cucsEquipmentFanOperState.{#SNMPINDEX}],#1,"eq","{$FAN_CRIT_STATUS:\"inoperable\"}")=1 |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Physical Disk Discovery | Scanning table of physical drive entries CISCO-UNIFIED-COMPUTING-STORAGE-MIB::cucsStorageLocalDiskTable. |
SNMP agent | physicalDisk.discovery |
Name | Description | Type | Key and additional info |
---|---|---|---|
{#DISK_LOCATION}: Physical disk status | MIB: CISCO-UNIFIED-COMPUTING-STORAGE-MIB Cisco UCS storage:LocalDisk:diskState managed object property. |
SNMP agent | system.hw.physicaldisk.status[cucsStorageLocalDiskDiskState.{#SNMPINDEX}] |
{#DISK_LOCATION}: Physical disk model name | MIB: CISCO-UNIFIED-COMPUTING-STORAGE-MIB Cisco UCS storage:LocalDisk:serial managed object property. Actually returns part number code |
SNMP agent | system.hw.physicaldisk.model[cucsStorageLocalDiskSerial.{#SNMPINDEX}] |
{#DISK_LOCATION}: Physical disk media type | MIB: CISCO-UNIFIED-COMPUTING-STORAGE-MIB Cisco UCS storage:LocalDisk:model managed object property. Actually returns 'HDD' or 'SSD' |
SNMP agent | system.hw.physicaldisk.media_type[cucsStorageLocalDiskModel.{#SNMPINDEX}] |
{#DISK_LOCATION}: Disk size | MIB: CISCO-UNIFIED-COMPUTING-STORAGE-MIB Cisco UCS storage:LocalDisk:size managed object property. In MB. |
SNMP agent | system.hw.physicaldisk.size[cucsStorageLocalDiskSize.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#DISK_LOCATION}: Physical disk failed | Please check physical disk for warnings or errors |
count(/Cisco UCS by SNMP/system.hw.physicaldisk.status[cucsStorageLocalDiskDiskState.{#SNMPINDEX}],#1,"eq","{$DISK_FAIL_STATUS:\"failed\"}")=1 |High |
||
{#DISK_LOCATION}: Physical disk error | Please check physical disk for warnings or errors |
count(/Cisco UCS by SNMP/system.hw.physicaldisk.status[cucsStorageLocalDiskDiskState.{#SNMPINDEX}],#1,"eq","{$DISK_CRIT_STATUS:\"bad\"}")=1 or count(/Cisco UCS by SNMP/system.hw.physicaldisk.status[cucsStorageLocalDiskDiskState.{#SNMPINDEX}],#1,"eq","{$DISK_CRIT_STATUS:\"predictiveFailure\"}")=1 |Average |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Virtual Disk Discovery | CISCO-UNIFIED-COMPUTING-STORAGE-MIB::cucsStorageLocalLunTable |
SNMP agent | virtualdisk.discovery |
Name | Description | Type | Key and additional info |
---|---|---|---|
{#VDISK_LOCATION}: Status | MIB: CISCO-UNIFIED-COMPUTING-STORAGE-MIB Cisco UCS storage:LocalLun:presence managed object property |
SNMP agent | system.hw.virtualdisk.status[cucsStorageLocalLunPresence.{#SNMPINDEX}] |
{#VDISK_LOCATION}: Layout type | MIB: CISCO-UNIFIED-COMPUTING-STORAGE-MIB Cisco UCS storage:LocalLun:type managed object property |
SNMP agent | system.hw.virtualdisk.layout[cucsStorageLocalLunType.{#SNMPINDEX}] |
{#VDISK_LOCATION}: Disk size | MIB: CISCO-UNIFIED-COMPUTING-STORAGE-MIB Cisco UCS storage:LocalLun:size managed object property in MB. |
SNMP agent | system.hw.virtualdisk.size[cucsStorageLocalLunSize.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#VDISK_LOCATION}: Virtual disk is not in OK state | Please check virtual disk for warnings or errors |
count(/Cisco UCS by SNMP/system.hw.virtualdisk.status[cucsStorageLocalLunPresence.{#SNMPINDEX}],#1,"ne","{$VDISK_OK_STATUS:\"equipped\"}")=1 |Warning |
Name | Description | Type | Key and additional info |
---|---|---|---|
Array Controller Discovery | Scanning table of Array controllers: CISCO-UNIFIED-COMPUTING-STORAGE-MIB::cucsStorageControllerTable. |
SNMP agent | array.discovery |
Name | Description | Type | Key and additional info |
---|---|---|---|
{#DISKARRAY_LOCATION}: Disk array controller status | MIB: CISCO-UNIFIED-COMPUTING-STORAGE-MIB |
SNMP agent | system.hw.diskarray.status[cucsStorageControllerOperState.{#SNMPINDEX}] |
{#DISKARRAY_LOCATION}: Disk array controller model | MIB: CISCO-UNIFIED-COMPUTING-STORAGE-MIB |
SNMP agent | system.hw.diskarray.model[cucsStorageControllerModel.{#SNMPINDEX}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#DISKARRAY_LOCATION}: Disk array controller is in critical state | Please check the device for faults |
count(/Cisco UCS by SNMP/system.hw.diskarray.status[cucsStorageControllerOperState.{#SNMPINDEX}],#1,"eq","{$DISK_ARRAY_CRIT_STATUS:\"inoperable\"}")=1 |High |
||
{#DISKARRAY_LOCATION}: Disk array controller is in warning state | Please check the device for faults |
count(/Cisco UCS by SNMP/system.hw.diskarray.status[cucsStorageControllerOperState.{#SNMPINDEX}],#1,"eq","{$DISK_ARRAY_WARN_STATUS:\"degraded\"}")=1 |Average |
Depends on:
|
|
{#DISKARRAY_LOCATION}: Disk array controller is not in optimal state | Please check the device for faults |
count(/Cisco UCS by SNMP/system.hw.diskarray.status[cucsStorageControllerOperState.{#SNMPINDEX}],#1,"ne","{$DISK_ARRAY_OK_STATUS:\"operable\"}")=1 |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Array Controller Cache Discovery | Scanning table of Array controllers: CISCO-UNIFIED-COMPUTING-STORAGE-MIB::cucsStorageControllerTable. |
SNMP agent | array.cache.discovery |
Name | Description | Type | Key and additional info |
---|---|---|---|
{#DISKARRAYCACHELOCATION}: Disk array cache controller battery status | MIB: CISCO-UNIFIED-COMPUTING-STORAGE-MIB |
SNMP agent | system.hw.diskarray.cache.battery.status[cucsStorageRaidBatteryOperability.{#SNMPINDEX}] |
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#DISKARRAYCACHELOCATION}: Disk array cache controller battery is in critical state! | Please check the device for faults |
count(/Cisco UCS by SNMP/system.hw.diskarray.cache.battery.status[cucsStorageRaidBatteryOperability.{#SNMPINDEX}],#1,"eq","{$DISK_ARRAY_CACHE_BATTERY_CRIT_STATUS}")=1 |Average |
||
{#DISKARRAYCACHELOCATION}: Disk array cache controller battery is not in optimal state | Please check the device for faults |
count(/Cisco UCS by SNMP/system.hw.diskarray.cache.battery.status[cucsStorageRaidBatteryOperability.{#SNMPINDEX}],#1,"ne","{$DISK_ARRAY_CACHE_BATTERY_OK_STATUS}")=1 |Warning |
Depends on:
|
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums
Cisco UCS® Manager provides unified, embedded management of all software and hardware components of the Cisco Unified Computing System™ (Cisco UCS) across multiple chassis and rack servers. It enables server, fabric, and storage provisioning as well as, device discovery, inventory, configuration, diagnostics, monitoring, fault detection, auditing, and statistics collection. This is a template for Cisco UCS Manager monitoring via Zabbix SNMP Agent that works without any external scripts. You can download UCS MIB files there ftp://ftp.cisco.com/pub/mibs/ucs-mibs/.
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
Name | Description | Default |
---|---|---|
{$PSU.STATUS.CRIT:"inoperable"} | The critical value of the PSU sensor for trigger expression. |
2 |
{$PSU.STATUS.WARN:"degraded"} | The warning value of the PSU sensor for trigger expression. |
3 |
{$FAN.STATUS.CRIT:"inoperable"} | The critical value of the FAN sensor for trigger expression. |
2 |
{$FAN.STATUS.WARN:"degraded"} | The warning value of the FAN sensor for trigger expression. |
3 |
{$TEMP.MAX.CRIT:"Ambient"} | The temperature maximum critical value for trigger expression. |
35 |
{$TEMP.MAX.WARN:"Ambient"} | The temperature maximum warning value for trigger expression. |
30 |
{$DISK.ARRAY.CACHE.BATTERY.STATUS.OK} | The cache battery normal state for trigger expression. |
1 |
{$DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT} | The cache battery critical state for trigger expression. |
2 |
{$DISK.ARRAY.STATUS.CRIT:"inoperable"} | The array controller critical state for trigger expression. |
2 |
{$DISK.ARRAY.STATUS.WARN:"degraded"} | The array controller warning state for trigger expression. |
3 |
{$DISK.ARRAY.STATUS.OK:"operable"} | The array controller normal state for trigger expression. |
1 |
{$DISK.STATUS.FAIL:"failed"} | The disk fail state for trigger expression. |
9 |
{$DISK.STATUS.CRIT:"predictiveFailure"} | The disk critical state for trigger expression. |
11 |
{$DISK.STATUS.CRIT:"bad"} | The disk critical state for trigger expression. |
16 |
{$VDISK.STATUS.OK:"equipped"} | The vdisk normal state for trigger expression. |
10 |
{$HEALTH.STATUS.CRIT:"computeFailed"} | The unit health critical state for trigger expression. |
30 |
{$HEALTH.STATUS.CRIT:"configFailure"} | The unit health critical state for trigger expression. |
33 |
{$HEALTH.STATUS.CRIT:"unconfigFailure"} | The unit health critical state for trigger expression. |
34 |
{$HEALTH.STATUS.CRIT:"inoperable"} | The unit health critical state for trigger expression. |
60 |
{$HEALTH.STATUS.WARN:"testFailed"} | The unit health warning state for trigger expression. |
35 |
{$HEALTH.STATUS.WARN:"thermalProblem"} | The unit health warning state for trigger expression. |
60 |
{$HEALTH.STATUS.WARN:"powerProblem"} | The unit health warning state for trigger expression. |
62 |
{$HEALTH.STATUS.WARN:"voltageProblem"} | The unit health warning state for trigger expression. |
62 |
{$IF.ERRORS.WARN} | 2 |
|
{$HEALTH.STATUS.WARN:"diagnosticsFailed"} | The unit health warning state for trigger expression. |
204 |
{$NET.IFNAME.MATCHES} | This macro is used in network interface discovery. Can be overridden on the host level. |
^.*$ |
{$NET.IFNAME.NOT_MATCHES} | This macro is used in network interface discovery. Can be overridden on the host level. Filter out loopbacks, sup-fc0, nulls, docker veth links and docker0 bridge by default. |
Macro too long. Please see the template. |
{$NET.IFOPERSTATUS.MATCHES} | This macro is used in network interface discovery. Can be overridden on the host level. |
^.*$ |
{$NET.IFOPERSTATUS.NOT_MATCHES} | This macro is used in network interface discovery. Can be overridden on the host level. Ignore notPresent(6) by default. |
^6$ |
{$NET.IFADMINSTATUS.MATCHES} | This macro is used in network interface discovery. Can be overridden on the host level. |
^.* |
{$NET.IFADMINSTATUS.NOT_MATCHES} | This macro is used in network interface discovery. Can be overridden on the host level. Ignore down(2) administrative status by default. |
^2$ |
{$NET.IFDESCR.MATCHES} | This macro is used in network interface discovery. Can be overridden on the host level. |
.* |
{$NET.IFDESCR.NOT_MATCHES} | This macro is used in network interface discovery. Can be overridden on the host level. |
CHANGE_IF_NEEDED |
{$NET.IFALIAS.MATCHES} | This macro is used in network interface discovery. Can be overridden on the host level. |
.* |
{$NET.IFALIAS.NOT_MATCHES} | This macro is used in network interface discovery. Can be overridden on the host level. |
CHANGE_IF_NEEDED |
{$NET.IFTYPE.MATCHES} | This macro is used in network interface discovery. Can be overridden on the host level. |
.* |
{$NET.IFTYPE.NOT_MATCHES} | This macro is used in network interface discovery. Can be overridden on the host level. |
CHANGE_IF_NEEDED |
{$SNMP.TIMEOUT} | The time interval for SNMP agent availability trigger expression. |
5m |
{$IFCONTROL} | 1 |
Name | Description | Type | Key and additional info |
---|---|---|---|
Cisco UCS Manager: Uptime (network) | MIB: SNMPv2-MIB The time in seconds since the network management portion of the system was last re-initialized. |
SNMP agent | cisco.ucs.net.uptime[sysUpTime.0] Preprocessing
|
Cisco UCS Manager: Uptime (hardware) | MIB: HOST-RESOURCES-MIB The amount of time since this host was last initialized. Note that this is different from sysUpTime in the SNMPv2-MIB [RFC1907] because sysUpTime is the uptime of the network management portion of the system. |
SNMP agent | cisco.ucs.hw.uptime[hrSystemUptime.0] Preprocessing
|
Cisco UCS Manager: SNMP traps (fallback) | The item is used to collect all SNMP traps unmatched by other snmptrap items |
SNMP trap | snmptrap.fallback |
Cisco UCS Manager: System location | MIB: SNMPv2-MIB The physical location of this node (e.g., `telephone closet, 3rd floor'). If the location is unknown, the value is the zero-length string. |
SNMP agent | cisco.ucs.location[sysLocation.0] Preprocessing
|
Cisco UCS Manager: System contact details | MIB: SNMPv2-MIB The textual identification of the contact person for this managed node, together with information on how to contact this person. If no contact information is known, the value is the zero-length string. |
SNMP agent | cisco.ucs.contact[sysContact.0] Preprocessing
|
Cisco UCS Manager: System object ID | MIB: SNMPv2-MIB The vendor's authoritative identification of the network management subsystem contained in the entity. This value is allocated within the SMI enterprises subtree (1.3.6.1.4.1) and provides an easy and unambiguous means for determining 'what kind of box' is being managed. For example, if vendor 'Flintstones, Inc.' was assigned the subtree1.3.6.1.4.1.4242, it could assign the identifier 1.3.6.1.4.1.4242.1.1 to its 'Fred Router'. |
SNMP agent | cisco.ucs.objectid[sysObjectID.0] Preprocessing
|
Cisco UCS Manager: System name | MIB: SNMPv2-MIB An administratively-assigned name for this managed node.By convention, this is the node's fully-qualified domain name. If the name is unknown, the value is the zero-length string. |
SNMP agent | cisco.ucs.name[sysName.0] Preprocessing
|
Cisco UCS Manager: System description | MIB: SNMPv2-MIB A textual description of the entity. This value should include the full name and version identification of the system's hardware type, software operating-system, and networking software. |
SNMP agent | cisco.ucs.descr[sysDescr.0] Preprocessing
|
Cisco UCS Manager: SNMP agent availability | Availability of SNMP checks on the host. The value of this item corresponds to availability icons in the host list. Possible value: 0 - not available 1 - available 2 - unknown |
Zabbix internal | zabbix[host,snmp,available] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Cisco UCS Manager: Host has been restarted | Uptime is less than 10 minutes. |
(last(/Cisco UCS Manager by SNMP/cisco.ucs.hw.uptime[hrSystemUptime.0])>0 and last(/Cisco UCS Manager by SNMP/cisco.ucs.hw.uptime[hrSystemUptime.0])<10m) or (last(/Cisco UCS Manager by SNMP/cisco.ucs.hw.uptime[hrSystemUptime.0])=0 and last(/Cisco UCS Manager by SNMP/cisco.ucs.net.uptime[sysUpTime.0])<10m) |Warning |
Manual close: Yes Depends on:
|
|
Cisco UCS Manager: System name has changed | The name of the system has changed. Acknowledge to close the problem manually. |
last(/Cisco UCS Manager by SNMP/cisco.ucs.name[sysName.0],#1)<>last(/Cisco UCS Manager by SNMP/cisco.ucs.name[sysName.0],#2) and length(last(/Cisco UCS Manager by SNMP/cisco.ucs.name[sysName.0]))>0 |Info |
Manual close: Yes | |
Cisco UCS Manager: No SNMP data collection | SNMP is not available for polling. Please check device connectivity and SNMP settings. |
max(/Cisco UCS Manager by SNMP/zabbix[host,snmp,available],{$SNMP.TIMEOUT})=0 |Warning |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature discovery | SNMP agent | cisco.ucs.temp.discovery |
Name | Description | Type | Key and additional info |
---|---|---|---|
{#SENSOR_LOCATION}.Ambient: Temperature | MIB: CISCO-UNIFIED-COMPUTING-COMPUTE-MIB Temperature readings of testpoint: {#SENSOR_LOCATION}.Ambient |
SNMP agent | cisco.ucs.sensor.temp.value[cucsComputeRackUnitMbTempStatsAmbientTemp.{#SNMPINDEX}] Preprocessing
|
{#SENSOR_LOCATION}.Front: Temperature | MIB: CISCO-UNIFIED-COMPUTING-COMPUTE-MIB Cisco UCS compute:RackUnitMbTempStats:frontTemp managed object property |
SNMP agent | cisco.ucs.sensor.temp.value[cucsComputeRackUnitMbTempStatsFrontTemp.{#SNMPINDEX}] Preprocessing
|
{#SENSOR_LOCATION}.Rear: Temperature | MIB: CISCO-UNIFIED-COMPUTING-COMPUTE-MIB Cisco UCS compute:RackUnitMbTempStats:rearTemp managed object property |
SNMP agent | cisco.ucs.sensor.temp.value[cucsComputeRackUnitMbTempStatsRearTemp.{#SNMPINDEX}] Preprocessing
|
{#SENSOR_LOCATION}.IOH: Temperature | MIB: CISCO-UNIFIED-COMPUTING-COMPUTE-MIB Cisco UCS compute:RackUnitMbTempStats:ioh1Temp managed object property |
SNMP agent | cisco.ucs.sensor.temp.value[cucsComputeRackUnitMbTempSltatsIoh1Temp.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#SENSOR_LOCATION}.Ambient: Temperature is above warning threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/Cisco UCS Manager by SNMP/cisco.ucs.sensor.temp.value[cucsComputeRackUnitMbTempStatsAmbientTemp.{#SNMPINDEX}],5m)>{$TEMP.MAX.WARN:"Ambient"} |Warning |
Depends on:
|
|
{#SENSOR_LOCATION}.Ambient: Temperature is above critical threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/Cisco UCS Manager by SNMP/cisco.ucs.sensor.temp.value[cucsComputeRackUnitMbTempStatsAmbientTemp.{#SNMPINDEX}],5m)>{$TEMP.MAX.CRIT:"Ambient"} |High |
||
{#SENSOR_LOCATION}.Front: Temperature is above warning threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/Cisco UCS Manager by SNMP/cisco.ucs.sensor.temp.value[cucsComputeRackUnitMbTempStatsFrontTemp.{#SNMPINDEX}],5m)>{$TEMP.MAX.WARN:"Ambient"} |Warning |
Depends on:
|
|
{#SENSOR_LOCATION}.Front: Temperature is above critical threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/Cisco UCS Manager by SNMP/cisco.ucs.sensor.temp.value[cucsComputeRackUnitMbTempStatsFrontTemp.{#SNMPINDEX}],5m)>{$TEMP.MAX.CRIT:"Ambient"} |High |
||
{#SENSOR_LOCATION}.Rear: Temperature is above warning threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/Cisco UCS Manager by SNMP/cisco.ucs.sensor.temp.value[cucsComputeRackUnitMbTempStatsRearTemp.{#SNMPINDEX}],5m)>{$TEMP.MAX.WARN:"Ambient"} |Warning |
Depends on:
|
|
{#SENSOR_LOCATION}.Rear: Temperature is above critical threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/Cisco UCS Manager by SNMP/cisco.ucs.sensor.temp.value[cucsComputeRackUnitMbTempStatsRearTemp.{#SNMPINDEX}],5m)>{$TEMP.MAX.CRIT:"Ambient"} |High |
||
{#SENSOR_LOCATION}.IOH: Temperature is above warning threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/Cisco UCS Manager by SNMP/cisco.ucs.sensor.temp.value[cucsComputeRackUnitMbTempSltatsIoh1Temp.{#SNMPINDEX}],5m)>{$TEMP.MAX.WARN:"Ambient"} |Warning |
Depends on:
|
|
{#SENSOR_LOCATION}.IOH: Temperature is above critical threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/Cisco UCS Manager by SNMP/cisco.ucs.sensor.temp.value[cucsComputeRackUnitMbTempSltatsIoh1Temp.{#SNMPINDEX}],5m)>{$TEMP.MAX.CRIT:"Ambient"} |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Temperature CPU discovery | SNMP agent | cisco.ucs.temp.cpu.discovery |
Name | Description | Type | Key and additional info |
---|---|---|---|
{#SENSOR_LOCATION}: Temperature | MIB: CISCO-UNIFIED-COMPUTING-PROCESSOR-MIB Cisco UCS processor:EnvStats:temperature managed object property |
SNMP agent | cisco.ucs.sensor.temp.value[cucsProcessorEnvStatsTemperature.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#SENSOR_LOCATION}: Temperature is above warning threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/Cisco UCS Manager by SNMP/cisco.ucs.sensor.temp.value[cucsProcessorEnvStatsTemperature.{#SNMPINDEX}],5m)>{$TEMP.MAX.WARN:"CPU"} |Warning |
Depends on:
|
|
{#SENSOR_LOCATION}: Temperature is above critical threshold | This trigger uses temperature sensor values as well as temperature sensor status if available |
avg(/Cisco UCS Manager by SNMP/cisco.ucs.sensor.temp.value[cucsProcessorEnvStatsTemperature.{#SNMPINDEX}],5m)>{$TEMP.MAX.CRIT:"CPU"} |High |
Name | Description | Type | Key and additional info |
---|---|---|---|
Network interface discovery | Discovering interfaces from IF-MIB. |
SNMP agent | cisco.ucs.net.if.discovery Preprocessing
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Interface {#IFNAME}({#IFALIAS}): Operational status | MIB: IF-MIB The current operational state of the interface. - The testing(3) state indicates that no operational packet scan be passed - If ifAdminStatus is down(2) then ifOperStatus should be down(2) - If ifAdminStatus is changed to up(1) then ifOperStatus should change to up(1) if the interface is ready to transmit and receive network traffic - It should change todormant(5) if the interface is waiting for external actions (such as a serial line waiting for an incoming connection) - It should remain in the down(2) state if and only if there is a fault that prevents it from going to the up(1) state - It should remain in the notPresent(6) state if the interface has missing(typically, hardware) components. |
SNMP agent | cisco.ucs.if.status[ifOperStatus.{#SNMPINDEX}] Preprocessing
|
Interface {#IFNAME}({#IFALIAS}): Bits received | MIB: IF-MIB The total number of octets received on the interface, including framing characters. This object is a 64-bit version of ifInOctets. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime. |
SNMP agent | cisco.ucs.if.in[ifHCInOctets.{#SNMPINDEX}] Preprocessing
|
Interface {#IFNAME}({#IFALIAS}): Bits sent | MIB: IF-MIB The total number of octets transmitted out of the interface, including framing characters. This object is a 64-bit version of ifOutOctets.Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime. |
SNMP agent | cisco.ucs.if.out[ifHCOutOctets.{#SNMPINDEX}] Preprocessing
|
Interface {#IFNAME}({#IFALIAS}): Inbound packets with errors | MIB: IF-MIB For packet-oriented interfaces, the number of inbound packets that contained errors preventing them from being deliverable to a higher-layer protocol. For character-oriented or fixed-length interfaces, the number of inbound transmission units that contained errors preventing them from being deliverable to a higher-layer protocol. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime. |
SNMP agent | cisco.ucs.if.in.errors[ifInErrors.{#SNMPINDEX}] Preprocessing
|
Interface {#IFNAME}({#IFALIAS}): Outbound packets with errors | MIB: IF-MIB For packet-oriented interfaces, the number of outbound packets that contained errors preventing them from being deliverable to a higher-layer protocol. For character-oriented or fixed-length interfaces, the number of outbound transmission units that contained errors preventing them from being deliverable to a higher-layer protocol. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime. |
SNMP agent | cisco.ucs.if.out.errors[ifOutErrors.{#SNMPINDEX}] Preprocessing
|
Interface {#IFNAME}({#IFALIAS}): Outbound packets discarded | MIB: IF-MIB The number of outbound packets which were chosen to be discarded even though no errors had been detected to prevent their being deliverable to a higher-layer protocol. One possible reason for discarding such a packet could be to free up buffer space. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime. |
SNMP agent | cisco.ucs.if.out.discards[ifOutDiscards.{#SNMPINDEX}] Preprocessing
|
Interface {#IFNAME}({#IFALIAS}): Inbound packets discarded | MIB: IF-MIB The number of inbound packets which were chosen to be discarded even though no errors had been detected to prevent their being deliverable to a higher-layer protocol. One possible reason for discarding such a packet could be to free up buffer space. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime. |
SNMP agent | cisco.ucs.if.in.discards[ifInDiscards.{#SNMPINDEX}] Preprocessing
|
Interface {#IFNAME}({#IFALIAS}): Multicast packets received | MIB: IF-MIB The number of packets, delivered by this sub-layer to a higher (sub-)layer, which were addressed to a multicast address at this sub-layer. For a MAC layer protocol, this includes both Group and Functional addresses. This object is a 64-bit version of ifInMulticastPkts. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime. |
SNMP agent | cisco.ucs.if.in.multicast[ifHCInMulticastPkts.{#SNMPINDEX}] Preprocessing
|
Interface {#IFNAME}({#IFALIAS}): Multicast packets sent | MIB: IF-MIB The total number of packets that higher-level protocols requested be transmitted, and which were addressed to a multicast address at this sub-layer, including those that were discarded or not sent. For a MAC layer protocol, this includes both Group and Functional addresses. This object is a 64-bit version of ifOutMulticastPkts. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime. |
SNMP agent | cisco.ucs.if.out.multicast[ifHCOutMulticastPkts.{#SNMPINDEX}] Preprocessing
|
Interface {#IFNAME}({#IFALIAS}): Broadcast packets received | MIB: IF-MIB The number of packets, delivered by this sub-layer to a higher (sub-)layer, which were addressed to a broadcast address at this sub-layer. This object is a 64-bit version of ifInBroadcastPkts. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime. |
SNMP agent | cisco.ucs.if.in.broadcast[ifHCInBroadcastPkts.{#SNMPINDEX}] Preprocessing
|
Interface {#IFNAME}({#IFALIAS}): Broadcast packets sent | MIB: IF-MIB The total number of packets that higher-level protocols requested be transmitted, and which were addressed to a broadcast address at this sub-layer, including those that were discarded or not sent. This object is a 64-bit version of ifOutBroadcastPkts. Discontinuities in the value of this counter can occur at re-initialization of the management system, and at other times as indicated by the value of ifCounterDiscontinuityTime. |
SNMP agent | cisco.ucs.if.out.broadcast[ifHCOutBroadcastPkts.{#SNMPINDEX}] Preprocessing
|
Interface {#IFNAME}({#IFALIAS}): Interface type | MIB: IF-MIB The type of interface. Additional values for ifType are assigned by the Internet Assigned Numbers Authority (IANA), through updating the syntax of the IANAifType textual convention. |
SNMP agent | cisco.ucs.if.type[ifType.{#SNMPINDEX}] Preprocessing
|
Interface {#IFNAME}({#IFALIAS}): Interface description | MIB: IF-MIB A textual string containing information about the interface. This string should include the name of the manufacturer, the product name and the version of the interface hardware/software. |
SNMP agent | cisco.ucs.if.descr[ifDescr.{#SNMPINDEX}] Preprocessing
|
Interface {#IFNAME}({#IFALIAS}): Speed | MIB: IF-MIB An estimate of the interface's current bandwidth in units of 1,000,000 bits per second. If this object reports a value of |
SNMP agent | cisco.ucs.if.speed[ifHighSpeed.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
Interface {#IFNAME}({#IFALIAS}): Link down | This trigger expression works as follows: |
{$IFCONTROL:"{#IFNAME}"}=1 and last(/Cisco UCS Manager by SNMP/cisco.ucs.if.status[ifOperStatus.{#SNMPINDEX}])=2 and (last(/Cisco UCS Manager by SNMP/cisco.ucs.if.status[ifOperStatus.{#SNMPINDEX}],#1)<>last(/Cisco UCS Manager by SNMP/cisco.ucs.if.status[ifOperStatus.{#SNMPINDEX}],#2)) |Average |
Manual close: Yes | |
Interface {#IFNAME}({#IFALIAS}): High error rate on {#IFNAME} | Recovers when value below {$IF.ERRORS.WARN:"{#IFNAME}"} threshold. |
min(/Cisco UCS Manager by SNMP/cisco.ucs.if.in.errors[ifInErrors.{#SNMPINDEX}],5m)>{$IF.ERRORS.WARN:"{#IFNAME}"} or min(/Cisco UCS Manager by SNMP/cisco.ucs.if.out.errors[ifOutErrors.{#SNMPINDEX}],5m)>{$IF.ERRORS.WARN:"{#IFNAME}"} |Warning |
Manual close: Yes Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
PSU discovery | SNMP agent | cisco.ucs.psu.discovery |
Name | Description | Type | Key and additional info |
---|---|---|---|
{#PSU_LOCATION}: Power supply status | MIB: CISCO-UNIFIED-COMPUTING-EQUIPMENT-MIB Cisco UCS equipment:Psu:operState managed object property |
SNMP agent | cisco.ucs.sensor.psu.status[cucsEquipmentPsuOperState.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#PSU_LOCATION}: Power supply is in critical state | Please check the power supply unit for errors |
last(/Cisco UCS Manager by SNMP/cisco.ucs.sensor.psu.status[cucsEquipmentPsuOperState.{#SNMPINDEX}])={$PSU.STATUS.CRIT:"inoperable"} |Average |
||
{#PSU_LOCATION}: Power supply is in warning state | Please check the power supply unit for errors |
last(/Cisco UCS Manager by SNMP/cisco.ucs.sensor.psu.status[cucsEquipmentPsuOperState.{#SNMPINDEX}])={$PSU.STATUS.WARN:"degraded"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Unit discovery | SNMP agent | cisco.ucs.unit.discovery |
Name | Description | Type | Key and additional info |
---|---|---|---|
{#UNIT_LOCATION}: Overall system health status | MIB: CISCO-UNIFIED-COMPUTING-COMPUTE-MIB Cisco UCS compute:RackUnit:operState managed object property |
SNMP agent | cisco.ucs.status[cucsComputeRackUnitOperState.{#SNMPINDEX}] Preprocessing
|
{#UNIT_LOCATION}: Hardware model name | MIB: CISCO-UNIFIED-COMPUTING-COMPUTE-MIB Cisco UCS compute:RackUnit:model managed object property |
SNMP agent | cisco.ucs.hw.model[cucsComputeRackUnitModel.{#SNMPINDEX}] Preprocessing
|
{#UNIT_LOCATION}: Hardware serial number | MIB: CISCO-UNIFIED-COMPUTING-COMPUTE-MIB Cisco UCS compute:RackUnit:serial managed object property |
SNMP agent | cisco.ucs.hw.serialnumber[cucsComputeRackUnitSerial.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#UNIT_LOCATION}: System status is in critical state | Please check the device for errors |
last(/Cisco UCS Manager by SNMP/cisco.ucs.status[cucsComputeRackUnitOperState.{#SNMPINDEX}])={$HEALTH.STATUS.CRIT:"computeFailed"} or last(/Cisco UCS Manager by SNMP/cisco.ucs.status[cucsComputeRackUnitOperState.{#SNMPINDEX}])={$HEALTH.STATUS.CRIT:"configFailure"} or last(/Cisco UCS Manager by SNMP/cisco.ucs.status[cucsComputeRackUnitOperState.{#SNMPINDEX}])={$HEALTH.STATUS.CRIT:"unconfigFailure"} or last(/Cisco UCS Manager by SNMP/cisco.ucs.status[cucsComputeRackUnitOperState.{#SNMPINDEX}])={$HEALTH.STATUS.CRIT:"inoperable"} |High |
||
{#UNIT_LOCATION}: System status is in warning state | Please check the device for warnings |
last(/Cisco UCS Manager by SNMP/cisco.ucs.status[cucsComputeRackUnitOperState.{#SNMPINDEX}])={$HEALTH.STATUS.WARN:"testFailed"} or last(/Cisco UCS Manager by SNMP/cisco.ucs.status[cucsComputeRackUnitOperState.{#SNMPINDEX}])={$HEALTH.STATUS.WARN:"thermalProblem"} or last(/Cisco UCS Manager by SNMP/cisco.ucs.status[cucsComputeRackUnitOperState.{#SNMPINDEX}])={$HEALTH.STATUS.WARN:"powerProblem"} or last(/Cisco UCS Manager by SNMP/cisco.ucs.status[cucsComputeRackUnitOperState.{#SNMPINDEX}])={$HEALTH.STATUS.WARN:"voltageProblem"} or last(/Cisco UCS Manager by SNMP/cisco.ucs.status[cucsComputeRackUnitOperState.{#SNMPINDEX}])={$HEALTH.STATUS.WARN:"diagnosticsFailed"} |Warning |
Depends on:
|
|
{#UNIT_LOCATION}: Device has been replaced | Device serial number has changed. Acknowledge to close the problem manually. |
last(/Cisco UCS Manager by SNMP/cisco.ucs.hw.serialnumber[cucsComputeRackUnitSerial.{#SNMPINDEX}],#1)<>last(/Cisco UCS Manager by SNMP/cisco.ucs.hw.serialnumber[cucsComputeRackUnitSerial.{#SNMPINDEX}],#2) and length(last(/Cisco UCS Manager by SNMP/cisco.ucs.hw.serialnumber[cucsComputeRackUnitSerial.{#SNMPINDEX}]))>0 |Info |
Manual close: Yes |
Name | Description | Type | Key and additional info |
---|---|---|---|
FAN discovery | SNMP agent | cisco.ucs.fan.discovery |
Name | Description | Type | Key and additional info |
---|---|---|---|
{#FAN_LOCATION}: Fan status | MIB: CISCO-UNIFIED-COMPUTING-EQUIPMENT-MIB Cisco UCS equipment:Fan:operState managed object property |
SNMP agent | cisco.ucs.sensor.fan.status[cucsEquipmentFanOperState.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#FAN_LOCATION}: Fan is in warning state | Please check the fan unit |
last(/Cisco UCS Manager by SNMP/cisco.ucs.sensor.fan.status[cucsEquipmentFanOperState.{#SNMPINDEX}])={$FAN.STATUS.WARN:"degraded"} |Warning |
Depends on:
|
|
{#FAN_LOCATION}: Fan is in critical state | Please check the fan unit |
last(/Cisco UCS Manager by SNMP/cisco.ucs.sensor.fan.status[cucsEquipmentFanOperState.{#SNMPINDEX}])={$FAN.STATUS.CRIT:"inoperable"} |Average |
Name | Description | Type | Key and additional info |
---|---|---|---|
Physical disk discovery | Scanning table of physical drive entries CISCO-UNIFIED-COMPUTING-STORAGE-MIB::cucsStorageLocalDiskTable. |
SNMP agent | cisco.ucs.physicalDisk.discovery |
Name | Description | Type | Key and additional info |
---|---|---|---|
{#DISK_LOCATION}: Physical disk status | MIB: CISCO-UNIFIED-COMPUTING-STORAGE-MIB Cisco UCS storage:LocalDisk:diskState managed object property. |
SNMP agent | cisco.ucs.hw.physicaldisk.status[cucsStorageLocalDiskDiskState.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk model name | MIB: CISCO-UNIFIED-COMPUTING-STORAGE-MIB Cisco UCS storage:LocalDisk:model managed object property. |
SNMP agent | cisco.ucs.hw.physicaldisk.model[cucsStorageLocalDiskModel.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk serial number | MIB: CISCO-UNIFIED-COMPUTING-STORAGE-MIB Cisco UCS storage:LocalDisk:serial managed object property. Actually returns part number code. |
SNMP agent | cisco.ucs.hw.physicaldisk.serialnumber[cucsStorageLocalDiskSerial.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Physical disk media type | MIB: CISCO-UNIFIED-COMPUTING-STORAGE-MIB Cisco UCS storage:LocalDisk:deviceType managed object property. Actually returns 'HDD' or 'SSD'. |
SNMP agent | cisco.ucs.hw.physicaldisk.media_type[cucsStorageLocalDiskDeviceType.{#SNMPINDEX}] Preprocessing
|
{#DISK_LOCATION}: Disk size | MIB: CISCO-UNIFIED-COMPUTING-STORAGE-MIB Cisco UCS storage:LocalDisk:size managed object property. In MB. |
SNMP agent | cisco.ucs.hw.physicaldisk.size[cucsStorageLocalDiskSize.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#DISK_LOCATION}: Physical disk failed | Please check physical disk for warnings or errors |
last(/Cisco UCS Manager by SNMP/cisco.ucs.hw.physicaldisk.status[cucsStorageLocalDiskDiskState.{#SNMPINDEX}])={$DISK.STATUS.FAIL:"failed"} |High |
||
{#DISK_LOCATION}: Physical disk error | Please check physical disk for warnings or errors |
last(/Cisco UCS Manager by SNMP/cisco.ucs.hw.physicaldisk.status[cucsStorageLocalDiskDiskState.{#SNMPINDEX}])={$DISK.STATUS.CRIT:"bad"} or last(/Cisco UCS Manager by SNMP/cisco.ucs.hw.physicaldisk.status[cucsStorageLocalDiskDiskState.{#SNMPINDEX}])={$DISK.STATUS.CRIT:"predictiveFailure"} |Average |
Depends on:
|
|
{#DISK_LOCATION}: Disk has been replaced | Disk serial number has changed. Acknowledge to close the problem manually. |
last(/Cisco UCS Manager by SNMP/cisco.ucs.hw.physicaldisk.serialnumber[cucsStorageLocalDiskSerial.{#SNMPINDEX}],#1)<>last(/Cisco UCS Manager by SNMP/cisco.ucs.hw.physicaldisk.serialnumber[cucsStorageLocalDiskSerial.{#SNMPINDEX}],#2) and length(last(/Cisco UCS Manager by SNMP/cisco.ucs.hw.physicaldisk.serialnumber[cucsStorageLocalDiskSerial.{#SNMPINDEX}]))>0 |Info |
Manual close: Yes |
Name | Description | Type | Key and additional info |
---|---|---|---|
Virtual disk discovery | CISCO-UNIFIED-COMPUTING-STORAGE-MIB::cucsStorageLocalLunTable |
SNMP agent | cisco.ucs.virtualDisk.discovery |
Name | Description | Type | Key and additional info |
---|---|---|---|
{#VDISK_LOCATION}: Status | MIB: CISCO-UNIFIED-COMPUTING-STORAGE-MIB Cisco UCS storage:LocalLun:presence managed object property |
SNMP agent | cisco.ucs.hw.virtualdisk.status[cucsStorageLocalLunPresence.{#SNMPINDEX}] Preprocessing
|
{#VDISK_LOCATION}: Layout type | MIB: CISCO-UNIFIED-COMPUTING-STORAGE-MIB Cisco UCS storage:LocalLun:type managed object property |
SNMP agent | cisco.ucs.hw.virtualdisk.layout[cucsStorageLocalLunType.{#SNMPINDEX}] Preprocessing
|
{#VDISK_LOCATION}: Disk size | MIB: CISCO-UNIFIED-COMPUTING-STORAGE-MIB Cisco UCS storage:LocalLun:size managed object property in MB. |
SNMP agent | cisco.ucs.hw.virtualdisk.size[cucsStorageLocalLunSize.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#VDISK_LOCATION}: Virtual disk is not in OK state | Please check virtual disk for warnings or errors |
last(/Cisco UCS Manager by SNMP/cisco.ucs.hw.virtualdisk.status[cucsStorageLocalLunPresence.{#SNMPINDEX}])<>{$VDISK.STATUS.OK:"equipped"} |Warning |
Name | Description | Type | Key and additional info |
---|---|---|---|
Array controller discovery | Scanning table of Array controllers: CISCO-UNIFIED-COMPUTING-STORAGE-MIB::cucsStorageControllerTable. |
SNMP agent | cisco.ucs.array.discovery |
Name | Description | Type | Key and additional info |
---|---|---|---|
{#DISKARRAY_LOCATION}: Disk array controller status | MIB: CISCO-UNIFIED-COMPUTING-STORAGE-MIB Cisco UCS storage:RaidBattery:operability managed object property. |
SNMP agent | cisco.ucs.hw.diskarray.status[cucsStorageControllerOperState.{#SNMPINDEX}] Preprocessing
|
{#DISKARRAY_LOCATION}: Disk array controller model | MIB: CISCO-UNIFIED-COMPUTING-STORAGE-MIB |
SNMP agent | cisco.ucs.hw.diskarray.model[cucsStorageControllerModel.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#DISKARRAY_LOCATION}: Disk array controller is in critical state | Please check the device for faults |
last(/Cisco UCS Manager by SNMP/cisco.ucs.hw.diskarray.status[cucsStorageControllerOperState.{#SNMPINDEX}])={$DISK.ARRAY.STATUS.CRIT:"inoperable"} |High |
||
{#DISKARRAY_LOCATION}: Disk array controller is in warning state | Please check the device for faults |
last(/Cisco UCS Manager by SNMP/cisco.ucs.hw.diskarray.status[cucsStorageControllerOperState.{#SNMPINDEX}])={$DISK.ARRAY.STATUS.WARN:"degraded"} |Average |
Depends on:
|
|
{#DISKARRAY_LOCATION}: Disk array controller is not in optimal state | Please check the device for faults |
last(/Cisco UCS Manager by SNMP/cisco.ucs.hw.diskarray.status[cucsStorageControllerOperState.{#SNMPINDEX}])>{$DISK.ARRAY.STATUS.OK:"operable"} |Warning |
Depends on:
|
Name | Description | Type | Key and additional info |
---|---|---|---|
Array controller cache discovery | Scanning table of Array controllers: CISCO-UNIFIED-COMPUTING-STORAGE-MIB::cucsStorageControllerTable. |
SNMP agent | cisco.ucs.array.cache.discovery |
Name | Description | Type | Key and additional info |
---|---|---|---|
{#DISKARRAYCACHELOCATION}: Disk array cache controller battery status | MIB: CISCO-UNIFIED-COMPUTING-STORAGE-MIB |
SNMP agent | cisco.ucs.hw.diskarray.cache.battery.status[cucsStorageRaidBatteryOperability.{#SNMPINDEX}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
{#DISKARRAYCACHELOCATION}: Disk array cache controller battery is in critical state! | Please check the device for faults |
last(/Cisco UCS Manager by SNMP/cisco.ucs.hw.diskarray.cache.battery.status[cucsStorageRaidBatteryOperability.{#SNMPINDEX}])={$DISK.ARRAY.CACHE.BATTERY.STATUS.CRIT} |Average |
||
{#DISKARRAYCACHELOCATION}: Disk array cache controller battery is not in optimal state | Please check the device for faults |
last(/Cisco UCS Manager by SNMP/cisco.ucs.hw.diskarray.cache.battery.status[cucsStorageRaidBatteryOperability.{#SNMPINDEX}])<>{$DISK.ARRAY.CACHE.BATTERY.STATUS.OK} |Warning |
Depends on:
|
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums
Template for monitoring servers with BMC over IPMI that work without any external scripts. All metrics are collected at once, thanks to Zabbix's bulk data collection. It collects metrics by polling BMC remotely using an IPMI agent.
Zabbix version: 6.0 and higher.
This template has been tested on:
Zabbix should be configured according to the instructions in the Templates out of the box section.
You can set {$IPMI.USER} and {$IPMI.PASSWORD} macros in the template for using on the host level.
Name | Description | Default |
---|---|---|
{$IPMI.USER} | This macro is used for access to BMC. It can be overridden on the host or linked template level. |
|
{$IPMI.PASSWORD} | This macro is used for access to BMC. It can be overridden on the host or linked template level. |
|
{$IPMI.SENSOR_TYPE.MATCHES} | This macro is used in sensors discovery. It can be overridden on the host or linked template level. |
.* |
{$IPMI.SENSORTYPE.NOTMATCHES} | This macro is used in sensors discovery. It can be overridden on the host or linked template level. |
invalid |
Name | Description | Type | Key and additional info |
---|---|---|---|
IPMI: Get IPMI sensors | The master item that receives all sensors with values for LLD and dependent elements from BMC. |
IPMI agent | ipmi.get |
Name | Description | Type | Key and additional info |
---|---|---|---|
Discrete sensors discovery | Discovery of the discrete IPMI sensors. |
Dependent item | ipmi.discrete.discovery |
Name | Description | Type | Key and additional info |
---|---|---|---|
IPMI: {#SENSOR_ID} | It is a state of the discrete IPMI sensor. |
Dependent item | ipmi.statetext[{#SENSORID}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
IPMI: {#SENSOR_ID} value has changed | The trigger is informing about changes in a state of the discrete IPMI sensor. A problem generated by this trigger can be manually closed. |
last(/Chassis by IPMI/ipmi.state_text[{#SENSOR_ID}],#1)<>last(/Chassis by IPMI/ipmi.state_text[{#SENSOR_ID}],#2) |Info |
Manual close: Yes |
Name | Description | Type | Key and additional info |
---|---|---|---|
Threshold sensors discovery | Discovery of the threshold IPMI sensors. |
Dependent item | ipmi.sensors.discovery |
Name | Description | Type | Key and additional info |
---|---|---|---|
IPMI: {#SENSORID}, {#SENSORUNIT} | It is a state of the threshold IPMI sensor. |
Dependent item | ipmi.value[{#SENSOR_ID}] Preprocessing
|
Name | Description | Expression | Severity | Dependencies and additional info |
---|---|---|---|---|
IPMI: {#SENSOR_ID} value is below non-critical low | The trigger is informing that a value less than the lower non-critical threshold has been reached. |
min(/Chassis by IPMI/ipmi.value[{#SENSOR_ID}],5m)<{#SENSOR_LO_WARN} |Warning |
Depends on:
|
|
IPMI: {#SENSOR_ID} value is below critical low | The trigger is informing that a value less than the lower critical threshold has been reached. |
min(/Chassis by IPMI/ipmi.value[{#SENSOR_ID}],5m)<{#SENSOR_LO_CRIT} |High |
Depends on:
|
|
IPMI: {#SENSOR_ID} value is below non-recoverable low | The trigger is informing that a value less than the lower non-recoverable threshold has been reached. |
min(/Chassis by IPMI/ipmi.value[{#SENSOR_ID}],5m)<{#SENSOR_LO_DISAST} |Disaster |
||
IPMI: {#SENSOR_ID} value is above non-critical high | The trigger is informing that a value higher than the upper non-critical threshold has been reached. |
min(/Chassis by IPMI/ipmi.value[{#SENSOR_ID}],5m)>{#SENSOR_HI_WARN} |Warning |
Depends on:
|
|
IPMI: {#SENSOR_ID} value is above critical high | The trigger is informing that a value higher than the upper critical threshold has been reached. |
min(/Chassis by IPMI/ipmi.value[{#SENSOR_ID}],5m)>{#SENSOR_HI_CRIT} |High |
Depends on:
|
|
IPMI: {#SENSOR_ID} value is above non-recoverable high | The trigger is informing that a value higher than the upper non-recoverable threshold has been reached. |
min(/Chassis by IPMI/ipmi.value[{#SENSOR_ID}],5m)>{#SENSOR_HI_DISAST} |Disaster |
Please report any issues with the template at https://support.zabbix.com
You can also provide feedback, discuss the template, or ask for help at ZABBIX forums