Hewlett-Packard
Event 200
- Severity: MAJOR
- Event Summary: Bad OS MCA checksum
- Event Class: System
- Problem Description:
The OS has registered an OS_MCA vector,
but it has not passed the checksum
- Cause / Action:
Cause: OS has registered a bad OS_MCA vector or
the data has been lost. Action: Reboot system to allow vector to be
re-registered.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 201
- Severity: MAJOR
- Event Summary: BMC interface to IPMI failed
- Event Class: System
- Problem Description:
The BMC has failed testing and has been
disabled.
- Cause / Action:
Cause: BMC firmware has locked up or the BMC is
disabled. Action: Cycle system power and attempt boot again. If error
re-occurs contact your HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 203
- Severity: FATAL
- Event Summary: Boot cell launch EFI failure
- Event Class: System
- Problem Description:
SFW failed to launch EFI
- Cause / Action:
Cause: The system has failed to launch EFI
because of an internal error.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 204
- Severity: MAJOR
- Event Summary: Monarch selection failure
- Event Class: System
- Problem Description:
0x11 = Calibration Failure 0x22 = Select
Code Failure
- Cause / Action:
Cause: An internal error has caused monarch
selection to fail. Action: Reboot system, swap processors if failure
persists.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 205
- Severity: MAJOR
- Event Summary: CPU monarch collision
- Event Class: System
- Problem Description:
Monarch Collision has occurred
- Cause / Action:
Cause: Unexpected error has occurred during
monarch selection. Action: Reboot, if problem persists contact your HP
representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 207
- Severity: FATAL
- Event Summary: Boot cell virtualize EFI failure
- Event Class: System
- Problem Description:
SFW attempted to virtualize EFI and failed
- Cause / Action:
Cause: An internal error has occurred that
prevented EFI from virtualizing. Action: Reboot, if problem persists contact
your HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 208
- Severity: FATAL
- Event Summary: Boot cell virtualize PAL failure
- Event Class: System
- Problem Description:
SFW was unable to virtualize PAL
- Cause / Action:
Cause: SFW was unable to virtualize PAL.
Action: Reboot, if problem persists contact your HP representative for
support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 209
- Severity: FATAL
- Event Summary: Boot cell virtualize SAL failure
- Event Class: System
- Problem Description:
SFW was unable to virtualize SAL
- Cause / Action:
Cause: SFW was unable to virtualize SAL.
Action: Reboot, if problem persists contact your HP representative for
support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 210
- Severity: FATAL
- Event Summary: Boot cell virtualize SALPROC failure
- Event Class: System
- Problem Description:
SFW was unable to virtualize SALPROC
- Cause / Action:
Cause: SFW was unable to virtualize SALPROC.
Action: Reboot, if problem persists contact your HP representative for
support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 211
- Severity: MAJOR
- Event Summary: CPU struct init failed
- Event Class: System
- Problem Description:
SFW has failed initializing the CPU
Struct.
- Cause / Action:
Cause: A CPU has failed the configuration
process. Action: Replace CPU. If problem persists contact your HP
representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 212
- Severity: MAJOR
- Event Summary: CPU failed early config
- Event Class: System
- Problem Description:
A CPU has failed early config.
- Cause / Action:
Cause: A CPU has failed the early configuration
process. Action: Replace CPU. If problem persists contact your HP
representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 213
- Severity: MAJOR
- Event Summary: CPU failed early selftest
- Event Class: System
- Problem Description:
A CPU has failed early self test. Data:
PAL Test State.
- Cause / Action:
Cause: A CPU has failed early self test.
Action: Replace CPU. If problem persists contact your HP representative for
support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 214
- Severity: MAJOR
- Event Summary: CPU failed
- Event Class: System
- Problem Description:
SFW has detected that a CPU has failed.
Data: the local cpu number that failed.
- Cause / Action:
Cause: A CPU has failed. Action: Replace CPU.
If problem persists contact your HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 215
- Severity: MAJOR
- Event Summary: CPU failed late selftest
- Event Class: System
- Problem Description:
SFW has determined a CPU or Memory has
failed late test. This could be related to a CPU error or a Correctable Single
Bit Memory error. See Cause/Action.
- Cause / Action:
Cause 1: A Correctable Single Bit Memory error
has caused CPU late self test to fail. It is possible the CPU is not faulty in
this case. Action 1: Look for the event "MEM_CORR_ERR" from the last time the
system was running. If you find these events, replace that DIMM(s) before
replacing the CPU's. Replace DIMMs with excessive "MEM_CORR_ERR" first. If
after replacing all suspect DIMMs this event is still seen, replace the CPU.
Cause2: A CPU has failed. Action2: Replace CPU. If problem persists contact
your HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 216
- Severity: MAJOR
- Event Summary: CPU not enough late test memory
- Event Class: System
- Problem Description:
The CPU late test has failed because of
insufficient memory
- Cause / Action:
Cause: Insufficient memory Action: Increase
memory and reboot.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 217
- Severity: FATAL
- Event Summary: Could not allocate memory for EFI image
- Event Class: System
- Problem Description:
Could not allocate memory for EFI image
- Cause / Action:
Cause: SFW could not allocate enough memory for
EFI image. Action: Replace/Add memory.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 218
- Severity: FATAL
- Event Summary: EFI image corrupted
- Event Class: System
- Problem Description:
EFI image is corrupted
- Cause / Action:
Cause: EFI image is corrupted. Action: Reflash
ROM if applicable, if problem persists contact your HP representative for
support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 219
- Severity: FATAL
- Event Summary: EFI not in fit table
- Event Class: System
- Problem Description:
EFI fit error
- Cause / Action:
Cause: EFI image is not in FIT. Action: Reflash
ROM if applicable, if problem persists contact your HP representative for
support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 220
- Severity: FATAL
- Event Summary: NVRAM test fail
- Event Class: System
- Problem Description:
EFI NVM has failed testing. The cell will
now halt.
- Cause / Action:
Cause: NVM is corrupted or bad. Action: Clear
NVM, if problem persists contact your HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 221
- Severity: FATAL
- Event Summary: EFI Rom size bad
- Event Class: System
- Problem Description:
EFI Image Error
- Cause / Action:
Cause: EFI image is corrupt. Action: Reflash
ROM if applicable, if problem persists contact your HP representative for
support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 222
- Severity: FATAL
- Event Summary: EFI Rom checksum error
- Event Class: System
- Problem Description:
EFI Image Error.
- Cause / Action:
Cause: EFI image is corrupt. Action: Reflash
ROM if applicable, if problem persists contact your HP representative for
support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 223
- Severity: FATAL
- Event Summary: External interruption nest limit exceeded
- Event Class: System
- Problem Description:
The IVT interrupting nesting depth has
been exceeded. This processor will be halted Data: Number of the offending
vector
- Cause / Action:
Cause: Internal FW error.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 224
- Severity: FATAL
- Event Summary: External interrupt not serviced
- Event Class: System
- Problem Description:
An external interrupt has been requested
and not serviced. Data: Number of the vector
- Cause / Action:
Cause: Internal FW error.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 225
- Severity: FATAL
- Event Summary: Ext int taken
- Event Class: System
- Problem Description:
An external interrupt has been taken.
Data: Number of the vector taken.
- Cause / Action:
Cause: An external interrupt has been taken
Action: None
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 226
- Severity: MAJOR
- Event Summary: Forward Progress Log (FPL) access failed
- Event Class: System
- Problem Description:
Access to the FPL has failed.
- Cause / Action:
Cause: FPL access has failed.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 227
- Severity: FATAL
- Event Summary: PSR fetch failure
- Event Class: System
- Problem Description:
SFW was unable to read the CPU PSR. Data:
Local CPU number
- Cause / Action:
Cause: SFW was unable to read the CPU PSR.
Action: Replace CPU.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 228
- Severity: FATAL
- Event Summary: Cell halt
- Event Class: System
- Problem Description:
SFW has halted the cell
- Cause / Action:
Cause: Internal Error Action: contact your HP
representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 229
- Severity: MAJOR
- Event Summary: CPU PAL incompatible with cpu
- Event Class: System
- Problem Description:
SFW has determined that PAL is not
compatible with the current processors.
- Cause / Action:
Cause: Incompatible PAL. Action: Update PAL or
change processors
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 230
- Severity: MAJOR
- Event Summary: Slave is incompatible with monarch
- Event Class: System
- Problem Description:
SFW has determined that a slave processor
is incompatible with the monarch. Data: Physical location of the incompatible
processor.
- Cause / Action:
Cause: Incompatible processors. Action: Replace
processors.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 231
- Severity: MAJOR
- Event Summary: Interrupt clear failure
- Event Class: System
- Problem Description:
Interrupt clear failed during cell config
- Cause / Action:
Cause: Interrupt clear failed. Action: Reboot,
if problem persists contact your HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 232
- Severity: MAJOR
- Event Summary: System Event Log (SEL) access failed
- Event Class: System
- Problem Description:
SFW has determined that an IPMI event
failed.
- Cause / Action:
Cause: An IPMI event has failed. Action:
None
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 233
- Severity: FATAL
- Event Summary: Trap taken
- Event Class: System
- Problem Description:
Data: IVT Offset
- Cause / Action:
Cause: This will follow other events indicating
some type of IVT error. Action: This event is for debugging the address, other
events will determine the user action.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 234
- Severity: MAJOR
- Event Summary: LDB State bad on entry
- Event Class: System
- Problem Description:
LDB state bad
- Cause / Action:
Action: None required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 235
- Severity: FATAL
- Event Summary: Interrupt with ic bit clear
- Event Class: System
- Problem Description:
Interrupt context was lost Data: interrupt
number.
- Cause / Action:
Cause: Interrupt context was lost. Action:
none
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 236
- Severity: FATAL
- Event Summary: Min-state registration failure
- Event Class: System
- Problem Description:
Registering of the processor min state
save area with PAL has failed.
- Cause / Action:
Cause: Registering of the processor min state
save area with PAL has failed. Action: Replace processor, if problem persists
contact your HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 238
- Severity: MAJOR
- Event Summary: Boot monarch timed out
- Event Class: System
- Problem Description:
SFW has determined the monarch has timed
out Data: Local CPU Number
- Cause / Action:
Cause: The monarch has timed out. Action: None,
Replace CPU if problem persists, system will reboot after this event.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 239
- Severity: FATAL
- Event Summary: PAL_B not in FIT table
- Event Class: System
- Problem Description:
A PAL_B FIT error has occurred
- Cause / Action:
Cause: Internal Error or ROM is corrupted.
Action: Reflash ROM if applicable, if problem persists contact your HP
representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 240
- Severity: FATAL
- Event Summary: SAL_B not in FIT table
- Event Class: System
- Problem Description:
A SAL_B FIT error has occurred
- Cause / Action:
Cause: Internal Error or ROM is corrupted.
Action: Reflash ROM if applicable, if problem persists contact your HP
representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 241
- Severity: FATAL
- Event Summary: NVRAM test fail
- Event Class: System
- Problem Description:
NVM has failed test. The system will halt
- Cause / Action:
Cause: NVM is corrupt or bad. Action: Reboot,
if problem persists contact your HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 242
- Severity: FATAL
- Event Summary: Interrupt vector out of range
- Event Class: System
- Problem Description:
A interrupt vector has been requested out
of the acceptable range. Data: Vector Number.
- Cause / Action:
Cause: An internal error has occurred
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 243
- Severity: FATAL
- Event Summary: Pal proc error getting pal copy info
- Event Class: System
- Problem Description:
The PAL Copy Info call has failed
- Cause / Action:
Cause: An internal error has occurred.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 244
- Severity: FATAL
- Event Summary: Pal proc error copying pal to memory
- Event Class: System
- Problem Description:
Error coping PAL to memory
- Cause / Action:
Cause: There has been an error copying PAL to
memory. Action: Reboot, if problem persists contact your HP representative for
support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 245
- Severity: MAJOR
- Event Summary: Boot pal proc failure
- Event Class: System
- Problem Description:
A PAL Proc has failed. This will halt the
processor. Data: Local CPU Number
- Cause / Action:
Cause: Internal PAL Error. Action: Reboot, if
problem persists contact your HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 246
- Severity: MAJOR
- Event Summary: Console device failure
- Event Class: System
- Problem Description:
A console device has failed. Data:
Physical Addr of device that failed.
- Cause / Action:
Cause: A console device has failed. Action:
Reset console device/system.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 247
- Severity: MAJOR
- Event Summary: Platform interface device failure
- Event Class: System
- Problem Description:
A console device has failed. Data:
Physical Addr of device that failed.
- Cause / Action:
Cause: A console device has failed. Action:
Reset console device/system.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 248
- Severity: MAJOR
- Event Summary: platform scratch RAM test failed
- Event Class: System
- Problem Description:
Platform Scratch RAM has failed the test.
- Cause / Action:
Cause: Bad or corrupt Scratch RAM. Action:
Reboot, if problem persists contact your HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 249
- Severity: MAJOR
- Event Summary: CPU rendezvous failure
- Event Class: System
- Problem Description:
A CPU has failed to meet rendezvous. Data:
Local CPU Number
- Cause / Action:
Cause: Bad or slow CPU. Action: Replace
CPU.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 250
- Severity: FATAL
- Event Summary: Error extracting sal_b from rom
- Event Class: System
- Problem Description:
SFW could not extract SAL_B from the ROM
- Cause / Action:
Cause: ROM Corrupt or unreadable. Action:
Reflash ROM if applicable, if problem persists contact your HP representative
for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 251
- Severity: FATAL
- Event Summary: Scratch RAM bad
- Event Class: System
- Problem Description:
Platform Scratch RAM has failed test.
- Cause / Action:
Cause: Bad or corrupt Scratch RAM. Action:
Reboot, if problem persists contact your HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 252
- Severity: MAJOR
- Event Summary: IPMI System Event Log (SEL) is full
- Event Class: System
- Problem Description:
IPMI SEL full
- Cause / Action:
Cause: IPMI SEL full. Action: Clear SEL through
BMC or MP.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 253
- Severity: MAJOR
- Event Summary: Slave wakeup before vector registered
- Event Class: System
- Problem Description:
No wakeup vector registered for processor
Data: Local CPU Number
- Cause / Action:
Cause: No wakeup vector registered for
processor. Action: Reboot, if problem persists contact your HP representative
for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 254
- Severity: MAJOR
- Event Summary: CPU failed rendezvous handler
- Event Class: System
- Problem Description:
Slave Rendezvous handler has failed. Data:
Local CPU Number.
- Cause / Action:
Cause: Internal Error. Action: Reboot, if
problem persists contact your HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 255
- Severity: FATAL
- Event Summary: Error building SMBIOS Tables
- Event Class: System
- Problem Description:
SFW failed to build the SMBIOS tables
- Cause / Action:
Cause: SFW failed to build the SMBIOS tables.
Action: None, if SMBIOS is preventing functionality, reboot. If problem
persists contact your HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 256
- Severity: FATAL
- Event Summary: Trap nest limit exceeded
- Event Class: System
- Problem Description:
The trap nesting limit has been exceeded.
Data: Vector Number
- Cause / Action:
Cause: The trap nesting limit has been
exceeded. Action: Reboot if necessary, if problem persists contact your HP
representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 257
- Severity: FATAL
- Event Summary: Trap not serviced
- Event Class: System
- Problem Description:
A trap has been requested and not
serviced. Data: Vector Number
- Cause / Action:
Cause: A invalid trap has been requested or a
trap has not been installed. Action: Reboot if necessary, if problem persists
contact your HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 258
- Severity: FATAL
- Event Summary: Trap taken
- Event Class: System
- Problem Description:
A trap has been taken. Data: Number of the
vector taken.
- Cause / Action:
Cause: A trap has been taken Action: None
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 259
- Severity: MAJOR
- Event Summary: Uncleared interrupt
- Event Class: System
- Problem Description:
At least one interrupt was not cleared.
Data: The highest pending interrupt number
- Cause / Action:
Cause: At least one interrupt was not cleared.
Action: None.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 260
- Severity: FATAL
- Event Summary: Unexpected external interrupt
- Event Class: System
- Problem Description:
An unexpected external interrupt has
occurred. Data: External Interrupt Number
- Cause / Action:
Cause: An unexpected external interrupt has
occurred. Action: None.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 261
- Severity: FATAL
- Event Summary: Interrupt before redirection table set up
- Event Class: System
- Problem Description:
An interrupt has occurred before setting
up the IVT. Data: Interrupt Number
- Cause / Action:
Cause: An interrupt has occurred before setting
up the IVT. Action: None.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 262
- Severity: FATAL
- Event Summary: CPU unexpected MCA
- Event Class: System
- Problem Description:
An unexpected MCA has occurred before
MCA's are unmasked. Data: Local CPU Number.
- Cause / Action:
Cause: Unexpected MCA Action: None
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 263
- Severity: FATAL
- Event Summary: Unexpected trap
- Event Class: System
- Problem Description:
An unexpected trap has occurred. The trap
number is either invalid or the requested trap has not been registered. Data:
Trap Number
- Cause / Action:
Cause: An unexpected trap has occurred. During
System Firmware boot time this indicates the system has requested a trap that
firmware has not registered. During OS run time it indicates the system has
requested a trap that is not recognized in the OS trap table. Action: If at OS
run time, verify that the OS has properly installed its trap handler, and that
only valid traps are caused. Investigate what could cause the trap that is
signaled by the event or why the OS has not properly installed the trap
handler.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 264
- Severity: FATAL
- Event Summary: CPU unknown boot error
- Event Class: System
- Problem Description:
SFW has detected an unknown error.
- Cause / Action:
Cause: unknown error. Action: None, if problem
persists contact your HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 265
- Severity: MAJOR
- Event Summary: CC errors PAL failure
- Event Class: System
- Problem Description:
SFW has detected a PAL Failure
- Cause / Action:
Cause: SFW has detected a PAL Failure. Action:
Reboot if necessary, if problem persists contact your HP representative for
support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 266
- Severity: MAJOR
- Event Summary: Expected MC vector unregistered
- Event Class: System
- Problem Description:
Expected Machine Check Vector not
registered
- Cause / Action:
Cause: Expected Machine Check Vector not
registered at the time of an Expected Machine Check
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 267
- Severity: FATAL
- Event Summary: INIT initiated
- Event Class: System
- Problem Description:
This is the equivalent of a TOC event in
the PA RISC Architecture. On IPF systems, this event is called an INIT. This
event can be triggered by the "tc" command from the MP, or from the button
labeled "TOC" :wor "Transfer of Control" on the Management card or bezel of
the system. There are also other causes of an INIT generated by software.
Data: Local CPU Number
- Cause / Action:
Cause: Software has requested an INIT or the
INIT button has been pressed. Action: None
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 268
- Severity: MAJOR
- Event Summary: Expected I/O host bridge is missing
- Event Class: System
- Problem Description:
An I/O host bridge is missing. Firmware
will continue boot and display the following EFI warning, "Unexpected hardware
I/O configuration." Data Field: Physical location of the missing I/O host
bridge.
- Cause / Action:
Cause: I/O host bridge failure. An incorrect
I/O backplane is installed. Action: Contact your HP representative to check
the I/O host bridge and the I/O backplane.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 269
- Severity: MAJOR
- Event Summary: LBA has unexpected number of I/O slots
- Event Class: System
- Problem Description:
Firmware detected an unexpected number of
I/O slots connected to an I/O host bridge. Firmware display the following EFI
warning message, "Unexpected hardware I/O configuration." Data Field: Physical
location of the I/O host bridge.
- Cause / Action:
Cause: The firmware needs to be updated. An
incorrect I/O backplane is installed. Action: Contact your HP representative
to check the firmware and the I/O backplane.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 270
- Severity: MAJOR
- Event Summary: I/O rope width does not match expected value
- Event Class: System
- Problem Description:
Firmware found an I/O controller rope of
unexpected width. Firmware will configure the I/O host bridge connected to the
rope and display the following EFI warning message, "Unexpected hardware I/O
configuration." Data Field: Physical location of the I/O host bridge connected
to the rope.
- Cause / Action:
Cause: The firmware needs to be updated. An
incorrect I/O backplane is installed. Action: Contact your HP representative
to check the firmware and the I/O backplane.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 271
- Severity: MAJOR
- Event Summary: Found unexpected I/O host bridge
- Event Class: System
- Problem Description:
Firmware found an unexpected I/O host
bridge. Firmware will configure the I/O host bridge and display the following
EFI warning message, "Unexpected hardware I/O configuration." Data Field:
Physical location of the unexpected I/O host bridge.
- Cause / Action:
Cause: The firmware needs to be updated. An
incorrect I/O backplane is installed. Action: Contact your HP representative
to check the firmware and the I/O backplane.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 272
- Severity: MAJOR
- Event Summary: PCI clock DLL error
- Event Class: System
- Problem Description:
An I/O host bridge's bus frequency DLL
circuit failed. Firmware will deconfigure the failed I/O host bridge and
display the following EFI warning message, "Failed I/O slot(s) deconfigured."
Data Field: Physical location of the I/O host bridge.
- Cause / Action:
Cause: Failed or improperly inserted I/O card.
Action: Remove or reseat the I/O card. Cause: Failed I/O chipset. Failed I/O
backplane. Action: Contact your HP representative to check the I/O chipset and
backplane.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 273
- Severity: MAJOR
- Event Summary: PCI hot plug controller failed
- Event Class: System
- Problem Description:
An I/O host bridge's hot-plug controller
has failed. Firmware will deconfigure the I/O host bridge and display the
following EFI warning message, "Failed I/O slot(s) deconfigured." Data Field:
Physical location of the I/O hostbridge.
- Cause / Action:
Cause: Hot-plug controller failure. I/O host
bridge failure. Action: Contact your HP representative to check the hot-plug
controller and the I/O host bridge.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 274
- Severity: MAJOR
- Event Summary: Found unknown I/O rope width
- Event Class: System
- Problem Description:
Firmware attempts to configure an I/O
controller rope to an unsupported width. Firmware will deconfigure any I/O
host bridge connected to the rope. Data Field: Physical location of the failed
rope.
- Cause / Action:
Cause: Internal firmware error. Action: Contact
your HP representative to check the firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 275
- Severity: MAJOR
- Event Summary: I/O LBA clear error failed
- Event Class: System
- Problem Description:
During I/O host bridge configuration,
firmware found a persistent error condition. Firmware will deconfigure the I/O
host bridge and display the following EFI warning message, "Failed I/O slot(s)
deconfigured." Data Field: Physical location of the I/O hostbridge.
- Cause / Action:
Cause: A failed or improperly seated I/O card
is present. Action: Replace or reseat the I/O card(s). Cause: I/O host bridge
failure. Action: Contact your HP representative to check the I/O host
bridge.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 276
- Severity: MAJOR
- Event Summary: I/O host bridge inaccessible because rope reset
failed to complete
- Event Class: System
- Problem Description:
An I/O host bridge is inaccessible because
an I/O controller rope reset failed to complete. Firmware will deconfigure the
I/O host bridge and display the following EFI warning message, "Failed I/O
slot(s) deconfigured." Data Field: Physical location of the I/O host bridge.
- Cause / Action:
Cause: I/O chipset failure. Action: Contact
your HP representative to check the I/O chipset.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 277
- Severity: MAJOR
- Event Summary: Insufficient power to turn on PCI slot
- Event Class: System
- Problem Description:
There is insufficient power. Firmware will
not power on a hot-plug I/O slot. In addition, firmware will display the
following EFI warning message, "Failed I/O slot(s) deconfigured." Date Field:
Physical location of the I/O slot.
- Cause / Action:
Cause: The power budget is exceeded. Action:
Install an additional power supply on the system.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 278
- Severity: MAJOR
- Event Summary: PCI bus walk unknown error
- Event Class: System
- Problem Description:
Firmware encountered an unexpected error
while attempting to configure an I/O host bridge's I/O devices. Firmware will
continue boot but will not configure the I/O devices connected to the
specified I/O host bridge. Such I/O devices will not be usable as console nor
boot devices but might be usable by the O/S. Data Field: Physical location of
the I/O host bridge.
- Cause / Action:
Cause: Internal firmware error. Action: Contact
your HP representative to check the firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 279
- Severity: MAJOR
- Event Summary: PCI bus walk resources exceeded
- Event Class: System
- Problem Description:
The total resource requirement from the
I/O devices connected to an I/O host bridge exceeds the resource limit of the
I/O host bridge. Firmware will continue boot but will not configure the I/O
devices connected to the specified I/O host bridge. In addition, firmware will
display the following EFI warning message, "Insufficient resources to assign
to one or more I/O devices." Such I/O devices will not be usable as console
nor boot devices but might be usable by the O/S. Data Field: Physical location
of the I/O host bridge.
- Cause / Action:
Cause: Unsupported I/O configuration. Action:
Remove any unsupported I/O cards. Move the I/O card to another slot.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 280
- Severity: MAJOR
- Event Summary: PCI bus unmap unknown error
- Event Class: System
- Problem Description:
Firmware encountered an unexpected error
while attempting to clear resource allocations on an I/O host bridge's I/O
devices. Data Field: Physical location of the I/O host bridge.
- Cause / Action:
Cause: Internal firmware error. Action: Contact
your HP representative to check the firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 281
- Severity: MAJOR
- Event Summary: PCIXCAP sampling error
- Event Class: System
- Problem Description:
An I/O host bridge failed to determine the
appropriate PCI[X] mode and frequency (PCI, PCI-X 66 MHz, PCI-X 133 MHz, etc.)
for its bus. Firmware will deconfigure the I/O host bridge and display the
following EFI warning message, "Failed I/O slot(s) deconfigured." Data Field:
Physical location of the failed I/O host bridge.
- Cause / Action:
Cause: I/O host bridge failure. Action: Contact
your HP representative to check the I/O host bridge.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 282
- Severity: MAJOR
- Event Summary: Power monitor failed to respond
- Event Class: System
- Problem Description:
Firmware is unable to access the power
monitor. Firmware will assume that there is sufficient power and proceed to
power on an I/O slot. Data Field: Physical location of the I/O slot.
- Cause / Action:
Cause: BMC failure. Action: Contact your HP
representative to check the BMC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 283
- Severity: MAJOR
- Event Summary: I/O rope reset failed to complete
- Event Class: System
- Problem Description:
An I/O controller rope reset did not
complete within the expected time limit. Firmware will deconfigure the I/O
host bridge attached to the rope. Data Field: Physical location of the
deconfigured I/O host bridge.
- Cause / Action:
Cause: I/O chipset failure. Action: Contact
your HP representative to check the I/O controller.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 284
- Severity: MAJOR
- Event Summary: I/O SBA clear error failed
- Event Class: System
- Problem Description:
During I/O chipset configuration, firmware
found a persistent error condition. Firmware will attempt to continue the
boot. Data Field: Physical location of the I/O chipset.
- Cause / Action:
Cause: I/O chipset failure. Action: Contact
your HP representative to check the I/O chipset.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 285
- Severity: MAJOR
- Event Summary: PCI slot has incorrect default power state
- Event Class: System
- Problem Description:
During boot, firmware has found a hot-plug
I/O slot with an incorrect default power state. The slot power should be off
by default. Data Field: Physical location of the I/O slot.
- Cause / Action:
Cause: A non-compliant PCI[X] card is inserted
in the slot. Such cards leaks power to the PCI[X] bus, which violates the PCI
Bus Specification. Action: Replace the card with a compliant card. Cause: The
hot-plug controller has failed. Action: Contact your HP representative to
check the hot-plug slot.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 286
- Severity: MAJOR
- Event Summary: PCI slot power on error
- Event Class: System
- Problem Description:
Firmware encountered an error while
attempting to power on an I/O slot. Firmware will deconfigure the I/O slot and
display the following EFI warning message, "Failed I/O slot(s) deconfigured."
Data Field: Physical location of the I/O slot.
- Cause / Action:
Cause: The I/O card is damaged or improperly
inserted. Action: Replace or reseat the I/O card. Cause: The hot-plug
controller has failed. Action: Contact your HP representative to check the
hot-plug slot.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 287
- Severity: MAJOR
- Event Summary: PCI slot's standby power failed
- Event Class: System
- Problem Description:
An I/O slot's standby (Vaux) power has
failed. Firmware will deconfigure the I/O slot and display the following EFI
warning message, "Failed I/O slot(s) deconfigured." Data Field: Physical
location of the failed I/O slot.
- Cause / Action:
Cause: I/O slot failure. Action: Contact your
HP representative to check the I/O slot.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 288
- Severity: MAJOR
- Event Summary: Found invalid PCIXCAP value
- Event Class: System
- Problem Description:
An I/O host bridge or hot-plug controller
reported an illegal PCI[X] bus mode for its bus or slot, respectively.
Firmware will deconfigure the I/O host bridge or I/O slot and display the
following EFI warning, "Failed I/O slot(s) deconfigured." Data Field: Physical
location of the failed I/O host bridge or the failed I/O slot.
- Cause / Action:
Cause: The I/O card is damaged or improperly
inserted. Action: Replace or reseat the I/O card. Cause: I/O host bridge
failure. Hot-plug controller failure. Action: Contact your HP representative
to check the I/O host bridge or the hot-plug controller.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 289
- Severity: MAJOR
- Event Summary: Unsupported rope frequency
- Event Class: System
- Problem Description:
Firmware attempted to configure an I/O
controller rope to an unsupported frequency. Firmware will deconfigure any I/O
host bridge connected to the rope and display the following EFI warning
message, "Failed I/O slot(s) deconfigured." Data Field: Physical location of
the failed rope.
- Cause / Action:
Cause: Internal firmware error. Action: Contact
your HP representative to check the firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 290
- Severity: MAJOR
- Event Summary: Unsupported host bridge type
- Event Class: System
- Problem Description:
Firmware has found an unsupported I/O host
bridge type. Firmware will deconfigure the I/O host bridge and display the
following EFI warning message, "Failed I/O slot(s) deconfigured." Data Field:
Physical location of the I/O host bridge.
- Cause / Action:
Cause: Firmware needs to be updated. An
incorrect I/O backplane is installed. Action: Contact your HP representative
to check the firmware and the I/O backplane.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 292
- Severity: FATAL
- Event Summary: Machine Check initiated
- Event Class: System
- Problem Description:
A Machine Check has been initiated
- Cause / Action:
Cause: A Machine Check has occurred. Action:
Analyze cause of Machine Check using diag's and EFI tools.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 293
- Severity: FATAL
- Event Summary: Error in temporary mdt area
- Event Class: System
- Problem Description:
There has been a problem building the MDT
table.
- Cause / Action:
Cause: MDT table bad. Action: Reboot if
necessary, if problem persists contact your HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 294
- Severity: FATAL
- Event Summary: Failed to find lmmio entry in mdt
- Event Class: System
- Problem Description:
There has been a problem building the MDT.
- Cause / Action:
Cause: MDT table bad. Action: Reboot if
necessary, if problem persists contact your HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 295
- Severity: FATAL
- Event Summary: Memory page zero bad
- Event Class: System
- Problem Description:
Memory page 0 was slated for deallocation
in the PDT. EFI cannot launch with page 0 bad, so the system will halt.
- Cause / Action:
Cause: Memory page 0 was slated for
deallocation in the PDT. Action: FW is written such that this event should
never be generated. If the user sees this event, please contact HP
support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 296
- Severity: FATAL
- Event Summary: Failed to find space in mdt
- Event Class: System
- Problem Description:
There has been a problem building the MDT.
- Cause / Action:
Cause: MDT table bad. Action: Reboot if
necessary, if problem persists contact your HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 297
- Severity: MAJOR
- Event Summary: Media failure: info was not retrieved/logged
- Event Class: System
- Problem Description:
There has been a media failure.
- Cause / Action:
Cause: The Error handler has failed to retrieve
or log data due to a media failure. Action: Reboot if necessary, if problem
persists contact your HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 298
- Severity: MAJOR
- Event Summary: Bus interface register test failed
- Event Class: System
- Problem Description:
Indicates that the chipset register test
has failed. The data field contains the physical address of the failing
register.
- Cause / Action:
Cause: The chipset failed the register test.
Action: Contact HP support to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 299
- Severity: MAJOR
- Event Summary: Memory ECC normal write/read test failed
- Event Class: System
- Problem Description:
After FW's first access to main memory, FW
detected that the CEC logged an error after reading back what was just
written.
- Cause / Action:
Cause: The DIMM that maps to cache line 0 is in
a chipspare condition Action: Contact HP support Cause: The DIMM that maps to
address 0 is not seated properly Action: Check all of the DIMMs in the system
and make sure that they are inserted fully into the slot with the retention
mechanism in place Cause: System may be running at the wrong frequency.
Action: Verify the system bus frequency and the memory bus frequency.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 300
- Severity: MAJOR
- Event Summary: DIMM loading order error: DIMM deallocated
- Event Class: System
- Problem Description:
A DIMM that is required to be loaded in
order for this DIMM to function properly is not loaded, so FW will deallocate
this DIMM. Currently, none of the platforms require any DIMMs to be loaded in
order for this DIMM to work properly.
- Cause / Action:
Cause: A required DIMM is not loaded in order
to allow for proper operation of the DIMM specified in the physical location.
Action: Refer to the user's manual for Memory loading instructions.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 301
- Severity: MAJOR
- Event Summary: DIMM SPD checksum failed
- Event Class: System
- Problem Description:
The DIMM specified by the physical
location has an SPD EEPROM that has a bad checksum. The Data field is the
physical location of the DIMM.
- Cause / Action:
Cause: The DIMMs SPD EEPROM got corrupted.
Action: Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 302
- Severity: MAJOR
- Event Summary: DIMM SPD fatal error
- Event Class: System
- Problem Description:
Detected a fatal error in DIMM SPD
- Cause / Action:
Cause: Detection of SPD fatal error type -
various types Action: Contact HP Support personnel to troubleshoot the
problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 303
- Severity: MAJOR
- Event Summary: Unsupported memory DIMM type
- Event Class: System
- Problem Description:
A DIMM was installed whose DIMM type is
not compatible with the current set of supported DIMMs for this platform.
- Cause / Action:
Cause: A DIMM with an invalid DIMM type was
found Action: Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 304
- Severity: MAJOR
- Event Summary: The DIMM type of this DIMM doesn't match with others
in the DIMM group
- Event Class: System
- Problem Description:
The DIMM type of this DIMM is not the same
as the other DIMMs in the same group. The group of DIMMs is deallocated. If
this is the last active group of DIMMs in the system, the system is halted.
- Cause / Action:
Cause: The DIMMs in the rank do not have the
same DIMM type Action: Contact HP Support personnel to troubleshoot the
problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 305
- Severity: MAJOR
- Event Summary: The DIMM type table is full. New DIMM type cannot be
added.
- Event Class: System
- Problem Description:
The DIMM type table is full
- Cause / Action:
Cause: Too many different types of DIMMs in
system Action: Reduce the number of different types of DIMMs in the
system.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 306
- Severity: MAJOR
- Event Summary: DIMM number not found in DMT Table
- Event Class: System
- Problem Description:
An entry for the DIMM was not found in the
DMT table. The data field contains the DMT entry that the caller wanted to
find (in DIMM number format, which is 2 bytes, upper byte is the extender
number, lower byte is the chipselect of the rank caller is looking for.)
- Cause / Action:
Cause: Probable internal FW error Action:
Reload System Firmware Action: Contact HP Support personnel to troubleshoot
the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 307
- Severity: MAJOR
- Event Summary: Memory ECC multiple-bit data error detection failed
- Event Class: System
- Problem Description:
The FW selftest of CEC multi-bit error
(MBE) detection has failed. The upper 32 bits of the data field contain the
Dword offset within the cacheline of the failed MBE detection. The lower 32
bits are split in two, and they contain the bit numbers within the Dword that
were flipped in order to casue an MBE.
- Cause / Action:
Cause: The CEC failed MBE detection. Action:
Contact HP support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 308
- Severity: MAJOR
- Event Summary: Memory ECC multiple-bit ECC error signalling failed
- Event Class: System
- Problem Description:
The FW selftest of CEC multi-bit error
(MBE) signalling has failed. The upper 32 bits of the data field contain the
Dword offset within the cacheline of the failed MBE detection. The lower 32
bits are split in two, and they contain the bit numbers within the Dword that
were flipped in order to casue an MBE.
- Cause / Action:
Cause: The CEC failed MBE detection. Action:
Contact HP support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 309
- Severity: MAJOR
- Event Summary: Memory ECC single-bit data error detection failed
- Event Class: System
- Problem Description:
The FW selftest of CEC single-bit error
(SBE) detection has failed. The data field contains the bit within the Dword
that was flipped that caused the CEC to not see an SBE.
- Cause / Action:
Cause: The CEC failed SBE detection. Action:
Contact HP support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 310
- Severity: MAJOR
- Event Summary: Memory ECC single-bit ECC error detection failed
- Event Class: System
- Problem Description:
The FW selftest of CEC single-bit error
(SBE) detection has failed. The data field contains the bit within the Dword
that was flipped that caused the CEC to not see an SBE.
- Cause / Action:
Cause: The CEC failed SBE detection. Action:
Contact HP support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 311
- Severity: MAJOR
- Event Summary: Insufficient memory for operation
- Event Class: System
- Problem Description:
Memory FW detected errors below 1MB. FW
will not allow boot in this case, so memory FW will reinterleave and retest.
- Cause / Action:
Cause: FW detected memory errors below 1MB.
Action: None needed if FW recovers. If system will not boot, contact HP
support to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 312
- Severity: MAJOR
- Event Summary: Memory address not found in MBAT
- Event Class: System
- Problem Description:
Memory FW could not figure out which rank
maps to the physical address specified in the data field maps to.
- Cause / Action:
Cause: The address logged in the CEC doesn't
map to a memory rank, possibly due to a software error or NVM corruption
Action: Contact HP support to trouble shoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 313
- Severity: MAJOR
- Event Summary: Memory Error Information not cleared
- Event Class: System
- Problem Description:
Memory FW was unable to clear the platform
error logs on the CEC. The data field contains the error status of the CEC.
- Cause / Action:
Cause: Software Error or CEC error Action:
Contact HP support to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 314
- Severity: MAJOR
- Event Summary: Couldn't clear memory error logs
- Event Class: System
- Problem Description:
Memory FW was unable to clear the platform
error logs on the CEC. The data field contains the error status of the CEC.
- Cause / Action:
Cause Software Error or CEC error Action:
Contact HP support to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 315
- Severity: MAJOR
- Event Summary: Memory error clear failed
- Event Class: System
- Problem Description:
The Error registers in the CEC have failed
to clear. The data field contains the error status of the CEC after the
attempted clear.
- Cause / Action:
Cause Software error or CEC error Action:
Contact HP support to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 316
- Severity: MAJOR
- Event Summary: DIMM loading order error: DIMM deallocated
- Event Class: System
- Problem Description:
A DIMM that is required to be loaded in
order for this DIMM to function properly is not loaded, so FW will deallocate
this DIMM. Currently, none of the platforms require any DIMMs to be loaded in
order for this DIMM to work properly.
- Cause / Action:
Cause A required DIMM is not loaded in order to
allow for proper operation of the DIMM specified in the physical location.
Action: Refer to the user's manual for Memory loading instructions.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 317
- Severity: MAJOR
- Event Summary: Generic memory firmware error
- Event Class: System
- Problem Description:
An error occurred that memory FW does not
know how to handle.
- Cause / Action:
Cause Corrupt NVM or System firmware failure
Action: Contact HP support to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 318
- Severity: FATAL
- Event Summary: Memory interleave generation failed
- Event Class: System
- Problem Description:
FW was unable to create a memory
configuration with no errors in low memory to hand off to EFI.
- Cause / Action:
Cause1: DIMM(s) that map into low memory have
errors on them. Action1: Contact HP support to troubleshoot the problem.
Cause2: SFW is outdated. Action2: Update SFW.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 319
- Severity: MAJOR
- Event Summary: Memory register test failed
- Event Class: System
- Problem Description:
The chipset's memory controller failed the
register test. The data field contains the address of the register that failed
selftest.
- Cause / Action:
Cause1: The register within the chipset went
bad. Action1: Contact HP support to troubleshoot the problem Cause2: Internal
SFW error. Action2: Update to most recent SFW.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 320
- Severity: FATAL
- Event Summary: SPD found no memory DIMMs
- Event Class: System
- Problem Description:
Memory Discovery could not detect any
DIMMs installed.
- Cause / Action:
Cause: No DIMMs were detected Action: Install
DIMMs or Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 321
- Severity: FATAL
- Event Summary: No memory found
- Event Class: System
- Problem Description:
FW could not continue because there are no
valid memory ranks loaded.
- Cause / Action:
Cause FW found memory, but it could not find a
correctly loaded rank. Action: Before this event is sent, FW will output which
ranks it is deallocating and why. Review the preceding events and refer to
the users manual to correct the memory loading.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 322
- Severity: FATAL
- Event Summary: Cannot log memory error because PDT is disabled
- Event Class: System
- Problem Description:
The PDT has been disabled, and FW found
memory errors during selftest. This is a stopboot condition. Also, the PDT
will never be disabled in customer systems, so this event should never be seen
in the field.
- Cause / Action:
Cause FW found memory errors during selftest,
but could not deallocate the page because the PDT is disabled. Action:
Reenable the PDT by clearing NVM
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 323
- Severity: MAJOR
- Event Summary: PDT is disabled
- Event Class: System
- Problem Description:
An event indicating that the user has the
PDT disabled on this boot. The PDT will never be disabled in customer systems,
so this event should never be seen in the field.
- Cause / Action:
Cause Informational event indicating that FW
will not use the PDT this boot. Action: None if user does not want to use the
PDT, otherwise, clear NVM
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 324
- Severity: MAJOR
- Event Summary: Error adding entry to PDT
- Event Class: System
- Problem Description:
Error writing entry into the PDT.
- Cause / Action:
Cause NVM write error. Action: Contact HP
support personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 325
- Severity: CRITICAL
- Event Summary: Cannot add PDT entry--PDT full
- Event Class: System
- Problem Description:
The memory page deallocation table (PDT)
is full.
- Cause / Action:
Cause Excessive memory errors Action: Contact
HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 326
- Severity: MAJOR
- Event Summary: Memory platform data update failure
- Event Class: System
- Problem Description:
Memory FW was unable to save or restore
the original error configuration (including CEC error log and signal enable
and CPU ECC detection). This event should never be seen in the field unless
there is a FW problem
- Cause / Action:
Cause Memory FW was unable to save or restore
the original error configuration. Action: If this is seen, update SFW.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 327
- Severity: MAJOR
- Event Summary: Can't find memory rank entry
- Event Class: System
- Problem Description:
The rank structure that corresponds to the
rankID in the data field could not be found in the Rank table. The Data field
is the rankID of the structure it is looking for. This error event should
never be seen.
- Cause / Action:
Cause The rank structure that corresponds to
the rankID in the data field could not be found in the Rank table, possibly
due to NVM corruption. Action: Contact HP support to troubleshoot the
problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 329
- Severity: MAJOR
- Event Summary: Memory error overflow:
- Event Class: System
- Problem Description:
More than one error type was detected when
only one error type was expected.
- Cause / Action:
Cause: An error other than a memory error
occurred during the memory test Action: Contact HP support to troubleshoot the
problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 330
- Severity: MAJOR
- Event Summary: Memory forward progress code invalid
- Event Class: System
- Problem Description:
The forward progress bits that memory FW
uses to track state are invalid. The data field is the fwd progress field.
- Cause / Action:
Cause: The forward progress bits are invalid.
Action: Upgrade to latest system firmware, or contact HP support to
troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 331
- Severity: MAJOR
- Event Summary: Memory error status invalid
- Event Class: System
- Problem Description:
The memory error status has bits set in it
that indicate another non-memory error occurred. The data field contains the
chipset's error status.
- Cause / Action:
Cause: Non-memory errors were detected during
the memory test that FW doesn't know how to handle. Action: Update to the
latest SFW Action: Contact HP support to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 332
- Severity: MAJOR
- Event Summary: Memory error summary bits invalid
- Event Class: System
- Problem Description:
The memory test summary bits are invalid.
The data field is the test summary bits.
- Cause / Action:
Cause: The memory test summary word is invalid
Action: Update to the latest SFW. Action: Contact HP support to troubleshoot
the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 333
- Severity: MAJOR
- Event Summary: The DIMM distribution check was bypassed
- Event Class: System
- Problem Description:
The control bit to skip the DIMM
distribution check is set and the DIMM distribution check was skipped. This
bit should only be done in the factory and not in the field.
- Cause / Action:
Cause: Control bit to skip DIMM distribution
check is set. Action: Clear NVM Action: Update PDC Action: Contact HP support
to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 334
- Severity: MAJOR
- Event Summary: The DIMM Loading Order check was bypassed
- Event Class: System
- Problem Description:
The control bit to skip the DIMM loading
order check is set and the DIMM loading order check was skipped. This bit
should only be done in the factory and not in the field.
- Cause / Action:
Cause: Control bit to skip DIMM loading order
check is set. Action: Clear NVM Action: Update PDC Action: Contact HP support
to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 335
- Severity: MAJOR
- Event Summary: Looping on destructive memory tests
- Event Class: System
- Problem Description:
The control bit to loop on destructive
memory test is set and the destructive memory tests are run continuously. This
bit should only be done in the factory and not in the field.
- Cause / Action:
Cause: Control bit to loop on destructive
memory test is set. Action: Clear NVM Action: Update PDC Action: Contact HP
support to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 336
- Severity: MAJOR
- Event Summary: DIMM Set Check has been skipped
- Event Class: System
- Problem Description:
The control bit to skip the DIMM set check
is set and the DIMM set check was skipped. This bit should only be done in the
factory and not in the field.
- Cause / Action:
Cause: Control bit to skip DIMM set check is
set. Action: Clear NVM Action: Update PDC Action: Contact HP support to
troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 337
- Severity: MAJOR
- Event Summary: Serial Presence Detect (SPD) has been skipped
- Event Class: System
- Problem Description:
The control bit to skip the DIMM SPD check
is set and the checking of the DIMM SPD was skipped. This bit should only be
done in the factory and not in the field.
- Cause / Action:
Cause: Control bit to skip DIMM SPD check is
set. Action: Clear NVM Action: Update PDC Action: Contact HP support to
troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 340
- Severity: MAJOR
- Event Summary: OS INIT address not registered
- Event Class: System
- Problem Description:
The OS_INIT vector has not been registered
- Cause / Action:
Cause: The OS has not registered an OS_INIT
vector. Action: None, the OS has failed to register the vector or has chosen
not to.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 341
- Severity: MAJOR
- Event Summary: OS MCA address not registered
- Event Class: System
- Problem Description:
The OS_MCA vector has not been registered
- Cause / Action:
Cause: The OS has not registered an OS_MCA
vector. Action: None, the OS has failed to register the vector or has chosen
not to.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 342
- Severity: MAJOR
- Event Summary: OS MCA did not correct the Machine Check
- Event Class: System
- Problem Description:
An Uncorrected Machine Check has occurred
- Cause / Action:
Cause: Uncorrected Machine Check. Action:
Analyze cause of Machine Check using diagnostic and EFI tools.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 343
- Severity: FATAL
- Event Summary: Found bad miscellaneous register
- Event Class: System
- Problem Description:
A PDH register has failed.
- Cause / Action:
Cause: A PDH register has failed. Action:
Reboot if necessary, if problem persists contact your HP representative for
support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 344
- Severity: MAJOR
- Event Summary: SAL_CHECK failed for an unknown reason
- Event Class: System
- Problem Description:
The handler for SAL_CHECK has failed for
an unknown reason.
- Cause / Action:
Cause: The handler for SAL_CHECK has failed for
an unknown reason. Action: Reboot if necessary, if problem persists contact
your HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 345
- Severity: MAJOR
- Event Summary: SAL_INIT failed for an unknown reason
- Event Class: System
- Problem Description:
The handler for SAL_INIT has failed for an
unknown reason.
- Cause / Action:
Cause: The handler for SAL_INIT has failed for
an unknown reason. Action: Reboot if necessary, if problem persists contact
your HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 347
- Severity: MAJOR
- Event Summary: Unexpected return to SAL_CHECK
- Event Class: System
- Problem Description:
SAL_CHECK has been unexpectedly returned
to.
- Cause / Action:
Cause: SAL_CHECK has been unexpectedly returned
to. Action: Reboot if necessary, if problem persists contact your HP
representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 348
- Severity: MAJOR
- Event Summary: Unexpected return to SAL_INIT
- Event Class: System
- Problem Description:
SAL_CHECK has been unexpectedly returned
to.
- Cause / Action:
Cause: SAL_CHECK has been unexpectedly returned
to. Action: Reboot if necessary, if problem persists contact your HP
representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 349
- Severity: CRITICAL
- Event Summary: Firmware is adding a DEGRADED cpu node to the device
tree.
- Event Class: System
- Problem Description:
Firmware is adding a device tree node for
a CPU that is degraded in functionality. The cpu should not be trussed.
- Cause / Action:
A CPU that is not fully functional is installed
in the cell board. Replace.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 350
- Severity: CRITICAL
- Event Summary: PD rendez will fail do to a Firmware Tree error
- Event Class: System
- Problem Description:
Firmware was unable to locate a required
element in the device tree and cannot create a partition. The resource that
cannot be located is listed as an ANSI string in the data field.
- Cause / Action:
Decode the ASCII string in the data field to
determine what resource is missing. Examine earlier chassis codes to determine
why that resource is unavailable.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 351
- Severity: CRITICAL
- Event Summary: The current cell is not configured as part of the
expected set
- Event Class: System
- Problem Description:
The currently executing cell is not
configured to be part of the cell set it is attempting to rendezvous with.
- Cause / Action:
A bad complex profile exists. Correct and
redistribute.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 352
- Severity: CRITICAL
- Event Summary: A remote CSR could not be read
- Event Class: System
- Problem Description:
The current cell could not read a remote
cells CSR. The remote cell number is displayed in the data field. These cells
will not be able to rendezvous.
- Cause / Action:
Either a hardware connection problem exists, or
fabric was unable to be routed. Verify hardware and reset.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 353
- Severity: CRITICAL
- Event Summary: The current cell is too late to rendezvous with
other cells
- Event Class: System
- Problem Description:
The currently executing cell arrived too
late to rendezvous with the other cells described in the complex profile as
cells it should rendezvous with.
- Cause / Action:
This cell took to long completing previous
steps to rendezvous. A bad complex profile could also cause this problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 354
- Severity: FATAL
- Event Summary: The current cell detected incompatible CPUs on
another cell
- Event Class: System
- Problem Description:
The currently executing cell detected CPUs
that are incompatible with it to be installed on a cell that the current cell
is trying to rendezvous with.
- Cause / Action:
Mixed CPU types are installed in the same
partition. Remove them.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 355
- Severity: CRITICAL
- Event Summary: Current cell was too slow creating the local
rendezvous set
- Event Class: System
- Problem Description:
The current cell was too slow creating the
local rendezvous set and the other cells have left it behind. It will not be
able to participate in the remainder of the rendezvous.
- Cause / Action:
Cell too slow. Could be bad hardware. Check for
other errors and reset
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 356
- Severity: CRITICAL
- Event Summary: Reporting cell was not included in the global cell
set
- Event Class: System
- Problem Description:
The reporting cell was not included in the
final global set that was agreed upon. This means that another cell either
could not reach the reporting cell or the reporting cell was too late arriving
to a required state.
- Cause / Action:
Fabric problem, Connection problem or timing
problem. Reset the PD.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 357
- Severity: FATAL
- Event Summary: No Core Cell can be selected in the PD.
- Event Class: System
- Problem Description:
No cells in the PD can be a core cell.
This is fatal.
- Cause / Action:
No cells have a functioning core IO card. Add a
core IO card to a cell in the PD and reset.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 358
- Severity: CRITICAL
- Event Summary: Firmware was unable to notify utilities of the core
cell number
- Event Class: System
- Problem Description:
System Firmware was unable to notify
utilities of the selected core cell number.
- Cause / Action:
Communication with utilities is broken. Check
for earlier errors or NVRAM problems.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 359
- Severity: CRITICAL
- Event Summary: Fabric code unable to find a needed service
provider.
- Event Class: System
- Problem Description:
The fabric code is unable to find a
service provider for a required banyan service.
- Cause / Action:
The registry is corrupt or the ROM is
incomplete.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 360
- Severity: CRITICAL
- Event Summary: Error in a fabric Port
- Event Class: System
- Problem Description:
The fabric port specified in the data
field had an error.
- Cause / Action:
Reset.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 361
- Severity: CRITICAL
- Event Summary: Parity error detected on read from fabric
- Event Class: System
- Problem Description:
An error occurred reading a CSR. The CSR
address is displayed in the data field.
- Cause / Action:
Hardware problem. Check connections and
reset.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 362
- Severity: CRITICAL
- Event Summary: Error writing to Fabric
- Event Class: System
- Problem Description:
Error writing to Fabric. CSR data in data
field.
- Cause / Action:
Bad hardware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 363
- Severity: FATAL
- Event Summary: Crossbar slices are out of rev with each other.
- Event Class: System
- Problem Description:
Incompatible crossbar slices are installed
The data field is the two revisions reported by slice1 and slice0 of the CSR
data.
- Cause / Action:
Bad hardware configuration. Replace the
crossbar.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 364
- Severity: FATAL
- Event Summary: Crossbar slices are configured poorly
- Event Class: System
- Problem Description:
Crossbar slices are in different
locations. The data field is the two locations reported by slice1 and slice0
of the CSR data.
- Cause / Action:
Fatal configuration. Reconfigure the
hardware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 365
- Severity: CRITICAL
- Event Summary: A CPU has taken over for the monarch CPU
- Event Class: System
- Problem Description:
A CPU has taken over as the monarch CPU.
- Cause / Action:
The previous monarch may be suspect.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 366
- Severity: FATAL
- Event Summary: SRAM cannot be used on the cell
- Event Class: System
- Problem Description:
SRAM cannot be accessed on the cell board.
Execution cannot continue.
- Cause / Action:
SRAM cannot be located or used on the cell
board. Replace the cell board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 367
- Severity: FATAL
- Event Summary: The dillon hardware cannot be located.
- Event Class: System
- Problem Description:
The dillon component/chip cannot be
located or used.
- Cause / Action:
ROM is corrupt. Replace the rom or reprogram
flash.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 368
- Severity: CRITICAL
- Event Summary: A required piece of PDH bus hardware cannot be
contacted.
- Event Class: System
- Problem Description:
A required piece of PDH bus hardware
cannot be contacted.
- Cause / Action:
Verify all connections of PDH bus components or
replace the cell board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 372
- Severity: MAJOR
- Event Summary: IO Link software error was corrected.
- Event Class: System
- Problem Description:
IO Link Software error was corrected.
- Cause / Action:
No action required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 373
- Severity: CRITICAL
- Event Summary: Bad parity data from RD Rtn FIFO on PIO Read (UNC)
- Event Class: System
- Problem Description:
Bad parity data from RD Rtn FIFO on PIO
Read (UNC).
- Cause / Action:
Replace bad hardware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 374
- Severity: CRITICAL
- Event Summary: Parity error in Reg FIFO Internal parity error.
- Event Class: System
- Problem Description:
Parity error in Reg FIFO Internal parity
error.
- Cause / Action:
Replace bad hardware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 375
- Severity: CRITICAL
- Event Summary: TLB Fetch timeout
- Event Class: System
- Problem Description:
TLB Fetch timeout.
- Cause / Action:
Replace bad hardware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 376
- Severity: FATAL
- Event Summary: Link presence goes away, FE
- Event Class: System
- Problem Description:
Link presence goes away, FE.
- Cause / Action:
Replace the link.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 377
- Severity: FATAL
- Event Summary: LBA to SBA parity error on command, rope will go
fatal
- Event Class: System
- Problem Description:
LBA to SBA parity error on command, rope
will go fatal.
- Cause / Action:
Bad hardware.
Replace I/O chassis.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 378
- Severity: FATAL
- Event Summary: Access to invalid TLB entry Requesting rope fatal
- Event Class: System
- Problem Description:
Access to invalid TLB entry Requesting
rope fatal.
- Cause / Action:
Replace bad hardware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 379
- Severity: FATAL
- Event Summary: Memory fetch timeout
- Event Class: System
- Problem Description:
Memory Fetch Timeout.
- Cause / Action:
Replace bad hardware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 380
- Severity: CRITICAL
- Event Summary: Error was encountered when initializing the LBA.
- Event Class: System
- Problem Description:
An error was encountered when initiating
the rope number specified in the data field.
- Cause / Action:
Replace the bad hardware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 381
- Severity: MAJOR
- Event Summary: LBA correctable Timeout Error was encountered.
- Event Class: System
- Problem Description:
LBA correctable timeout error was
encountered.
- Cause / Action:
No action required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 382
- Severity: CRITICAL
- Event Summary: LBA uncorrectable Function Error was encountered.
- Event Class: System
- Problem Description:
LBA uncorrectable Function Error was
encountered.
- Cause / Action:
Replace the damaged hardware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 383
- Severity: CRITICAL
- Event Summary: LBA uncorrectable Timeout Error was encountered.
- Event Class: System
- Problem Description:
LBA uncorrectable Timeout Error was
encountered.
- Cause / Action:
Replace the damaged hardware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 384
- Severity: CRITICAL
- Event Summary: Misc. uncorrectable error discovered on LBA.
- Event Class: System
- Problem Description:
Misc uncorrectable error discovered on
LBA.
- Cause / Action:
Replace damaged hardware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 385
- Severity: FATAL
- Event Summary: LBA encountered an uncorrectable parity error.
- Event Class: System
- Problem Description:
LBA encountered an uncorrectable parity
error.
- Cause / Action:
Replace the damaged hardware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 386
- Severity: FATAL
- Event Summary: LBA Misc. Fatal Error encountered.
- Event Class: System
- Problem Description:
LBA misc. Fatal Error encountered.
- Cause / Action:
Replace damaged hardware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 387
- Severity: FATAL
- Event Summary: LBA Fatal function error encountered.
- Event Class: System
- Problem Description:
LBA Fatal function error encountered.
- Cause / Action:
Replace damaged hardware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 388
- Severity: FATAL
- Event Summary: LBA Fatal Parity error encountered.
- Event Class: System
- Problem Description:
LBA Fatal Parity error encountered.
- Cause / Action:
Replace hardware, either PCI card or IO
backplane.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 389
- Severity: FATAL
- Event Summary: LBA Fatal Timeout Error Encountered.
- Event Class: System
- Problem Description:
LBA Fatal timeout error encountered.
- Cause / Action:
Replace damaged hardware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 392
- Severity: CRITICAL
- Event Summary: DIMM SPD Extended Checksum Failure
- Event Class: System
- Problem Description:
The calculated and compared Checksums of
the SPD EEPROM don't match.
- Cause / Action:
Replace any bad DIMMs.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 393
- Severity: MAJOR
- Event Summary: Options header checksum error encountered.
- Event Class: System
- Problem Description:
The Options component encountered a header
checksum error. The actual data is in the data field of the chassis code.
- Cause / Action:
Reinitialize the options data.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 394
- Severity: MAJOR
- Event Summary: Options data checksum error was encountered.
- Event Class: System
- Problem Description:
The Options service data had a bad
checksum. Actual data is in the data field.
- Cause / Action:
Verify options data and reinitialize if
necessary.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 395
- Severity: CRITICAL
- Event Summary: Internal inconsistency in the interleave tables.
- Event Class: System
- Problem Description:
Internal inconsistency in the interleave
tables.
- Cause / Action:
Reconfigure and Reset.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 396
- Severity: MAJOR
- Event Summary: CellInfoList is not NULL.
- Event Class: System
- Problem Description:
The CellInfoList is not null and was
expected to be. There has been an error in interleaving.
- Cause / Action:
Reset.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 397
- Severity: CRITICAL
- Event Summary: Error in constructing the Memory Descriptor.
- Event Class: System
- Problem Description:
Error in constructing the Memory
Descriptor.
- Cause / Action:
Reset.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 398
- Severity: CRITICAL
- Event Summary: Unable to update the local memory layout
- Event Class: System
- Problem Description:
Unable to update the local memory layout.
- Cause / Action:
Reset
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 399
- Severity: CRITICAL
- Event Summary: A required address was not found within a mapped
address.
- Event Class: System
- Problem Description:
A required address was not found within a
mapped address in the PDT.
- Cause / Action:
Reset
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 400
- Severity: CRITICAL
- Event Summary: Failure to install a Partition level PDT.
- Event Class: System
- Problem Description:
Failure to install a partition level PDT.
Errors prevented it.
- Cause / Action:
Reset
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 401
- Severity: CRITICAL
- Event Summary: A FATAL resource could not be found or is unusable
- Event Class: System
- Problem Description:
A FATAL resource that is required early in
the initialization process either could not be found, or was unusable. The
specific resource is specified in the data field as follows: Platform
Parameters Component not found in FIT: 0xdead0001; SRAM_BASE not found in
platform parms: 0xdead0002; SRAM_SIZE not found in Platform Parms: 0xdead0003;
firmware framework not found in the fit: 0xdead0004; Framework Segmant not
usable: 0xdead0005; bad NVRAM: 0xdead0006; Dillon unusable: 0xdead0007; SRAM
unusable: 0xdead0008; CPU unusable: 0xdead0009; Options Component Unusable:
0xdead000a; Real Time Clock unusable: segment; Unknown: 0xdead0086
- Cause / Action:
Determine the failing component or hardware
from the data field as described and replace.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 402
- Severity: FATAL
- Event Summary: Internal firmware programming error.
- Event Class: System
- Problem Description:
An internal firmware error was
encountered. This is usually caused by a bad parameter passed to a function,
corrupt memory, corrupt malloc tables or something similar. The data field
contains the IP address of the function that encountered the error.
- Cause / Action:
Report the IP to the firmware team. Reset the
system. This cannot be worked around in the field.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 405
- Severity: CRITICAL
- Event Summary: A semaphore could not be obtained
- Event Class: System
- Problem Description:
The required semaphore could not be
obtained due to errors. The data field contains the IP of the routine trying
to obtain the semaphore. A request was placed for more NVRAM to be allocated
but NVRAM was full.
- Cause / Action:
Cause: Action: Reset system to clear the
semaphore Try reinitializing NVRAM. If problem persists, contact engineering.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 407
- Severity: MAJOR
- Event Summary: The requested NVRAM block was not found.
- Event Class: System
- Problem Description:
The requested NVRAM block was not found.
The ID that was not found is displayed in the data field.
- Cause / Action:
No Action Required. Firmware can allocated
space for the block.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 408
- Severity: MAJOR
- Event Summary: The requested NVRAM block is locked.
- Event Class: System
- Problem Description:
The block id specified in the data field
is locked.
- Cause / Action:
Retry the operation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 409
- Severity: MAJOR
- Event Summary: Firmware tried to unlock a NVRAM block that was
already unlocked.
- Event Class: System
- Problem Description:
Firmware tried to unlock a NVRAM block
that was already unlocked. Data field contains the block ID.v
- Cause / Action:
No action required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 410
- Severity: CRITICAL
- Event Summary: The Header in NVRAM was not found
- Event Class: System
- Problem Description:
The header in the NVRAM space was not
found.
- Cause / Action:
NVRAM cannot be used. It must be initialized
first. Firmware will attempt the initialization.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 411
- Severity: CRITICAL
- Event Summary: The Freelist used for NVM block allocation is
corrupt.
- Event Class: System
- Problem Description:
The Freelist used vor Non-Volatile Memory
allocation is corrupt.
- Cause / Action:
Band NVRAM/ reinitialize.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 412
- Severity: CRITICAL
- Event Summary: Firmware is preparing to reset for reconfiguration.
- Event Class: System
- Problem Description:
System firmware has detected a condition
that requires the cell to be reset for reconfiguration. The function has been
called and is now executing. Data field contains the cell number being reset.
- Cause / Action:
This can be caused by many conditions including
a bad complex profile, a bad hardware configuration, a cell arriving late to
the rendezvous point. A cell not being able to rendezvous. Reconfiguration
from partition manager is recommended.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 413
- Severity: CRITICAL
- Event Summary: An error was encountered communicating with
utilities during PD render.
- Event Class: System
- Problem Description:
During PD rendezvous, system firmware
encountered a problem sending commands to the utilities system. This will
prevent a fully functional PD from being created.
- Cause / Action:
Verify communications with the utilities
system.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 414
- Severity: FATAL
- Event Summary: Forward Progress is stopping. The Cell or System
will not boot further.
- Event Class: System
- Problem Description:
System Firmware has determined that cell
or system progress must be halted. The data field contains the Instruction
Pointer of the function that called for the halt. The second instance of this
code being emitted indicates the major state in system change. This code must
be emitted in pairs.
- Cause / Action:
An error occurred which triggered system
firmware to cease making forward progress. The CPU is put into a spin loop so
that external debugging can take place. See earlier event ids to help
determine the cause of the error. Also note that the Error Response Mode is
likely to have directed firmware to HALT.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 415
- Severity: MAJOR
- Event Summary: No console is available for the DUI to use.
- Event Class: System
- Problem Description:
The DUI (Developers User Interface) was
entered, but there is no console available for the interface.
- Cause / Action:
DUI was entered before the console is
available. DUI will exit and processing will continue.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 416
- Severity: CRITICAL
- Event Summary: Error Processing encountered an unrecoverable error
- Event Class: System
- Problem Description:
During Error processing and reporting, an
error was detected that prevented further processing of errors. The data field
contains an ASCII message indicating the problem.
- Cause / Action:
Decode the ASCII message and correct the
problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 417
- Severity: CRITICAL
- Event Summary: System is unable to complete the Reset For
Reconfiguration request.
- Event Class: System
- Problem Description:
System firmware is unable to complete the
request to reset the cell for reconfiguration. Typically, are required step
has not been performed yet or a needed resource is unavailable.
- Cause / Action:
Delay the request for reconfiguration until
after the PD has been released from SINC BIB.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 418
- Severity: CRITICAL
- Event Summary: The cell is not able to reach all requested cells
through the fabric.
- Event Class: System
- Problem Description:
The cell was not able to reach all the
other cells in its configured set through the fabric. The data field contains
the bitmask of actual cells that were reached.
- Cause / Action:
Fabric wasn't able to route to all cells
described in the complex profile correctly due to a hardware problem. Some of
the cells are unreachable. Update the complex profile or correct the hardware
problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 419
- Severity: MAJOR
- Event Summary: LBA has unexpected number of I/O Slots.
- Event Class: System
- Problem Description:
Firmware detected a PCI-to-PCI bridge that
exceeds the maximum supported bridge depth. Firmware will not configure I/O
devices below the maximum bridge depth. Such I/O devices will not be usable as
console nor boot devices but might be usable by the O/S. Data Field: PCI
function address of the bridge that exceeded the maximum depth limit. Bits
24..31: segment number Bits 16..23: bus number Bits 11..15: device number Bits
8..10: function number Bits 0..7: reserved (0)
- Cause / Action:
Cause: Unsupported I/O configuration. Action:
Remove the I/O cards below the specified PCI-to-PCI bridge.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 420
- Severity: CRITICAL
- Event Summary: Console device failed to connect.
- Event Class: System
- Problem Description:
Debugging event, not for release. This
event is no longer used on Everest/xPeak systems but its event ID is still
contained in the code base.
- Cause / Action:
Debugging event, not for release.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 421
- Severity: MAJOR
- Event Summary: Copying memory test code failed.
- Event Class: System
- Problem Description:
This event is unused
- Cause / Action:
Cause: Memory test code located in main memory
has been corrupted Action: Contact HP support personnel to troubleshoot the
problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 423
- Severity: CRITICAL
- Event Summary: Multiple Core Cells have been discovered in the same
PD
- Event Class: System
- Problem Description:
The reporting Cell thinks that it should
be the core cell but has discovered another cell in the same PD that thinks it
should be the core cell. This is a CRITICAL problem.
- Cause / Action:
Verify that the complex profile is correct and
reset the partition.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 424
- Severity: CRITICAL
- Event Summary: The utilities component encountered an error when
sending a command to the MP
- Event Class: System
- Problem Description:
The utilities system firmware component
received an error response from the SINC in response to a command being sent.
The exact error is displayed in the data field. Typically, this can occur
when the SINC cannot talk to the MP.
- Cause / Action:
Verify the utilities system is connected
correctly and reset.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 426
- Severity: CRITICAL
- Event Summary: This indicates that all the CPUs in the cell did not
rendezvous during the MCA.
- Event Class: System
- Problem Description:
This denotes the fact that all the CPUs in
the cell did not rendezvous.
- Cause / Action:
When this happens the cell will step through
some of the error logging code on its own and then reset itself.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 427
- Severity: CRITICAL
- Event Summary: This indicates that it does not have any access to
the PD.
- Event Class: System
- Problem Description:
This chassis code indicates that the cell
does not have any access to a PD.
- Cause / Action:
Forward Progress indicator; the cell will
independently step through the error logging steps before it resets
itself.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 428
- Severity: CRITICAL
- Event Summary: This indicates the loss of lockstep during the MCA
path.
- Event Class: System
- Problem Description:
This indicates the cell would not be able
to join the other cells in the PD level rendezvous. The data portion
represents the cell id of the cell that incurred the loss of lockstep.
- Cause / Action:
The cell will take up a few more error logging
steps independently before resetting itself.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 429
- Severity: CRITICAL
- Event Summary: The PD level cell rendezvous failed.
- Event Class: System
- Problem Description:
This indicates that some of the cells did
not show up during the PD level rendezvous.
- Cause / Action:
This means that the cells will independently
step through some of the error logging code and then reset themselves.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 434
- Severity: CRITICAL
- Event Summary: The reporting cell is not configured to be in a PD.
- Event Class: System
- Problem Description:
The Reporting Cell is not configured to be
in a PD, according to Complex Profile Group A.
- Cause / Action:
Run parmgr to configure the cell into a PD and
reset the PD or add the cell.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 437
- Severity: FATAL
- Event Summary: The PD cannot boot, a majority of cells did not
arrive at Rendezvous
- Event Class: System
- Problem Description:
Not enough cells made the Rendezvous for
boot to continue. The rules are listed in the cause action section.
- Cause / Action:
PD Rendezvous Boot Rules: If greater than 50%
of the assigned cells are rendezvoused, we will boot. If less than 50% of the
assigned cells are rendezvoused, don't boot. If exactly 50% of the assigned
cells are rendezvoused, including all of the preferred core cells, we will
boot. If exactly 50% have rendezvoused, and there is a specified preferred
core cell not rendezvoused, don't boot. If exactly 50% have rendezvoused, and
there are no preferred core cells, don't boot. If any of the above apply in
preventing the boot. Reconfigure the PD and reboot.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 439
- Severity: MAJOR
- Event Summary: INIT: Monarch failed in slave rendezvous
- Event Class: System
- Problem Description:
SFW's INIT handler has failed to
rendezvoused the processors.
- Cause / Action:
Cause: A processor has failed rendezvous.
Action: Reboot if necessary, if problem persists contact your HP
representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 440
- Severity: MAJOR
- Event Summary: MC: I/O error log/clear error
- Event Class: System
- Problem Description:
SFW's Machine Check Handler was unable to
log or clear I/O error records.
- Cause / Action:
Cause: SFW's Machine Check Handler was unable
to log or clear I/O error records. Action: Reboot if necessary, if problem
persists contact your HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 441
- Severity: MAJOR
- Event Summary: MC: MCA to BERR escalation not supported by PAL
- Event Class: System
- Problem Description:
Cannot escalate an MCA to BERR
- Cause / Action:
Cause: Cannot escalate an MCA to BERR. Action:
Analyze Machine Check Logs using diagnostic tools and EFI tools.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 442
- Severity: MAJOR
- Event Summary: MC: MCA to BINIT escalation not supported by PAL
- Event Class: System
- Problem Description:
Cannot escalate an MCA to BINIT.
- Cause / Action:
Cause: Cannot escalate an MCA to BINIT. Action:
Analyze Machine Check Logs using diagnostic tools and EFI tools.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 443
- Severity: MAJOR
- Event Summary: MC: Get PAL features failed
- Event Class: System
- Problem Description:
SFW failed to get the feature set from
PAL.
- Cause / Action:
Cause: SFW failed to get the feature set from
PAL. Action: None
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 444
- Severity: MAJOR
- Event Summary: MC: Previous PAL rendezvous failed; rebooting
- Event Class: System
- Problem Description:
PAL Failed to rendezvous the processors
during a MCA.
- Cause / Action:
Cause: PAL Failed to rendezvous the processors
during a MCA.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 445
- Severity: MAJOR
- Event Summary: MC: Set PAL features failed
- Event Class: System
- Problem Description:
SFW failed to get the feature set from
PAL.
- Cause / Action:
Cause: SFW failed to get the feature set from
PAL. Action: None
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 446
- Severity: MAJOR
- Event Summary: MC: Monarch failed in slave rendezvous
- Event Class: System
- Problem Description:
SFW's MCA Handler has failed to rendezvous
all the slaves Data: Return from the rendezvous call.
- Cause / Action:
Cause: A slave failed to rendezvous. Action:
Reboot if necessary, if problem persists contact your HP representative for
support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 447
- Severity: MAJOR
- Event Summary: MC_RENDEZVOUS: Rendezvous vector out of range
- Event Class: System
- Problem Description:
A bad rendezvous vector has been
registered.
- Cause / Action:
Cause: A bad rendezvous vector has been
registered. Action: Reboot if necessary to re-register vector, if problem
persists contact your HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 448
- Severity: MAJOR
- Event Summary: MC_RENDEZVOUS: No MC monarch
- Event Class: System
- Problem Description:
No Machine Check Monarch exists, exiting
MC Rendezvous.
- Cause / Action:
Forward progress, no action required
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 449
- Severity: MAJOR
- Event Summary: MC_RENDEZVOUS: No wakeup registered
- Event Class: System
- Problem Description:
The OS has not registered a wake-up
mechanism for rendezvous.
- Cause / Action:
Forward progress, no action required
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 450
- Severity: MAJOR
- Event Summary: MC_RENDEZVOUS: MCA escalation not supported by PAL
- Event Class: System
- Problem Description:
PAL call failed to set the BINIT
escalation bit
- Cause / Action:
No action required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 451
- Severity: MAJOR
- Event Summary: MC_RENDEZVOUS: Get PAL features failed
- Event Class: System
- Problem Description:
The PAL call PAL_GET_FEATURES has failed.
- Cause / Action:
No action required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 452
- Severity: MAJOR
- Event Summary: MC_RENDEZVOUS: Set PAL features failed
- Event Class: System
- Problem Description:
The PAL call PAL_SET_FEATURES has failed.
- Cause / Action:
No action required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 453
- Severity: FATAL
- Event Summary: Internal Firmware Programming Error from the EFI
portion of the firmware
- Event Class: System
- Problem Description:
An internal SAL_ABI firmware error was
encountered. This is usually caused by a bad parameter passed to a function,
corrupt memory, corrupt malloc, corrupt firmware tree or something similar.
The data field contains the IP address of the function that encountered the
error.
- Cause / Action:
Report the IP to the firmware team. Reset the
system. This cannot be worked around in the field.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 455
- Severity: FATAL
- Event Summary: Inconsistency in the length of the ESI table
- Event Class: System
- Problem Description:
The length field within the ESI
(Extensible SAL Interface) table does not agree with the product of the
entry_count field and the size of each entry. Data Field: computed value of
the length based on entry_count and size of the entries.
- Cause / Action:
Cause: Table entries corrupted. Action: Reboot
system. Cause: New table entry types added by SAL not understood by EFI.
Action: Upgrade system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 456
- Severity: FATAL
- Event Summary: The computed checksum for ESI Table incorrect.
- Event Class: System
- Problem Description:
The computed checksum for the ESI
(Extensible SAL Interface) table is not zero as expected. EFI is halting. Data
Field: the computed checksum.
- Cause / Action:
Cause: Table corrupted. Action: Reboot the
system. Cause: Table's checksum miscomputed. Action: Upgrade system
firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 457
- Severity: MAJOR
- Event Summary: ESI Table contains an unsupported entry type.
- Event Class: System
- Problem Description:
EFI found an unsupported entry type within
the ESI (Extensible SAL Interface) Table. Data Field: unknown type.
- Cause / Action:
Cause: Corrupted table. Action: Reboot system.
Cause: Mismatch between SAL and EFI. Action: Upgrade system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 458
- Severity: MAJOR
- Event Summary: A GUID was larger than the expected 128 bits.
- Event Class: System
- Problem Description:
EFI was attempting to output a GUID in the
EFI_GUID_HALF1 and EFI_GUID_HALF2 events which was larger than 128 bits. The
data field contains the actual length of the GUID in bytes.
- Cause / Action:
Cause: Inconsistency in EFI firmware. Action:
Upgrade system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 459
- Severity: FATAL
- Event Summary: EFI is halting
- Event Class: System
- Problem Description:
EFI is halting. Look for the cause of the
halt in preceding events. Data Field: the "halt" (0x0F) major change in system
state code.
- Cause / Action:
Cause: Unknown. Action: examine preceding
events for problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 461
- Severity: MAJOR
- Event Summary: EFI internal error detected resulting in execution
of ASSERT macro
- Event Class: System
- Problem Description:
EFI has detected an internal error. The
actual error is unspecified by this event. Examine previous events and console
output for possible explanations.
- Cause / Action:
The cause is unknown. See previous events and
console output for causes.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 462
- Severity: FATAL
- Event Summary: EFI has executed the "break" shell command.
- Event Class: System
- Problem Description:
- Cause / Action:
Cause: Executing the "break command. Action:
Check for user entering "break" command. Check for shell scripts using the
"break" command.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 463
- Severity: FATAL
- Event Summary: EFI USB HCD interrupt service has detected the host
controller is hung
- Event Class: System
- Problem Description:
The EFI USB HCD interrupt service has
detected the host controller is hung. EFI is halting.
- Cause / Action:
Cause: Problem with USB controller. Action:
Reset the card containing the USB interface to restart the controller. Contact
your HP representative to check the USB interface.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 464
- Severity: FATAL
- Event Summary: The EFI/SAL handoff structure's version does not
match EFI expectations
- Event Class: System
- Problem Description:
The EFI/SAL handoff structure's version
does not match EFI expectations. EFI is halting. Look for
EFI_SAL_HANDOFF_VER_EXPECTED to provide EFI's expected value. Data Field:
Actual value of the version in the structure.
- Cause / Action:
Cause: EFI/SAL firmware mismatch. Action:
Upgrade System Firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 465
- Severity: FATAL
- Event Summary: Unable to obtain access to all RTC SAL services
- Event Class: System
- Problem Description:
EFI is unable to obtain access to all the
RTC (Real Time Clock) SAL services. This means that EFI is unable to fully
interact with the RTC. EFI is halting. Data Field: Return status from internal
EFI function.
- Cause / Action:
Cause: Not all expected services are available.
Mismatch between EFI and SAL versions. Internal EFI error.
Action: Upgrade
system firmware. Cause: EFI unable to create internal event. EFI out of
resources. Action: Reset system.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 466
- Severity: FATAL
- Event Summary: Unable to obtain access to all SAL timer services
- Event Class: System
- Problem Description:
EFI is unable to obtain access to all the
SAL timer services. This means that EFI is unable to fully interact with the
timer. EFI is halting. Data Field: Return status from internal EFI function.
- Cause / Action:
Cause: Not all expected services are available.
Mismatch between EFI and SAL versions. Internal EFI error. Action: Upgrade
system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 467
- Severity: FATAL
- Event Summary: EFI unable to start the periodic timer
- Event Class: System
- Problem Description:
EFI is unable to start the periodic timer.
This timer interrupts EFI periodically to process time sensitive events. EFI
is halting. Data Field: Return status for internal EFI function.
- Cause / Action:
Cause: Internal system firmware error. Action:
Reset the system. Cause: Mismatch between EFI and SAL versions Action: Upgrade
system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 468
- Severity: FATAL
- Event Summary: No I/O port space region found in the MDT
- Event Class: System
- Problem Description:
EFI did not find an I/O port space region
in the MDT. EFI is halting.
- Cause / Action:
Cause: EFI/SAL handoff structure corrupted.
Action: Determine source of corruption and reboot. Cause: EFI/SAL mismatch.
Action: Check system firmware versions and upgrade if necessary.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 469
- Severity: FATAL
- Event Summary: EFI reached an unimplemented section of code
- Event Class: System
- Problem Description:
EFI reached an unimplemented section of
code. EFI is halting. Data Field: Unique identifier indicating the location
reached within the code.
- Cause / Action:
Cause: Reached unimplemented firmware. Action:
Upgrade system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 470
- Severity: MAJOR
- Event Summary: EFI unable to read current speedy boot settings
- Event Class: System
- Problem Description:
EFI was unable to read the current speedy
boot settings. The speedy boot settings are stored within the BMC. EFI will
use a default value of 0 and continue booting. The speedy boot functionality
is also accessed via the boot test EFI shell command and via the OS. These
other accesses will likely fail. Data Field: Return status from internal EFI
function.
- Cause / Action:
Cause: BMC not functioning. Action: Reset the
BMC. Contact your HP representative to check the BMC. Cause: BMC/SAL firmware
mismatch. Action: Upgrade system firmware and/or BMC firmware. Cause: EFI/SAL
version mismatch. Action: Upgrade system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 471
- Severity: FATAL
- Event Summary: Unpermitted SAL callback attempted
- Event Class: System
- Problem Description:
A SAL Callback was attempted. This is not
permitted. EFI is halting. Data Field: index of the function that was being
called.
- Cause / Action:
Cause: Internal EFI error. Action: Upgrade
system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 472
- Severity: MAJOR
- Event Summary: EFI unable to determine frequency base of the CPU
interval timer
- Event Class: System
- Problem Description:
EFI is unable to determine the frequency
base for the Interval Timer within the CPU. The SAL procedure EFI uses to get
this information returned an error. EFI uses this information to create delays
within EFI based on the interval timer. EFI will assume 800 MIPS. Data Field:
return status from the SAL procedure.
- Cause / Action:
Cause: Invalid timer ratio. Action: Reset
system. Cause: Internal system firmware error. Action: Upgrade system
firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 473
- Severity: MAJOR
- Event Summary: EFI system events already initialized
- Event Class: System
- Problem Description:
The EFI system events have already been
initialized. This is unexpected. EFI is continuing. Data Field: the current
value of the system event entry point.
- Cause / Action:
Cause: Multiple attempts to initialize system
events, EFI internal error. Action: Upgrade system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 474
- Severity: MAJOR
- Event Summary: Unable to create internal virtualization event while
initializing IPMI events
- Event Class: System
- Problem Description:
EFI was unable to create an internal
virtualization event while initializing EFI's System Events (IPMI events).
This internal event is not an IPMI event; rather it serves as a trigger for
EFI to virtualize the System Event facility when going virtual. EFI will
likely halt. Data Field: return status from internal EFI function.
- Cause / Action:
Cause: Out of resources. Internal EFI error.
Action: Reboot system. Upgrade system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 476
- Severity: CRITICAL
- Event Summary: There was an error creating or initializing the FPGA
node in firmware
- Event Class: System
- Problem Description:
An error was detected while initializing
the FPGA node and services associated with the PDH.
- Cause / Action:
Cause: Unable to properly initialize a system
firmware node Action: Check for other errors in the system first. Invalidate
NVM and retry to boot. Get the latest firmware release.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 481
- Severity: FATAL
- Event Summary: some processors not compatible
- Event Class: System
- Problem Description:
Installed processors are not of compatible
models or families
- Cause / Action:
Replace processors with compatible ones if all
processors are to be used.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 482
- Severity: FATAL
- Event Summary: caches sizes are inconsistent
- Event Class: System
- Problem Description:
Processors with different cache sizes are
installed
- Cause / Action:
Replace processors with compatible ones if all
processors are to be used.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 483
- Severity: MAJOR
- Event Summary: processor steppings are not equal
- Event Class: System
- Problem Description:
Processors with different steppings are
installed
- Cause / Action:
If desired, replace processors with equal
stepping ones, this is a warning only.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 484
- Severity: MAJOR
- Event Summary: selecting new monarch
- Event Class: System
- Problem Description:
SFW is selecting a new processor due to
compatibility problems.
- Cause / Action:
Replace incompatible processor if desired.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 485
- Severity: FATAL
- Event Summary: monarch not lowest stepping
- Event Class: System
- Problem Description:
The monarch stepping is not equal to the
lowest installed CPU stepping.
- Cause / Action:
Replace the processor with one that has an
equal stepping to the others.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 487
- Severity: MAJOR
- Event Summary: processors are over clocked
- Event Class: System
- Problem Description:
A CPU's FSB frequency is overclocked.
Data: Local CPU Number.
- Cause / Action:
Change FSB frequency.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 488
- Severity: MAJOR
- Event Summary: cpu access error on processor info area
- Event Class: System
- Problem Description:
There was an error reading the info ROM
area of the CPU. Data: Local CPU Number
- Cause / Action:
Cause: An early version of CPU or a bad info
ROM. Action: Replace CPU.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 489
- Severity: MAJOR
- Event Summary: PAL A was not executed - HALT
- Event Class: System
- Problem Description:
PAL_A has not been executed and control is
being transferred back to SAL_B.
- Cause / Action:
No Action.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 490
- Severity: FATAL
- Event Summary: PAL B was not executed - HALT
- Event Class: System
- Problem Description:
PAL_B has not been executed and control is
being transferred back to SAL_B.
- Cause / Action:
No Action.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 491
- Severity: MAJOR
- Event Summary: Prototype CPU installed
- Event Class: System
- Problem Description:
Data: Lower 32 bits have Local CPU Number
- Cause / Action:
Cause: A Prototype CPU is installed. Action:
Replace CPU with a production CPU.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 492
- Severity: MAJOR
- Event Summary: final boot rendezvous monarch watchdog timeout
- Event Class: System
- Problem Description:
Data: Monarch's Local CPU Number
- Cause / Action:
Cause: A watchdog timer has expired and
determined that a monarch is dead. Action: Reboot, if problem persists,
replace CPU.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 493
- Severity: MAJOR
- Event Summary: A multi-bit error was found while reading a XBC CSR
- Event Class: System
- Problem Description:
While reading a XBC CSR, a multi-bit error
was found.
- Cause / Action:
None.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 494
- Severity: MAJOR
- Event Summary: The return value from a function was an unknown
value.
- Event Class: System
- Problem Description:
The return value from a function was an
unknown value. Data field is the unknown status that was returned.
- Cause / Action:
None.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 495
- Severity: MAJOR
- Event Summary: Cannot get system ID status from BMC
- Event Class: System
- Problem Description:
EFI queries the BMC on the system board
for the status of a system ID. The BMC could not complete the request
successfully or on time. Data Field: Internal EFI function status.
- Cause / Action:
Cause: The communication with the system ID is
lost Action: Unplug power from the system for 10 seconds and try rebooting the
system. Cause: Inaccessible FRU EPROM on system board and/or I/O backplane.
Failure in IPMI messaging path on system board and/or I/O backplane Action:
Check FRU EPROM content and accessibility on system and I/O backplane using
ifru. If BMC communication is not working (no answer from BMC), flash BMC
firmware. If it cannot be done or doesn't solve the problem, replace system
board. If system board FRU EPROM cannot be accessed, replace system board If
I/O backplane FRU EPROM cannot be accessed, replace I/O backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 496
- Severity: MAJOR
- Event Summary: Cannot read a system ID
- Event Class: System
- Problem Description:
BMC reported a system ID status as
inaccessible, reported invalid status or cannot return the current value of a
system ID. Data Field: uuid status or internal EFI function status. System ID
status: a 1 byte value 0 extended to 64bits: 0x00 -> primary and secondary
values are valid 0x01 -> primary and secondary values are magic 0x02 ->
primary and secondary values are inaccessible 0x04 -> primary and secondary
values are invalid 0x08 -> primary and secondary values are null (UUID
only) 0x10 -> primary and secondary values are different, value (primary or
secondary) is valid 0x11 -> primary and secondary values are different,
value (primary or secondary) is magic 0x12 -> primary and secondary values
are different, value (primary or secondary) is inaccessible 0x14 -> primary
and secondary values are different, value (primary or secondary) is invalid
0x18 -> primary and secondary values are different, value (primary or
secondary) is null (UUID only)
- Cause / Action:
Cause: BMC failure Action: Unplug power from
the system for 10 seconds and try rebooting the system. Cause:
Inaccessible/corrupted FRU EPROM on system board and/or I/O backplane. Action:
Check content of FRU EPROM of the system board and I/O backplane using ifru.
If FRU EPROM content can be accessed on both board flash BMC firmware. If
content cannot be accessed on system board replace system board. If content
cannot be accessed on I/O backplane, replace I/O backplane If this cannot be
done or doesn't solve the issue replace system board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 497
- Severity: MAJOR
- Event Summary: Failed to write new system ID. BMC reported an error
- Event Class: System
- Problem Description:
Firmware tried to write a primary or
secondary system ID as requested by the user during the boot sequence. The
write failed. Data Field: Internal EFI function status.
- Cause / Action:
Cause: Communication failure with the BMC.
Action: Unplug power from the system for 10 seconds and try rebooting the
system. Cause: Inaccessible/corrupted FRU EPROM on system board and/or I/O
backplane. Inaccessible/corrupted FRU EPROM on system board and/or I/O
backplane. Action: Check content of FRU EPROM of the system board and I/O
backplane using ifru. If FRU EPROM content can be accessed on both board flash
BMC firmware. If content cannot be accessed on system board replace system
board. If content cannot be accessed on I/O backplane, replace I/O backplane
If it cannot be done or doesn't solve the issue replace system board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 498
- Severity: MAJOR
- Event Summary: The system ID(s) currently in the system is invalid
- Event Class: System
- Problem Description:
The system ID(s) currently in the system
is either invalid or, if the EFI_SYSID_BMC_WARNING, EFI_SYSID_BMC_READ_ERROR
or EFI_SYSID_BMC_WRITE_ERROR events are also present, inaccessible to the
system firmware. A stop boot condition will be generated and software license
will probably be invalid. Data Field: uuid: 2 byte value. If preceded by
0xbad00000000000 the following valid values are possible: 0000 -> valid
(should never see his one) 0001 -> magic 0002 -> inaccessible If zero
extended: 1st byte refers to primary UUID, 2nd byte to secondary 00 ->
valid 10 / 01 -> magic 11 / 02 -> inaccessible 12 /
- Cause / Action:
Cause: The system ID(s) is invalid and the user
did not elect to fix the problem. Action: Reboot the system and follow the
prompts to fix the issue. Cause: The system ID(s) cannot be accessed or the
BMC is not providing the requested information. One of the following events
will also be present: EFI_SYSID_BMC_WARNING, EFI_SYSID_BMC_READ_ERROR or
EFI_SYSID_BMC_WRITE_ERROR Action: Fix the error indicated by the other system
ID event.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 499
- Severity: FATAL
- Event Summary: EFI unable to find the SAL services for installing
interrupt handlers
- Event Class: System
- Problem Description:
EFI is unable to find the SAL services for
installing interrupt handlers. EFI was trying to install the run-time handlers
that are required for normal EFI booting. EFI will be halting. Data Field:
internal EFI function status.
- Cause / Action:
Cause: Mismatch between EFI and SAL. Action:
Upgrade system firmware. Cause: Corrupted ESI table. Action: Reboot
system.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 500
- Severity: FATAL
- Event Summary: EFI unable to find the SAL service to install
run-time interrupt handlers
- Event Class: System
- Problem Description:
EFI is unable to find the SAL service to
install run-time interrupt handlers. These handlers are required for normal
EFI booting. EFI will be halting. Data Field: internal EFI function status.
- Cause / Action:
Cause: Mismatch between EFI and SAL. Action:
Upgrade system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 501
- Severity: FATAL
- Event Summary: EFI unable to find the SAL services for installing
interrupt handlers
- Event Class: System
- Problem Description:
EFI is unable to find the SAL services for
installing interrupt handlers. EFI was trying to install the boot-time
handlers that are required for normal EFI booting. EFI will be halting. Data
Field: internal EFI function status.
- Cause / Action:
Cause: Mismatch between EFI and SAL. Action:
Upgrade system firmware. Cause: Corrupted firmware table. Action: Find source
of corruption and reboot.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 502
- Severity: FATAL
- Event Summary: EFI unable to find the SAL service to install
boot-time interrupt handlers
- Event Class: System
- Problem Description:
EFI is unable to find the SAL service to
install boot-time interrupt handlers. These handlers are required for normal
EFI booting. EFI will be halting. Data Field: internal EFI function status.
- Cause / Action:
Cause: Mismatch between EFI and SAL. Action:
Upgrade system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 503
- Severity: MAJOR
- Event Summary: Too many parameters were passed to the utilities
system
- Event Class: System
- Problem Description:
Too many parameters were passed in a
request for the utilities system to perform an operation. No more data is
provided.
- Cause / Action:
This is a firmware error. Contact FW
engineering.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 504
- Severity: CRITICAL
- Event Summary: A crossbar port is unexpectedly not present.
- Event Class: System
- Problem Description:
A crossbar port is expected to be present,
but its presence detect bit is not set. Data field bits 32:43 contain the
crossbar ID, bits 44:55 contain the port number for which the error occurred,
and bits 0:31 contain the port status information.
- Cause / Action:
Cause: An XBC is indicating a port failure
Action: Validate all of the cells connectivity to the PD Check the TOGO chips
seating reset the system replace either cells/system backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 505
- Severity: CRITICAL
- Event Summary: A crossbar port unexpectedly has its HW_LINK_OK bit
not set.
- Event Class: System
- Problem Description:
A crossbar port is expected to have its
HW_LINK_OK bit set, but it is not. Data field bits 32:43 contain the crossbar
ID, bits 44:55 contain the port number for which the error occurred, and bits
0:31 contain the port status information.
- Cause / Action:
Cause: An XBC is indicating a port failure
Action: Validate all of the cells connectivity to the PD Check the TOGO chips
seating reset the system replace either cells/system backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 506
- Severity: CRITICAL
- Event Summary: A connected port was found to be in FE
- Event Class: System
- Problem Description:
A connected crossbar port was found to
have its FE bit set. Data field bits 32:43 contain the crossbar ID, bits 44:55
contain the port number for which the error occurred, and bits 0:31 contain
the port status information.
- Cause / Action:
Cause: An XBC is indicating a port failure
Action: Validate all of the cells connectivity to the PD Check the TOGO chips
seating reset the system replace either cells/system backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 507
- Severity: CRITICAL
- Event Summary: There was an error while initializing the
Concorde-Xbc interface.
- Event Class: System
- Problem Description:
There was an error while initializing the
Concorde-Xbc interface. The data field contains the address of the Concorde
CSR for which the error occurred.
- Cause / Action:
Cause: An XBC is indicating a port failure
Action: Validate all of the cells connectivity to the PD Check the TOGO chips
seating reset the system replace either cells/system backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 508
- Severity: FATAL
- Event Summary: The CC - XBC link failed to initialize.
- Event Class: System
- Problem Description:
The CC - XBC link failed to initialize.
- Cause / Action:
Cause: An XBC is indicating a port failure
Action: Validate all of the cells connectivity to the PD Check the TOGO chips
seating reset the system replace either cells/system backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 509
- Severity: MAJOR
- Event Summary: Unable to determine system mode because EFI/SAL
interface not initialized
- Event Class: System
- Problem Description:
EFI is unable to determine current system
mode. The EFI/SAL interface is not initialized. This interface should have
been initialized before now. This event indicates an internal EFI error. EFI
will continue executing.
- Cause / Action:
Cause: Internal EFI error. Action: Upgrade
system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 510
- Severity: MAJOR
- Event Summary: BMC returned an invalid system mode
- Event Class: System
- Problem Description:
The BMC has returned an invalid system
mode. Data Field: the invalid mode. Expected values are 0 or 1.
- Cause / Action:
Cause: Mismatch between BMC and EFI firmware.
Action: Upgrade system firmware or BMC firmware as necessary.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 511
- Severity: MAJOR
- Event Summary: EFI unable to specify system mode because EFI/SAL
interface not initialized
- Event Class: System
- Problem Description:
EFI is unable to specify a new system
mode. The EFI/SAL interface point is not initialized. This interface should
have been initialized before now. This event indicates an internal EFI error.
EFI will continue executing in the current mode.
- Cause / Action:
Cause: Internal EFI error. Action: Upgrade
system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 512
- Severity: MAJOR
- Event Summary: Unable to enter normal system mode because EFI/SAL
interface not initialized
- Event Class: System
- Problem Description:
EFI is unable to enter normal system mode.
The EFI/SAL interface is not initialized. This interface should have been
initialized before now. This event indicates an internal EFI error. EFI will
continue executing in the current mode.
- Cause / Action:
Cause: Internal EFI error. Action: Upgrade
system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 513
- Severity: FATAL
- Event Summary: Unable to initialize part of the SAL/EFI interface
- Event Class: System
- Problem Description:
EFI is unable to initialize part of the
SAL/EFI interface. This crucial service provides access to certain BMC
functionality such as the security system. EFI will halt. Data Field: Return
status from internal EFI function.
- Cause / Action:
Cause: Incompatible versions of EFI and SAL
Internal EFI error. Action: Upgrade system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 515
- Severity: CRITICAL
- Event Summary: An expected tree node was not found
- Event Class: System
- Problem Description:
A needed tree node was not found. The data
field contains the ASCII name of the tree node that was not found.
- Cause / Action:
This is a bug. Contact engineering.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 516
- Severity: MAJOR
- Event Summary: EFI unable to modify system state to "running"
- Event Class: System
- Problem Description:
- Cause / Action:
Cause: BMC malfunctioning. Action: Reset BMC.
Cause: BMC non functional. Action: Contact your HP representative to check the
BMC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 518
- Severity: MAJOR
- Event Summary: The Get Processor Bus Dependent Configuration
Features PAL call failed.
- Event Class: System
- Problem Description:
Firmware was unable to correctly issue the
Get Processor Bus Dependent Configuration Features command.
- Cause / Action:
Contact engineering. There is a PAL
compatibility problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 520
- Severity: FATAL
- Event Summary: EFI unable to initialize internal library
- Event Class: System
- Problem Description:
EFI is unable to initialize internal
library. This collection of internal services is required for much of EFI's
functionality. EFI is halting.
- Cause / Action:
Cause: Internal EFI error. Action: Upgrade
system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 521
- Severity: CRITICAL
- Event Summary: EFI unable to initialize security system
- Event Class: System
- Problem Description:
EFI is unable to initialize the security
system. The privilege level of the system may or may not be Admin. It is
likely certain EFI facilities will be unavailable. EFI will continue booting
but security may be compromised. Data Field: Return status from internal EFI
function.
- Cause / Action:
Cause: EFI out of resources. Action: Reboot
system. Cause: SAL or EFI mismatch/failure. Action: Upgrade system firmware.
Cause: BMC not responding properly. Action: Reset BMC. Contact your HP
representative to check the BMC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 522
- Severity: MAJOR
- Event Summary: EFI detected invalid internal privilege level
- Event Class: System
- Problem Description:
EFI detected an invalid value for its
internal privilege level. This value is stored within SAL. EFI will continue
but system security may be compromised. Data Field: The invalid privilege
level.
- Cause / Action:
Cause: SAL storage corrupted. Action: Reboot
system. Cause: Invalid argument with EFI. Action: Upgrade system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 523
- Severity: MAJOR
- Event Summary: EFI detected invalid privilege level when setting
password
- Event Class: System
- Problem Description:
EFI detected an invalid privilege level
when setting a BMC password. Only the levels of Admin (0x30) and User (0x20)
are permitted. Data Field: the invalid privilege level.
- Cause / Action:
Cause: Internal EFI error. Action: Upgrade
system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 524
- Severity: FATAL
- Event Summary: EFI MDT table is bad
- Event Class: System
- Problem Description:
SFW has determined that the MDT table is
invalid.
- Cause / Action:
Cause: SFW has determined that the MDT table is
invalid. Action: Reboot if necessary, if problem persists contact your HP
representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 525
- Severity: MAJOR
- Event Summary: Processor has incompatible fixed core ratio
- Event Class: System
- Problem Description:
Data: Local CPU Number.
- Cause / Action:
Cause: A CPU has a different fixed ration than
the FSB frequency set in the chipset. Action: Replace CPU
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 526
- Severity: FATAL
- Event Summary: All processors slated for compatibility
deconfiguration
- Event Class: System
- Problem Description:
Data: A bitmask for which CPUs are slated
to be deconfigured
- Cause / Action:
Cause: The user or SFW has set all CPUs to be
deconfigured. Action: Replace bad processors, if problem persists contact your
HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 527
- Severity: CRITICAL
- Event Summary: An unexpected or invalid value was read from a
crossbar remote route table.
- Event Class: System
- Problem Description:
An error occurred while reading a crossbar
remote route table, or an unexpected/invalid value was read from the table.
The data field consists of the crossbar ID (32:43), the port number of which
the table was read (44:55), and the return status of the read call (0:32).
- Cause / Action:
Cause: An XBC is indicating a port failure
Action: Validate all of the cells connectivity to the PD Check the TOGO chips
seating reset the system replace either cells/system backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 528
- Severity: CRITICAL
- Event Summary: Error reading the PORT[n]_NEIGHBOR_INFO XBC CSR.
- Event Class: System
- Problem Description:
An error occurred while trying to read the
PORT[n]_NEIGHBOR_INFO crossbar CSR. The data field consists of the crossbar ID
(32:43) and port number (44:55) for which the CSR was read.
- Cause / Action:
No action required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 530
- Severity: MAJOR
- Event Summary: Firmware detected excessive errors on the DIMM.
- Event Class: System
- Problem Description:
The DIMM at the physical location given by
the data field had excessive errors and has been marked as "FAILED" by
firmware.
- Cause / Action:
Firmware detected excessive errors on the DIMM
/ Replace the specified DIMM
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 531
- Severity: CRITICAL
- Event Summary: The OE (output enable) bit was not set for a XBC
port.
- Event Class: System
- Problem Description:
A XBC port was expected to be functional,
but its OE bit was not set. The data field consists of the contents of the
port_status CSR (0:31), the XBC number (32:43), and the port number (44:55).
- Cause / Action:
No action required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 532
- Severity: CRITICAL
- Event Summary: An error occurred while trying to read the
PORT_STATUS CSR for a XBC port.
- Event Class: System
- Problem Description:
Unable to read the PORT_STATUS CSR for a
XBC port. The data field consists of the contents of the PORT_STATUS CSR
(0:31), the XBC number (32:43), and the port number (44:55).
- Cause / Action:
No action required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 533
- Severity: CRITICAL
- Event Summary: A XBC port was unexpectedly found to be landmined.
- Event Class: System
- Problem Description:
A XBC port was unexpectedly found to be
landmined. The data field consists of the XBC number (32:43) and the port
number (44:55).
- Cause / Action:
No action required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 535
- Severity: CRITICAL
- Event Summary: The link between the local CC and the local XBC is
unexpectedly not initialized.
- Event Class: System
- Problem Description:
The link between the local CC and the
local XBC is unexpectedly not initialized. The data field is the
XIN_LINK_STATE CC CSR value.
- Cause / Action:
Cause: An error initializing fabric Action: A
previously reported event may provide exact details Reboot, if failure
persists, then either replace the CC chip or the system backplane.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 536
- Severity: CRITICAL
- Event Summary: An invalid XBC number was given.
- Event Class: System
- Problem Description:
A value that was expected to be a XBC
number was found to be an invalid XBC number. The data field is the invalid
XBC number.
- Cause / Action:
A bad value was passed in as a parameter to
fabric traversability functions. No action required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 537
- Severity: CRITICAL
- Event Summary: An invalid XBC port number was given.
- Event Class: System
- Problem Description:
A value that was expected to be a valid
XBC port number was found to be invalid. The data field is the XBC number
(33:44) and the invalid XBC port number (44:55).
- Cause / Action:
A bad value was passed in as a parameter to
fabric traversability functions. No action required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 539
- Severity: CRITICAL
- Event Summary: An unexpected neighbor type was read from a XBC
PORT_NEIGHBOR_INFO CSR.
- Event Class: System
- Problem Description:
A neighbor type read from a XBC
PORT_NEIGHBOR_INFO CSR was different than the expected neighbor type. The data
field contains the expected type (32:63) and the actual neighbor type (0:31).
- Cause / Action:
No action required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 540
- Severity: CRITICAL
- Event Summary: A given XBC port is not a valid XBC-CC port.
- Event Class: System
- Problem Description:
A XBC port number was unexpectedly found
to not be a valid XBC-CC port. The data field consists of the XBC number
(32:43) and the port number (44:55).
- Cause / Action:
No action required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 541
- Severity: CRITICAL
- Event Summary: A XBC port was unexpectedly found to be an invalid
XBC-XBC port.
- Event Class: System
- Problem Description:
A XBC port was unexpectedly found to be an
invalid XBC-XBC port. The data field consists of the XBC number (32:43) and
the port number (44:55).
- Cause / Action:
No action required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 542
- Severity: CRITICAL
- Event Summary: The XBC neighbor chip number does not match the
expected value for this topology
- Event Class: System
- Problem Description:
The XBC neighbor chip number does not
match the expected value for this topology. The data field contains the
expected neighbor chip number (32:63) and the actual neighbor chip number
(0:31).
- Cause / Action:
No action required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 543
- Severity: CRITICAL
- Event Summary: The XBC neighbor port number does not match the
expected value for this topology
- Event Class: System
- Problem Description:
The XBC neighbor port number does not
match the expected value for this topology. The data field contains the
expected neighbor port number (32:63) and the actual neighbor port number
(0:31).
- Cause / Action:
No action required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 544
- Severity: FATAL
- Event Summary: Write through to BMC token failed
- Event Class: System
- Problem Description:
Data: Upper 32 bits, BMC failure return
value. This is a stop boot condition. Lower 32 bits, BMC token number that
failed.
- Cause / Action:
Cause: Problem accessing the BMC. Action: Reset
BMC or reboot, if problem persists contact your HP representative for
support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 546
- Severity: CRITICAL
- Event Summary: Duplicate CPU Ids were detected within a cell.
- Event Class: System
- Problem Description:
2 CPUs think that they have the same ID
within the cell. Typically this would mean that PAL reported the same cpu id
for more than 1 cpu on a bus. The cpuid is in the data field.
- Cause / Action:
Most likely cause is a bad cpu module
connection on the cell board. Replace the cell board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 547
- Severity: MAJOR
- Event Summary: OS crashdump started (D700)
- Event Class: System
- Problem Description:
OS crashdump started (D700)
- Cause / Action:
panic occurred
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 548
- Severity: CRITICAL
- Event Summary: OS legacy PA hex fault code (Bxxx)
- Event Class: System
- Problem Description:
OS legacy PA hex fault code (Bxxx).
Possible I/O error or system panic
- Cause / Action:
fault/panic
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 549
- Severity: MAJOR
- Event Summary: OS dump status (EFxx)
- Event Class: System
- Problem Description:
OS dump status (EFxx). Report on the
success/failure of the writing of the dump. EF00 = success (followed by either
EF0A = successful dump with sync, or EF09 = successful dump without sync),
EFFF = a general error, EFFE = dump path assertion failure, EFFD = no dump was
taken by default, choice or failure, EFFC = dump was aborted by user.
- Cause / Action:
panic path: attempt to write out the dump is
complete
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 550
- Severity: MAJOR
- Event Summary: Setting processor response timeout failed
- Event Class: System
- Problem Description:
SFW has failed to set the processor
timeout value via a PAL call. Data: PAL call return value.
- Cause / Action:
Cause: A PAL call made by SFW has failed.
Action: Reboot if necessary, if problem persists contact your HP
representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 551
- Severity: MAJOR
- Event Summary: Unable to validate blank password during EFI
security initialization
- Event Class: System
- Problem Description:
During EFI security initialization, the
attempt to determine what privilege level a blank password provides, failed.
Most likely this indicates the BMC has failed. EFI assumes that the BMC has
failed and will attempt to continue booting. Some EFI functionality may be
unavailable. Data Field: Return status from internal EFI function.
- Cause / Action:
Cause: SAL failed. Action: Reset the system.
Upgrade system firmware. Cause: BMC failed. Action: Reset the BMC. Contact
your HP representative to check the BMC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 552
- Severity: MAJOR
- Event Summary: Unable to enter Guest mode during EFI security
initialization
- Event Class: System
- Problem Description:
As part of normal security initialization,
EFI attempted to issue a close session to the BMC (I.e. force the BMC to GUEST
mode). This attempt failed. EFI is unable to initialize the security system.
EFI will continue but security may be compromised. Data Field: Return status
from internal EFI function.
- Cause / Action:
Cause: SAL failure. Action: Reset the system.
Upgrade system firmware. Cause: BMC failure. Action: Reset the BMC. Contact
your HP representative to check the BMC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 553
- Severity: MAJOR
- Event Summary: Unable to increase privilege during EFI security
initialization
- Event Class: System
- Problem Description:
As part of normal security initialization,
EFI attempted to issue an open session to the BMC in order to raise the
privilege level to the highest permitted by a blank password. This attempt
failed. EFI is unable to initialize the security system. Data Field: Return
status from internal EFI function.
- Cause / Action:
Cause: SAL failure. Action: Reset the system.
Upgrade system firmware. Cause: BMC failure. Action: Reset the BMC. Contact
your HP representative concerning the BMC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 554
- Severity: MAJOR
- Event Summary: EFI unable to write privilege level during security
initialization
- Event Class: System
- Problem Description:
As part of normal security initialization,
EFI attempted to record the current privilege level. This attempt failed. EFI
is unable to initialize the security system. Data Field: Return status from
internal EFI function.
- Cause / Action:
Cause: SAL failure. Action: Reboot the system.
Upgrade system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 555
- Severity: MAJOR
- Event Summary: EFI was denied permission to write the privilege
level during security init
- Event Class: System
- Problem Description:
As part of normal security initialization,
EFI attempted to record the current privilege level. This attempt failed with
a privilege violation error. EFI is unable to initialize the security system.
Data Field: Return status from internal EFI function.
- Cause / Action:
Cause: SAL is not in ADMIN or USER mode.
Action: Reboot the system. Upgrade system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 556
- Severity: MAJOR
- Event Summary: OS dump, error writing image area to disk (E055)
- Event Class: System
- Problem Description:
OS dump, error writing image area to disk
(E055)
- Cause / Action:
panic path forward progress
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 557
- Severity: CRITICAL
- Event Summary: It stands for diagnosis of catastrophic errors in
the PIN block of concorde.
- Event Class: System
- Problem Description:
This indicates that catastrophic errors
have been found in the PIN block of the concorde. The cell needs to be reset/
halt.
- Cause / Action:
This means that the cell will be reset.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 559
- Severity: CRITICAL
- Event Summary: This indicates that the cell missed the rendezvous
at the partition level.
- Event Class: System
- Problem Description:
This indicates that the cell is too late
for the PD level rendezvous. And hence it will not join the other PD cells.
- Cause / Action:
The cell will independently step through some
of the error logging steps and then finally reset itself.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 560
- Severity: CRITICAL
- Event Summary: This means that the PD monarch timed out.
- Event Class: System
- Problem Description:
This indicates the state where the PD
monarch was not able to complete the task within a certain time. It failed.
- Cause / Action:
The cell will be reset ; also the partition
will be reset.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 563
- Severity: CRITICAL
- Event Summary: This indicates the failure in collecting the Complex
profile info.
- Event Class: System
- Problem Description:
This chassis code reports the failure in
collecting the ICM parameters needed for the cell interleaving.
- Cause / Action:
The partition level memory interleaving cannot
continue without the appropriate information.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 564
- Severity: CRITICAL
- Event Summary: This chassis code indicates the failure in
collecting the cell info.
- Event Class: System
- Problem Description:
This chassis code indicates that the cell
interleaving routine could not get the information on the cell memory.
- Cause / Action:
The partition level memory will fail.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 565
- Severity: CRITICAL
- Event Summary: This indicates the failure in updating the GNI info
of the cell with CLM.
- Event Class: System
- Problem Description:
This chassis code is used to represent the
failure in updating the GNI information of the cell with the CLM ( cell local
memory) information obtained from the Complex Profile.
- Cause / Action:
The partition level memory will fail at this
point.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 566
- Severity: CRITICAL
- Event Summary: This indicates the failure in adjusting the mem info
with Minimum ZI req.
- Event Class: System
- Problem Description:
This represents the failure in adjusting
the memory information with the minimum ZI requirements.
- Cause / Action:
This will cause the partition level memory to
exit cell interleaving.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 570
- Severity: FATAL
- Event Summary: Internal Firmware Programming Error from the EFI
portion of the firmware
- Event Class: System
- Problem Description:
An internal EFI firmware error was
encountered. This is usually caused by a bad parameter passed to a function,
corrupt memory, corrupt malloc, corrupt firmware tree or something similar.
The data field contains the IP address of the function that encountered the
error.
- Cause / Action:
Report the IPF to the firmware team. Reset the
system. This cannot be worked around in the field.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 573
- Severity: MAJOR
- Event Summary: Could not obtain the crossbar port semaphore
- Event Class: System
- Problem Description:
Tried to obtain the port semaphore but
GetPortSemaphore returned an ERROR. Could be a failed write to the port
semaphore crossbar CSR or another cell owned the semaphore. Data field bits
32:63 contain the crossbar ID and bits 0:31 contain the port number for which
the semaphore was being obtained.
- Cause / Action:
No action required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 574
- Severity: MAJOR
- Event Summary: Could not release the crossbar port semaphore.
- Event Class: System
- Problem Description:
Currently owned the port semaphore but
could not release the semaphore. Data field bits 32:63 contain the crossbar ID
and bits 0:32 contain the port number.
- Cause / Action:
No action required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 576
- Severity: FATAL
- Event Summary: BMC token upload failure
- Event Class: System
- Problem Description:
There was an error reading from the BMC
token when attempting to write to SAL NVM. This is a stop boot condition.
Data: BMC Token Number.
- Cause / Action:
Cause: A read from the BMC failed. Action: AC
power cycle if necessary, if problem persists contact your HP representative
for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 577
- Severity: MAJOR
- Event Summary: NVM token access failure
- Event Class: System
- Problem Description:
The read from SAL NVM has failed. This is
a stop boot condition. Data: The token number on which the write failed
- Cause / Action:
Cause: NVM Error, or incorrect permissions to
read token. Action: Retry, AC power cycle if necessary, if problem persists
contact your HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 578
- Severity: FATAL
- Event Summary: BMC token download failure
- Event Class: System
- Problem Description:
There was an error when trying to write to
the BMC Tokens. This is a stop boot condition Data: lower 32 bits are BMC
token number, upper 32 bits is the status return from the BMC.
- Cause / Action:
Cause: BMC Error. Action: AC power cycle if
necessary, if problem persists contact your HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 579
- Severity: FATAL
- Event Summary: Error Writing BMC first boot token
- Event Class: System
- Problem Description:
There has been an error writing the
BMC_FIRST_BOOT token. This is a stop boot condition.
- Cause / Action:
Cause: BMC Error. Action: AC power cycle if
necessary, if problem persists contact your HP representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 580
- Severity: MAJOR
- Event Summary: Fru Id read error
- Event Class: System
- Problem Description:
The read of the motherboard FRU has
failed. Data: Device ID of device that failed the FRU read.
- Cause / Action:
Cause: Error reading the motherboard FRU.
Action: Reboot if necessary, if problem persists contact your HP
representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 581
- Severity: MAJOR
- Event Summary: FRU Id checksum error
- Event Class: System
- Problem Description:
The read of the motherboard FRU has failed
a checksum. Data: Device ID of device that failed the FRU read.
- Cause / Action:
Cause: Error reading the motherboard FRU.
Action: Reboot if necessary, if problem persists contact your HP
representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 582
- Severity: MAJOR
- Event Summary: FRU Id version error
- Event Class: System
- Problem Description:
The read of the motherboard FRU has failed
due to a version problem. Data: Device ID of device that failed the FRU read.
- Cause / Action:
Cause: Error reading the motherboard FRU.
Action: Reboot if necessary, if problem persists contact your HP
representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 583
- Severity: MAJOR
- Event Summary: Rom revision not equal to FIT revision
- Event Class: System
- Problem Description:
A ROM Rev and FIT Rev do not match. Data:
Code for what didn't match: 0x1 = PAL_A, 0x2 = PAL_B, 0x4 = SAL_A, 0x8 = ACPI,
0xA = EFI
- Cause / Action:
Cause: A ROM Rev and FIT Rev do not match.
Action: Update ROM, , if problem persists contact your HP representative for
support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 584
- Severity: MAJOR
- Event Summary: ROM revision not equal to Rev block
- Event Class: System
- Problem Description:
A ROM Rev and Rev Block do not match.
Data: Code for what didn't match: 0x3 = PAL, 0x5 = SAL_A, 0x7 = SAL_B, 0x9 =
ACPI, 0xB = EFI, 0xC = BMC
- Cause / Action:
Cause: A ROM Rev and Rev Block do not match.
Action: Update ROM, , if problem persists contact your HP representative for
support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 585
- Severity: MAJOR
- Event Summary: Primary Fit bad
- Event Class: System
- Problem Description:
The FIT is bad.
- Cause / Action:
Cause: The FIT is bad. Action: Reboot, update
ROM if necessary, if problem persists contact your HP representative for
support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 586
- Severity: MAJOR
- Event Summary: Secondary Fit bad
- Event Class: System
- Problem Description:
The FIT is bad.
- Cause / Action:
Cause: The FIT is bad. Action: Reboot, update
ROM if necessary, if problem persists contact your HP representative for
support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 587
- Severity: MAJOR
- Event Summary: PAL A execution rom warning
- Event Class: System
- Problem Description:
PAL_A_ROM has generated a warning.
- Cause / Action:
Cause: PAL_A_ROM has generated a warning.
Action: Reboot, update ROM if necessary, if problem persists contact your HP
representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 588
- Severity: MAJOR
- Event Summary: PAL B execution ROM warning
- Event Class: System
- Problem Description:
PAL_B_ROM has generated a warning.
- Cause / Action:
Cause: PAL_B_ROM has generated a warning.
Action: Reboot, update ROM if necessary, if problem persists contact your HP
representative for support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 589
- Severity: CRITICAL
- Event Summary: An error was encountered when firmware tried to
update the Group B Profile
- Event Class: System
- Problem Description:
Firmware tried to default the Dynamic
(Group B) complex profile and encountered an error.
- Cause / Action:
Manageability may be unavailable to update the
profiles. Check the connections are reset.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 590
- Severity: CRITICAL
- Event Summary: A DIMM loading order error has occurred
- Event Class: System
- Problem Description:
The loading order of the DIMMs is
incorrect. The cell is halted.
- Cause / Action:
Cause: Incorrect loading of the DIMMs on the
cell Action: Install the DIMMs in the correct order. DIMMs are installed in
ranks of DIMMs , starting with DIMM 0A, 0B, etc. Subsequent ranks are loaded
in ascending order , i.e., rank 1, 2, 3, 4, 5, 6 and 7.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 591
- Severity: MAJOR
- Event Summary: Refresh Control Error Timeout
- Event Class: System
- Problem Description:
Timeout Waiting for SDRAM parts to become
ready - mem_status[0] Refresh Control Register
- Cause / Action:
Cause: At start of memory refresh, timing out
waiting for ready bit to be set Action: Contact HP Support personnel to
troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 592
- Severity: MAJOR
- Event Summary: memory extender/baseboard FRU mismatch
- Event Class: System
- Problem Description:
The version of Memory extender installed
in the system has not been qualified to work with the version of the baseboard
installed in the system.
- Cause / Action:
Cause: Memory extender and baseboard are
incompatible Action: Contact HP support to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 593
- Severity: FATAL
- Event Summary: Fabric topology mismatch with XBCs in complex
- Event Class: System
- Problem Description:
There is a fabric topology mismatch with
the XBCs in the complex. Data Field: (Topology of XBC << 32) | Topology
of destination XBC 0x00 Topology not yet determined 0x30 Domelight 0x40 U-Turn
(Left cabinet) 0x41 U-Turn (Right cabinet) 0x42 Cross-Flex 0x43 U-Turn
- Cause / Action:
There is a fabric topology mismatch with XBC in
complex.
Contact HP Support personnel to analyze the cell, XBC flex cables,
system backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 594
- Severity: CRITICAL
- Event Summary: An invalid XBC to XBC port was found.
- Event Class: System
- Problem Description:
While routing the XBC to XBC ports, an
invalid port was encountered. The data field is the crossbar number (32:43)
and the port number (44:55).
- Cause / Action:
Cause: Loss of Lockstep Action: Reset
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 595
- Severity: MAJOR
- Event Summary: Could not get neighbor information.
- Event Class: System
- Problem Description:
The XBC could not get neighbor
information. Data Field: XBC # << 32 | internal port attempting to
access neighbor
- Cause / Action:
Cause: Defective XBC link Defective XBC Action:
Check XBC link connections Reset the system backplane Contact HP Support
personnel to troubleshoot problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 596
- Severity: CRITICAL
- Event Summary: The XBC's routing state was marked as in ERROR
- Event Class: System
- Problem Description:
For the XBC being routed, routing has
already been attempted, but an error occurred. Inspect chassis codes from
other cells for more details regarding the nature of the problem. The data
field consists of the XBC number (32:63)
- Cause / Action:
Another cell already attempted routing for the
XBC and found an error. Action: Check for hardware failure: flex cables,
crossbar chip, etc.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 597
- Severity: MAJOR
- Event Summary: It indicates that there is no NVM error space left
for logging an Error Event.
- Event Class: System
- Problem Description:
This means that the error event log cannot
be logged to the persistent storage. The data field gives the event type that
was supposed to be logged.
- Cause / Action:
The error event will not be logged.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 598
- Severity: CRITICAL
- Event Summary: An XBC port found to have an unexpected error.
- Event Class: System
- Problem Description:
An XBC port was found to have an
unexpected error. The data field consists of the crossbar number (32:63) and
the current port errors (0:31)
- Cause / Action:
Cause: A port was landmined so it had to be
routed around. Action: Check flex cables
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 599
- Severity: CRITICAL
- Event Summary: A XBC port route around has occurred
- Event Class: System
- Problem Description:
During fabric routing a port on a XBC was
found in error or had been previously marked as in error. PDC will route
around this XBC port. Data Field: XBC number (32:63) and external XBC port
number (0:31)
- Cause / Action:
Cause: During routing, when a XBC to XBC port
is found to be in error, or was previously marked in error, it is routed
around. This chassis code indicates that which XBC port was routed around.
Action: Reset the system backplane to clear the error If the suspect XBC port
uses a flex cable, check / replace the flex cable and then the system
backplane(s) involved. If the suspect XBC port uses the hardwire link built
into the system backplane, replace the system backplane involved.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 600
- Severity: MAJOR
- Event Summary: During routing a crossbar is found to be in an
unexpected routing state.
- Event Class: System
- Problem Description:
Data field: the unexpected forward
progress state (0:31) XBC number (32:44) Cell number (56:63)
- Cause / Action:
Cause: An XBC is indicating a port failure
Action: Validate all of the cells connectivity to the PD Check the TOGO chips
seating reset the system replace either cells/system backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 601
- Severity: MAJOR
- Event Summary: An unexpected XBC forward progress state was
continually found until timing out.
- Event Class: System
- Problem Description:
A crossbar was found to be in an
unexpected forward progress state during fabric routing. This crossbar stayed
in the unexpected state until Fabric Discovery timed out. Data field:
unexpected forward progress (0:31) XBC number (32:44) Cell number (56:63)
- Cause / Action:
Cause: An XBC is indicating a port failure
Action: Validate all of the cells connectivity to the PD Check the TOGO chips
seating reset the system replace either cells/system backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 602
- Severity: FATAL
- Event Summary: During remote routing, the current port's neighbor
is not healthy.
- Event Class: System
- Problem Description:
An XBC port was found that is not healthy.
This indicates at least one of the following about the port: - Hardware link
is not okay - Presence detect is false - Fatal error detected - SBE detected -
LPE detected - Port landmined The data field of the chassis code indicates
which port is unhealthy, as well as the fabric routing state before the
problem was encountered.
- Cause / Action:
An XBC port is not healthy. Action: Check for
hardware failure: flex cables, crossbar chips, etc.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 603
- Severity: FATAL
- Event Summary: The CC to XBC link is not viable.
- Event Class: System
- Problem Description:
The CC to XBC link is not viable.
- Cause / Action:
Cause: The CC to XBC link is not operational.
Action: Reset the cell Reset the system backplane Contact HP Support personnel
to troubleshoot problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 604
- Severity: FATAL
- Event Summary: Remote routing a crossbar failed.
- Event Class: System
- Problem Description:
There was a problem performing remote
routing on the local XBC. Chassis codes sent before this one may provide more
details about the exact nature of the problem. The data field consists of the
XBC number that failed routing (32:63)
- Cause / Action:
A failure was encountered while performing
remote routing on an XBC, most likely due to a problem with the system
backplane or local cell. Action: Check for hardware failure: CC, XBC to CC
link, flex cables, crossbar chip, etc.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 605
- Severity: FATAL
- Event Summary: Too many XBC-to-XBC were broken in the complex.
- Event Class: System
- Problem Description:
Two or more XBC-XBC links were found to be
broken. The data field is the XBC number (32:63) and a bit map of the ports
broken (0:31)
- Cause / Action:
Port status indicated that two or more ports on
a XBC had errors. Action: Check for hardware failure: flex cables, crossbar
chip, etc.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 606
- Severity: CRITICAL
- Event Summary: This cell did not get the XBC Global Semaphore.
- Event Class: System
- Problem Description:
After unlocking the XBC Global Semaphore
for a takeover, this cell did not get the semaphore.
- Cause / Action:
C1: Another cell won the race and got the
semaphore before this cell. This would be apparent in chassis codes. A1: None.
C2: XBC write or read failure. A2: check XBC, check link, check CC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 607
- Severity: FATAL
- Event Summary: Attempted an XBC SM4 takeover and timed out trying
to unlock the SM4.
- Event Class: System
- Problem Description:
When a cell holds an XBC semaphore for an
extended period of time, fabric will attempt to takeover the semaphore so that
the rest of the cells will have access to it. Fabric will attempt to take the
SM4 for a period of time. If it is unable to unlock the SM4 within the timeout
period, it will send this chassis code and halt the cell. Data field: XBC
number (32:63) and current owner (cell) of the semaphore (0:31)
- Cause / Action:
Cannot takeover an XBC semaphore that has been
held for a long time. Try forcing firmware to reroute the fabric by cycling
48V power on the cabinets. Look for other fabric chassis codes that explain
why the current owner of the SM4 was unable to release it. Look for fabric
problems on the backplane.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 608
- Severity: FATAL
- Event Summary: Waiting for the XBC Global Semaphore has timed out.
- Event Class: System
- Problem Description:
During Fabric Discovery, the cell will
wait until it gets the XBC's Global Semaphore. It waits for a very long time.
This chassis code indicates that the wait has timed out.
- Cause / Action:
XBC Key Contention. Hardware Failure Action:
Look for other chassis codes that indicate XBC Key contention Check XBC Check
Links/Flex Cables
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 609
- Severity: FATAL
- Event Summary: A timeout occurred while attempting to release the
XBC semaphore.
- Event Class: System
- Problem Description:
The XBC Release Semaphore timeout is
designed to fail last. The semaphore could not be released. Any other cell
(even outside the PD) may be blocked because the XBC is a global resource.
Data field: current semaphore owner (0:31) XBC number (32:43) port number
(44:55) cell number (56:63)
- Cause / Action:
XBC Key Contention. Hardware Failure Action:
Look for additional chassis codes that would explain the failure Check XBC
Check Link/Flex Cables
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 610
- Severity: MAJOR
- Event Summary: Management Processor Firmware Battery Failure or
NVRAM change
- Event Class: System
- Problem Description:
Management Processor Firmware detected
improper data in NVRAM (bad checksums.) Either the NVRAM layout changed, or
the Management Processor Battery may not be maintaining the data through A/C
power cycles.
- Cause / Action:
Determine if the firmware was recently
upgraded. This is often the reason for the NVRAM to change. If not, and the
A/C power has been removed, than it's possible the battery is indeed going bad
and would need to be replaced.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 611
- Severity: MAJOR
- Event Summary: Management Processor Firmware Software Error
- Event Class: System
- Problem Description:
Management Processor Firmware detected a
software error and is logging an event. The data represents data associated
with the error seen.
- Cause / Action:
A software error was detected and is being
logged. The internal data is connected to the location and module where the
error occurred. The Forward Progress Log will receive additional (lower alert
level) event entries with more data associated with this event.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 612
- Severity: MAJOR
- Event Summary: Management Processor detected an I2C Communication
Error with BMC.
- Event Class: System
- Problem Description:
An I2C Communication failure with the
Baseboard Management Controller was detected. Without I2C communication, the
system cannot be powered on/off or reset.
- Cause / Action:
An I2C Communication failure with the Baseboard
Management Controller was detected. Without I2C communication, the system
cannot be powered on/off or reset. Check the I2C communication via the 'SR'
command or the 'PS' command. If it is indeed down, look for hardware reasons.
It's possible resetting the Management Processor firmware ("XD" command option
'r') or completely cycling AC power of the system will restore the
communication.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 613
- Severity: CRITICAL
- Event Summary: A CRC error was discovered when verifying the ROM
- Event Class: System
- Problem Description:
A stored CRC value did not match the
calculated CRC value for the specified address.
- Cause / Action:
Either the ROM was programmed incorrectly or
has gone bad. Reprogram the Flash on the cell board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 614
- Severity: MAJOR
- Event Summary: An error was encountered when executing a PAL_PROC
- Event Class: System
- Problem Description:
An error was encountered when executing a
PAL_PROC. This code will be emitted in pairs. The Proc INDEX will be in the
data of the first chassis code. The status is in the second data field.
- Cause / Action:
PAL was unable to be successfully called. See
other event ids to determine if action needs to be taken.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 615
- Severity: FATAL
- Event Summary: CPUs (and or termination) loaded in wrong order
- Event Class: System
- Problem Description:
CPUs not loaded in correct order. Correct
loading order is CPU 0, 1, 2, 3.
- Cause / Action:
Cause: CPUs not loaded in correct order.
Action: Load CPUs in order 0, 1, 2, 3.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 616
- Severity: CRITICAL
- Event Summary: Error Reading a platform storage variable from the
PDHC/MP
- Event Class: System
- Problem Description:
System firmware was unable to complete a
platform storage read command from the utilities system. The exact status
printed in the data field.
- Cause / Action:
Either the MP is not present, or the requested
information does not exist. Ensure that the MP is functioning and that the
proper data is being requested.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 617
- Severity: CRITICAL
- Event Summary: An error was returned on a Platform Storage Write
Command to the PDHC/MP
- Event Class: System
- Problem Description:
System firmware was unable to complete a
platform storage write command. The actual status is returned in the data
field.
- Cause / Action:
The MP is not present, may be out of space, or
the command was badly formatted. Ensure that the MP has enough space and try
again. If the problem persists, contact engineering.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 618
- Severity: CRITICAL
- Event Summary: The Sequencer was unable to find/use a needed tree
node
- Event Class: System
- Problem Description:
The Sequencer was unable to find the tree
node it needed to complete an operation. The tree node is in the ascii in the
data field.
- Cause / Action:
This is a bug, contact engineering
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 619
- Severity: CRITICAL
- Event Summary: Firmware encountered an error in processing the
partition variables
- Event Class: System
- Problem Description:
System firmware attempted to read a
partition variable from the GSP and store it in options. An error was
encountered during this process. The data field contains the partition
variable element ID that was being processed.
- Cause / Action:
Either the GSP was not present or there was a
resource problem storing the variable. There should be other clues in the
event id log to indicate which is the case. Restore the GSP.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 620
- Severity: CRITICAL
- Event Summary: A non-FATAL cell power fault has occurred
- Event Class: System
- Problem Description:
One or more power converter on the Cell or
Cell Power Board has reported a fault. However, because of redundancy in the
power system, the power to the Cell is still good. The data field contains
detailed power fault location information (see Cell ERS for more information).
Data Byte[0]: bit0 - Power_Fault status, bit1 - Power_Good status Data
Byte[1]: Contents of Power Board Converter Status register. Data Byte[2]:
Contents of Cell Converter Status register. Data Byte[3]: Contents of CPU
Module Power Status register.
- Cause / Action:
Cause(1): A power converter has failed.
Cause(2): A CPU Power Module has been disabled following a thermal warning
reported by that CPU Module.
Action: Contact HP Support personnel to
troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 622
- Severity: MAJOR
- Event Summary: Firmware was unable to determine the Processor
Dependent Features
- Event Class: System
- Problem Description:
System firmware was unable to successfully
issue the PAL_GET_PROC_FEATURES PAL proc. The data field is unused
- Cause / Action:
Contact Engineering, This is a bug.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 624
- Severity: CRITICAL
- Event Summary: The CLU has encountered an undEFIned case
- Event Class: System
- Problem Description:
The CLU has encountered an undEFIned case
in its control flow.
- Cause / Action:
Cause: CLU firmware on the UGUY has gotten into
an unexpected execution path, most likely due to a hardware issue on the UGUY.
Action: Check revision of CLU firmware. If out of date, or known bad revision,
use FWUU to update CLU firmware. Contact HP Support personnel to troubleshoot
problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 625
- Severity: MAJOR
- Event Summary: An unknown Cell voltage margin has been detected.
- Event Class: System
- Problem Description:
The Cell voltage margin settings do not
match the Normal, +5%, or -5% values.
- Cause / Action:
Cause: A user has manually, using back-door
debugging methods, altered the voltage margin setting of one or more Cell
Board or Cell Power Board converters.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 626
- Severity: MAJOR
- Event Summary: The run-time verification of a programming
assumption has failed.
- Event Class: System
- Problem Description:
For debug purposes, many assumptions made
by the PDHC developer(s) are checked at run-time. If this event log is seen,
it will either indicate that the hardware is in a unknown state that is not
handled by the PDHC, or that a programming bug has been found. For developer
debug purposes, the data field describes where in the code that the error was
detected. Data Bytes[0-1]: The line number within the source code file where
the error was detected. Data Bytes[2-7]: The first 6 characters of the source
code file name.
- Cause / Action:
Cause: Hardware in unknown state, or
programming bug found.
Action: Upgrade PDHC firmware to latest revision. If
already at current revision, contact HP Support personnel to troubleshoot the
problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 627
- Severity: MAJOR
- Event Summary: An unknown error has been detected by the PDHC
firmware.
- Event Class: System
- Problem Description:
An unknown error has been detected by the
PDHC firmware. For developer debug purposes, the data field describes where in
the code that the error was detected. Data Bytes[0-1]: The line number within
the source code file where the error was detected. Data Bytes[2-7]: The first
6 characters of the source code file name.
- Cause / Action:
Cause: Hardware in unknown state, or
programming bug found.
Action: Upgrade PDHC firmware to latest revision. If
already at current revision, contact HP support personnel to troubleshoot the
problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 628
- Severity: MAJOR
- Event Summary: An attempt to write to a device on the PDHCs I2C bus
has failed.
- Event Class: System
- Problem Description:
An attempt to write to a device on the
PDHC's I2C bus has failed. The devices on the I2C bus are the Cell's FRU
EEPROM, the Cell Power Board's FRU EEPROM, the voltage margining D-to-A
converters, and, if they are accessible, the CPU Module Power Pods' FRU
EEPROMs. The Data field information contains information that can identify the
exact device that has failed. Refer to the Cell ERS for a mapping of I2C
device addresses to devices. Data Bytes[0-1]: Reserved Data Bytes[2-3]: I2C
Device Address Data Bytes[4-5]: Starting Word Address Data Bytes[6-7]: Size of
attempted access (in bytes).
- Cause / Action:
Cause: A hardware fault has
occurred.
Action: Contact HP Support personnel to troubleshoot the Cell
Board, Cell Power Board, and/or PDH Daughtercard.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 629
- Severity: MAJOR
- Event Summary: An attempt to read from a device on the PDHC's I2C
bus has failed.
- Event Class: System
- Problem Description:
An attempt to read from a device on the
PDHC's I2C bus has failed. The devices on the I2C bus are the Cell's FRU
EEPROM, the Cell Power Board's FRU EEPROM, the voltage margining D-to-A
converters, and, if they are accessible, the CPU Module Power Pods' FRU
EEPROMs. The Data field information contains information that can identify the
exact device that has failed. Refer to the Cell ERS for a mapping of I2C
device addresses to devices. Data Bytes[0-1]: Reserved Data Bytes[2-3]: I2C
Device Address Data Bytes[4-5]: Starting Word Address Data Bytes[6-7]: Size of
attempted access (in bytes).
- Cause / Action:
Cause: A hardware fault has
occurred.
Action: Contact HP Support personnel to troubleshoot the Cell
Board, Cell Power Board, and/or PDH Daughtercard.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 630
- Severity: MAJOR
- Event Summary: An attempt to write to a device on the PDHC's SM bus
has failed.
- Event Class: System
- Problem Description:
An attempt to write to a device on the
PDHC's SM bus has failed. The devices on the SM bus are the CPU modules' FRU
EEPROMs, the CPU modules' Processor Information ROMs, and the CPU modules'
thermal sensors. The Data field information contains information that can
identify the exact device that has failed. Refer to the Cell ERS for a mapping
of SM Bus device addresses to devices. Data Bytes[0-1]: Reserved Data
Bytes[2-3]: SM bus Device Address Data Bytes[4-5]: Starting Word Address Data
Bytes[6-7]: Size of attempted access (in bytes).
- Cause / Action:
Cause: A hardware fault has
occurred.
Action: Contact HP Support personnel to troubleshoot the Cell
Board, Cell Power Board, and/or PDH Daughtercard.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 631
- Severity: MAJOR
- Event Summary: An attempt to read from a device on the PDHC's SM
bus has failed.
- Event Class: System
- Problem Description:
An attempt to read from a device on the
PDHC's SM bus has failed. The devices on the SM bus are the CPU modules' FRU
EEPROMs, the CPU modules' Processor Information ROMs, and the CPU modules'
thermal sensors. The Data field information contains information that can
identify the exact device that has failed. Refer to the Cell ERS for a mapping
of SM Bus device addresses to devices. Data Bytes[0-1]: Reserved Data
Bytes[2-3]: SM bus Device Address Data Bytes[4-5]: Starting Word Address Data
Bytes[6-7]: Size of attempted access (in bytes).
- Cause / Action:
Cause: A hardware fault has
occurred.
Action: Contact HP Support personnel to troubleshoot the Cell
Board, Cell Power Board, and PDH Daughtercard.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 632
- Severity: CRITICAL
- Event Summary: Cell boot has been disabled due to a failure setting
the frequency registers.
- Event Class: System
- Problem Description:
The PDHC did not read valid frequency
information from the CPU modules' or Cell's FRU EEPROMs, or the frequency
registers would not update properly. Following this event, the Cell will not
boot until the problem is corrected and Cell Power has been turned off, then
on again, using the PE command.
- Cause / Action:
Cause(1, probable): Invalid data programmed in
the Cell's FRU EEPROM or a CPU module's Scratch/FRU EEPROM. Action (1): If in
manufacturing, program correct data in partition specific field of the Cell or
CPU Module's FRU EEPROM. Otherwise, contact HP support personnel to
troubleshoot the problem. Cause(2): A hardware fault has occurred. Action(2):
Contact HP Support personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 633
- Severity: MAJOR
- Event Summary: An error has occurred while updating System FW.
- Event Class: System
- Problem Description:
An error has occurred while updating
System FW. More details about the update failure may be available as displayed
by the Firmware Update Utility (FWUU).
- Cause / Action:
Cause(1): Obsolete version of FWUU. Action(1):
If you are not using the latest revision of FWUU, obtain and use the latest
version of FWUU to retry the update. Cause(2): MP firmware not at a revision
that supports the current version of PDHC FW or System FW. Action(2): If MP is
not at a compatible revision, update the MP firmware to a compatible revision
and repeat the firmware update. Cause(3): Other error indicated by FWUU.
Action(3): Exit from FWUU, reset the MP using the XD command, then attempt to
update System FW. If repeated attempts to update the System FW fail, contact HP
support personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 634
- Severity: MAJOR
- Event Summary: The PDHC firmware was reset for some unknown reason.
- Event Class: System
- Problem Description:
The PDHC firmware was reset for some
unknown reason.
- Cause / Action:
Cause(1): System FW has reset the PDHC because
it suspects the PDHC of corrupting shared memory. Cause(2): A PDHC watchdog
timer timeout has occurred because the PDHC was stuck in some unknown state.
Cause(3): An unknown hardware fault has caused the PDHC to reset.
Action:
Upgrade PDHC firmware to the latest revision. If the error continues, contact
HP support personnel to troubleshoot the PDH Daughtercard and/or Cell Board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 635
- Severity: CRITICAL
- Event Summary: Cell boot has been disabled because setup of a CPU
thermal sensor failed.
- Event Class: System
- Problem Description:
A hardware fault prevented the PDHC from
configuring the thermal sensor(s) on one or more of the CPU modules. Following
detection of this fault condition, the Cell will be prevented from booting
until the Cell is powered "off", then "on", using the PE command.
- Cause / Action:
Cause(1): A hardware fault exists in the
communication path to a CPU module's thermal sensor, or in the thermal sensor
itself. Cause(2): A hardware fault prevents access to a CPU module's Processor
Information ROM.
Action: Contact HP support personnel to troubleshoot the
Cell Board, the PDH Daughtercard, and/or the offending CPU module.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 636
- Severity: CRITICAL
- Event Summary: A CPU module has reported overtemp, so will be
powered off in 2 minutes.
- Event Class: System
- Problem Description:
A CPU module's temperature has exceed the
high temperature threshold. As a result of this event, an irrevocable 2 minute
timer will begin. At the end of 2 minutes, the offending CPU module will be
powered off by the Cell hardware. The Cell must be powered off then on using
the MP's PE command before the CPU module will be powered again.
- Cause / Action:
Cause(1): Excessive heat in the data center has
caused the CPU module to heat up beyond the programmed temperature threshold.
Action(1): Resolve the environmental problem, shut down the partition, then PE
the Cell off, then on again. Cause(2): A hardware fault has caused the CPU
module to heat up beyond the programmed temperature threshold. Cause(3): The
Processor Information ROM on the processor module is unprogrammed or
programmed with invalid temperature thresholds. Action(2,3): Contact HP
support personnel to troubleshoot the problem.
Cause(1): Excessive heat in
the data center has caused the CPU module to heat up beyond the programmed
temperature threshold. Action(1): Resolve the environmental problem, shut down
the partition, then PE the Cell off, then on again. Cause(2): A hardware fault
has caused the CPU module to heat up beyond the programmed temperature
threshold. Cause(3): The Processor Information ROM on the processor module is
unprogrammed or programmed ! with invalid temperature thresholds. Action(2,3):
Contact HP support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 637
- Severity: MAJOR
- Event Summary: An error occurred while updating the PDHC firmware.
- Event Class: System
- Problem Description:
An error occurred while updating the PDHC
firmware. More specific details of the update error may be displayed by the
Firmware Update utility running on the MP.
- Cause / Action:
Cause(1): MP firmware not at a revision that
supports that version of PDHC firmware. Action(1): If MP is not at a
compatible revision, update the MP firmware to a compatible revision and
repeat PDHC firmware update. Cause(2): Other error indicated by Firmware
Update. Action(2): Exit from Firmware Update, reset the MP using the XD
command, then attempt to update PDHC firmware again. If repeated attempts to
update the PDHC firmware fail, contact HP support personnel to troubleshoot
the problem
Cause(1): MP firmware not at a revision that supports that
version of PDHC firmware. Action(1): If MP is not at a compatible revision,
update the MP firmware to a compatible revision and repeat PDHC firmware
update. Cause(2): Other error indicated by Firmware Update. Action(2): Exit
from Firmware Update, reset the MP using the XD command, then attempt to
update PDHC firmware again. If repeated attempts to update the PDHC firmware
fail, contact HP support personnel to troubleshoot ! the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 638
- Severity: CRITICAL
- Event Summary: CPU Revisions did not match
- Event Class: System
- Problem Description:
2 CPUs in the system are reporting
different revisions. This event will be emitted in groups of 3 with the two
revisions reported in the first 2 data fields and the CPU number in the 3rd
data field.
- Cause / Action:
2 CPUs are at different revisions. Replace
incompatible CPU.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 639
- Severity: CRITICAL
- Event Summary: 2 CPUs are running at mismatched frequencies.
- Event Class: System
- Problem Description:
This chassis code will be emitted in
pairs. 2 CPUs are reporting that they are running at different frequencies.
The two frequencies are reported in the data fields.
- Cause / Action:
There is a CPU or Cell compatibility problem.
Verify that all CPUs are clocked at the same frequency and have the same
ratios set.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 640
- Severity: CRITICAL
- Event Summary: A cpu is being over clocked
- Event Class: System
- Problem Description:
The rating for the cpu and the actual
speed will be emitted in 2 sequential event data fields.
- Cause / Action:
A cpu is being clocked at a rate higher than it
is rated for. Replace the cpu or cell board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 641
- Severity: FATAL
- Event Summary: Copy of complex profile on sub and cells don't match
- Event Class: System
- Problem Description:
The complex profile is stored in NVRAM on
the MP and each cell. All copies must match. For this error to be generated,
not only is the MP's copy of the complex profile invalid, but not all of the
cell's copies match.
- Cause / Action:
Cause: MP NVRAM was erased by removing MP from
system without setting "NVRAM SAVE" switch to on. MP was replaced with
cabinet's AC Breakers "off". Either of first two causes and replacing or
installing a cell board with cabinet's AC Breakers "off". Action: Remove cell
board causing problem. Power complex on and allow cells to distribute their
copy of complex profile to MP, then add new cell following proper OLA
procedures. Remove improper cell board. Execute MP Handler "CC" command and
choose "Last Profile". This will load the sub with what should be the same
copy as the cells. Then add new cell board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 642
- Severity: FATAL
- Event Summary: Duplicate cabinet number detected
- Event Class: System
- Problem Description:
The MP detected 2 or more cabinets with
the same cabinet number.
- Cause / Action:
Cause: When adding a new cabinet to the complex
or replacing the UGUY, the cabinet number switch was set to a number already
in use. Action: Turn off AC breakers to cabinet with duplicate number. Check
all other cabinet numbers in the complex for validity. Set cabinet number
switch on UGUY-PCB in new cabinet (s) to proper cabinet number. Turn on AC
breakers for cabinet(s).
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 643
- Severity: FATAL
- Event Summary: MP ID command must be run
- Event Class: System
- Problem Description:
The complex identification information in
group A of the complex profile is invalid. The MP (Manageability Processor)
command "ID" must be run. The SSKEY hardware is required.
- Cause / Action:
Cause: This is the first time the machine has
been powered on and there is no valid complex profile anywhere. Action: Run
"CC" command and generate genesis profile. Cause: MP lost its profile by being
replaced with power off ,or, "NVRAM save" switch was not enabled and MP was
removed and replaced. Also, at the same time, a cell was replaced or added
while power was off. Both scenarios are violations of OL* Rules. A
complex_profile_incoherent code was issued. The "cc" command was run and
genesis profile was selected. Action: If "cc" command is selected, choose
"last good profile" instead of genesis profile, or remove illegal cell(s),
power up and follow OL* Rules.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 645
- Severity: MAJOR
- Event Summary: MP Battery is low
- Event Class: System
- Problem Description:
The battery on the SBCH is below the safe
threshold. The battery can be replaced online.
- Cause / Action:
Cause: MP was running on battery for too long.
Someone didn't set "NVRAM Save" switch to "off". Action: Replace battery as
per MP Battery Remove and Replace procedures.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 646
- Severity: FATAL
- Event Summary: Partition being reset due to watchdog timeout
expiring
- Event Class: System
- Problem Description:
The partition is being reset because its
watchdog timer expired and automatic restart is enabled.
- Cause / Action:
Cause: There are 2 watchdog mechanisms, both of
which trigger the MP to reset a partition if its OS becomes unresponsive. An
unresponsive OS is detected when the OS fails to refresh the watchdog timer
before it expires. PA systems refresh the watchdog timer by emitting an event
with data field set to activity level/timeout, and the timeout fields
specifies the desired timeout. This timer can be disabled with the MP AR
command. IPF systems refresh the watchdog timer using the IPMI clear watchdog
command. The AR command does not affect the IPMI watchdog timer. Regardless of
which timer was in use, the MP emits this event when timer expiration triggers
resetting the partition. Action: Find out why the partition's OS had hung. The
cause could be bad HW that crashed the partition, or in rare cases, a
combination of events that caused the OS to be unable to refresh the watchdog
timer. Look for other events preceding the timeout for clues to the root
cause of the partition being unresponsive.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 647
- Severity: MAJOR
- Event Summary: PDHC FW was reset by hardware due to firmware
inactivity.
- Event Class: System
- Problem Description:
The processor dependent hardware
controller (PDHC) on the cell board had its watchdog timer expire. The PDHC
will reset the watchdog as the main program runs. If the watchdog does not get
reset within 7 seconds the timer will expire, resetting the PDHC.
- Cause / Action:
Cause: Processor dependent hardware controller
(PDHC) Hardware Failed; causing inactivity. PDHC Firmware hung; causing
inactivity.
Action: Even though the PDHC will reset itself without
interrupting the cell, HP Support personnel should be contacted to
troubleshoot the PDH daughtercard and/or cell board as soon as possible.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 649
- Severity: MAJOR
- Event Summary: Power Up Aborted, Over Temp
- Event Class: System
- Problem Description:
The Cabinet Power Up request was aborted
due to ambient air over temperature.
- Cause / Action:
Cause: Computer Room over temp Action: Cool
Computer Room Cause: Environment immediately surrounding cabinet. Action:
Correct local environmental problem Cause: Reporting Error Action:
Troubleshoot ambient air sensor/cable/PM3.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 651
- Severity: FATAL
- Event Summary: No Cabinet Start, Insufficient Blowers
- Event Class: System
- Problem Description:
When given a power up request, the cabinet
had to abort the start up due to less than the required number of Cabinet
Blowers installed.
- Cause / Action:
Cause: The number of blowers required is a hard
number. It is not dependent upon the number of entities installed in a
Cabinet. The Utilities Subsystem is not allowing the Cabinet to power up due
to an insufficient number of installed blowers. Action: Install missing
Cabinet Blowers. If proper number of blowers are installed, troubleshoot
blower presence detection.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 652
- Severity: FATAL
- Event Summary: No Cabinet Start, Insufficient IO Fans
- Event Class: System
- Problem Description:
When given a power up request, the cabinet
had to abort the start up due to less than the required number of IO fans
present.
- Cause / Action:
Cause: The number of IO fans required is a hard
number. It is not dependent upon the number of entities installed in a
Cabinet. The Utilities Subsystem is not allowing the cabinet to power up due
to an insufficient number of installed IO fans. Action: Install missing IO
fans, or if proper number installed, troubleshoot IO fan presence
detection.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 653
- Severity: MAJOR
- Event Summary: AC power to the PDCA was removed. Data Byte 3
specifies PDCA number.
- Event Class: System
- Problem Description:
The AC power connected to the PDCA (Power
Distribution Control Assembly) was removed. The data field contains the
physical location of the PDCA. The PDCA source that was deleted can be
identified by the implementation dependent field (data byte 3) of the physical
location: data byte[3]: 0 for PDCA 0, 1 for PDCA 1.
- Cause / Action:
Cause: Circuit breakers on the PDCA are open.
Action: Close the PDCA circuit breakers. Cause: Power source supplying AC to
the PDCA has failed. Action: Troubleshoot AC power problem. Cause: PDCA (Power
Distribution Control Assembly) has failed. Action: Replace the PDCA with
proper type (4-wire or 5-wire) PDCA following power distribution control
assembly Remove and Replace procedures. Cause: AC Detection and monitoring
circuitry failed. Action: Troubleshoot and replace failed Field Replaceable
Units.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 654
- Severity: MAJOR
- Event Summary: Cabinet Main Blower Failed
- Event Class: System
- Problem Description:
A cabinet main blower has failed.
Depending on the number of blowers still operating, the cabinet may or may not
shut down. View the Error Log entries to determine if the cabinet is
operating. If many log entries call out entities powering off during the same
time frame as this BLOWR_FAIL, the cabinet has probably shutdown. Carefully
review the log for the first few events within the same time frame for the
root cause of the problem. The GSP command, PS, will show a detailed power
status for a cabinet. If the +48V LED on the Front Panel Board is not lit,
power is not enabled to the cabinet. This is an indication the cabinet blowers
have probably gone from N to N - 1 status requiring an immediate cabinet
shutdown.
- Cause / Action:
Cause: Cabinet Blower Failed Action: Replace
failed blower module as soon as possible following the Blower Module Remove
and Replace Procedures.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 655
- Severity: MAJOR
- Event Summary: 48 Volt Converter Failed. Data Byte 3 specifies PDCA
number.
- Event Class: System
- Problem Description:
A 48 Volt DC Converter powered by the
specified PDCA failed on the designated Bulk Power Supply. The PDCA powering
the converter on the BPS that failed can be identified by the implementation
dependent field (data byte 3) of the BPS' physical location: data byte[3]: 0
for PDCA 0, 1 for PDCA 1.
- Cause / Action:
Cause: The 48 Volt DC Converter powered by the
PDCA identified failed in the named Bulk Power Supply. Action: Contact HP
Support personnel to troubleshoot problem Cause: The PDCA identified has
failed. This will be evident by many BPS_FAIL codes and probably a AC_DELETED
code in the Event Log. Action: Contact HP Support personnel to troubleshoot
problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 657
- Severity: MAJOR
- Event Summary: Fan failed in designated Bulk Power Supply
- Event Class: System
- Problem Description:
The designated Bulk Power Supply is
reporting its fan has failed.
- Cause / Action:
Cause: Fan failure or fan obstructed Action: If
fan is obstructed, remove obstruction. If no obstruction, Contact HP Support
personnel to troubleshoot problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 659
- Severity: MAJOR
- Event Summary: Bulk Power Supplies are not Redundant.
- Event Class: System
- Problem Description:
The number of functioning Bulk Power
Supplies has decreased to where the Cabinet Power supplied (number of
available Bulk Power Supplies times power output per each) minus the estimated
Cabinet Power consumed is greater than 0, but less than the output of one Bulk
Power Supply.
- Cause / Action:
Cause: Entities were added to the cabinet,
increasing the estimated Power Consumption. Or, a non-functional GSP bus
entity has become functional, providing previously missing power consumption
information. Action: Purchase and install a Bulk Power Supply, if redundancy
is desired. Cause: Bulk Power Supply failed. Action: Contact HP Support
personnel to troubleshoot problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 660
- Severity: FATAL
- Event Summary: +48V DC has exceeded its upper limit
- Event Class: System
- Problem Description:
The PM has detected the value of +48V
power, as measured on the UGUY board, has exceeded an upper threshold.
- Cause / Action:
Cause: The cabinet's 48V power has exceeded an
acceptable upper threshold. Action: Contact HP Support personnel to
troubleshoot problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 661
- Severity: FATAL
- Event Summary: +48V DC has fallen below its lower limit
- Event Class: System
- Problem Description:
The PM has detected the value of +48V
power, as measured on the UGUY board, has fallen below a lower threshold.
- Cause / Action:
Cause: The cabinet's 48V power has fallen below
an acceptable lower threshold. Action: Contact HP Support personnel to
troubleshoot problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 662
- Severity: MAJOR
- Event Summary: Cabinet Fan Failed
- Event Class: System
- Problem Description:
A cabinet fan has failed. Depending on the
number of cabinet fans still operating, the cabinet may or may not shut down.
View the Error Log entries to determine if the cabinet is operating.
- Cause / Action:
Cause: Cabinet Fan Failed Action: Replace
failed cabinet fan module as soon as possible following the Cabinet Fan Module
Remove and Replace Procedures.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 670
- Severity: FATAL
- Event Summary: Housekeeping power has exceeded expected levels.
- Event Class: System
- Problem Description:
Housekeeping power has exceeded expected
levels.
- Cause / Action:
Cause: The cabinet's housekeeping power has
risen above an acceptable upper threshold. Action: Contact HP Support
personnel to troubleshoot problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 671
- Severity: FATAL
- Event Summary: Housekeeping power has fallen below expected levels.
- Event Class: System
- Problem Description:
Housekeeping power has fallen below
expected levels.
- Cause / Action:
Cause: The cabinet's housekeeping power has
fallen below an acceptable upper threshold. Action: Contact HP Support
personnel to troubleshoot problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 672
- Severity: MAJOR
- Event Summary: The BPSs for the cabinet are illegally configured.
Data Byte 3 = PDCA number.
- Event Class: System
- Problem Description:
Through failures or reconfiguration, the
BPS for the cabinet named are illegally configured. There must be a BPS
connected to each phase of the power. Phase 1 feeds BPS slots 0 & 1, phase
2 feeds slots 2 & 3, and phase 3 feeds 4 & 5. There must be a BPS
connected to each phase. If 4 BPS are installed in a cabinet in slots 0 - 3
and 4 & 5 were empty, this would be an illegal configuration. They should
be installed in 0,1,2,and 4 or 0,1,3,and 5 or some combination thereof. The
PDCA physical location determines which phase is configured incorrectly. Data
Byte 3 (implementation dependent field) indicates the PDCA number used when
the configuration error occurred:
- Cause / Action:
Cause: The BPS are installed in an illegal
configuration. Action: Re-configure the BPS in a manner consistent with the
explanation in the Problem Description statement
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 673
- Severity: MAJOR
- Event Summary: BPS ID received from installed Bulk Power Supply was
unknown
- Event Class: System
- Problem Description:
A Bulk Power Supply is reporting an
unknown BPS ID. The Bulk Power Supply will not be powered up and added to the
Power Available tally. If cabinet is not powered up, it will refuse to power
up until this fault is corrected.
- Cause / Action:
Cause: The designated power supply is
responding with an illegal BPS ID. It could be a faulty supply, a different
revision, or a wrong supply in the wrong box. Action: Replace this Bulk Power
Supply with a proper one. Cause: A new revision of Power Supply that requires
a PM3 firmware upgrade was attempting install. Action: Check service notes for
firmware revisions and compatibility charts.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 675
- Severity: FATAL
- Event Summary: Ambient Air Sensor Overtemp Warning
- Event Class: System
- Problem Description:
The cabinet's Ambient Air Sensor detected
a change in air temperature entering the over-temp-high range. The Cabinet
will be shutting itself down to prevent component damage.
- Cause / Action:
Cause: Room Temperature has risen to a FATAL
level. Action: Shutdown and power off the system. Correct air temperature
problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 676
- Severity: MAJOR
- Event Summary: Ambient Air Sensor Overtemp Warning
- Event Class: System
- Problem Description:
The cabinet's Ambient Air Sensor detected
a change in air temperature crossing to the low range. The air temperature may
be rising or falling. This is just a reporting of entering the over-temp-low
range.
- Cause / Action:
Cause: Room Temperature is rising or falling.
Action: Check the error log's previous entries within a logical time frame. If
temperature is rising, prepare for system shutdown. If temperature is
dropping, then problem is probably resolved.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 677
- Severity: MAJOR
- Event Summary: Ambient Air Sensor Overtemp Warning
- Event Class: System
- Problem Description:
The cabinet's Ambient Air Sensor detected
a change in air temperature crossing to the mid range. The air temperature may
be rising or falling. This is just a reporting of entering the over-temp-mid
range.
- Cause / Action:
Cause: Room Temperature is rising or falling.
Action: Check the error log's previous entries within a logical time frame. If
temperature is rising, prepare for system shutdown. If temperature is
dropping, then problem is probably resolved.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 678
- Severity: MAJOR
- Event Summary: IO Fan Failed
- Event Class: System
- Problem Description:
An IO Chassis cooling fan has failed.
Depending on the number of fans still operating, the cabinet may or may not
shut down. View Error Log entries to determine if the cabinet is operating. If
many log entries call out entities powering off during the same time frame as
this IOFAN_FAIL, the cabinet has probably shutdown. Carefully review the log
for the first few events within the same time frame for the root cause of the
problem. The Guardian Service Processor command, PS, will show a detailed
power status for a cabinet. The +48V LED on the Front Panel Board not lit,
power is not enabled to the cabinet, indicating the cabinet IO Chassis fans
have probably gone from N to N - 1 status requiring an immediate cabinet
shutdown.
- Cause / Action:
Cause: IO Cooling Fan Failed Action: Replace IO
Fan Module as soon as possible following the IO Fan Module Remove and Replace
Procedures.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 680
- Severity: MAJOR
- Event Summary: Cabinet Power System is in overload.
- Event Class: System
- Problem Description:
This code is issued when the Cabinet Power
supplied (number of Bulk Power Supplies times power output per each) minus the
estimated Cabinet Power consumed drops below 0. Utilities firmware will not
allow a cabinet in this state to power up (see ABORT_PWRUP_BPS). Utilities
firmware will not shut down a cabinet in this state. However, there is a
possibility of a cabinet brownout, making the cabinet unreliable.
- Cause / Action:
Cause: A Bulk Power Supply has failed, or,
entities were added. Look for one or more BPS_Fail Chassis Codes preceding
this one for the actual failures. This code is a warning of possible cabinet
unreliability. Action: Contact HP Support personnel to troubleshoot the
problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 681
- Severity: FATAL
- Event Summary: Cabinet Shutdown - Insufficient Blowers
- Event Class: System
- Problem Description:
After a BLOWR_FAIL, there were N-1 blowers
functioning. This is an illegal condition causing immediate cabinet shutdown
to prevent component damage.
- Cause / Action:
Cause: One blower has failed creating condition
N. Before condition N was corrected, another blower in the same cabinet was
declared failed. This created the illegal condition of N-1. Action: Contact HP
Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 682
- Severity: FATAL
- Event Summary: Cabinet Shutdown - Insufficient IO Fans
- Event Class: System
- Problem Description:
After a IOFAN_FAIL, there were N-1 fans
functioning. This is an illegal condition causing immediate cabinet shutdown
to prevent component damage.
- Cause / Action:
Cause: One IO fan has failed creating condition
N. Before condition N was corrected, another IO fan in the same cabinet
failed. This created the illegal condition of N-1. Action: Contact HP Support
personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 683
- Severity: MAJOR
- Event Summary: IO Expansion Utility Cabinet Fan Failed
- Event Class: System
- Problem Description:
One of two fans in the Utility chassis of
the IO Expansion Cabinet has failed.
- Cause / Action:
Cause: IO Expansion Utility Fan or Fan sensor
failure PM failure Action: Contact HP Support personnel to troubleshoot the
problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 684
- Severity: FATAL
- Event Summary: Watchdog Timer Expired
- Event Class: System
- Problem Description:
The Watchdog Timer checks for inactivity,
or hung state, of the Cabinet Level Utilities (CLU) portion of the UGUY.
During activity, the timer is continually reset. If the timer expires, it will
automatically reset the CLU microprocessor. This will not affect running
partitions.
- Cause / Action:
Cause: CLU has been reset after a firmware
update. Action: None. Cause: The CLU firmware has been reset by the MFG MP
command RU. Action: None. Cause: Hardware or firmware failure on the UGUY.
Action: Check revision of CLU firmware. If out of date, or known bad revision,
use FWUU to update CLU firmware. Contact HP Support personnel to troubleshoot
problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 685
- Severity: FATAL
- Event Summary: Invalid checksum from EEPROM
- Event Class: System
- Problem Description:
An invalid checksum was received when
reading the FRUID EEPROM for the device named in the chassis code. If this is
a single error, the fault lies with the named FRU. If there are many
INVALID_CKSM entries in the Event Log, there is probably a problem with the
I2C bus.
- Cause / Action:
Cause: Data corrupted in the named EEPROM.
Action: If this is a single entry, replace the FRU. Cause: Problem with I2C
bus. Action: If every entity with a FRUID logs an error, the problem is
probably with the CLU portion of the Utilities Board. Replace the Utilities
Board following the Utilities Board Remove and Replace Procedures. If there
are a few entities reporting checksum errors, but several have reported in
properly, chances are one device is causing the problem with the I2C bus. This
will take a more concerted effort to find and correct that problem. Probably
wish to take the bus to a minimum configuration and test, add, test until the
failure is verified.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 686
- Severity: MAJOR
- Event Summary: System Backplane Power Board Fault
- Event Class: System
- Problem Description:
One or more of the System Backplane Power
Boards is reporting a DC Fault through the System Backplane Local Power
Monitor. The physical location of the failing power board is in the Data Field
of the event.
- Cause / Action:
Cause: A DC-DC converter on the named power
board failed. Action: Contact HP Support personnel to troubleshoot the problem
Caution: The 1.8 volt converters are N+1. The 3.3 volt converters are N+2. If
there is a situation where a 1.8 fails at the same time as a 3.3 on a
different power board, replace the failed 1.8 board first.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 687
- Severity: MAJOR
- Event Summary: Read of EEPROM failed
- Event Class: System
- Problem Description:
An attempt to read the EEPROM (FRUID) on
the IO Backplane Board failed.
- Cause / Action:
Cause: The I2C controller on the Utilities
Board (CLU section) is bad. This will be shown by many I2C failure codes in
the Error Log. These codes should identify entities on both the System
Backplane and the Master IO Backplane. Action: Contact HP Support personnel to
troubleshoot the problem. Cause: The cable from the Utilities Backplane to the
Master IO Backplane is bad, or is not properly connected. Action: Check and
reseat the Master IO Backplane Utilities cable. If no help, contact HP Support
personnel to troubleshoot the problem. Cause: The I2C bus into the IO
Backplane EEPROM is bad. Action: Could possibly be a bent pin on the Master IO
Backplane Utilities cable connectors. Check the connectors at each end of the
cable for bent or broken pins. If the connectors and cable are good, contact
HP Support personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 688
- Severity: MAJOR
- Event Summary: Read of EEPROM failed
- Event Class: System
- Problem Description:
An attempt to read the EEPROM (FRUID) on
the IO Backplane Power Board failed.
- Cause / Action:
Cause: The I2C controller on the Utilities
Board (CLU section) is bad. This will be shown by many I2C failure codes in
the Error Log. These codes should identify entities on both the System
Backplane and the Master IO Backplane. Action: Contact HP Support personnel to
troubleshoot the problem. Cause: The cable from the Utilities Backplane to the
Master IO Backplane is bad, or is not properly connected. Action: Check and
reseat the Master IO Backplane Utilities cable. If no help, contact HP Support
personnel to troubleshoot the problem. Cause: The I2C bus into the IO Power
Board EEPROM is bad. Action: Could possibly be a bent pin on the Master IO
Backplane Utilities cable connectors. Check the connectors at each end of the
cable for bent or broken pins. Or, it could be a bent pin on the Master IO
Backplane where the PCI Cardcage connects. If the MIOB, connectors and cable
are good, contact HP Support personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 689
- Severity: MAJOR
- Event Summary: Read of LPM Fault failed
- Event Class: System
- Problem Description:
An attempt to read the Local Power Monitor
Fault register on the IO Backplane Power Board failed.
- Cause / Action:
Cause: The I2C controller on the Utilities
Board (CLU section) is bad. This will be shown by many I2C failure codes in
the Error Log. These codes should identify entities on both the System
Backplane and the Master IO Backplane. Action: Contact HP Support personnel to
troubleshoot the problem. Cause: The cable from the Utilities Backplane to the
Master IO Backplane is bad, or is not properly connected. Action: Check and
reseat the Master IO Backplane Utilities cable. If no help, contact HP Support
personnel to troubleshoot the problem. Cause: The IO Backplane Power Board is
bad. Action: Contact HP Support personnel to troubleshoot the problem. Cause:
The I2C bus into the IO Power Board EEPROM is bad. Action: Could possibly be a
bent pin on the Master IO Backplane Utilities cable connectors. Check the
connectors at each end of the cable for bent or broken pins. Or, it could be a
bent pin on the Master IO Backplane where the PCI Cardcage connects. If the
MIOB, connectors ! and cable are good, contact HP Support personnel to
troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 690
- Severity: FATAL
- Event Summary: IO Power Board Over temperature
- Event Class: System
- Problem Description:
The Local Power Monitor of the named IO
Chassis is reporting a Power Brick over temperature condition.
- Cause / Action:
Cause: The ambient air is too warm. Action:
Check the Error Log for other Over temperature Warnings to confirm the
environmental problem. Cause: The specified Power Brick, or the Local Power
Monitor, has failed in such a manner as to report this error. Action: Contact
HP Support personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 691
- Severity: FATAL
- Event Summary: IO Power Board Fault
- Event Class: System
- Problem Description:
The Local Power Monitor on the named IO
Chassis has reported a power fault condition.
- Cause / Action:
Cause: The named power brick on the named IO
Chassis has failed. Action: Contact HP Support personnel to troubleshoot the
problem. Cause: Input power has created some fault conditions. This will be
evident by the presence of several chassis codes in the Error Log within the
same time frame. Action: The Error Log must be reviewed carefully for the root
cause of the errors. There is almost always a single cause, even if many
events are reported.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 692
- Severity: MAJOR
- Event Summary: Voltage Margin on IO Power Board failed
- Event Class: System
- Problem Description:
The Local Power Monitor on the named IO
Power Board failed to properly margin the power as commanded.
- Cause / Action:
Cause: The IO Power Board LPM is not
communicating with the CLU. Action: Some troubleshooting will be involved
here. Is it the IO Power Board LPM, or the CLU. You'll have to check the Error
Log for other entries related to either CLU communications problems or the IO
Power Board LPM. If there are messages about other HIOPB_VOLT_MRGN_FAIL
entries as well as SYS_BKP_VOLT_MRGN_FAIL, it is pointing to the CLU. Cause:
The MP is not communicating with the CLU. Action: The MP bus (USB) is not
functioning. There should be many entries in the Error Log with the same type
of error message. They will point to MP bus errors. Also, try the GSP "PS"
command. This will display status of entities within a cabinet.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 693
- Severity: MAJOR
- Event Summary: Failure to read data from a FRUID EEPROM
- Event Class: System
- Problem Description:
Either by command or as part of
initialization, the data from a FRUID EEPROM failed a read command. This does
not necessarily mean the FRU has failed, just that the FRUID can't be read.
The specific FRU Handle of the failing FRUID is embedded in the two uppermost
bytes of the data field.
- Cause / Action:
Cause: The CLU can't read the data from a FRUID
EEPROM. Action: Contact HP Support personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 694
- Severity: MAJOR
- Event Summary: Failure to read data from a SBCH FRUID EEPROM
- Event Class: System
- Problem Description:
Either by command or as part of
initialization, the data from a FRUID EEPROM failed a read command. This does
not necessarily mean the FRU has failed, just that the FRUID data cannot be
read.
- Cause / Action:
Cause: The CLU cannot read the data contained
in the EEPROM on the SBCH board in the same cabinet. Action: Contact HP
Support personnel to troubleshoot the problem. If this is the only READ
failure in this timeframe, replace the SBCH board following the SBCH Board
Remove and Replace Procedures as soon as possible. If there are other READ
failures in this same cabinet, replace the Utilities Board following the
Utilities Board Remove and Replace Procedures.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 695
- Severity: MAJOR
- Event Summary: Failure to read data from a UGUY FRUID EEPROM
- Event Class: System
- Problem Description:
Either by command or as part of
initialization, the data from a FRUID EEPROM failed a read command. This does
not necessarily mean the FRU has failed, just that the FRUID can't be read.
- Cause / Action:
Cause: Attempted access to read the UGUY FRUID
EEPROM failed. Action: If there is only one FRUID that can't be read, replace
that FRU as soon as possible. If there are a lot of log entries for different
FRUs, suspect the Utilities Board or the Utilities cable to those FRUs. For
example, if the failures are all associated with a Master IO Backplane, the
failing FRU is probably the Utilities cable to that backplane.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 696
- Severity: MAJOR
- Event Summary: Read EEPROM failed
- Event Class: System
- Problem Description:
An attempt to read the EEPROM (FRUID) on
the System Backplane failed
- Cause / Action:
Cause: The I2C controller on the Utilities
Board (CLU section) is bad. This will be shown by many I2C failure codes in
the Error Log. These codes should indentify entities on both the System
Backplane and the Master IO Backplane. Action: Replace the Utilities board
(UGUY) following the Utilities Board Remove and Replace procedures. Cause: The
100 pin cable from the Utilities Backplane to the System Backplane is bad, or
is not properly connected. Action: Check and reseat the System Backplane
Utilities cable. If this does not resolve the issue, replace the System
Backplane utilities cable following the Backplane Utilities Cable Remove and
Replace procedures. Cause: The I2C bus into the System Backplane EEPROM is
bad. Action: Could possibly be a bent pin on the System Backplane Utilities
cable connectors. Check the connectors at each end of the cable for bent or
broken pins. If the connectors and cable are good, replace the System
Backplane following the System Backplane Re! move and Replace procedures.
NOTE: System Backplane replacement is a major undertaking. Ensure all other
possibilities have been explored before replacing the backplane. You should
have WTEC approval before replacing the backplane.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 697
- Severity: MAJOR
- Event Summary: Read command on System Backplane I2C bus failed
- Event Class: System
- Problem Description:
A read command on the system backplane I2C
bus failed.
- Cause / Action:
Cause: The I2C controller on the Utilities
Board (CLU section) is bad. This will be shown by many I2C failure codes in
the Error Log. These codes should indentify entities on both the System
Backplane and the Master IO Backplane. Action: Replace the Utilities board
(UGUY) following the Utilities Board Remove and Replace procedures. Cause: The
100 pin cable from the Utilities Backplane to the System Backplane is bad, or
is not properly connected. Action: Check and reseat the System Backplane
Utilities cable. If no help, replace the System Backplane utilities cable
following the Backplane Utilities Cable Remove and Replace procedures. Cause:
The I2C bus into the System Backplane EEPROM is bad. Action: Could possibly be
a bent pin on the System Backplane Utilities cable connectors. Check the
connectors at each end of the cable for bent or broken pins. If the connectors
and cable are good, replace the System Backplane following the System
Backplane Remove and Replace procedures. NOTE: System Backplane replacement is
a major undertaking. Ensure all other possibilities have been explored before
replacing the backplane. You should have WTEC approval before replacing the
backplane.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 698
- Severity: MAJOR
- Event Summary: Write command on System Backplane I2C bus failed
- Event Class: System
- Problem Description:
A write command on the system backplane
I2C bus failed. The type of command that failed can be identified by the
activity status field (last byte) of the encoded field. B = RC Cable
Configuration Register write C = Backplane Voltage Margin Register write 9 =
Flex circuit configuration register write
- Cause / Action:
Cause: The I2C controller on the Utilities
Board (CLU section) is bad. This will be shown by many I2C failure codes in
the Error Log. These codes should identify entities on both the System
Backplane and the Master IO Backplane. Action: Replace the Utilities board
(UGUY) following the Utilities Board Remove and Replace procedures. Cause: The
100 pin cable from the Utilities Backplane to the System Backplane is bad, or
is not properly connected. Action: Check and reseat the System Backplane
Utilities cable. If no help, replace the System Backplane utilities cable
following the Backplane Utilities Cable Remove and Replace procedures. Cause:
The I2C bus into the System Backplane EEPROM is bad. Action: Could possibly be
a bent pin on the System Backplane Utilities cable connectors. Check the
connectors at each end of the cable for bent or broken pins. If the connectors
and cable are good, replace the System Backplane following the System
Backplane Remove and Replace procedure. NOTE: System Backplane replacement is
a major undertaking. Ensure all other possibilities have been explored before
replacing the backplane. You should have WTEC approval before replacing the
backplane.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 699
- Severity: FATAL
- Event Summary: System Backplane Power Fault
- Event Class: System
- Problem Description:
The Local Power Monitor on the named
System Backplane has detected a power fault. The failing Backplane Power Board
status is read from the Backplane LPM I2C interface register and the value is
placed in the data field of the event (bits 15-8).
- Cause / Action:
Cause: While running normally, the CLU
microcontroller detected a fault on the I2C Bus from the system Backplane LPM.
Action: Check other log entries around this time for other events. If there
are other events, analyze for best troubleshooting approach. Check the log
carefully as a shorted ASIC could cause many errors to occur. These errors
will not necessarily point to the ASIC. If none, replace failed Backplane
Power Board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 700
- Severity: CRITICAL
- Event Summary: System Backplane voltage margin failed
- Event Class: System
- Problem Description:
Margining voltage to the System Backplane
has failed.
- Cause / Action:
Cause: The CLU was unable to write to the
voltage margin register on the System backplane. Action: Try re-margining the
system backplane and check connections. If many I2C access events are
occurring inspect the UGUY utilities board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 701
- Severity: MAJOR
- Event Summary: Failure to write data to FRUID EEPROM
- Event Class: System
- Problem Description:
An attempt to write data to the FRUID
EEPROM by the MFG level MP command WF failed. The FRU handle of the failing
FRUID is embedded in the two uppermost bytes of the data field.
- Cause / Action:
Cause: The entity being written to is not
powered up. Action: Power the entity with the PE command. Cause: The entity
being written to has failed. Action: Replace the entity with the failed FRUID.
Cause: The I2C bus has failed. Look for other entries in the Error Log to
confirm this. If there are a lot of entries in this timeframe about I2C
failures, analyze errors the errors to see if they are all within a cabinet,
or the entire complex. Action: Each cabinet's Utilities Board (CLU and PM) is
responsible for the query over I2C for the FRUID, LPM status, and other
information. If there are other entries in the Error Log and they are all
within a cabinet, replace the Utilities Board following the Utilities Board
Remove and Replace Procedures.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 707
- Severity: FATAL
- Event Summary: PDH Controller firmware version is not supported
with this version of MP FW
- Event Class: System
- Problem Description:
The MP checked the FW revision of the PDHC
identified in the physical location data field and discovered that it did not
recognize the revision as one that it has been qualified with. This is an
unsupported configuration.
- Cause / Action:
Update PDHC or MP FW
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 708
- Severity: CRITICAL
- Event Summary: Power fault on cell board
- Event Class: System
- Problem Description:
The local Power Monitor is reporting a
fault with the named Cell Power Board.
- Cause / Action:
Cause: One or more of the DC to DC power
converters on the Cell Power Board is displaying a fault condition. Action:
Contact HP Support personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 710
- Severity: MAJOR
- Event Summary: The ExecuteCommand function failed on a CPU.
- Event Class: System
- Problem Description:
ExecuteCommand issues commands that
execute on remote CPUs via IPI interrupts. If the command failed to execute,
this event is printed and the data field contains the status.
- Cause / Action:
Inter-Processor-Interrupts may not be working,
or the command may have timed out. This could be a firmware bug or hardware
problem. Look for other clues in the event log.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 711
- Severity: MAJOR
- Event Summary: A remote CPU is not prepared to receive a command
- Event Class: System
- Problem Description:
A remote CPU is in a state where it cannot
receive and execute a new command. The current status of the CPU is provided
in the data field.
- Cause / Action:
The CPU may be stuck waiting for a previous
command or may not be healthy. This could also be caused by a system resource
contention problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 712
- Severity: CRITICAL
- Event Summary: Boot is disabled because the cell type does not
match the System FW ROM type.
- Event Class: System
- Problem Description:
The cell type (IPF or PA) does not match
System FW type. The cell type is detected based on information stored in CPU
modules' FRUID EEPROMs. The System FW type is determined based on data that is
embedded in the System FW ROM image. This is checked each time Cell power
transitions from off to on, and each time the System FW is updated. Following
the detection of this mismatch, the Cell will not be allowed to boot until the
problem has been resolved.
- Cause / Action:
Cause(1): The System FW ROM in unprogrammed, or
an invalid System FW ROM image is programmed in the System FW flash.
Action(1): Update the System FW using Firmware Update from the MP. Cause(2):
The Cell's installed CPU modules do not all have the same type, frequency and
partition compatibility, so the Cell type cannot be accurately determined. In
this case, a CPU_MOD_COMPAT_MISMATCH event should also be emitted. Action(2):
Contact HP support personnel to troubleshoot the mismatched CPU module
Cause(3): A CPU module's FRU data is programmed incorrectly. Action(3): If
this is in manufacturing, re-program the FRU specific field of the FRU data
for the CPU module. Otherwise, contact HP support personnel to troubleshoot
the mismatched CPU module..
Cause(1): The System FW ROM in unprogrammed, or
an invalid System FW ROM image is programmed in the System FW flash.
Action(1): Update the System FW using Firmware Update from the MP. Cause(2):
The Cell's installed CPU modules d! o not all have the same type, frequency
and partition compatibility, so the Cell type cannot be accurately determined.
In this case, a CPU_MOD_COMPAT_MISMATCH event should also be emitted.
Action(2): Contact HP support personnel to troubleshoot the mismatched CPU
module. Cause(3): A CPU module's FRU data is programmed incorrectly.
Action(3): If this is in manufacturing, re-program the FRU specific field of
the FRU data for the CPU module. Otherwise, contact HP support personnel to
troubleshoot the mismatched CPU module.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 713
- Severity: MAJOR
- Event Summary: The PDHC has waited an abnormally long time for PDH
bus access.
- Event Class: System
- Problem Description:
This event is emitted after the PDHC has
waited longer than a maximum expected time for the PDH arbiter to grant it
control of the PDH bus. The PDHC will continue waiting for control of the PDH
bus until the arbiter grants it control, or the Cell is powered off using the
MP's PE command. While waiting for the PDH bus, the PDHC will NOT perform its
normal duties such as monitoring the Cell status, and passing messages from
the system to the MP, and the PDHC heartbeat will not blink.
- Cause / Action:
Cause (probable): A hardware fault is
preventing the PDH arbiter from granting the PDHC control of the bus. Action:
Contact HP support personnel to troubleshoot the cell board and/or PDH
daughtercard. Cause: Bad connection on UGUY clock cable. Action: Check UGUY
clock cable connection.
Cause (probable): A hardware fault is preventing
the PDH arbiter from granting the PDHC control of the bus. Action: Contact HP
support personnel to troubleshoot the Cell Board and/or PDH Daughtercard.
Cause: Bad connection on UGUY clock cable. Action: Check UGUY clock cable
connection.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 714
- Severity: MAJOR
- Event Summary: The PDHC has waited an abnormally long time to
obtain the PDH semaphore.
- Event Class: System
- Problem Description:
This event is emitted after the PDHC has
waited longer than a maximum expected time to obtain control of the PDH bus
semaphore. The PDHC will continue waiting for control of the PDH bus semaphore
until System FW relinquishes control of the semaphore, or the Cell is powered
off using the MP's PE command. While waiting for the PDH bus semaphore, the
PDHC will NOT perform its normal duties such as monitoring the Cell status,
and passing messages from the system to the MP, and the PDHC heartbeat will
not blink. The data field contains debug data that may be useful for
developers. Data_byte[0] = last value read from PDHC's address for the
microSemaphore register. Data_byte[1] = boolean indicator (1=set,0=not_set) of
whether the PDHC's flag is set. Data_byte[2] = boolean indicator
(1=set,0=not_set) of whether the System FW's flag is set.
- Cause / Action:
Cause(1): System FW has control of the PDH bus
semaphore, and has failed to relinquish control of it. Action(1): Update the
System FW revision to the latest version of System FW using the Firmware
Update Utility. Cause(2): A hardware fault is preventing the PDH bus semaphore
from being taken/released as expected. Action(2): Contact HP support personnel
to troubleshoot the Cell Board and/or PDH Daughtercard
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 715
- Severity: MAJOR
- Event Summary: An error occurred while transmitting an IPMI message
in the BMC2HOST direction.
- Event Class: System
- Problem Description:
This event indicates that an error
occurred while transmitting an IPMI message in the BMC2HOST direction. The
data field contains more detailed information about the source of the error.
Data Bytes 0 & 1 form a 16-bit IPMI error indicator that has the following
values and meanings: 1 - IPMI_HOST_BUSY_TIMEOUT - The PDHC could not put a
message in the BMC2HOST hardware message queue for over 10 seconds, so the
pending message(s) were dropped. 2 - IPMI_INVALID_MSG_SIZE - The MP sent an
IPMI message response that has an embedded size indicator that is less than 4
bytes or greater than the size of the message data. The poorly formed message
response will be dropped. 3 - IPMI_BMC2HOST_Q_FULL - The BMC2HOST message
queue in the PDHC is full, so a message response from the MP has been dropped.
- Cause / Action:
Cause(1): An unknown OS IPMI driver or
Utilities FW bug has occurred. Action(1): Update PDHC FW, MP FW, System FW and
the OS IPMI driver to the latest revisions. Cause(2): A hardware fault is
preventing the BMC2HOST queue from working. Action(2): Contact HP support
personnel to troubleshoot the PDH Daughtercard.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 716
- Severity: MAJOR
- Event Summary: EFI unable to read initial debug level from the BMC
- Event Class: System
- Problem Description:
EFI was unable to read the initial debug
level from the BMC token. EFI will continue with an unknown value for the
debug level. Data Field: Return status from internal EFI function.
- Cause / Action:
Cause: BMC not functioning properly. Action:
Reset the BMC. Contact your HP representative to check the BMC. Cause: SAL
service to read tokens not functioning properly. Action: Reset the system.
Clear NVM. Upgrade system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 717
- Severity: CRITICAL
- Event Summary: A XBC port was unexpectedly found to not be
landmined.
- Event Class: System
- Problem Description:
A XBC port was unexpectedly found to not
be landmined. The data field consists of the XBC number (32:43) and the port
number (44:55).
- Cause / Action:
Cause: An XBC is indicating a port failure
Action: Validate all of the cells connectivity to the PD Check the TOGO chips
seating reset the system replace either cells/system backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 718
- Severity: FATAL
- Event Summary: An invalid number of XBC ports were landmined in the
system.
- Event Class: System
- Problem Description:
The number of landmined XBC ports was not
within the allowable range. There is a minimum number of landmined ports
because some ports are always unused. There is a maximum number of landmined
ports because there is a limit to the number of broken links allowed in a
system. The data field shows the number of landmined ports found
- Cause / Action:
Check for hardware failures: crossbar chips,
etc.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 719
- Severity: FATAL
- Event Summary: The backplane was not recognized as one that
contains fabric
- Event Class: System
- Problem Description:
Data field contains the backplane type
found. During Intra SKD Routing, the backplane type detected was either a
Medel backplane or was unrecognized. The backplane could therefore not be
routed. This is a firmware sanity check. Data Field: system type
- Cause / Action:
Cause: An unrecognized backplane is installed.
Action: Contact HP Support Personnel to determine why the backplane was
unrecognized.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 720
- Severity: MAJOR
- Event Summary: Writing the XIN Error Mask Register to zero failed
- Event Class: System
- Problem Description:
Prior to initializing the CC to XBC link,
the XIN error mask should be zeroed out to prevent spurious errors from
interfering with the link initialization. This write to zero out the error
mask failed. Data Field: (cell << 56) | return status
- Cause / Action:
CC Write Failure.
Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 722
- Severity: CRITICAL
- Event Summary: Data read from the CC Primary Mode CSR
- Event Class: System
- Problem Description:
The Coherency Controller's (CC) XIN link
did not initialize properly. The data field contains the data read from the CC
Primary Error Mode CSR.
- Cause / Action:
CC to XBC link init failure. Contact your HP
service representative to check the CC to XBC link
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 723
- Severity: CRITICAL
- Event Summary: Dumping error info. Read status of the CC Error Mask
Register
- Event Class: System
- Problem Description:
The Coherency Controller's (CC) XIN link
did not initialize properly. The data field contains the return status from an
attempted read of the CC Primary Error Mode CSR. (0 = SUCCESS)
- Cause / Action:
CC to XBC link init failure. Contact your HP
service representative to check the CC to XBC link
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 724
- Severity: CRITICAL
- Event Summary: Data read from the CC Error Mask CSR
- Event Class: System
- Problem Description:
The Coherency Controller's (CC) XIN link
did not initialize properly. The data field contains the data read from the CC
Error Mask CSR.
- Cause / Action:
CC to XBC link init failure. Contact your HP
service representative to check the CC to XBC link
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 725
- Severity: MAJOR
- Event Summary: The link could not be crossed upon first attempt
- Event Class: System
- Problem Description:
The neighbor's port connected to the link
being crossed is not routable. This was the first attempt to cross the link,
PDC will now look for another link it can cross. DATA: (xbcNum << 32 ) |
(port << 44)
- Cause / Action:
The neighbor port is not routable. The port is
either: not connected, landmined, in FE, or contains an SBE or LPE.
Contact
HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 726
- Severity: CRITICAL
- Event Summary: Failed reading an XBC forward progress register
- Event Class: System
- Problem Description:
Fabric read error. Data field: (XBC number
<< 32 | return status)
- Cause / Action:
Fabric access error
Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 727
- Severity: CRITICAL
- Event Summary: Could not find an adjacent XBC due to broken fabric
links
- Event Class: System
- Problem Description:
Too many crossbar links are broken. Cell
cannot boot, halting. Data field: XBC number << 32
- Cause / Action:
Possible crossbar failure
Contact HP Support
personnel to analyze the crossbar.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 728
- Severity: MAJOR
- Event Summary: The run-time verification of a programming
assumption has failed.
- Event Class: System
- Problem Description:
For debug purposes, many assumptions made
by the PM developer(s) are checked at run-time. If this event log is seen, it
will either indicate that the hardware is in a unknown state that is not
handled by the PM, or that a programming bug has been found. For developer
debug purposes, the data field describes where in the code that the error was
detected. Data Bytes[0-1]: The line number within the source code file where
the error was detected. Data Bytes[2-7]: The first 6 characters of the source
code file name.
- Cause / Action:
Cause: Hardware in unknown state, or
programming bug found. Action: Upgrade PM firmware to latest revision. If
already at current revision, replace UGUY board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 729
- Severity: MAJOR
- Event Summary: An unknown error has been detected by the PDHC
firmware.
- Event Class: System
- Problem Description:
An unknown error has been detected by the
PM firmware. For developer debug purposes, the data field describes where in
the code that the error was detected. Data Bytes[0-1]: The line number within
the source code file where the error was detected. Data Bytes[2-7]: The first
6 characters of the source code file name.
- Cause / Action:
Cause: Hardware in unknown state, or
programming bug found. Action: Upgrade PM firmware to latest revision. If
already at current revision, replace UGUY board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 731
- Severity: MAJOR
- Event Summary: Testing of correctable errors injected from the CC
has failed
- Event Class: System
- Problem Description:
Failed link testing to ensure that SBE and
LPE errors are detected properly by the XBC. The XBC did not detect any
errors. Data field indicates the return status: (1 = err detected, 0 = no err
detected, -1 = XBC accesses failed)
- Cause / Action:
Cause: Either the CC failed to inject the
errors, the XBC failed to detect them, or PDC could not access the XBC CSR.
Action: Check results from other cells connected to the same XBC. Check CC,
Check XBC, Contact HP Support Personnel.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 732
- Severity: FATAL
- Event Summary: A cabinet has been configured using an invalid
cabinet number
- Event Class: System
- Problem Description:
The data field contains the cabinet number
that is invalid
- Cause / Action:
Re-configure cabinet to use a valid cabinet
number
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 733
- Severity: CRITICAL
- Event Summary: Cells trying to join a PD are at incompatible
firmware revisions
- Event Class: System
- Problem Description:
The cell indicated in the data field is at
a different firmware revision than the reporting cell. This is determined by
evaluating the checksums of the 2 ROM images.
- Cause / Action:
The reporting cell is at a different firmware
revision than the cell reported in the data field. A PD cannot be established.
Please reprogram the 2 cells to the same firmware revision.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 734
- Severity: MAJOR
- Event Summary: An attempt to write to a device on the PM's I2C bus
has failed.
- Event Class: System
- Problem Description:
An attempt to write to a device on the
PM's I2C bus has failed. The Data field contains information that can identify
the exact device that has failed. Refer to the UGUY ERS for a mapping of I2C
device addresses to devices. Data Bytes[0-1]: Reserved Data Bytes[2-3]: I2C
Device Address Data Bytes[4-5]: Starting Word Address Data Bytes[6-7]: Size of
attempted access (in bytes).
- Cause / Action:
Cause: A hardware error has occurred. Action:
Replace the UGUY board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 735
- Severity: MAJOR
- Event Summary: An attempt to read from a device on the PM's I2C bus
has failed.
- Event Class: System
- Problem Description:
An attempt to read from a device on the
PM's I2C bus has failed. The Data field contains information that can identify
the exact device that has failed. Refer to the UGUY ERS for a mapping of I2C
device addresses to devices. Data Bytes[0-1]: Reserved Data Bytes[2-3]: I2C
Device Address Data Bytes[4-5]: Starting Word Address Data Bytes[6-7]: Size of
attempted access (in bytes).
- Cause / Action:
Cause: A hardware error has occurred. Action:
Replace the UGUY board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 736
- Severity: MAJOR
- Event Summary: An error was encountered updating the cell info
structure in ICM
- Event Class: System
- Problem Description:
An error was encountered trying to obtain
the data required for the cell information structure in ICM. The data field is
an ASCII message that indicates the information that was not found.
- Cause / Action:
This should not happen. Contact engineering to
diagnose the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 737
- Severity: MAJOR
- Event Summary: An error was encountered pointing the slave cell
consoles to the diva
- Event Class: System
- Problem Description:
An error was encountered establishing the
slave cells use of the diva console.
- Cause / Action:
A CPU on the slave cell could not process an
interrupt in time or establish the diva console.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 738
- Severity: CRITICAL
- Event Summary: An error was encountered trying to relocate a slave
cells registry
- Event Class: System
- Problem Description:
An error was encountered trying to
relocate the registry on a slave cell to point to the core cells main memory
structures.
- Cause / Action:
There could be a PD rendezvous error or a
processor on the slave cell failed to respond to an interrupt in time.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 742
- Severity: MAJOR
- Event Summary: Unexpected fabric firmware error
- Event Class: System
- Problem Description:
An unexpected error occurred while
initializing the fabric. The firmware is not able to analyze this error. Clues
to the cause of this error may be found in the IPMI forward progress log (FPL)
either shortly before or after this log entry occurred. The FPL is available
from the management processor using the "sl" command.
- Cause / Action:
An unanticipated error occurred. Contact HP
Support personnel to analyze the IPMI FPL log.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 743
- Severity: FATAL
- Event Summary: Internal firmware programming error in the PMI
handler.
- Event Class: System
- Problem Description:
An internal firmware error was
encountered. This is usually caused by a bad parameter passed to a function,
corrupt memory, corrupt malloc tables or something similar. The data field
contains the IP address of the function that encountered the error.
- Cause / Action:
Report the IP to the firmware team. Reset the
system. This cannot be worked around in the field.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 744
- Severity: CRITICAL
- Event Summary: During a Cell On Line Add inconsistent number of
cells discovered
- Event Class: System
- Problem Description:
During the on line addition of a cell the
partition adding the cell has determined inconsistent data as to which cell is
being added. The cell addition will be aborted and the partition will resume
execution without the new cell.
- Cause / Action:
This can be caused by inconsistent profile
information. This can also occur when an expected cell did not make the
original boot of the partition. Update the complex profile to all the cells
with a correct view of the system and try to add the cell again.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 745
- Severity: MAJOR
- Event Summary: Error reading source cell port on XBC during data
traversability test
- Event Class: System
- Problem Description:
An error occurred while reading the
routing from the source cell's port on the source XBC. Data Field: (source
cell << 56 | source XBC << 32)
- Cause / Action:
A read error most likely occurred. Look for
preceding chassis codes to determine exact cause.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 753
- Severity: MAJOR
- Event Summary: CPUs of different maximum core frequencies are
installed
- Event Class: System
- Problem Description:
CPU's of mixed maximum core frequencies
are installed
- Cause / Action:
Cause: CPU's of mixed maximum core frequencies
are installed. Action: If operating at the slowest of the maximum core
frequency of installed CPU's is acceptable, no action is necessary. If not,
replace the slower core frequency CPU's to match the faster CPU's. This will
enable all CPU's to work at their maximum frequency.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 754
- Severity: FATAL
- Event Summary: The RVL CC-Togo link initialization workaround
(PS221) failed
- Event Class: System
- Problem Description:
The Concorde-Togo link initialization is
having an intermittent failure. The data field contains the number of
initialization sequences that failed before being successful.
- Cause / Action:
Cause: The link initialization failed at least
once and then subsequently was successful.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 756
- Severity: CRITICAL
- Event Summary: Fabric Discovery could not initialize the local
cell's XBC link
- Event Class: System
- Problem Description:
Fabric Discovery's final attempt to
initialize the local cell's CC to Crossbar Chip (XBC) link has failed. This
cell cannot talk to the fabric. Data: link init state bit read from the CC
Link State register
- Cause / Action:
Cause: CC to XBC link init failure. Action:
check CC, XBC, reset cell, reset backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 760
- Severity: FATAL
- Event Summary: Internal firmware programming error
- Event Class: System
- Problem Description:
An internal firmware error was
encountered. This is usually caused by a bad parameter passed to a function,
corrupt memory, corrupt malloc tables or something similar. The data field
contains the physical address that failed mapping to a virtual address
- Cause / Action:
Report the IP to the firmware team. Reset the
system. This cannot be worked around in the field.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 771
- Severity: CRITICAL
- Event Summary: Error writing the XIN init disable register.
- Event Class: System
- Problem Description:
Failure while writing the XBC CSR
containing the link status
- Cause / Action:
Check XBC, CC, backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 772
- Severity: CRITICAL
- Event Summary: Error reading the XIN init state register.
- Event Class: System
- Problem Description:
Failure while reading the XBC CSR
containing the link status
- Cause / Action:
Check XBC, CC, backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 773
- Severity: CRITICAL
- Event Summary: intermittent failure while retrying the CC to XBC
link init
- Event Class: System
- Problem Description:
Fabric Discovery's attempt to initialize
the local cell's CC to XBC link has failed. The link initialization sequence
has an intermittent problem.
- Cause / Action:
contact your HP service representative
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 774
- Severity: MAJOR
- Event Summary: Initialization of a PCI node in the firmware device
tree failed
- Event Class: System
- Problem Description:
error code
- Cause / Action:
Cause: A firmware error setting up data storage
to allow PCI bus bridge processing to occur. Action: Correct any previous
errors reset the system clear NVM and reset the system Update to the latest
recipe Replace the cell board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 775
- Severity: CRITICAL
- Event Summary: An error was encountered while scanning the PCI bus.
- Event Class: System
- Problem Description:
error code
- Cause / Action:
Cause: A firmware error setting up data storage
to allow PCI bus scanning to occur. Action: Correct any previous errors reset
the system clear NVM and reset the system Update to the latest recipe Replace
the cell board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 776
- Severity: MAJOR
- Event Summary: An error was encountered initializing the PCI bridge
- Event Class: System
- Problem Description:
error code
- Cause / Action:
Cause: A firmware error setting up data storage
to allow PCI bus bridge processing to occur. Action: Correct any previous
errors reset the system clear NVM and reset the system Update to the latest
recipe Replace the cell board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 777
- Severity: MAJOR
- Event Summary: An error was encountered initializing the PCI IO
map.
- Event Class: System
- Problem Description:
pfa
- Cause / Action:
Cause: PCI requested I/O port size larger than
system can handle Action: Correct any previous errors Remove cards that are
requesting too much memory space or move a card to a dual rope slot (PCI slots
1-7).
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 778
- Severity: MAJOR
- Event Summary: An error was encountered creating the PCI MMIO map
- Event Class: System
- Problem Description:
pfa
- Cause / Action:
Cause: PCI requested memory map size larger
than system can handle Action: Correct any previous errors Remove cards that
are requesting too much memory space or move a card to a dual rope slot (PCI
slots 1-7).
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 779
- Severity: CRITICAL
- Event Summary: There was an error initializing the SBA node
- Event Class: System
- Problem Description:
error code
- Cause / Action:
Cause: An error was while initializing the SBA
firmware structures Action: Correct any previous errors Invalidate NVM and
reset replace the cell board
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 780
- Severity: CRITICAL
- Event Summary: There was an error discovering the SBA
- Event Class: System
- Problem Description:
error code
- Cause / Action:
Cause: An error was discovered with the SBA
during discovery Action: Correct any previous errors Replace the I/O
backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 781
- Severity: CRITICAL
- Event Summary: An error was encountered while resetting the SBA
- Event Class: System
- Problem Description:
error code
- Cause / Action:
Cause: An error was detected while resetting
the ropes Action: replace the I/O backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 782
- Severity: MAJOR
- Event Summary: There was an error initializing the IO link
- Event Class: System
- Problem Description:
An error was detected in the link between
the CC and the I/O controller.
- Cause / Action:
Cause: Unable to establish the link between the
CC and IOC. Action: Validate power to the I/O chassis Reset the system A/C
power cycle Replace the I/O backplane, cell, and system backplane to resolve
the issue.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 783
- Severity: MAJOR
- Event Summary: There is a problem initializing the REO cable
- Event Class: System
- Problem Description:
cable status
- Cause / Action:
Check the REO cable connection
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 784
- Severity: CRITICAL
- Event Summary: The IO chassis discovered was powered off
- Event Class: System
- Problem Description:
Identified the cell number that is
connected to the chassis.
- Cause / Action:
No action required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 785
- Severity: MAJOR
- Event Summary: There was an error initializing the LBA
- Event Class: System
- Problem Description:
error code
- Cause / Action:
Cause: Error initializing the LBA node and
services Action: Validate that there is not another error causing this error
invalidate NVM and reset or replace the cell board
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 786
- Severity: CRITICAL
- Event Summary: There was an error querying the LBA width
- Event Class: System
- Problem Description:
error code
- Cause / Action:
Cause: Error while writing the LBA phase data
Action: Replace the I/O backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 787
- Severity: MAJOR
- Event Summary: There was an error with the LBA phase
- Event Class: System
- Problem Description:
error code
- Cause / Action:
Cause: Error while writing the LBA phase data
Action: Replace the I/O backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 788
- Severity: MAJOR
- Event Summary: There was an error clearing the LBA
- Event Class: System
- Problem Description:
error code
- Cause / Action:
Cause: Unable to clear an error in the LBA
Action: Check other events for the error being generated replace either the
PCI card or the I/O backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 789
- Severity: CRITICAL
- Event Summary: There was an error with the LBA log
- Event Class: System
- Problem Description:
error code
- Cause / Action:
Cause: Error log is corrupt Action: Clear
errors and continue
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 790
- Severity: CRITICAL
- Event Summary: There was an error discovering the LBA
- Event Class: System
- Problem Description:
error code
- Cause / Action:
Cause: The wrong backplane type was detected
Action: replace I/O backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 791
- Severity: MAJOR
- Event Summary: There was an error configuring the LBA
- Event Class: System
- Problem Description:
error code
- Cause / Action:
Cause: Unable to configure the LBA Action:
replace I/O backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 792
- Severity: CRITICAL
- Event Summary: There was an error scanning the PCI bus
- Event Class: System
- Problem Description:
An error was encountered while attempting
to scan the PCI bus
- Cause / Action:
Cause: ld not scan the card in a populated
slot. Typically caused by an improperly installed or faulty PCI
card.
Action: Reseat or replace the faulty card.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 793
- Severity: CRITICAL
- Event Summary: There was an error configuring PCI space through the
LBA
- Event Class: System
- Problem Description:
error code
- Cause / Action:
Cause: Unable to obtain semaphore Action: reset
Update to latest recipe
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 808
- Severity: CRITICAL
- Event Summary: The Options service received an NVRAM allocation
error.
- Event Class: System
- Problem Description:
The Options service received an error when
attempting to allocate an NVRAM storage block. Either an error was returned
from the call, or the call returned successfully yet an invalid address was
returned.
- Cause / Action:
Invalidate NVRAM and reset.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 810
- Severity: MAJOR
- Event Summary: SAL errlog access timeout
- Event Class: System
- Problem Description:
Access to SAL error log procedure timed
out because the log facility was busy processing a request from another CPU.
Data field indicates the SAL procedure ID.
- Cause / Action:
Firmware is taking too long to process
requests.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 816
- Severity: MAJOR
- Event Summary: The echelon given in the data field is not fully
populated.
- Event Class: System
- Problem Description:
One or more DIMMs are missing from the
echelon given in the data field. The DIMMs may not be installed or firmware
was not able to detect the DIMMs.
- Cause / Action:
cause - the specified echelon is not fully
populated and is not usable action - add or replace DIMMs in the specified
echelon
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 817
- Severity: MAJOR
- Event Summary: Attempted to read the port state from an illegal
port number
- Event Class: System
- Problem Description:
The code that reads the port state
(landmine vs. healthy) expects a XBC internal port number, it received bogus
data. The port state cannot be read. Data Field: (port << 44) | (xbc num
<< 32)
- Cause / Action:
An invalid port number has been provided. The
port number will be converted to an internal port and processing should
continue.
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 818
- Severity: MAJOR
- Event Summary: Attempted to write the port state for an illegal
port
- Event Class: System
- Problem Description:
The code that writes the port state
(landmine vs. healthy) expects a XBC internal port number, it received bogus
data. The port state cannot be read. Data Field: (port << 44) | (xbc num
<< 32)
- Cause / Action:
An invalid port number has been provided. The
port number will be converted to an internal port and processing should
continue.
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 822
- Severity: CRITICAL
- Event Summary: System firmware was unable to default the complex
profile
- Event Class: System
- Problem Description:
System firmware was unable to default the
complex profile
- Cause / Action:
Needed information could not be obtained. Reset
the MP.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 824
- Severity: MAJOR
- Event Summary: Means that the error log space in the NVRAM has not
been allocated.
- Event Class: System
- Problem Description:
This chassis code shows that the error log
space in the NVRAM has not been allocated for the current error event. This
will be emitted out whenever a error section is attempted to be logged
without allocation of log space in NVRAM
- Cause / Action:
This happens because of the NVRAM is full with
unconsumed error logs. Clear the error logs.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 825
- Severity: MAJOR
- Event Summary: This indicates the maximum number of logs for the
event.
- Event Class: System
- Problem Description:
This indicates that the error logs for a
particular event type have reached the maximum allowed to be stored in the
NVRAM. The event type is indicated in the data field.
- Cause / Action:
This shouldn't be occur. But in case it does
than clear the error logs of this event type from the NVRAM.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 826
- Severity: MAJOR
- Event Summary: On Line Delete operation was begun but firmware
could not find a cell that can be deleted.
- Event Class: System
- Problem Description:
System firmware has been invoked to
perform a cell delete operation but no cell in the system appears to be ready
for deletion.
- Cause / Action:
This can occur if the OS has not returned all
the CPUs to firmware or if a cell is not marked correctly in the complex
profile to allow its deletion.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 827
- Severity: FATAL
- Event Summary: The bulk power system is above its current capacity.
- Event Class: System
- Problem Description:
The bulk power supply is over current
- Cause / Action:
N/A
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 828
- Severity: MAJOR
- Event Summary: The bulk specified is warning of a potential thermal
problem.
- Event Class: System
- Problem Description:
Data: Bulk location.
- Cause / Action:
The bulk power supply is warning of an over
temperature condition
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 829
- Severity: CRITICAL
- Event Summary: Malloc failed while trying to process and ERM
- Event Class: System
- Problem Description:
Error Response Mode code attempted a
malloc of heap space that failed.
- Cause / Action:
Heap space is completely used or corrupt.
Contact Product Engineering.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 830
- Severity: MAJOR
- Event Summary: DIMM at physical location in data field is not
supported on this platform.
- Event Class: System
- Problem Description:
The DIMM in the physical location given by
the data field is not supported on this platform. The DIMM may not be
supported by the hardware, or the DIMM may not have been properly qualified
for this platform.
- Cause / Action:
Cause: Unsupported DIMM in specified slot
Action: Replace DIMM with supported DIMM.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 831
- Severity: CRITICAL
- Event Summary: The OPTIONS component received a memory allocation
error.
- Event Class: System
- Problem Description:
The OPTIONS component was unable to
allocate NVRAM memory in order to store a non-volatile variable. The storage
area for NVRAM options may be full, or there may be undetected corruption.
- Cause / Action:
Invalidate NVRAM and reset.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 832
- Severity: MAJOR
- Event Summary: A DIMM or CPU has is deconfigured or failed testing
- Event Class: System
- Problem Description:
A DIMM or CPU has failed and is not
operational for the system. This event is emitted prior to determining if the
cell should be integrated into the Partition.
- Cause / Action:
A deconfigured DIMM or cpu has been detected.
Examine earlier events to isolate the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 833
- Severity: CRITICAL
- Event Summary: The cell will not join the PD
- Event Class: System
- Problem Description:
A cpu or DIMM error has been detected, and
the Complex Profile, Cell Integration Table, Cell integration policy says to
not integrate the cell into the PD.
- Cause / Action:
Broken hardware was detected and the cell
integration policy combined to cause the cell to not join the PD. Fix the
broken hardware or change the policy using parmgr.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 834
- Severity: MAJOR
- Event Summary: The error context in NVM was corrupt
- Event Class: System
- Problem Description:
The IO error context is corrupt. This will
impair IO error reporting.
- Cause / Action:
NVM is corrupted.
Check for other errors in
the system first. Invalidate NVM and retry boot. Get the latest firmware
release.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 837
- Severity: CRITICAL
- Event Summary: Firmware encountered a problem trying to initialize
- Event Class: System
- Problem Description:
System firmware encountered an error while
trying to perform an operation during system initialization. This event ID
will always be emitted before an event ID that describes the status of the
operation that failed.
- Cause / Action:
Examine the related event that failed and
correct that problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 838
- Severity: MAJOR
- Event Summary: This means that all the CPUs in the cell did not
show up.
- Event Class: System
- Problem Description:
This means that all the CPUs in the cell
did not show up.
- Cause / Action:
This will result in the cell stepping
independently to collect its logs and resetting itself.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 839
- Severity: MAJOR
- Event Summary: This means that all the cells did not rendezvous
during the PD rendezvous.
- Event Class: System
- Problem Description:
This means that all the cells did not
rendezvous during the PD rendezvous. The data part will contain the Expected
data and the actual mask of the cells that rendezvoused.
- Cause / Action:
The cells will reset themselves.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 840
- Severity: MAJOR
- Event Summary: The FW tree sanity check failed during the MCA error
processing.
- Event Class: System
- Problem Description:
The FW tree sanity check failed during the
MCA error processing.
- Cause / Action:
The cells will independently log errors and
reset.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 841
- Severity: MAJOR
- Event Summary: This means that the registry sanity check failed
during MCA error handling.
- Event Class: System
- Problem Description:
This means that the registry sanity check
failed during MCA error handling.
- Cause / Action:
The cells will independently log errors and
reset.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 842
- Severity: MAJOR
- Event Summary: This means that MCA occurred while OS_MCA was
performing error recovery.
- Event Class: System
- Problem Description:
This means that MCA occurred while OS_MCA
was performing error recovery.
- Cause / Action:
The cells will log information and reset.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 843
- Severity: MAJOR
- Event Summary: One of the BT errors occurred that results in
abandoning memory dump.
- Event Class: System
- Problem Description:
This means that memory dump will be
abandoned due to work-around for CN2272. This happens when one of the Blocking
timeout in the Processor input block of the concorde occurs.
- Cause / Action:
Cause: A machine check has occurred and cells
have not rendezvoused. Action: Cells will reset themselves.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 844
- Severity: MAJOR
- Event Summary: The firmware tree is not complete and hence there
will be no PD rendezvous.
- Event Class: System
- Problem Description:
The firmware tree is not complete and
hence there will be no PD rendezvous.
- Cause / Action:
The cell will log errors and reset
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 845
- Severity: CRITICAL
- Event Summary: ACPI configuration mismatch across cells in the
partition
- Event Class: System
- Problem Description:
The firmware parameter that dEFInes the
ACPI configuration is inconsistent in at least one of the cells in the
partition.
- Cause / Action:
Set the ACPI configuration parameter again to
ensure that all cells have a consistent value.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 846
- Severity: CRITICAL
- Event Summary: Failed clearing of the XIN_ERR_ORDER_STATUS CSR
- Event Class: System
- Problem Description:
Writing the XIN_ERR_ORDER_STATUS register
of the CC failed. This is some sort of a hardware failure. Data Field: return
status
- Cause / Action:
Failure to access the register or the write did
not work.
Contact HP Support personnel to check the CC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 848
- Severity: MAJOR
- Event Summary: Unexpected fabric firmware error
- Event Class: System
- Problem Description:
An unexpected error occurred while
initializing the fabric. The firmware is not able to analyze this error. Clues
to the cause of this error may be found in the IPMI forward progress log (FPL)
either shortly before or after this log entry occurred. The FPL is available
from the management processor using the "sl" command.
- Cause / Action:
An unanticipated error occurred. Contact HP
Support personnel to analyze the IPMI FPL log.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 849
- Severity: MAJOR
- Event Summary: Invalid data read from a CPU module's Processor
Information ROM.
- Event Class: System
- Problem Description:
A value read by the PDHC from a CPU
module's Processor Information ROM was not within acceptable limits.
- Cause / Action:
Cause (probable): The CPU module's Processor
Information ROM is unprogrammed. Action: Contact HP support personnel to
troubleshoot the CPU module pointed to by the physical location portion of
this event. Cause: The CPU module's Processor Information ROM contains invalid
data. Action: Contact HP support personnel to troubleshoot the CPU module
pointed to by the physical location portion of this event.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 851
- Severity: MAJOR
- Event Summary: Option block in NVRAM has a checksum error
- Event Class: System
- Problem Description:
The overhead structure of the OPTIONS
block in NVRAM has a checksum error.
- Cause / Action:
Clear NVRAM.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 852
- Severity: MAJOR
- Event Summary: CC to CC link did not initialize on the local cell
- Event Class: System
- Problem Description:
During a cell OLA, the link on the local
cell failed to initialize. Data Field: (my cell << 32) | XIN Link State
- Cause / Action:
link failure between the XBC and the
CC
Check CC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 853
- Severity: MAJOR
- Event Summary: Failed to write the CC link disable register
- Event Class: System
- Problem Description:
An attempt to disable the fabric link
failed because writing the CC CSR failed. Data Field: (cell << 56) |
return status
- Cause / Action:
Fabric Access Failure.
Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 854
- Severity: MAJOR
- Event Summary: An unknown backplane type was found
- Event Class: System
- Problem Description:
Could not determine the system type in
order to write the appropriate error mask for the fabric link. Data Field:
system type
- Cause / Action:
CSR Read/Write error
Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 855
- Severity: MAJOR
- Event Summary: Error writing the CC link error mask
- Event Class: System
- Problem Description:
Failed writing the XIN error mask for CC's
fabric link. Data Field: (cell << 56) | return status
- Cause / Action:
Fabric Access Error.
Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 856
- Severity: MAJOR
- Event Summary: Failed to read the CC's fabric link error mask
- Event Class: System
- Problem Description:
Could not read the XIN Link error mask
register. Data Field: (cell << 56) | return status
- Cause / Action:
CC CSR access failure.
Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 857
- Severity: CRITICAL
- Event Summary: Could not initialize the CC to CC link upon boot.
- Event Class: System
- Problem Description:
The CC to CC link initialization sequence
has failed. Data Field: link init status
- Cause / Action:
CC CSR Access Failure.
Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 858
- Severity: MAJOR
- Event Summary: An Error occurred trying to notify the MP of the
attempted reset.
- Event Class: System
- Problem Description:
An error occurred while trying to notify
the MP that a reset is about to occur (QPartitionReleaseBIB command). The
status is in the data field.
- Cause / Action:
The MP is not functioning or the PDHC cannot
communicate with it. Reset the MP.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 860
- Severity: MAJOR
- Event Summary: Failed disabling the XIN link for a single cell
medel
- Event Class: System
- Problem Description:
A fabric access error occurred while
trying to disable the CC to CC link on a single cell Medel system. This cell
will halt. Data field: error status
- Cause / Action:
Fabric Access Error.
Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 861
- Severity: CRITICAL
- Event Summary: Error while getting the XBC Semaphore
- Event Class: System
- Problem Description:
While updating the Port State register,
the cell could not get the XBC semaphore. Data field is: (Port Num << 44
| XBC num << 32 | return status). Where return status is: (0 Success; -1
Access Failure; -2 Semaphore Owned By Another, -3 Semaphore Already Owned; -4
XBC Key Contention)
- Cause / Action:
Most likely a hardware problem, but confirm the
cause by looking at the return status. Action: Check XBC, Backplane, Flex
Cables, Contact HP Support Personnel for further troubleshooting.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 862
- Severity: MAJOR
- Event Summary: Error releasing the XBC Semaphore
- Event Class: System
- Problem Description:
While updating the Port State register,
the cell could not get the XBC semaphore. Data field is: (Port Num << 44
| XBC num << 32 | return status). Where return status is: (0 Success; -1
Generic Failure)
- Cause / Action:
Cause: Fabric Access problem. Either an error
reading the hardware or XBC Key contention. Action: Look for additional
chassis codes to provide detail. Check XBC, Backplane, Flex Cables, Contact HP
Support Personnel.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 864
- Severity: MAJOR
- Event Summary: Unexpected fabric firmware error
- Event Class: System
- Problem Description:
An unexpected error occurred while
initializing the fabric. The firmware is not able to analyze this error. Clues
to the cause of this error may be found in the IPMI forward progress log (FPL)
either shortly before or after this log entry occurred. The FPL is available
from the management processor using the "sl" command.
- Cause / Action:
An unanticipated error occurred. Contact HP
Support personnel to analyze the IPMI FPL log.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 865
- Severity: MAJOR
- Event Summary: The CC's XIN link was found to be already
initialized
- Event Class: System
- Problem Description:
While attempting to initialize the XIN
link, it was found to already be initialized. A firmware assertion has failed.
The link will not be re-initialized and processing should continue as normal.
However, the system could be confused at this point.
- Cause / Action:
Firmware problem. Contact HP Support
Personnel.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 866
- Severity: CRITICAL
- Event Summary: Cell has been disabled by the PDHC because no CPU
modules were found.
- Event Class: System
- Problem Description:
The PDHC FW could not detect any CPU
modules on its Cell board, so it is holding the Cell in reset.
- Cause / Action:
Cause(1, probable): No CPU modules are
installed. Action(1): Install CPU modules on the Cell. Cause(2): A Cell or PDH
Daughtercard error is causing the presence of CPU modules to be reported
incorrectly to the PDHC. Action(2): Contact HP support personnel to
troubleshoot the PDH Daughtercard and/or Cell board. Cause(3): The CPU
module(s) that are installed have invalid data stored in the partition
specific field of the FRU EEPROM. Action(3): If in manufacturing, reprogram
the partition specific field of the CPU module(s) FRU EEPROM. Otherwise,
contact HP support personnel to troubleshoot the unreported CPU module.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 867
- Severity: CRITICAL
- Event Summary: Cell has been disabled by PDHC FW because the CPU
modules are not compatible.
- Event Class: System
- Problem Description:
The Cell has been disabled by PDHC FW
because the CPU modules are not compatible. Compatibility is determined based
on data stored in the Scratch/FRUID EEPROM on each CPU module. The CPU module
partition compatibility byte for each CPU module must be identical.
- Cause / Action:
Cause(1): At least one of the installed CPU
modules are incompatible with at least one other CPU module. Action(1):
Contact HP support personnel to troubleshoot the CPU modules on the Cell.
Cause(2): The FRUID data stored in a CPU Module's Scratch/FRUID EEPROM is
incorrectly programmed. Action(1): Reprogram the FRUID data (manufacturing
only) or contact HP support personnel to troubleshoot the CPU module on the
Cell.
Cause(1): At least one of the installed CPU modules are incompatible
with at least one other CPU module. Action(1): Contact HP support personnel to
troubleshoot one or more CPU modules on the Cell. Cause(2): The FRUID data
stored in a CPU Module's Scratch/FRUID EEPROM is incorrectly programmed.
Action(1): Reprogram the FRUID data (manufacturing only) or contact HP support
personnel to troubleshoot the CPU module on the Cell.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 868
- Severity: CRITICAL
- Event Summary: Cell has been disabled because of invalid data in a
CPU module Scratch EEPROM.
- Event Class: System
- Problem Description:
The Cell has been disabled because of
invalid data in a CPU module Scratch EEPROM. PDHC FW checksums the FRUID data
stored in each CPU module's Scratch EEPROM. If a checksum fails, the Cell is
held in reset and will not boot. The data field identifies the CPU module that
failed.
- Cause / Action:
Cause: The CPU module is not an HP CPU module,
or the FRUID data for this CPU module has not been programmed.
Action:
Contact HP support personnel to troubleshoot the CPU module.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 869
- Severity: MAJOR
- Event Summary: The Cell Battery voltage level low warning
- Event Class: System
- Problem Description:
The battery voltage level is low for the
cell. This indicates that the NVRAM will not be saved if the power is removed.
- Cause / Action:
Cause1: The Cell Battery is low. Action1: It
needed to be replaced.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 870
- Severity: CRITICAL
- Event Summary: Error while copying the XBC routing to the local
port
- Event Class: System
- Problem Description:
There was an error while copying the
routing for the XBC to the local XBC port. The cell will reset. Data: (XBC
port << 44) | (XBC num << 32) | return status
- Cause / Action:
Error accessing XBC CSRs.
Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 871
- Severity: CRITICAL
- Event Summary: A read after write of a XBC CSR failed
- Event Class: System
- Problem Description:
The read immediately after a write while
copying routing registers failed. Data: whether or not the XBC Key was enabled
- Cause / Action:
Fabric Access Error, XBC Key Disabled. Check
XBC, links, backplane, Contact HP Support Personnel for
further troubleshooting.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 872
- Severity: MAJOR
- Event Summary: Couldn't release the Semaphore while writing routing
states.
- Event Class: System
- Problem Description:
Failed to release a XBC Semaphore while
marking each XBC in the complex to indicate that routing has completed. Data:
(XBC num << 32) | return value
- Cause / Action:
Fabric Access Error. Check XBC, Check
links.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 873
- Severity: CRITICAL
- Event Summary: Couldn't write the XBC's forward progress register
- Event Class: System
- Problem Description:
Writing this XBC's forward progress
register failed. Data: (XBC num << 32) | return value
- Cause / Action:
Fabric Access Error. Couldn't write this
XBC.
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 874
- Severity: CRITICAL
- Event Summary: Couldn't access the XBC semaphore registers.
- Event Class: System
- Problem Description:
Failed to get a XBC Semaphore while
marking each XBC in the complex to indicate that routing has completed.
Skipping this XBC. Data: (XBC num << 32) | return value
- Cause / Action:
Fabric Access Error. Couldn't read or write
this XBC.
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 875
- Severity: CRITICAL
- Event Summary: Couldn't determine the complex fabric topology
- Event Class: System
- Problem Description:
Reading this XBC's topology register
failed. Data Field: (xbc num << 32) | return status
- Cause / Action:
Fabric Access Error. Couldn't write this
XBC.
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 876
- Severity: MAJOR
- Event Summary: Error checking a cell to cell link during
traversability tests
- Event Class: System
- Problem Description:
Could not check the traversability between
two cells on an XBCless platform. Data field: return status (1 = SUCCESS, 0 =
FALSE, -1 = FAILURE)
- Cause / Action:
Probably an error reading the XIN. Look for
additional descriptive chassis codes.
Contact HP Support personnel to check
the CC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 877
- Severity: MAJOR
- Event Summary: An error occurred while traversing the cell to cell
link.
- Event Class: System
- Problem Description:
Could not check the traversability between
two cells on an XBCless platform. Data field: return status (1 = SUCCESS, 0 =
FALSE, -1 = FAILURE)
- Cause / Action:
Probably an error reading the XIN. Look for
additional descriptive chassis codes.
Contact HP Support personnel to check
the CC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 878
- Severity: MAJOR
- Event Summary: Error reading the local cell's XIN link state
- Event Class: System
- Problem Description:
While checking traversability of a 2 cell
back to back system, there was an error reading the local cell's XIN block.
Data Field: return status (1 or -1)
- Cause / Action:
Hardware Access Error. Have your HP support
representative check the Coherency Controller (CC).
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 879
- Severity: MAJOR
- Event Summary: Error reading the remote cell's XIN link state
register
- Event Class: System
- Problem Description:
While checking traversability of a 2 cell
back to back system, there was an error reading the local cell's XIN block.
Data Field: return status (1 or -1)
- Cause / Action:
Hardware Access Error. Have your HP support
representative check the backplane and Coherency Controller (CC).
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 880
- Severity: MAJOR
- Event Summary: The XIN link is not connected to the target cell.
- Event Class: System
- Problem Description:
Could not traverse to the target cell. The
XIN link is either not initialized, or is not connected to the target cell.
However, the target cell is designated to be within the partition. Data Field:
target cell << 56 | XIN link state register
- Cause / Action:
Ensure the cells are connected. Check
historical chassis codes from most recent boot to see if the link had ever
initialized. Have your HP support representative check the backplane and
Coherency Controller (CC).
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 881
- Severity: MAJOR
- Event Summary: The XIN link is not connected to the target cell.
- Event Class: System
- Problem Description:
Could not traverse to the target cell. The
XIN link is either not initialized, or is not connected to the target cell.
However, the target cell is designated to be within the partition. Data Field:
target cell << 56 | XIN link state register
- Cause / Action:
Ensure the cells are connected. Check
historical chassis codes from most recent boot to see if the link had ever
initialized. Have your HP support representative check the backplane and
Coherency Controller (CC).
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 882
- Severity: MAJOR
- Event Summary: Error reading the XIN_LINK_STATE register while
disabling the link
- Event Class: System
- Problem Description:
Error reading the XIN_LINK_STATE register
of the CC. This occurred while verifying that the link had been disabled. Data
Field: cell being read << 56 | return status from the CSR read.
- Cause / Action:
Hardware Access Error.
Contact HP Support
personnel to analyze the fabric, CC, Backplane.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 883
- Severity: CRITICAL
- Event Summary: Error reading the XIN_LINK_STATE register
- Event Class: System
- Problem Description:
Failure while reading the XBC CSR
containing the link status. This occurred while attempting the retry process
to get XBC to CC link initialized. Data Field: link init status
- Cause / Action:
link init problem
Contact HP Support
personnel to check the XBC, CC, backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 884
- Severity: MAJOR
- Event Summary: Unexpected fabric firmware error
- Event Class: System
- Problem Description:
An unexpected error occurred while
initializing the fabric. The firmware is not able to analyze this error. Clues
to the cause of this error may be found in the IPMI forward progress log (FPL)
either shortly before or after this log entry occurred. The FPL is available
from the management processor using the "sl" command.
- Cause / Action:
An unanticipated error occurred. Contact HP
Support personnel to analyze the IPMI FPL log.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 885
- Severity: MAJOR
- Event Summary: The CPU is performance or functionally restricted
- Event Class: System
- Problem Description:
The CPU that just completed self tests is
functionally or performance restricted. The data field contains the self-test
state word.
- Cause / Action:
A CPU is broken. Replace it.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 886
- Severity: MAJOR
- Event Summary: The RTC was found to be invalid and has been cleared
- Event Class: System
- Problem Description:
The RTC was found to be invalid and has
been cleared
- Cause / Action:
Cause: The RTC was invalid Action: None, the
problem has been corrected by SFW.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 887
- Severity: MAJOR
- Event Summary: Status indicates that the Late Self Tests did not
actually run
- Event Class: System
- Problem Description:
System firmware requested that Late Self
Tests be run by PAL, but PAL returned that the tests did not actually run on
the processor. The data field indicates the status word returned by PAL.
- Cause / Action:
This could be caused by an incompatibility
problem between PAL and the CPUs. Check that PAL supports all the CPUs
installed on the system.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 888
- Severity: CRITICAL
- Event Summary: A fabric walk failed while updating the cell state
- Event Class: System
- Problem Description:
An attempt to update the cell state has
failed due to a fabric crossbar failure. The cell number being updated in in
bits 63:56, while the traversable cell set (those cells connected to the
fabric) is returned in bits 31:0
- Cause / Action:
Look for adjacent chassis codes to determine
the cause of FabricWalk failure. Check the backplane and fabric connectivity.
Contact the HP Support Personnel for further troubleshooting.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 889
- Severity: CRITICAL
- Event Summary: Could not reset the cell due to failure updating
cell state
- Event Class: System
- Problem Description:
Failed to reset a cell due to an error
setting the cell's state. The cell will not be reset with the other cells in
the PD. The cell number is reported in the data field.
- Cause / Action:
Most likely a failure on the fabric or on the
CC. Fabric failures should produce additional chassis codes. If no additional
chassis codes indicate the cause of the failure, then contact the HP Support
Personnel for further troubleshooting.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 890
- Severity: MAJOR
- Event Summary: DRAM failure on DIMM XX, deallocate rank
- Event Class: System
- Problem Description:
SFW has detected that a DRAM is failing on
the DIMM specified by the physical location. The rank the failing DIMM is part
of will be deallocated.
- Cause / Action:
Cause: SFW detected a failing DIMM Action:
Replace the DIMM flagged by SFW
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 891
- Severity: CRITICAL
- Event Summary: System Clocks are not valid
- Event Class: System
- Problem Description:
Internal CPU clocks are not valid when
compared with the real time clock. The data field contains the hex value of
the elapsed time. If this value is off a small percentage from the expected
value (which is given in the next chassis code), the event is emitted.
- Cause / Action:
The Cell board has a problem. Either the Real
Time Clock is not working properly or the system is not being clocked at the
value it thinks it is.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 892
- Severity: CRITICAL
- Event Summary: Cell Online Addition failed due to fabric access
error
- Event Class: System
- Problem Description:
Could not traverse the fabric to the cell
being added. Data field: (chosen cell << 56) | return status, where -1 =
failure
- Cause / Action:
Cause: Fabric Access Failure, Action: Check CC
to CC link. Look for additional failure chassis codes to provide more
detail.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 893
- Severity: CRITICAL
- Event Summary: Fabric found a bad XBC port on a reboot. Attempting
to route around it.
- Event Class: System
- Problem Description:
A XBC port was found to be unhealthy on
this reboot. This cell will attempt to route around it. Data field: (local
Cell << 56) | (local internal Port << 44) | (local XBC <<
32) | XBC internal port number being routed around.
- Cause / Action:
Cause: link errors. Action: Run DC Connectivity
test. Check flex cables, XBCs, and CCs.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 894
- Severity: MAJOR
- Event Summary: Could not access an internal firmware table while
rerouting XBC port
- Event Class: System
- Problem Description:
Error getting the XBC port's expected
neighbor from a firmware table. Data field: 0 (SUCCESS) or -1 (FAILURE)
- Cause / Action:
Cause: Firmware Error. Action: Capture chassis
codes and contact HP Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 895
- Severity: MAJOR
- Event Summary: Cell/Partition to be reset because PDC couldn't read
PDH memory
- Event Class: System
- Problem Description:
Either PDC is going to halt the cell or
reset the partition because PDC was unable to access the PDH memory of either
its local cell or another cell in the partition. The data field contains the
error return value from PDC function IsHCellCpuDeconfig().
- Cause / Action:
Cause1: Cell hardware problem like PDH memory
itself, the coherency controller, the executing CPU or interaction between any
of these cell components. Action1: Contact HP Support to troubleshoot the cell
and either fix it or replace it. Cause2: PDC bug in which PDC thinks it was
unable to safely access PDH memory when maybe it really could have. Action2:
Contact HP Support to see if a new PDC image is available.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 896
- Severity: MAJOR
- Event Summary: Cell/Partition to be reset because PDC couldn't read
PDH memory
- Event Class: System
- Problem Description:
Either PDC is going to halt the cell or
reset the partition because PDC was unable to access the PDH memory of either
its local cell or another cell in the partition. The data field contains the
error return value from PDC function SleepAndWakeupCountersGet().
- Cause / Action:
Cause1: Cell hardware problem like PDH memory
itself, the Concorde chip, the executing Mako or interaction between any of
these cell components. Action1: Troubleshoot the cell and either fix it or
replace it. Cause2: PDC bug in which PDC has passed an invalid argument from
one PDC function to another. Action2: Upgrade PDC if this is found to be the
problem and a new PDC image is available.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 897
- Severity: MAJOR
- Event Summary: Cell/Partition to be reset because PDC couldn't read
PDH memory
- Event Class: System
- Problem Description:
Either PDC is going to halt the cell or
reset the partition because PDC was unable to access the PDH memory of either
its local cell or another cell in the partition. The data field contains the
error return value from PDC function PdhGetHCellStructAddr().
- Cause / Action:
Cause1: Cell hardware problem like PDH memory
itself, the Concorde chip, the executing Mako or interaction between any of
these cell components. Action1: Troubleshoot the cell and either fix it or
replace it. Cause2: PDC bug in which PDC thinks it was unable to safely access
PDH memory when maybe it really could have. Action2: Upgrade PDC if this is
found to be the problem and a new PDC image is available.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 898
- Severity: MAJOR
- Event Summary: Cell/Partition to be reset because PDC couldn't read
PDH memory
- Event Class: System
- Problem Description:
Either PDC is going to halt the cell or
reset the partition because PDC was unable to access the PDH memory of either
its local cell or another cell in the partition. The data field contains the
error return value from PDC function HasCpuCompletedWakeupTask().
- Cause / Action:
Cause1: Cell hardware problem like PDH memory
itself, the Concorde chip, the executing Mako or interaction between any of
these cell components. Action1: Troubleshoot the cell and either fix it or
replace it. Cause2: PDC bug in which PDC thinks it was unable to safely access
PDH memory when maybe it really could have. Action2: Upgrade PDC if this is
found to be the problem and a new PDC image is available.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 899
- Severity: MAJOR
- Event Summary: Cell/Partition to be reset because PDC couldn't read
PDH memory
- Event Class: System
- Problem Description:
Either PDC is going to halt the cell or
reset the partition because PDC was unable to access the PDH memory of either
its local cell or another cell in the partition. The data field contains the
error return value from PDC function PdhGetHCellStructAddr().
- Cause / Action:
Cause1: Cell hardware problem like PDH memory
itself, the Concorde chip, the executing Mako or interaction between any of
these cell components. Action1: Troubleshoot the cell and either fix it or
replace it. Cause2: PDC bug in which PDC thinks it was unable to safely access
PDH memory when maybe it really could have. Action2: Upgrade PDC if this is
found to be the problem and a new PDC image is available.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 900
- Severity: MAJOR
- Event Summary: A reset for reconfiguration will be performed soon
on the cell.
- Event Class: System
- Problem Description:
There is a need to reset the cell for
reconfiguration, but it cannot be done yet because the cell has not reported
at BIB. The Reset is being scheduled to be performed later.
- Cause / Action:
An error during cell initialization occurred
and the cell will not be able to join the partition. Look for other errors in
the event log that articulate the exact problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 901
- Severity: MAJOR
- Event Summary: The Partition Profile specifies the wrong
architecture type
- Event Class: System
- Problem Description:
When processing the complex profile, the
an unexpected "Architecture Type" was specified in the PA/IA Arch field. The
actual data found is displayed.
- Cause / Action:
This is caused by the wrong type of complex
profile being loaded. System firmware will default a new partition profile and
continue on.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 902
- Severity: MAJOR
- Event Summary: Cell/Partition is about to be reset because PDC is
unable to access CPU data
- Event Class: System
- Problem Description:
While trying to determine whether or not a
particular processor has completed the task for which it was awakened, PDC was
unable to access the deconfig byte information about the target processor. A
processor should always be able to access this data in PDH memory for any
processor on its own cell and for any processor on a cell that is alive in the
partition. Therefore, PDC is either going to halt the cell or reset the
partition because of this problem. The data field contains the PDC error
return status from IsHCellCpuDeconfig().
- Cause / Action:
Cause1: Cell hardware problem, like a problem
with PDH registers or PDH memory, or a problem with the concorde or Mako
chips. Action1: Troubleshoot the cell and either fix cell or replace the cell
board. Cause2: PDC problem such that PDC is passing bad data from one function
to another. Action2: Upgrade PDC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 903
- Severity: MAJOR
- Event Summary: Cell/Partition is about to be reset because PDC is
unable to access CPU data
- Event Class: System
- Problem Description:
While trying to determine whether or not a
particular processor has completed the task for which it was awakened, PDC was
unable to access the CPU's sleep and wakeup counters for the target processor.
A processor should always be able to access this data in PDH memory for any
processor on its own cell and for any processor on a cell that is alive in the
partition. Therefore, PDC is either going to halt the cell or reset the
partition because of this problem. The data field contains the PDC error
return status from SleepAndWakeupCountersGet().
- Cause / Action:
Cause1: Cell hardware problem, like a problem
with PDH registers or PDH memory, or a problem with the concorde or Mako
chips. Action1: Troubleshoot the cell and either fix cell or replace the cell
board. Cause2: PDC problem such that PDC is passing bad data from one function
to another. Action2: Upgrade PDC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 904
- Severity: MAJOR
- Event Summary: Cell/Partition about to be reset because PDC is
unable to access CPU data
- Event Class: System
- Problem Description:
While trying to determine whether or not a
particular processor has completed the task for which it was awakened, PDC was
unable to access the CPU's forward progress state (ie PST state) for the
target processor. A processor should always be able to access this data in PDH
memory for any processor on its own cell and for any processor on a cell that
is alive in the partition. Therefore, PDC is either going to halt the cell or
reset the partition because of this problem. The data field contains the PDC
error return status from CpuFpSet().
- Cause / Action:
Cause1: Cell hardware problem, like a problem
with PDH registers or PDH memory, or a problem with the concorde or Mako
chips. Action1: Troubleshoot the cell and either fix cell or replace the cell
board. Cause2: PDC problem such that PDC is passing bad data from one function
to another. Action2: Upgrade PDC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 905
- Severity: MAJOR
- Event Summary: Cell/Partition is about to be reset because PDC is
unable to access CPU data
- Event Class: System
- Problem Description:
While trying to determine whether or not a
particular processor has completed the task for which it was awakened, PDC was
unable to access the CPU's Forward Progress State (ie PST state) for the
target processor. A processor should always be able to access this data in PDH
memory for any processor on its own cell and for any processor on a cell that
is alive in the partition. Therefore, PDC is either going to halt the cell or
reset the partition because of this problem. The data field contains the PDC
error return status from CpuFpSet().
- Cause / Action:
Cause1: Cell hardware problem, like a problem
with PDH registers or PDH memory, or a problem with the concorde or Mako
chips. Action1: Troubleshoot the cell and either fix cell or replace the cell
board. Cause2: PDC problem such that PDC is passing bad data from one function
to another. Action2: Upgrade PDC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 906
- Severity: MAJOR
- Event Summary: PDC is unable to branch to other software via the
Page Zero location
- Event Class: System
- Problem Description:
At a certain point in PDC boot, all of the
processors in the partition except the PD monarch are put into a sleep, and
they remain there until they are awakened by the PD monarch, at which time
they read an architected location in Page Zero to find out where to branch to.
This gives the OS a mechanism by which to bring processors under its control
and have it executing OS code. This chassis log is sent if and when a problem
is detected by PDC regarding the contents in the Page Zero location. This
means that PDC cannot branch to the location logged in the Page Zero location.
So, PDC sends this chassis log and then the processor returns to sleep. The
data field is unused.
- Cause / Action:
Cause1: The MEM_RENDEZ fields of Page Zero were
programmed incorrectly. Action1: Upgrade or patch the OS. Cause2: Cell
Hardware or memory problem that PDC didn't catch. Action2: Troubleshoot the
cell to find out if page zero contents are screwed up or if hardware is just
failed to do the OS write or failed to do the PDC read. Verify that memory is
properly written and holds contents at the page zero locations. Perhaps
replace the cell board or replace the memory. Cause3: PDC is not doing the
appropriate verification of the page zero contents and is treating it like its
invalid even though maybe its not. Action3: Upgrade PDC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 908
- Severity: MAJOR
- Event Summary: PDC couldn't access a data structure in PDH memory
- Event Class: System
- Problem Description:
While trying to get the sleep counter and
the wakeup counter for a particular processor, which is kept in a data
structure in PDH memory, PDC was unable to determine the address to the data
structure on the remote cell. PDC is supposed to be able to calculate
addresses to anything in PDH memory on other cells in the partition. The data
field contains the PDC error return status from a function called
PdhGetHCellStructAddr().
- Cause / Action:
Cause1: Cell hardware problem with the PDH
memory, the Concorde chip, or the Mako processor itself. Action1:
Troubleshoot/Replace the cell. Cause2: PDC bug in which PDC is trying to
access PDH memory of a cell not in its partition. Action2: Upgrade PDC if
there is a version of PDC that fixes such a problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 909
- Severity: MAJOR
- Event Summary: Error occurred accessing a PDC data structure
- Event Class: System
- Problem Description:
Error occurred accessing a PDC data
structure. Depending upon the situation the cell or entire partition will be
reset. The data field contains the return status for the function that
encountered the error.
- Cause / Action:
Cause1: Hardware problem with the PDH riser
card. Action1: Contact HP Support to confirm the PDH riser card is functioning
properly. Cause2: Hardware problem with the CPU or cell board. Action2:
Contact HP Support to confirm the CPUs and cell board are functioning
properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 910
- Severity: MAJOR
- Event Summary: Cell about to be halted because PDC couldn't
determine relocated address of code
- Event Class: System
- Problem Description:
PDC is about to halt the cell because PDC
was unable to determine the GNI address of the SlaveDispatcher function of PDC
relocated to memory by PDC. The data field contains the error return value
from the function GetGniCodeAddrFromRomCodeAddr().
- Cause / Action:
Cause1: Hardware connecting cells in the
partition experienced a problem such that cells in the partition together can
no longer communicate. Action1: Troubleshoot the fabric and reseat/replace the
cells or cables or backplane if necessary. Cause2: Cell was unable to access
its own PDH memory. Action2: Troubleshoot the cell board and replace it if
necessary. Cause3: PDC bug such that PDC didn't log the relocation address.
Action3: Check for PDC upgrade
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 911
- Severity: MAJOR
- Event Summary: Halting cell because a CPU didn't complete the task
for which it was awakened
- Event Class: System
- Problem Description:
PDC is about to halt the cell because at
least one of the processors didn't complete the task for which they were
awakened and then return to sleep. The data field contains an error return
status.
- Cause / Action:
Cause1: Hardware problem with the CPU, CC, or
PDH flash. Action1: Troubleshoot the cell and/or replace it.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 912
- Severity: MAJOR
- Event Summary: Cell about to be halted because PDC couldn't
determine relocated address of code
- Event Class: System
- Problem Description:
PDC is about to halt the cell because PDC
was unable to determine the GNI address of the CpuFpSet() function of PDC
relocated to memory by PDC. The data field contains the error return value
from the function GetGniCodeAddrFromRomCodeAddr().
- Cause / Action:
Cause1: Hardware connecting cells in the
partition experienced a problem such that cells in the partition together can
no longer communicate. Action1: Troubleshoot the fabric and reseat/replace the
cells or cables or backplane if necessary. Cause2: Cell was unable to access
its own PDH memory. Action2: Troubleshoot the cell board and replace it if
necessary. Cause3: PDC bug such that PDC didn't log the relocation address.
Action3: Check for PDC upgrade
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 913
- Severity: MAJOR
- Event Summary: Cell about to be halted because CPU couldn't change
its CPU FP (PST) state
- Event Class: System
- Problem Description:
PDC is about to halt the cell because one
or more of the slaves were unable to change their CPU FP state in PDH memory
on the local cell. The data field contains an error return status.
- Cause / Action:
Cause1: Hardware problem with the cell (like
PDH memory) or the CC or CPU. Action1: Contact HP support to troubleshoot or
replace the cell board. Cause2: PDC bug. Action2: Contact HP Support to check
for PDC upgrade.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 914
- Severity: CRITICAL
- Event Summary: Partition about to be reset because PDC couldn't get
address to a structure
- Event Class: System
- Problem Description:
PDC was trying to move the cell monarchs
on each of the non-core cells into the Dispatcher, but in order to do that,
the PD monarch needs to be able to read the CPU number of the cell monarch on
each of the non-core cells, which is kept in a data structure on each of the
cells. PDC was unable to get the address to the CELL_CPU_STATE structure in
PDH memory in a cell in the partition. The data field is the error return
status from the PDC function called PdhGetHCellStructAddr().
- Cause / Action:
Cause1: Hardware connecting cells in the
partition experienced a problem such that cells in the partition together can
no longer communicate. Action1: Troubleshoot the fabric and replace backplane
or cells. Cause2: Cell was unable to access its own PDH memory. Action2:
Troubleshoot the cell board and replace it if necessary. Cause3: PDC bug such
that PDC passed invalid arguments to try to get the address to the data
structure. Action3: Upgrade PDC if there is a fix for this problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 915
- Severity: CRITICAL
- Event Summary: Resetting a partition because a CPU didn't complete
the task it was awakened for
- Event Class: System
- Problem Description:
PDC is about to reset the partition
because at least one of the processors didn't complete the task for which they
were awakened and then return to sleep. The data field contains the error
return status from the PDC function CheckSingleSlave().
- Cause / Action:
Cause1: Hardware problem with the Mako chip,
Concorde chip, or PDH flash. Action1: Troubleshoot the cell and/or replace
it.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 916
- Severity: CRITICAL
- Event Summary: Resetting partition because PDC couldn't determine
relocated address of code
- Event Class: System
- Problem Description:
PDC is about to reset the partition
because it is unable to determine the GNI address for the CpuFpSet() function
for one or more of the cells in the partition. The data field contains the
error return status from GetGniCodeAddrFromRomCodeAddr().
- Cause / Action:
Cause1: Hardware connecting cells in the
partition experienced a problem such that cells in the partition together can
no longer communicate. Action1: Troubleshoot the fabric and replace backplane
or cells. Cause2: Cell was unable to access its own PDH memory. Action2:
Troubleshoot the cell board and replace it if necessary. Cause3: PDC bug such
that PDC didn't log the relocation address. Action3: Upgrade PDC if there is a
fix for this problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 917
- Severity: CRITICAL
- Event Summary: Resetting partition because a CPU was unable to
change its CPU FP state
- Event Class: System
- Problem Description:
PDC is about to reset the partition
because one or more of the processors were unable to successfully modify their
CPU FP State (aka their PST state). The data field contains the error return
status from the CpuFpSet() function.
- Cause / Action:
Cause1: Hardware problem with PDH memory,
Concorde chip, or the Mako chip. Action1: Troubleshoot the cell and/or replace
it. Cause2: PDC bug in which passed invalid arguments. Action2: Upgrade PDC if
there is a fix.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 918
- Severity: MAJOR
- Event Summary: CPU Dual Core Initialization Failed
- Event Class: System
- Problem Description:
CPU Dual Core Initialization Failed
- Cause / Action:
Attempt Reboot, Replace Processor
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 919
- Severity: MAJOR
- Event Summary: Second CPU in Pair has been disabled
- Event Class: System
- Problem Description:
None
- Cause / Action:
The second CPU in the Dual Core has been
deconfigured as a result of the first core being deconfigured. Investigate the
cause of the first core being deconfigured
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 920
- Severity: MAJOR
- Event Summary: Virtualzing Dual Core Registers Failed
- Event Class: System
- Problem Description:
None
- Cause / Action:
Reboot, if problem continues, replace CPU
Module.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 921
- Severity: MAJOR
- Event Summary: Virtualizing Dual Core Interposer has failed
- Event Class: System
- Problem Description:
None
- Cause / Action:
Virtualizing the Dual Core Interposer has
failed. Reboot, if problem continues, Replace CPU module.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 922
- Severity: MAJOR
- Event Summary: Install PMI Handler Failed
- Event Class: System
- Problem Description:
None
- Cause / Action:
Reboot, if problem continues replace CPU
Module
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 923
- Severity: FATAL
- Event Summary: Cell failed compatibility checks.
- Event Class: System
- Problem Description:
Cell and or CPUs have failed compatibility
checks.
- Cause / Action:
Cause - CPUs are incompatible with each other,
or the cell front side bus frequency is incompatible with the CPUs. Action -
Correct the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 924
- Severity: FATAL
- Event Summary: PDH space not available after release from reset.
- Event Class: System
- Problem Description:
PDH space not available after release from
reset.
- Cause / Action:
Cause - Hardware failure. Action - Fix the
hardware, cell or PDH riser.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 925
- Severity: FATAL
- Event Summary: MPON failed to release.
- Event Class: System
- Problem Description:
MPON failed to release.
- Cause / Action:
Cause - Hardware failure. Action - Fix the
hardware, cell or pdh riser.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 926
- Severity: FATAL
- Event Summary: Dillon failed to reset.
- Event Class: System
- Problem Description:
Dillon failed to reset.
- Cause / Action:
Cause - Hardware failure. Action - Fix the
hardware, pdh riser or cell.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 927
- Severity: FATAL
- Event Summary: DMD clock is not running.
- Event Class: System
- Problem Description:
DMD clock is not running.
- Cause / Action:
Cause - Hardware problem Action - Fix the
hardware, pdh riser or cell board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 928
- Severity: CRITICAL
- Event Summary: All CPUs on the Cell are scheduled to be
deconfigured
- Event Class: System
- Problem Description:
All possible CPUs on a cell have been
scheduled for deconfiguration.
- Cause / Action:
All CPUs on the cell have been scheduled for
deconfiguration. On the next reset, the cell will no longer be operational;
system firmware will deconfigure all the CPUs and this cell will not be part
of a partition. This action is not recommended. To recover, the NVRAM on the
PDH card must be cleared, the cell power cycled, and defaults restored from
disk.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 929
- Severity: CRITICAL
- Event Summary: A read error occurred while dumping the routing
registers
- Event Class: System
- Problem Description:
A read error occurred while dumping the
XBC port routing registers during boot. This cell will attempt fabricless
boot. Data field: (XBC port << 48) | (XBC num << 32) | error
status reg
- Cause / Action:
Cause: Fabric Read Error. Action: Check XBC,
CC, links, etc.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 930
- Severity: FATAL
- Event Summary: Failed to disable the CC to CC link
- Event Class: System
- Problem Description:
After cell rendezvous for a 2 cell Medel,
only one cell made it into the partition. Disabling the link failed. The cell
will reset for reconfig. Data Field: return status
- Cause / Action:
Failure to read or write Concorde
CSRs.
Contact HP Support personnel to check the Check CC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 931
- Severity: MAJOR
- Event Summary: Power has been removed from AC input A0.
- Event Class: System
- Problem Description:
Power is no longer being supplied to input
A0 on the cabinet specified in the data field.
- Cause / Action:
A power source has been removed from the
chassis.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 932
- Severity: MAJOR
- Event Summary: Power has been removed from AC input A1.
- Event Class: System
- Problem Description:
Power is no longer being supplied to input
A1 on the cabinet specified in the data field.
- Cause / Action:
A power source has been removed from the
chassis.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 933
- Severity: MAJOR
- Event Summary: Power has been removed from AC input B0.
- Event Class: System
- Problem Description:
Power is no longer being supplied to input
B0 on the cabinet specified in the data field.
- Cause / Action:
A power source has been removed from the
chassis.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 934
- Severity: MAJOR
- Event Summary: Power has been removed from AC input B1.
- Event Class: System
- Problem Description:
Power is no longer being supplied to input
B1 on the cabinet specified in the data field.
- Cause / Action:
A power source has been removed from the
chassis.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 935
- Severity: MAJOR
- Event Summary: Failed to disable the XIN link during a failed link
init
- Event Class: System
- Problem Description:
Failed to disable the XIN link init CSR on
a XBCless system. Cell will halt. Data field: return status (0 = SUCCESS, -1 =
FAILURE), -1 is expected for this event.
- Cause / Action:
Have your HP Support Representative check the
Coherency Controller
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 937
- Severity: MAJOR
- Event Summary: Error while reading the remote CC's XIN Error Mask
register
- Event Class: System
- Problem Description:
Could not read the XIN error mask
register on the CC. Data Field: cell number and return status
- Cause / Action:
CC access failure.
Contact HP Support
personnel to check the CC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 938
- Severity: MAJOR
- Event Summary: Error clearing the init packet received bit in the
XIN error mask
- Event Class: System
- Problem Description:
Could not write the XIN error mask
register on the CC. Data Field: cell number and return status
- Cause / Action:
Cause: CC access failure.
PDC Reviewed alert
level for SR - 9/6/03 CC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 939
- Severity: MAJOR
- Event Summary: Failed to read the XBC's Port Status register
- Event Class: System
- Problem Description:
While testing link traversability, a XBC
CSR could not be read. Data Field: Port Number << 44 | XBC Number
<< 32 | return value
- Cause / Action:
Cause: fabric access failure Action: Check XBC,
Check CC, Check backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 941
- Severity: CRITICAL
- Event Summary: FW will not handoff to the OS_MCA handler for this
MCA event
- Event Class: System
- Problem Description:
This means that the system FW MCA handler
is not going to handoff to the OS_MCA handler.
- Cause / Action:
The error logs should be retrieved from the EFI
shell prompt.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 942
- Severity: CRITICAL
- Event Summary: The NVRAM block table maintained by System Firmware
is corrupt
- Event Class: System
- Problem Description:
Unused
- Cause / Action:
The NVRAM-based descriptor for System Firmware
NVRAM blocks is corrupt.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 943
- Severity: MAJOR
- Event Summary: All CPUs were deconfigured and have now been
reconfigured.
- Event Class: System
- Problem Description:
All CPUs have been determined to be
manually deconfigured in NVM during boot. This may only happen when switching
from single core CPU deconfiguration to multi-core CPU deconfiguration in
product qualification testing. As a recovery, NVM settings have been changed
to reconfigure all CPUs.
- Cause / Action:
Cause: User test operational error. Action:
Reboot system and update CPU configuration as desired.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 944
- Severity: MAJOR
- Event Summary: A failure has occurred trying to determine the
number of CPU cores per module.
- Event Class: System
- Problem Description:
A failure has occurred trying to determine
the number of CPU cores per module. Depending upon the situation, either the
cell will be halted or the entire partition will be reset.
- Cause / Action:
C1: Hardware failure with CPU, CC or cell
board. A1: Contact HP Support to confirm the CPUs, CC, and cell board are
functioning properly. Update PDC if a version is available to fix this
problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 945
- Severity: MAJOR
- Event Summary: Couldn't read the topology from the XBC register
- Event Class: System
- Problem Description:
While writing the remote routing, the
local XBC could not be accessed to determine the topology. Look for additional
chassis codes to determine what will happen as a result of this failure. Data
field: return status, either SUCCESS (0) or (-1)
- Cause / Action:
Fabric Access Error
Contact HP Support
personnel to check the XBC, Backplane, CC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 947
- Severity: MAJOR
- Event Summary: Failed to read the XBC CSR that contains the number
of failed links
- Event Class: System
- Problem Description:
Could not read the XBC register that
contains the number of links that are currently broken on the complex. Data
Field: (XBC Num << 32) | PDC return status
- Cause / Action:
Fabric Access Failure
Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 948
- Severity: MAJOR
- Event Summary: Failed to read the XBC CSR that contains the number
of failed links
- Event Class: System
- Problem Description:
Could not read the XBC register that
contains the number of links that are currently broken on the complex. Data
Field: (XBC Num << 32) | PDC return status
- Cause / Action:
Fabric Access Failure
Contact HP Support
personnel to check XBC, Backplane, CC, look for additional chassis codes to
describe the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 949
- Severity: MAJOR
- Event Summary: Failed to write the XBC CSR that contains the number
of failed links
- Event Class: System
- Problem Description:
Could not write the XBC register that
contains the number of links that are currently broken on the complex. Data
Field: (XBC Num << 32) | PDC return status
- Cause / Action:
Fabric Access Failure
Contact HP Support
personnel to check XBC, Backplane, CC, look for additional chassis codes to
describe the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 952
- Severity: CRITICAL
- Event Summary: This cell encountered too many broken crossbar links
- Event Class: System
- Problem Description:
Too many broken crossbar links were found.
This cell will have no connectivity to other cells in the complex. It will
attempt a fabricless boot, except in a few configurations. Data Field: (XBC
Num << 32) | number of broken links
- Cause / Action:
Cause: Broken fabric links, Action: Check XBC,
Backplane, Flex Cables, look for additional chassis codes to describe the
problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 958
- Severity: MAJOR
- Event Summary: Failed to do a broadcast write to the XBC Remote
Routing registers
- Event Class: System
- Problem Description:
Failed to complete a broadcast write to an
XBC. Data Field: (XBC Num << 32) | PDC return status
- Cause / Action:
Fabric Access Failure
Contact HP Support
personnel to check the XBC, Backplane, CC. Look for additional chassis codes
to describe the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 959
- Severity: MAJOR
- Event Summary: Failed to read a XBC Remote Routing register
- Event Class: System
- Problem Description:
Failed to complete a read to the built-in
port of a XBC. Data Field: (XBC Num << 32) | PDC return status
- Cause / Action:
Fabric Access Failure
Contact HP Support
personnel to check the XBC, Backplane, CC. Look for additional chassis codes
to describe the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 960
- Severity: MAJOR
- Event Summary: Failed to write a XBC Remote Routing register
- Event Class: System
- Problem Description:
Failed to complete a write to the local
cell's port of the XBC. Data Field: (XBC Port << 44) | (XBC Num <<
32) | PDC return status
- Cause / Action:
Cause: Fabric Access Failure, Action: Check
XBC, Backplane, CC. Look for additional chassis codes to describe the
problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 961
- Severity: CRITICAL
- Event Summary: The link between the CC and SBA failed
- Event Class: System
- Problem Description:
The link between the CC and SBA failed
meaning that I/O is not available to the reporting cell.
- Cause / Action:
See other associated events for the root cause
of the failure.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 962
- Severity: CRITICAL
- Event Summary: The SBA failed and the cell has no I/O
- Event Class: System
- Problem Description:
An error was detected in the SBA and the
reporting cell has no I/O.
- Cause / Action:
See other associated events that describe the
root cause.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 963
- Severity: CRITICAL
- Event Summary: The system firmware had an error with the structured
error handling mechanism.
- Event Class: System
- Problem Description:
The structured exception handling within
the system firmware failed during I/O initialization.
- Cause / Action:
Cause: Either there is an error in the system
firmware or the system firmware has exhausted all resources. Action:
Invalidate NVM or check for newer version of system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 964
- Severity: CRITICAL
- Event Summary: Not enough malloc resources for I/O structure error
handling.
- Event Class: System
- Problem Description:
There is not enough malloc resources for
the I/O structure exception handling. I/O on the reported cell is not
available.
- Cause / Action:
Either invalidate NVM or check for a new
version of system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 965
- Severity: CRITICAL
- Event Summary: Unable to create entry for I/O structure error
handling.
- Event Class: System
- Problem Description:
Error creating the structure for housing
the I/O structured exception handling services and data. I/O is lost on the
reporting cell.
- Cause / Action:
This is a system firmware error, either
invalidate NVM or check for a newer version of system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 966
- Severity: CRITICAL
- Event Summary: Unable to bind services for I/O structure exception
handling.
- Event Class: System
- Problem Description:
Unable to bind the I/O structure exception
handling to the internal data structures.
- Cause / Action:
This is a system firmware error. Either reset
NVM or check for a newer version of system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 967
- Severity: CRITICAL
- Event Summary: Error initializing the I/O structure exception
handling services.
- Event Class: System
- Problem Description:
Error detected while initializing the I/O
structure exception handling services.
- Cause / Action:
This is a system firmware error. Either reset
NVM or check for a newer version of system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 968
- Severity: CRITICAL
- Event Summary: Error initializing structured I/O exception data
structures.
- Event Class: System
- Problem Description:
Error initializing the I/O structure
exception handling data structures.
- Cause / Action:
This is a system firmware error, there is a
conflict with system resources. Either reset NVM or check for a newer version
of system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 969
- Severity: CRITICAL
- Event Summary: The I/O exception context has an error.
- Event Class: System
- Problem Description:
The structured I/O exception handling data
structures have an error. All I/O on the reporting cell is not available.
- Cause / Action:
This is a system firmware error. Reset the
system, invalidate NVM and reset the system, or check for a newer version of
the system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 970
- Severity: CRITICAL
- Event Summary: Error creating the internal data and services for
the SBA.
- Event Class: System
- Problem Description:
While setting up the internal SBA data and
service an error was detected. All I/O for the reporting cell is not
available.
- Cause / Action:
This is a system firmware error. Reset the
system; invalidate NVM and reset the system; or check for a newer version of
system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 971
- Severity: CRITICAL
- Event Summary: Error attaching the series to the SBA internal data
structures.
- Event Class: System
- Problem Description:
An error attaching firmware services to
the internal structures was detected. All I/O on the reporting cell is not
available.
- Cause / Action:
This is a system firmware error. Reset the
partition; invalidate NVM on the reporting cell and reset the system; or check
for a newer version of system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 972
- Severity: CRITICAL
- Event Summary: Error initializing the internal SBA data and
services.
- Event Class: System
- Problem Description:
System firmware detected an error
initializing internal SBA data structures and services. This is usually an
error with unavailable resources.
- Cause / Action:
This is a system firmware error. Reset the
partition; invalidate NVM on the reporting cell and reset the partition; or
check for newer system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 973
- Severity: CRITICAL
- Event Summary: The SBA type is unknown to the system firmware
- Event Class: System
- Problem Description:
The SBA type is unknown to the system
firmware. The I/O on the reporting cell is not available.
- Cause / Action:
This is either a system firmware error, or the
wrong I/O is connected to the system. Validate the system recipe both firmware
and hardware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 974
- Severity: MAJOR
- Event Summary: An embedded I/O device is missing.
- Event Class: System
- Problem Description:
An expected I/O device cannot be detected
by the system firmware.
- Cause / Action:
Replaces the I/O card specified by the physical
location.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 975
- Severity: MAJOR
- Event Summary: Fabric link route around failed because the route
around port was bad
- Event Class: System
- Problem Description:
Too many broken links! The XBC port route
around failed because the route-around port was bad too. Data field: (XBC port
<< 44) | (XBC num << 32) | port state
- Cause / Action:
Cause: 2 or more XBC links are not
routable.
Contact HP Support personnel to check the XBC, Flex Cables,
Backplane, CCs, etc
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 977
- Severity: MAJOR
- Event Summary: (warning) Outputted in MFG, when Memory SBE Seeding
is enabled
- Event Class: System
- Problem Description:
This is a warning that the system is
running in a segregated mode. It will only be emitted in MFG mode when Memory
SBE Seeding is enabled. This is only for testing of SBE seeding for LAB and
possibly MFG use ONLY. It should NEVER be seen in the field.
- Cause / Action:
Cause: In MFG with Memory SBE Seeding control
Flag (26) Enabled. Should never be seen at a customer's machine.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 978
- Severity: CRITICAL
- Event Summary: Failed to read the fabric topology information from
the XBC
- Event Class: System
- Problem Description:
Read failure while writing the number of
failed links to the XBC. Data Field: Return Status (SUCCESS = 0, FAILURE = -1)
- Cause / Action:
Fabric Access Failure.
Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 981
- Severity: CRITICAL
- Event Summary: Could not disable the XIN link before a fabricless
boot
- Event Class: System
- Problem Description:
Before attempting a fabricless boot, the
cell's link to the fabric should be disabled to provide isolation and
stability. The link could not be disabled, so the cell will halt.
- Cause / Action:
Fabric Access Error.
Contact HP Support
personnel to check the CC, Check XBC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 985
- Severity: CRITICAL
- Event Summary: Manual override of fatal stop boot condition
- Event Class: System
- Problem Description:
The user has manually bypassed a stop boot
condition (caused by a fatal error during boot) and continued to boot an O/S.
The system might experience unpredictable failures.
- Cause / Action:
Cause: The user has initiated manual O/S boot
despite the existence of a fatal error. Action: Correct the fatal error
condition (see output of "INFO WARNING" EFI shell command), reboot the system,
and then initiate O/S boot.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 986
- Severity: MAJOR
- Event Summary: Firmware unable to relocate VGA BIOS
- Event Class: System
- Problem Description:
Firmware was unable to relocate the VGA
BIOS to the hardcoded VGA BIOS region in main memory (physical address range
0xc0000 - 0xdffff). VGA routing has been disabled by firmware. No VGA device
will be accessible on this boot.
- Cause / Action:
Cause: Most likely there is a permanent memory
error in the VGA BIOS region (physical address 0xc0000 - 0xdffff). Action:
Replace the DIMM causing the permanent memory error in the VGA BIOS region.
The PDT reports which DIMM is causing errors in the physical address range
0xc0000 - 0xdffff.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 993
- Severity: MAJOR
- Event Summary: PDC could not access a complex profile
- Event Class: System
- Problem Description:
PDC could not access a complex profile.
The cell will be reset. Data field contains the return status from the
function that encountered the error.
- Cause / Action:
Cause1: An error occurred which prevented the
complex profiles from being distributed properly. Action1: Create and
distribute a new complex profile using ParMgr on a functional partition in the
complex. Restore the last complex profile using the "CC" command from the MP,
then use ParMgr to create a new complex profile. Generate a genesis complex
profile using the "CC" command from the MP, then use ParMgr to create a new
complex profile. Cause2: A hardware problem exists with MP or PDHC hardware.
Action2: Contact HP Support to confirm the MP and PDHC are functioning
properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 994
- Severity: MAJOR
- Event Summary: No possible core cells were found in the configured
set
- Event Class: System
- Problem Description:
Could not find a potential core cell for
the partition in the configured set. This cell will reset for reconfiguration.
Data Field: return status from failing function
- Cause / Action:
Cause: most likely a configuration problem,
Action: check to ensure a valid core cell is configured to be in the
partition.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 995
- Severity: MAJOR
- Event Summary: Could not find a viable core cell in the partition
- Event Class: System
- Problem Description:
The potential core cell was not viable
(ie. no core I/O, etc). This cell will reset for reconfiguration. Data Field:
bit mask of cells that made the rendezvous set
- Cause / Action:
Cause: Configuration error, fabric failure; the
intended core cell failed during boot. Action: check partition configuration,
check for failed cells.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 996
- Severity: MAJOR
- Event Summary: Could not find a viable core cell in the partition
- Event Class: System
- Problem Description:
The potential core cell was not viable
(ie. no core I/O, etc). This cell will reset for reconfiguration. Data Field:
bit mask of cells that made the rendezvous set
- Cause / Action:
Cause: Configuration error, Mainbackplane
failure, The intended core cell failed during boot. Action: Check partition
configuration, Check for failed cells, as indicated by high-alert level IPMI
events earlier in the boot.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 997
- Severity: MAJOR
- Event Summary: The core cell selected is not in the rendezvoused
partition
- Event Class: System
- Problem Description:
The potential core cell did not rendezvous
with the rest of the partition. This cell cannot talk to the selected core
cell. This cell will reset for reconfiguration. Data Field: bit mask of cells
that made the rendezvous set
- Cause / Action:
Cause: Configuration error, main backplane
failure; the intended core cell failed during boot. Action: check partition
configuration, check for failed cells, check for additional chassis codes
indicating more failure detail.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 998
- Severity: MAJOR
- Event Summary: The local cell is not viable
- Event Class: System
- Problem Description:
The local cell is disconnected from the
rest of the system due to the main backplane configuration. While the
partition is only configured to contain a single cell, the local cell is not a
viable core cell. The cell will reset for reconfiguration. Data Field: bit
mask of cells that made the rendezvous set
- Cause / Action:
Cause: Configuration error, main backplane
failure; no viable core cell. Action: check partition configuration, attach
core I/O to local cell, make sure a viable core cell is configured within the
partition.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 999
- Severity: MAJOR
- Event Summary: cell cannot reach the fabric, partition contains 3
or more cells
- Event Class: System
- Problem Description:
This cell has booted without the main
backplane, probably due to prior main backplane errors. The partition it is in
is configured with 3 or more cells. The combination of these two
configurations is not allowed. The cell will reset for reconfiguration. Data
Field: configured set
- Cause / Action:
Cause: configuration combined with main
backplane problems. Action: Contact HP Support to confirm the main backplane
is functioning properly. Change the partition configuration to only contain 1
or 2 cells.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1002
- Severity: CRITICAL
- Event Summary: The buffer size is too small for the XBC error log
- Event Class: System
- Problem Description:
The buffer size passed in to the XBC error
logging routine through SAL_GET_STATE_INFO, SAL_CLEAR_STATE_INFO, or MCA
logging is too small for the XBC error log Data field consists of: XBC number
(32:43)
- Cause / Action:
Caller of SAL state info calls did not
correctly set up the buffer for the error logs
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1003
- Severity: CRITICAL
- Event Summary: System firmware was unable to clear an XBC error
- Event Class: System
- Problem Description:
System firmware was unable to clear an XBC
error. The data field contains: XBC number (32:43) port number (44:55) error
type (0:31)
- Cause / Action:
The particular XBC and port could have a
persistent error. Check flex cables
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1004
- Severity: MAJOR
- Event Summary: Firmware detected a possible Cabinet Power Timeout
- Event Class: System
- Problem Description:
System Firmware detected a possible
timeout waiting for the other cabinet to power on. System firmware queried the
utilities system to see what cells are installed. This indicated that cells
are installed in the other cabinet but none are powered on. Firmware delayed
fabric routing, waiting for the other cabinet cells to power on, but
eventually timed out and went on.
- Cause / Action:
Cells exist in both cabinets, but one of the
cabinets has no cells powered on. If a 2 cabinet configuration is desired,
shutdown any active partitions and power off both cabinets and then power them
both on, including at least 1 cell in each cabinet. (Note: it is possible to
get this event ID and have both cabinets powered on. In this event, no action
is required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1005
- Severity: CRITICAL
- Event Summary: Fabric is unable to route the crossbar after
multiple retry attempts
- Event Class: System
- Problem Description:
During fabric initialization, if a
crossbar is found to be in an unexpected state, the number of retries is
incremented. If the number of retries exceeds the maximum, then something is
wrong and there is no way to initialize the fabric. Data field: number of
retries (0:31) crossbar number (32:63)
- Cause / Action:
Hardware problem. Possible bad XBC or
Concorde
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event Details:
Examples:
Event 1007
- Severity: MAJOR
- Event Summary: Error received after issuing the Retrieve Cell Slot
State command
- Event Class: System
- Problem Description:
System Firmware issued the Retrieve Cell
Slot State command to the Sync and got an error back. See related chassis code
or the specifics of the error.
- Cause / Action:
Cause: Make sure the GSP is connected and
reset. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1008
- Severity: CRITICAL
- Event Summary: Firmware was unable to publish the Partition Profile
- Event Class: System
- Problem Description:
Firmware tried to default the Partition
(Group C) complex profile and encountered an error.
- Cause / Action:
Cause: Utilities may be unavailable to update
the profiles. Check the connections are reset. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1009
- Severity: CRITICAL
- Event Summary: Error creating the pdh ioconfig node or attaching
the service to it.
- Event Class: System
- Problem Description:
Firmware encountered an error when creating
the ioconfig node as a child of the pdh node.
- Cause / Action:
Cause: This is likely to be a symptom of an
earlier problem, or the system is out of malloc space. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1010
- Severity: CRITICAL
- Event Summary: Error encountered setting up the dillon_pdh node or
service.
- Event Class: System
- Problem Description:
System firmware was unable to correctly set
up the dill_pdh node as a child of the pdh node, or was unable to locate and
attach the dillon_pdh service to the node. The status is returned in the data
field.
- Cause / Action:
Cause: This is usually a symptom of an earlier
problem. Check to be sure the pdh node was initialized into the tree
correctly. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1016
- Severity: CRITICAL
- Event Summary: CPUs running at different speeds were detected
during rendezvous
- Event Class: System
- Problem Description:
Reporting cell tried to rendezvous with a
cell with processors that are running at a different speed. The data field
lists the offending cell
- Cause / Action:
Cause: Reconfigure the PD so that all cells
have processors running at the same speed. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1019
- Severity: FATAL
- Event Summary: Coherency controller (CC) registers indicate a
Deadlock Recovery Reset
- Event Class: System
- Problem Description:
Early in bootstrap, the coherency
controller (CC) registers are checked for Deadlock Recovery Reset. This
chassis code indicates that CC logs will be stored to NVRAM.
- Cause / Action:
Cause: Coherency controller (CC) resources are
deadlocked and the CC is resetting the cell. Action: Analyze the Deadlock
Recovery logs (like MCA logs) to determine the cause of the failure.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1020
- Severity: CRITICAL
- Event Summary: The cell monarch cpu has failed.
- Event Class: System
- Problem Description:
This means that the cell monarch cpu has not
completed the assigned task within the timeout and hence it will be
deconfigured.
- Cause / Action:
Cause: The monarch cpu will be deconfigured.
Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1030
- Severity: CRITICAL
- Event Summary: An error was encountered when firmware tried to
update the Group B Profile
- Event Class: System
- Problem Description:
Firmware tried to default the Dynamic
(Group B) complex profile and encountered an error.
- Cause / Action:
Cause: Utilities may be unavailable to update
the profiles. Check the connections are reset. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1031
- Severity: CRITICAL
- Event Summary: The XBC SBE and LPE errors were not cleared properly
- Event Class: System
- Problem Description:
The XBC logged a SBE or LPE after they
should have been cleared. Either the clear failed, or a new error was logged
immediately. Data field: XBC number (32:43), port number (44:55), port status
information (0:31)
- Cause / Action:
Cause: the link generated a new error Action:
check CC, check link Check logs for other errors. If error is persistent,
replace cell board
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1034
- Severity: CRITICAL
- Event Summary: Failure to identify a core cell during Global MCA.
- Event Class: System
- Problem Description:
Not able to find a core cell in the PD
during a global MCA error processing.
- Cause / Action:
Cause: This will lead to a system reset.
Action:-
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1036
- Severity: CRITICAL
- Event Summary: Firmware was unable to find a suitable block of main
memory to relocate ROM
- Event Class: System
- Problem Description:
A Firmware tries to find a main memory
block large enough meeting alignment requirements.
- Cause / Action:
Cause: Probably caused by lots of PDT entries,
or no main memory present. Action:-
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1037
- Severity: MAJOR
- Event Summary: PDHC has detected the PDH battery low warning.
- Event Class: System
- Problem Description:
The Battery-Low interrupt was signaled in
the Interrupt Pending Register in Dillon (PDH) by the hardware. PDHC is merely
reporting the problem.
- Cause / Action: Cause: PDH battery power is low. Action: Replaced
the PDH battery.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1042
- Severity: MAJOR
- Event Summary: Unexpected software path has been taken
- Event Class: System
- Problem Description:
A software error has occurred. Data field
consists of file number and line number. Lab involvement is indicated.
- Cause / Action:
Cause: System Firmware design or code bug is
likely. Action: Contact the Response Center to report defect Upgrade PDC
firmware
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1043
- Severity: FATAL
- Event Summary: An HPMC has been encountered.
- Event Class: System
- Problem Description:
Each CPU will send this code early in the
PDC HPMC handler, as soon as the cause of the machine check is determined to
be HPMC. The data field contains the interrupt instruction address offset.
- Cause / Action: Cause: HPMC has occurred. Action: Contact HP
Support to analyze the HPMC PIM and Error Logs to determine the cause of the
failure
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1044
- Severity: MAJOR
- Event Summary: The OS_HPMC vector in the IVA table is misaligned.
- Event Class: System
- Problem Description:
PDC performs a number of checks on the OS
HPMC handler before branching to it. In this case, an IVA table has been
installed, but the OS_HPMC vector address is misaligned. The partition will
reboot rather than branch to OS_HPMC for crash-dump
- Cause / Action:
Cause: IVA table has been incorrectly
constructed or corrupted. Action: There will be no OS crash-dump. Contact HP
Support for analysis of HPMC PIM and ErrorLogs Report event to the Response
Center
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1045
- Severity: MAJOR
- Event Summary: The OS HPMC handler checksum is bad.
- Event Class: System
- Problem Description:
PDC performs a number of checks on the OS
HPMC handler before branching to it. In this case, an IVA table has been
installed but the OS HPMC handler checksum is bad. The partition will reboot
rather than branch to OS_HPMC for crash-dump
- Cause / Action:
Cause: There will be no OS crash-dump. Contact
HP Support for analysis of HPMC PIM and ErrorLogs Report event to the Response
Center Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1046
- Severity: MAJOR
- Event Summary: The HPMC handler length field in the IVA is zero.
- Event Class: System
- Problem Description:
PDC performs a number of checks on the OS
HPMC handler before branching to it. In this case, the HPMC handler length
field in the IVA determines the length for the checksum test. The length must
be a multiple of 4 to cover complete code instructions. This check has failed.
The partition will reboot rather than branch to OS_HPMC for crash-dump Cause /
Action:
Cause: IVA table has been incorrectly constructed or corrupted.
Action: There will be no OS crash-dump. Contact HP Support for analysis of
HPMC PIM and ErrorLogs Report event to the Response Center
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1047
- Severity: OTHER
- Event Summary: Attempt to branch to OS HPMC handler failed.
- Event Class: System
- Problem Description:
Cannot branch to OS HPMC handler. Cause /
Action:
Cause: Specific reason for this failure will be identified by
another chassis code. There will be no OS crash-dump. Contact HP Support for
analysis of HPMC PIM and ErrorLogs Action: Review previous chassis codes to
determine reason for branch failure
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1048
- Severity: MAJOR
- Event Summary: Attempt to reset the cell from within the HPMC
handler has failed.
- Event Class: System
- Problem Description:
It should not be possible for the reset to
fail. Lab involvement is indicated. Cause / Action:
Cause: Indicates
CRITICAL software or hardware error. Escalate. Action: Report this to the
Response Center Contact HP Support personnel to troubleshoot problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1049
- Severity: UNKNOWN
- Event Summary: PDC has detected a nested HPMC
- Event Class: System
- Problem Description:
After PDC_HPMC completes and branches to OS
HPMC, OS_HPMC will unmask HPMCs. If the OS_HPMC encounters an HPMC, the second
entry to PDC_HPMC be caught before the original PIM and ErrorLogs are
overwritten. PDC will restart the partition.
- Cause / Action:
Cause: HPMC within OS_HPMC handler. Action:
There may be no crash-dump, or an incomplete crash-dump. Contact HP Support to
Analyze the HPMCs
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1050
- Severity: MAJOR
- Event Summary: NVM flag has been used to halt the cell during HPMC
- Event Class: System
- Problem Description:
This chassis code should only be enabled at
the direction of the lab. If it is seen inadvertently, it is Equivalent to
ERR_ASSERT. The lab must be notified.
- Cause / Action: Cause: Cell halt during HPMC has been enabled from
the BCH debug menu. Action: Contact the Response Center to have flag cleared
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1051
- Severity: MAJOR
- Event Summary: NVM flag has been used to halt the cell during HPMC
- Event Class: System
- Problem Description:
This chassis code should only be enabled at
the direction of the lab. If it is seen inadvertently, it is Equivalent
toERR_ASSERT. Lab must be notified.
- Cause / Action: Cause: Cell halt during HPMC has been enabled from
the BCH debug menu. Action: Contact the Response Center to have flag cleared
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1052
- Severity: MAJOR
- Event Summary: An IVA table has been installed, but the HPMC vector
is zero.
- Event Class: System
- Problem Description:
The IVA table is expected to provide an OS
HPMC handler. This event is sent if the first instruction of the handler is
NULL. PDC will reboot the partition instead of branching to the OS HPMC
handler.
- Cause / Action: Cause: IVA table has been incorrectly constructed
or corrupted. Action: There will be no OS crash-dump. Contact HP Support to
Analyze HPMC Report event to the Response Center
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1131
- Severity: MAJOR
- Event Summary: ECC parity error
- Event Class: System
- Problem Description:
The path between the Coherency Controller
(CC) and the Crossbar Chip (XBC) has failed the ECC and Parity testing. Data
Field: bit index of failed bit
- Cause / Action:
Cause: Link or Hardware Failure Action: Have
your HP support representative heck the CC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1251
- Severity: CRITICAL
- Event Summary: Failed to read from the XBC.
- Event Class: System
- Problem Description:
After an attempt to takeover the XBC Global
Semaphore, a read of the same register failed. This indicates a connectivity
failure.
- Cause / Action: Cause: Fabric Access Error Action: Check XBC. Check
Links/Flex Cables.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1467
- Severity: MAJOR
- Event Summary: PDC is unable to determine HVERSION for the CPU
module
- Event Class: System
- Problem Description:
PDC is unable to determine HVERSION for the
CPU module and is, therefore, about to halt the cell.
- Cause / Action:
Cause: Cell board or PDH riser hardware problem
preventing PDC from accessing PDH memory or registers. Action: Contact HP
Support to confirm the cell board and PDH riser card are functioning properly.
Cause: PDC bug in which implementation has changed such that is no longer
follows original design. Action: Find out if this is a known problem, and
upgrade PDC if it is a problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1468
- Severity: MAJOR
- Event Summary: PDC is unable to determine HVERSION for the CPU
module
- Event Class: System
- Problem Description:
PDC is unable to determine HVERSION for the
CPU module and is, therefore, about to halt the cell.
- Cause / Action:
Cause: Cell board or PDH riser hardware problem
preventing PDC from accessing PDH memory or registers. Action: Contact HP
Support to confirm the cell board and PDH riser card are functioning properly.
Cause: PDC bug in which implementation has changed such that is no longer
follows original design. Action: Find out if this is a known problem, and
upgrade PDC if it is a problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1469
- Severity: MAJOR
- Event Summary: PDC is unable to determine HVERSION for the CPU
module
- Event Class: System
- Problem Description:
PDC is unable to determine HVERSION for the
CPU module and is, therefore, about to halt the cell.
- Cause / Action:
Cause: Cell board or PDH riser hardware problem
preventing PDC from accessing PDH memory or registers. Action: Contact HP
Support to confirm the cell board and PDH riser card are functioning properly.
Cause: PDC bug in which implementation has changed such that is no longer
follows original design. Action: Find out if this is a known problem, and
upgrade PDC if it is a problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1470
- Severity: MAJOR
- Event Summary: PDC was unable to determine an HVERSION for this CPU
even in MFG mode
- Event Class: System
- Problem Description:
An entry that directly corresponds to the
data associated with the executing CPU was not found in the HVERSION table in
PDC. However, PDC then tries to see if the complex is in MFG mode. In this
case, the complex was found to be in MFG mode. Therefore, PDC proceeded to
find out if there was at least a close configuration whose HVERSION could be
used instead of halting.
- Cause / Action: Cause: The complex was in Normal Mode and the cell
or CPUs were either not part of an expected shippable configuration or the
complex was in MFG Mode and the cell or CPUs were so far from an expected
shippable configuration that PDC could not even find a close configuration
from which to derive the HVERSION. Action: Figure out what's wrong with the
configuration either through chassis logs or by verifying the configuration
you have and cross-checking it against documented supported configurations.
Change your hardware so that it is a supported configuration.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1471
- Severity: MAJOR
- Event Summary: PDC was unable to find out if the complex is in MFG
Mode
- Event Class: System
- Problem Description:
An entry that directly corresponds to the
data associated with the executing CPU was not found in the HVERSION table in
PDC. However, PDC then tries to see if the complex is in MFG mode. This
chassis log is sent to indicate that PDC was unable to find out what the
operating mode was for the complex.
- Cause / Action: Cause: Cell board or PDH riser hardware problem
preventing PDC from accessing PDH memory or registers. Action: Contact HP
Support to confirm the cell board and PDH riser card are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1487
- Severity: MAJOR
- Event Summary: PDC could not determine the CPU module's HVERSION
- Event Class: System
- Problem Description:
The cell is about to be halted.
- Cause / Action:
Cause: The complex was in Normal Mode and the
cell or CPUs were either not part of an expected shippable configuration or
the complex was in MFG Mode and the cell or CPUs were so far from an expected
shippable configuration that PDC could not even find a close configuration
from which to derive the HVERSION. Action: Figure out what's wrong with the
configuration either through chassis logs or by verifying the configuration
you have and cross-checking it against documented supported configurations.
Change your hardware so that it is a supported configuration.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1544
- Severity: FATAL
- Event Summary: Partition monarch CPU cannot obtain data from a cell
in its partition.
- Event Class: System
- Problem Description:
User attempted to return to BCH from ISL,
but the PD monarch could not access data in PDH memory of a cell in its
partition, so reset PD.
- Cause / Action:
Cause: CPU is on unreachable cell Defective CPU
Action: Contact HP Support personnel to troubleshoot cell board Investigate
for fabric problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1550
- Severity: MAJOR
- Event Summary: Error occurred accessing a PDC data structure
- Event Class: System
- Problem Description:
Error occurred accessing a PDC data
structure. The cell will be halted. The data field contains the return status
for the function that encountered the error.
- Cause / Action:
Cause: Hardware problem with the PDH riser
card. Action: Contact HP Support to confirm the PDH riser card is functioning
properly. Cause: Hardware problem with the CPU or cell board. Action: Contact
HP Support to confirm the CPUs and cell board are functioning properly
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1551
- Severity: MAJOR
- Event Summary: Error occurred accessing a PDC data structure
- Event Class: System
- Problem Description:
Error occurred accessing a PDC data
structure. The cell will be halted. The data field contains the return status
for the function that encountered the error.
- Cause / Action:
Cause: Hardware problem with the PDH riser
card. Action: Contact HP Support to confirm the PDH riser card is functioning
properly. Cause: Hardware problem with the CPU or cell board. Action: Contact
HP Support to confirm the CPUs and cell board are functioning properly
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1553
- Severity: MAJOR
- Event Summary: Error occurred accessing a PDC data structure
- Event Class: System
- Problem Description:
Error occurred accessing a PDC data
structure. The cell will be halted. The data field contains the return status
for the function that encountered the error.
- Cause / Action:
Cause: Hardware problem with the PDH riser
card. Action: Contact HP Support to confirm the PDH riser card is functioning
properly. Cause: Hardware problem with the CPU or cell board. Action: Contact
HP Support to confirm the CPUs and cell board are functioning properly
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1554
- Severity: MAJOR
- Event Summary: Error occurred accessing a PDC data structure
- Event Class: System
- Problem Description:
Error occurred accessing a PDC data
structure. The cell will be halted. The data field contains the return status
for the function that encountered the error.
- Cause / Action:
Cause: Hardware problem with the PDH riser card.
Action: Contact HP Support to confirm the PDH riser card is functioning
properly. Cause: Hardware problem with the CPU or cell board. Action: Contact
HP Support to confirm the CPUs and cell board are functioning properly
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1632
- Severity: MAJOR
- Event Summary: PDC was unable to check the compatibility of the
CPUs
- Event Class: System
- Problem Description:
Around the time of early CPU self tests, PDC
checks that all the CPUs within the cell are compatible with one another.
However, PDC was unable to perform this check. Therefore, the cell is about to
be halted.
- Cause / Action:
Cause: Something is wrong with the cell that
either prevents PDC from accessing PDH memory or causes PDC not to fetch and
execute code properly. Action: Troubleshoot the cell hardware to determine if
this is the case Cause: There is a PDC bug in which the PDC implementation has
changed over time and no longer abides by the original design of CPU
compatibility checking. Action: Find out if PDC has found any problems with
this part of the code and if there is a new PDC image, and if so, upgrade PDC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1633
- Severity: MAJOR
- Event Summary: PDC was unable to successfully complete the CPU
homogeneity check
- Event Class: System
- Problem Description:
PDC was unable to complete the task of
verifying that the processors satisfy their homogeneity requirements. The cell
is about to be halted.
- Cause / Action:
Cause: Cell hardware failure preventing PDC
from being able to complete some homogeneity check. See high-alert level
chassis logs sent just prior to this one to find out exactly what data could
not be accessed. Action: Contact HP Support personnel to troubleshoot problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1634
- Severity: MAJOR
- Event Summary: PDC has determined that the CPUs on the cell are not
compatible.
- Event Class: System
- Problem Description:
PDC now allows for different revisions of
the same processor to run together on a single cell and within a partition.
PDC checks to make sure that all the CPUs on a cell are compatible with one
another. In this case, the function that checks for compatibility has returned
to its caller indicating the CPUs are not compatible. This chassis log
tells us that the cell is about to be halted and why.
- Cause / Action:
Cause: Check the chassis logs sent following
CC_BOOT_CPUs_ARE_INCOMPATIBLE to figure out why the CPUs are incompatible.
There will be chassis logs with the physical location of each of the CPUs that
were checked for compatibility, along with CPU type and CPU revision. Action:
Figure out which CPU(s) didn't belong in the cell and replace CPUs within the
cell accordingly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1635
- Severity: MAJOR
- Event Summary: PDC was unable to determine its operating mode
(Normal or MFG)
- Event Class: System
- Problem Description:
PDC was unable to complete the task of
determining system mode (mfg or normal). The cell is about to be halted.
- Cause / Action: Cause: Cell hardware failure preventing PDC access
to data in PDH memory Action: Contact HP Support personnel to troubleshoot
problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1636
- Severity: MAJOR
- Event Summary: PDC was unable to determine whether or not
processors are overclocked.
- Event Class: System
- Problem Description:
PDC was unable to complete the task of
checking for overclocked CPUs. The cell is about to be halted.
- Cause / Action:
Cause: Cell hardware failure preventing PDC from
being able to complete some homogeneity check. See high-alert level chassis
logs sent just prior to this one to find out exactly what data could not be
accessed. Action: Contact HP Support personnel to troubleshoot problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1653
- Severity: MAJOR
- Event Summary: CPU frequency is greater than the maximum reliable
speed for this processor
- Event Class: System
- Problem Description:
PDC detected that the processor is being
clocked at a rate that is more than the maximum speed at which the processor
is expected to run reliably. If the complex is in Normal Mode, this processor
will soon be deconfigured. If the complex is in MFG Mode, this high alert
level chassis log will be sent as a warning but the processor will be allowed
to boot anyway. To know which processor is being over clocked, find the local
CPU number for this processor in the data field of a chassis log sent just
prior, called CC_BOOT_MISMATCH_CPU_CAP_SPEEDS.
- Cause / Action:
Cause: The cell is programmed incorrectly such
that it is lying to PDC about the frequency at which the processor is running.
Action: Perform an update to the cell board so that it accurately reports the
rate at which processors are being clocked Cause2: One or more CPUs are being
over clocked. Action1: Contact HP Support personnel to troubleshoot the cell
board Cause3: PDC error in which PDC is incorrectly calculating the CPU speed.
Action3: Upgrade PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1654
- Severity: MAJOR
- Event Summary: CPU frequency is greater than the maximum reliable
speed for this processor
- Event Class: System
- Problem Description:
PDC detected that the processor is being
clocked at a rate that is more than the maximum speed at which the processor
is expected to run reliably. If the complex is in Normal Mode, this processor
will soon be deconfigured.
- Cause / Action:
Cause: The cell is programmed incorrectly such
that it is lying to PDC about the frequency at which the processor is running.
Action: Perform an update to the cell board so that it accurately reports the
rate at which processors are being clocked Cause2: One or more CPUs are being
over clocked. Action1: Contact HP Support personnel to troubleshoot the cell
board Cause3: PDC error in which PDC is incorrectly calculating the CPU speed.
Action3: Upgrade PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1655
- Severity: MAJOR
- Event Summary: PDC is was unable to read the CPU speed rating
and/or the actual CPU speed
- Event Class: System
- Problem Description:
PDC is was unable to read the CPU speed
rating and/or the actual CPU speed. When either case fails, both are sent out.
- Cause / Action:
Cause: Cell hardware failure preventing PDC
from reading speed rating data from PDH. Action: Contact HP Support personnel
to troubleshoot the cell board
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1656
- Severity: MAJOR
- Event Summary: PDC was unable to read the CPU speed rating and/or
the actual CPU speed.
- Event Class: System
- Problem Description:
PDC was unable to read the CPU speed rating
and/or the actual CPU speed. When either case fails, both are sent out.
- Cause / Action:
Cause: Cell hardware failure preventing PDC
from reading data from PDH memory Action: Contact HP Support personnel to
troubleshoot the cell board
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1696
- Severity: UNKNOWN
- Event Summary: Errors signaled in coherency controller Secondary
Error Mode Regs
- Event Class: System
- Problem Description:
PDC periodically polls the coherency
controller (CC) Secondary Error Mode registers to check for errors logged
there. During portions of bootstrap when some CC errors are masked, this
polling can illuminate transient errors that may otherwise be missed, or can
help to root cause a problem that appears later as an HPMC. These chassis
codes are also used to report CC register contents during HPMC handling. The
data field contains the CC block address and register address in the most
significant byte (each field occupying a nibble). The remainder of the data
field contains the Secondary Error Mode register contents. When errors are
masked (via clear bits in the Error Enable Mask), they are still recorded in
the Secondary Error Mask. This chassis code will not be output when error
overflow occurs from Primary to Secondary Error Mode. In that case only the
ERR_DNA_PRI_HEALTH chassis code will be output.
- Cause / Action:
Cause: Errors have been detected by CC, while
errors are masked from Primary Error Mode. Action: Analysis data field to
determine block, register and error status
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1698
- Severity: FATAL
- Event Summary: Machine check type could not be determined.
- Event Class: System
- Problem Description:
The Reporting Entity CPU experienced a trap
that has caused an asynchronous branch to the machine check handler, but CPU
logs do not indicate that an HPMC, LPMC or TOC has occurred. The data field
will contain the CPU Check Summary. This Check Summary is described in the
return value description for CpuProcessMachineCheck in PA-8800 CPU Library
Application
- Cause / Action:
Cause: Contact HP Support. Save event list and
Processor HPMC PIM for analysis by lab. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1702
- Severity: MAJOR
- Event Summary: Blocking Timeout detected by Concorde PIN block.
- Event Class: System
- Problem Description:
A blocking timeout has been detected by
Concorde PIN block. This will normally preclude branching to OS_HPMC and
collection of crash dump. PIM and ErrorLogs are collected to NVM by firmware.
Tombstones may be analyzed after reboot. Data field contains the physical
location of the affected CC.
- Cause / Action:
Cause: Blocking timeout in CC PIN block. OS
crash-dump will not occur. Contact HP Support to Analyze HPMC Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1705
- Severity: MAJOR
- Event Summary: During HPMC handling, a latent HPMC has been logged
while HPMCs are masked.
- Event Class: System
- Problem Description:
Data field contains the physical location
of the CPU that has logged a latent HPMC while HPMCs are masked. This is a
FATAL error, which precludes branch to OS_HPMC for dump.
- Cause / Action:
Cause: PD will reboot. No operator intervention
is required. Analyze HPMC cause using PIM/ErrorLogs (tombstones). There will
be no OS crashdump. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1706
- Severity: MAJOR
- Event Summary: An HPMC crash slave CPU has detected a latent HPMC
while HPMCs are masked.
- Event Class: System
- Problem Description:
The data field specifies the physical
location of the CPU that has detected a latent HPMC while HPMCs are masked.
This will prevent the HPMC crash monarch from branching to OS_HPMC for dump.
- Cause / Action:
Cause: The PD will reset rather than branch to
OS_HPMC. There will be no OS crash-dump. The HPMC cause should be determined
by analysis of PIM and ErrorLogs. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1708
- Severity: MAJOR
- Event Summary: An HPMC has occurred during an cell Online Add or
Online Delete FATAL section
- Event Class: System
- Problem Description:
If an HPMC occurs during Cell OL*
operations, there is a short FATAL section which PDC will not be sure of
Partition membership. If this section is interrupted, PDC will not branch to
OS_HPMC for crash dump. The PD will be reset at completion of PDC_HPMC.
- Cause / Action:
Cause: Partition will reboot rather than branch
to OS_HPMC for dump. No operator action is required. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1709
- Severity: MAJOR
- Event Summary: OAQ block of Concorde has hung due to multiple fatal
timeouts.
- Event Class: System
- Problem Description:
OAQ block in CC has experienced multiple
fatal timeouts, and has hung. The data field contains the physical location of
the CC.
- Cause / Action: Cause: A part of CC has hung. PDC_HPMC cannot
safely branch to OS_HPMC. PDC will reset the partition, precluding memory
crash dump. Action: Contact HP Support to Analyze HPMC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1713
- Severity: FATAL
- Event Summary: PDC was unable to access a cell board hardware
register
- Event Class: System
- Problem Description:
PDC was unable to access a cell board
hardware register or a cell board hardware register did not behave as
expected. The partition will be reset. The data field is the return status
from the function that encountered the error.
- Cause / Action:
Cause: Hardware problem with the PDH riser card.
Action: Contact HP support to confirm the PDH riser card is functioning
properly. Cause: Hardware problem with the CPU or cell board. Action1: Contact
HP support to confirm the CPUs and cell board are functioning properly.
Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1714
- Severity: FATAL
- Event Summary: PD is doing a reset for reconfiguration because of
IPR clearing error
- Event Class: System
- Problem Description:
PDC is required to clear the IPR of all
cells in the partition prior to handing off the system to the OS. To meet this
requirement, the PD monarch clears the IPR on all cells in the partition as it
boots to ISL. This chassis log is thrown when there are one or more cells
whose IPR the PD monarch was unable to clear. This can happen for a couple of
reasons, but PDC is now doing a reset for reconfiguration of the partition to
get the cells to SINC_BIB so that the user/CE can address the problem. The
partition will be reset.
- Cause / Action:
Cause: Look for chassis logs
BOOT_ERROR_CLEARING_IPR_ON_CELL and BOOT_ERROR_CLEARING_IPR_AT_LAUNCH and
follow their cause actions. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1715
- Severity: FATAL
- Event Summary: PDC was unable to access a cell board hardware
register
- Event Class: System
- Problem Description:
PDC was unable to access a cell board
hardware register or a cell board hardware register did not behave as
expected. The partition will be reset. The data field is the return status
from the function that encountered the error.
- Cause / Action:
Cause: Hardware problem with the PDH riser card.
Action: Contact HP support to confirm the PDH riser card is functioning
properly. Cause: Hardware problem with the CPU or cell board. Action1: Contact
HP support to confirm the CPUs and cell board are functioning properly.
Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1716
- Severity: FATAL
- Event Summary: PDC was unable to clear the IPR in Dillon for
unknown reason
- Event Class: System
- Problem Description:
Indicates an unexpected return status from
a PDC function. This is a PDC bug. Data field contains the unexpected return
value. The partition will be reset.
- Cause / Action:
Cause: PDC bug. Action: Contact HP Support to
check for PDC Upgrade.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1717
- Severity: MAJOR
- Event Summary: PDC has detected the PDH battery low warning
- Event Class: System
- Problem Description:
The Battery-Low interrupt was signaled in
the Interrupt Pending Register in Dillon (PDH) by the hardware. PDC is merely
reporting the problem.
- Cause / Action:
Cause: PDH battery power is low. Action:
Replace the PDH battery.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1718
- Severity: FATAL
- Event Summary: PDC is about to reset the partition due to one or
more PDH events
- Event Class: System
- Problem Description:
Due to PDH events PDC found pending on one
or more cells in the partition, PDC is about to do a reset for reconfiguration
of the partition. The data field contains the value of the flag that is used
in a PDC function. This flag value is what controls whether or not PDC enters
this section of code that is now going to reset the partition.
- Cause / Action:
Cause: One or more CRITICAL PDH events were
found pending in one or more of the cells' Interrupt Pending Registers.
Action: Look for other chassis logs sent shortly before that would indicate
what PDH events were found and on which cells, and handle those PDH events
according to the cause-action statements for the BOOT_PDH_EVENTS_PENDING and
BOOT_PDH_BATTERY_POWER_LOW chassis logs.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1719
- Severity: MAJOR
- Event Summary: There is at least one CPU module for which SPIROM
data is unavailable
- Event Class: System
- Problem Description:
While trying to gather the SPIROM data for
each CPU module present, PDC was unable to get SPIROM data from the Utilities
for at least one CPU module and PDC does not already have cached SPIROM data
for the CPU module to enable boot. So, PDC is about to deconfigure the modules
for which there is no SPIROM data available.
- Cause / Action:
Cause: EEPROM on the System Management Bus
(accessible to Utilities) that contains the SPIROM data has invalid data or
bad checksums. Action: Fix the contents of the EEPROM(s) to have valid SPIROM
data for the CPU module(s). Cause: Internal PDC problem. Action: Upgrade PDC
if there is a PDC ROM that fixes this particular problem. Cause: There could
be a problem in USB that is preventing the PDHC and the MP from communicating
to gather the SPIROM data. Action: Troubleshoot to find out if the problem is
that USB was just not functioning at the time PDC requested the SPIROM data.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1720
- Severity: MAJOR
- Event Summary: PDC received an error while communicating with the
PDHC
- Event Class: System
- Problem Description:
PDC received an error while communicating
with the PDHC. The cell will be halted. The data field contains the cell
number.
- Cause / Action:
Cause: Hardware problem with the MP or PDHC.
Action: Contact HP Support to confirm the manageability subsystem is
functioning properly. Cause: PDHC, MP, and/or PDC firmware are not compatible.
Action: Upgrade PDHC, MP, and/or PDC firmware to supported and compatible
revisions.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1721
- Severity: MAJOR
- Event Summary: PDC received an unexpected return status from an
internal function
- Event Class: System
- Problem Description:
PDC received an unexpected return status
from an internal function. The cell will be halted. Data field contains the
cell number.
- Cause / Action:
Cause: Hardware failure with CPU, CC or cell
board. Action: Contact HP Support to confirm the CPUs, CC, and cell board are
functioning properly. Update PDC if a version is available to fix this
problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1724
- Severity: MAJOR
- Event Summary: TOC has interrupted a FATAL section of OLA/D
operation.
- Event Class: System
- Problem Description:
If a TOC occurs during Cell OL* operations,
there is a short FATAL section in which PDC will not be sure of Partition
membership. If this section is interrupted, PDC will not branch to OS_TOC. The
partition will reset instead.
- Cause / Action:
Cause: Identify cause of TOC. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1726
- Severity: MAJOR
- Event Summary: A write access to a scratch RAM structure failed.
- Event Class: System
- Problem Description:
PDC attempted to perform a write access to
the data structure that contains the CPU module HVersion, but a failure
occurred. The data field contains the return status from the function that
writes to the data structure. The cell will be halted.
- Cause / Action:
Cause: Cell hardware failure. Cause2: PDC
runtime error. Action: Contact HP support to troubleshoot the cell Action:
Check hardware for failures: cell Upgrade PDC if a newer version is available.
Contact response center. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1727
- Severity: CRITICAL
- Event Summary: PDC detected a failure in creating and/or passing
around an argument
- Event Class: System
- Problem Description:
An invalid CPU number was passed into an
internal PDC function. The data field contains the invalid parameter.
- Cause / Action:
Cause: Hardware failure with CPU, CC or cell
board. Action: Contact HP Support to confirm the CPUs, CC, and cell board are
functioning properly. Update PDC if a version is available to fix this
problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1728
- Severity: CRITICAL
- Event Summary: PDC detected an illegal argument passed between
functions
- Event Class: System
- Problem Description:
PDC passed a bad value between functions,
specifically, an invalid number for a local CPU number. This is an internal
PDC error for which the cell will be halted. Data field is the invalid local
CPU number passed to a PDC function.
- Cause / Action: Cause: Cell hardware failure preventing PDC from
getting valid data from the hardware. Action1: Contact HP Support personnel to
troubleshoot problem Cause 2: Some internal PDC error where PDC incorrectly
determines the local CPU number Action2: Upgrade PDC Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1729
- Severity: MAJOR
- Event Summary: PDC detected an illegal CPU number while trying to
deconfigure a CPU
- Event Class: System
- Problem Description:
Cell is about to be halted because an
invalid CPU number was passed into an internal PDC function. The data field
contains the invalid parameter.
- Cause / Action:
Cause: Hardware failure with CPU, CC or cell
board. Action: Contact HP Support to confirm the CPUs, CC, and cell board are
functioning properly. Update PDC if a version is available to fix this
problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1730
- Severity: MAJOR
- Event Summary: PDC detected an illegal CPU number while trying to
deconfigure a CPU
- Event Class: System
- Problem Description:
Cell is about to be halted because an
invalid CPU number was passed into an internal PDC function. The data field
contains the invalid parameter.
- Cause / Action:
Cause: Hardware failure with CPU, CC or cell
board. Action: Contact HP Support to confirm the CPUs, CC, and cell board are
functioning properly. Update PDC if a version is available to fix this
problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1732
- Severity: MAJOR
- Event Summary: PDC was unable to access a local data structure.
- Event Class: System
- Problem Description:
PDC could not access one of its own data
structures on the local cell. The cell will be halted or if a partition has
been formed, the partition will be reset. The data field contains the return
status from the PDC function that encountered the error.
- Cause / Action:
Cause: Hardware problem with the PDH riser card.
Action: Contact HP Support to confirm the PDH riser card is functioning
properly. Cause: Hardware problem with the CPU or cell board. Action: Contact
HP Support to confirm the CPUs and cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1733
- Severity: MAJOR
- Event Summary: PDC was unable to access a local data structure.
- Event Class: System
- Problem Description:
PDC could not access one of its own data
structures on the local cell. The cell will be halted or if a partition has
been formed, the partition will be reset. The data field contains the return
status from the PDC function that encountered the error.
- Cause / Action:
Cause: Hardware problem with the PDH riser card.
Action: Contact HP Support to confirm the PDH riser card is functioning
properly. Cause: Hardware problem with the CPU or cell board. Action: Contact
HP Support to confirm the CPUs and cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1734
- Severity: MAJOR
- Event Summary: PDC was unable to access a local data structure.
- Event Class: System
- Problem Description:
PDC could not access one of its own data
structures on the local cell. The cell will be halted or if a partition has
been formed, the partition will be reset. The data field contains the return
status from the PDC function that encountered the error.
- Cause / Action:
Cause: Hardware problem with the PDH riser card.
Action: Contact HP Support to confirm the PDH riser card is functioning
properly. Cause: Hardware problem with the CPU or cell board. Action: Contact
HP Support to confirm the CPUs and cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1735
- Severity: MAJOR
- Event Summary: PDC was unable to access a local data structure.
- Event Class: System
- Problem Description:
PDC could not access one of its own data
structures on the local cell. The cell will be halted or if a partition has
been formed, the partition will be reset. The data field contains the return
status from the PDC function that encountered the error.
- Cause / Action:
Cause: Hardware problem with the PDH riser card.
Action: Contact HP Support to confirm the PDH riser card is functioning
properly. Cause: Hardware problem with the CPU or cell board. Action: Contact
HP Support to confirm the CPUs and cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1756
- Severity: FATAL
- Event Summary: A hardware failure with a PDH Raiser Card's
"hardware" semaphore register
- Event Class: System
- Problem Description:
- Cause / Action:
Cause: A hardware failure with a PDH
Raiser Card's "hardware" semaphore register. Action: Contact HP Support
personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1758
- Severity: CRITICAL
- Event Summary: Dumping error info. Read status of the Primary Mode
Register
- Event Class: System
- Problem Description:
The Coherency Controller's (CC) XIN link
did not initialize properly. The data field contains the return status from an
attempted read of the CC Primary Error Mode CSR. (0 = SUCCESS)
- Cause / Action:
CC to XBC link init failure. Contact your HP
service representative to check the CC to XBC link
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1760
- Severity: MAJOR
- Event Summary: The main backplane is reporting the LPM status as
fault.
- Event Class: System
- Problem Description:
The main backplane is reporting the LPM
status as fault.
- Cause / Action:
Many possible causes, repair / replace the
appropriate part.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1761
- Severity: MAJOR
- Event Summary: PDC failed clearing the OLA steering bit in the
Dillon microstatus reg.
- Event Class: System
- Problem Description:
PDC failed clearing the OLA steering bit
in the Dillon microstatus register. Data field contains the physical location
of the cell with the failure. This can only happen on an OLA cell and will
cause that cell to reset and not to join the existing partition.
- Cause / Action:
Cause1: Probably something wrong with the cell
hardware. Action1: Try OLAing a different cell. Contact HP Support personnel
to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1762
- Severity: MAJOR
- Event Summary: The IO backplane is reporting a LPM status as fault.
- Event Class: System
- Problem Description:
The IO backplane has reported a local
power monitor fault.
- Cause / Action:
Service / replace the appropriate part of, or
the entire backplane.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1764
- Severity: MAJOR
- Event Summary: The System Flash Write Enable bit is incorrectly set
and now clearing by PDC
- Event Class: System
- Problem Description:
The System Flash Write Enable bit is
incorrectly set and now clearing by PDC. The Data field contains the value of
PDH Miscellaneous Signal Register read before System Flash bit is cleared.
- Cause / Action:
Cause: The System Flash Write Enable bit
is incorrectly set by hardware and now cleared by PDC. Action: If this chassis
code occurs in every boot then contact HP Support personnel to troubleshoot
the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1765
- Severity: CRITICAL
- Event Summary: Error copying the routing registers to the local
port
- Event Class: System
- Problem Description:
Error writing the XBC port's routing
registers. The cell will reboot. Data Field: XBC port << 44 | XBC num
<< 32 | return status
- Cause / Action:
Cause: XBC access failure. Action: Check
XBC, check links, check backplane, check CC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1766
- Severity: FATAL
- Event Summary: Resetting the partition because couldn't access PDH
memory
- Event Class: System
- Problem Description:
When returning from other software, like
when returning from ISL, PDC is trying to make sure that all of the slave
processors in the partition are asleep; however, this event ID indicates that
we were unable to access PDH memory of a cell that is supposed to be part of
our partition. The data field contains the error return status from a function
called SleepAndWakeupCountersGet().
- Cause / Action:
Cause: CPU is on unreachable cell
Defective CPU Action: Contact HP Support personnel to troubleshoot cell board
Investigate for fabric problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1767
- Severity: FATAL
- Event Summary: Resetting the partition because a processor was not
in expected state
- Event Class: System
- Problem Description:
When returning to PDC from other software,
like returning from ISL, PDC tries to verify that all the slave processors are
in the expected state (ie that all slave processors are asleep); however, this
event ID indicates that at least one processor active in the partition was not
asleep. So, PDC is going to reset the partition. The data field of this Event
ID is the global CPU number of the first CPU in the partition not found in the
expected state.
- Cause / Action:
Cause1: Software has not correctly returned all
CPUs to sleep state Action1: Reset would clear this issue Cause2: CPU did not
properly receive/execute sleep command Action2: Contact HP Support to
troubleshoot cell board
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1768
- Severity: MAJOR
- Event Summary: PDC could not access a data structure on the local
cell
- Event Class: System
- Problem Description:
PDC could not access one of its own data
structures on the local cell. The cell will be halted. The data field contains
the return status from the PDC function that encountered the error.
- Cause / Action:
Cause1: Hardware problem with the PDH riser
card. Action1: Contact HP Support to confirm the PDH riser card is functioning
properly. Cause2: Hardware problem with the CPU or cell board. Action2:
Contact HP Support to confirm the CPUs and cell board are functioning
properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1769
- Severity: MAJOR
- Event Summary: Resetting cell because processor couldn't access
it's own PDH memory
- Event Class: System
- Problem Description:
The cell will be reset because it was
unable to access PDH memory on its own cell. While trying to move all the
slave processors on the cell to the "late boot sleep", the monarch tried to
write the sleep timeout to PDH memory on its own cell, but encountered an
error in doing so. The data field contains a PDC return status.
- Cause / Action:
Cause1: Hardware problem with the cell (like
PDH memory) or the CC or CPU. Action1: Contact HP support to troubleshoot or
replace the cell board. Cause2: PDC bug. Action2: Contact HP Support to check
for PDC upgrade.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1770
- Severity: MAJOR
- Event Summary: Halting cell because bad parameter passing was
discovered
- Event Class: System
- Problem Description:
PDC attempted to tell a slave CPU to
execute from an unknown location. Data field contains the location id that PDC
attempted to move the slave to.
- Cause / Action:
Cause: PDC passed bad parameter Action:
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1771
- Severity: MAJOR
- Event Summary: Halting cell because PDC was unable to determine GNI
address of a function
- Event Class: System
- Problem Description:
Halting cell/PD because PDC was unable to
determine GNI address of a function. Data field contains a status return
indicating type of failure.
- Cause / Action:
Cause1: If data field of chassis code = -102 or
-103, cell failed in getting the address Action1: Check fabric connections
Contact HP support to troubleshoot cell board(s) Cause2: if data field
contains -104, PDC successfully read the address, but that address was
invalid, likely was not initialized Action2: Contact HP support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1772
- Severity: MAJOR
- Event Summary: Could not access a data structure on a cell in the
partition.
- Event Class: System
- Problem Description:
A write attempt to a data structure on the
executing cell board failed. The cell will be reset. The data field contains
the return value from PDC function that detected the error.
- Cause / Action:
Cause1: Hardware problem with the cell board,
CPU, or PDH riser card. Action1: Contact HP Support to confirm the cell board,
CPUs, and PDH riser card are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1773
- Severity: FATAL
- Event Summary: Error occurred accessing a PDC data structure
- Event Class: System
- Problem Description:
Error occurred accessing a PDC data
structure. Depending upon the situation the cell or entire partition will be
reset. The data field contains the return status for the function that
encountered the error
- Cause / Action:
Cause1: Hardware problem with the PDH riser
card. Action1: Contact HP Support to confirm the PDH riser card is functioning
properly. Cause2: Hardware problem with the CPU or cell board. Action2:
Contact HP Support to confirm the CPUs and cell board are functioning
properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1774
- Severity: MAJOR
- Event Summary: Error Data tied to a previous Error Assert event
- Event Class: System
- Problem Description:
A software error has occurred. Data field
consists of data pertinent to the error. Lab involvement is indicated.
- Cause / Action:
Cause: System Firmware design or code bug
is likely. Action: Contact the Response Center to report defect Upgrade PDC
firmware
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1775
- Severity: MAJOR
- Event Summary: The read of the Memory extender FRU failed.
- Event Class: System
- Problem Description:
The read of the Memory extender FRU
failed.
- Cause / Action:
Cause: The FRU EEPROM for the memory extender
is corrupted or the EEPROM was not able to be accessed. Action: Contact HP
Support to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1776
- Severity: CRITICAL
- Event Summary: Attempt to update Cell Static Routing has failed
- Event Class: System
- Problem Description:
Failed to route around a broken link on
cell reboot. Data Field: PDC return status
- Cause / Action:
Fabric Access Error
Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1777
- Severity: CRITICAL
- Event Summary: Failed to read the XBC CSR that contains the number
of failed links
- Event Class: System
- Problem Description:
Could not read the XBC register that
contains the number of links that are currently broken on the complex. Data
Field: (XBC Num << 32) | PDC return status
- Cause / Action:
Fabric Access Failure
Contact HP Support
personnel to check the XBC, Backplane, CC, look for additional chassis codes
to describe the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1778
- Severity: CRITICAL
- Event Summary: Failed to write the XBC CSR that contains the
number of failed links
- Event Class: System
- Problem Description:
Could not write the XBC register that
contains the number of links that are currently broken on the complex. Data
Field: (XBC Num << 32) | PDC return status
- Cause / Action:
Fabric Access Failure
Contact HP Support
personnel to check the XBC, Backplane, CC, look for additional chassis codes
to describe the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1779
- Severity: CRITICAL
- Event Summary: Failed to read the XBC CSR that contains the number
of failed links
- Event Class: System
- Problem Description:
Could not read the XBC register that
contains the number of links that are currently broken on the complex. Data
Field: PDC return status
- Cause / Action:
Fabric Access Failure
Contact HP Support
personnel to check the XBC, Backplane, CC, look for additional chassis codes
to describe the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1780
- Severity: CRITICAL
- Event Summary: Failed to write the XBC CSR that contains the
number of failed links
- Event Class: System
- Problem Description:
Could not write the XBC register that
contains the number of links that are currently broken on the complex. Data
Field: (XBC Num << 32) | PDC return status
- Cause / Action:
Fabric Access Failure
Contact HP Support
personnel to check the XBC, Backplane, CC, look for additional chassis codes
to describe the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1781
- Severity: CRITICAL
- Event Summary: Failed to read the XBC CSR that marks the port route
around
- Event Class: System
- Problem Description:
Could not read the XBC register that marks
the ports that have been routed around. Data Field: (XBC Num << 32) |
PDC return status
- Cause / Action:
Fabric Access Failure
Contact HP Support
personnel to check the XBC, Backplane, CC, look for additional chassis codes
to describe the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1782
- Severity: CRITICAL
- Event Summary: Could not traverse the PIOB route to the remote XBC
- Event Class: System
- Problem Description:
The PIOB route to the remote XBC was not
traversable. The cell will halt. Data Field: (XBC Num << 32) | PDC
return status
- Cause / Action:
Broken Crossbar Link
Contact HP Support
personnel to check the XBC, Backplane, Flex Cables. Look for additional
chassis codes to provide more detail.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1783
- Severity: MAJOR
- Event Summary: Failed to release the XBC semaphore after land
mining
a remote XBC port
- Event Class: System
- Problem Description:
Could not release the remote XBC's
semaphore. Cell will halt. Data Field: (XBC Num << 32) | PDC return
status
- Cause / Action:
Fabric Access Failure
Contact HP Support
personnel to check the XBC, Backplane, CC, look for additional chassis codes
to describe the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1784
- Severity: MAJOR
- Event Summary: Windows IML: Temperature overheat condition warning
- Event Class: System
- Problem Description:
This error is logged when SCSI Disk
Drivers or Disk Array Drivers indicate that an overheat condition has
occurred.
- Cause / Action:
Cause: SCSI Disk Drivers or Disk Array Drivers
indicate an Overheat Condition. Action: Shutdown servers and storage box.
Check room temperature of room and air flow to storage box.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1785
- Severity: CRITICAL
- Event Summary: Windows IML: FATAL fan failure
- Event Class: System
- Problem Description:
This error is logged when SCSI Disk
Drivers of Disk Array driver detect a FATAL Fan Failure.
- Cause / Action:
Cause: SCSI Disk Drivers or Disk Array Drivers
indicate a FATAL Fan Failure. This alert occurs when redundant fans have
failed and the FATAL Fan Failure is imminent. Action: Replace Fan Modules as
soon as possible following the Fan Module Remove and Replace Procedures.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1786
- Severity: MAJOR
- Event Summary: Windows IML: Fan failure warning condition
- Event Class: System
- Problem Description:
This error is logged when SCSI Disk
Drivers of Disk Array driver detect a Fan Failure.
- Cause / Action:
Cause: SCSI Disk Drivers or Disk Array Drivers
indicate that a redundant fan has failed or is operating in a degraded
condition. Action: Replace Fan Module as soon as possible following the Fan
Module Remove and Replace Procedures.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1787
- Severity: MAJOR
- Event Summary: Windows IML: Door open event warning
- Event Class: System
- Problem Description:
This error is logged when SCSI Disk
Drivers or Disk Array Drivers detect an open door panel.
- Cause / Action:
Cause: SCSI Disk Drivers or Disk Array Drivers
detects an open door panel. Action: Close any open panels.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1788
- Severity: MAJOR
- Event Summary: Windows IML: Fans no longer redundant warning
- Event Class: System
- Problem Description:
This event should be logged after a fan
failure that causes a fan set to be no longer redundant.
- Cause / Action:
Cause: SCSI Disk Drivers or Disk Array Drivers
has issued a Fans No Longer Redundant alert. This event should be logged after
a fan failure that causes a fan set to be no longer redundant. Action: Replace
Fan Module as soon as possible following the Fan Module Remove and Replace
Procedure.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1789
- Severity: MAJOR
- Event Summary: Windows IML: Power Supply Failure
- Event Class: System
- Problem Description:
This error indicates that either SCSI Disk
Drivers or Disk Array Drivers have issued a Power Supply Failure alert.
- Cause / Action:
Cause: SCSI Disk Drivers or Disk Array Drivers
has issued a Power Supply Failure alert. Action: Replace with a proper power
supply.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1790
- Severity: MAJOR
- Event Summary: Windows IML: Power subsystem no longer redundant
- Event Class: System
- Problem Description:
This error is logged by Disk drivers when
a loss of redundancy is detected due to a power supply failure.
- Cause / Action:
Cause: Disk drivers logged a Power SubSystem No
Longer Redundant alert. A loss of redundancy due to the power supply failure.
Action: Replace with a proper power supply.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1791
- Severity: MAJOR
- Event Summary: Windows IML: Network adapter check
- Event Class: System
- Problem Description:
Network Drivers are detecting adapter
checks possibly due to a bad adapter or a bad driver.
- Cause / Action:
Cause: Network Drivers log this event when
adapter checks are detected. This event will never be repaired due to the
possibility of a large number of adapter checks being generated by a bad
adapter or bad driver. Action: No user action is required, informational
only.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1792
- Severity: CRITICAL
- Event Summary: Windows IML: Network adapter link fault
- Event Class: System
- Problem Description:
Network Drivers or Agents log this event
when a FATAL link problem is detected.
- Cause / Action:
Cause: Network Drivers or Agents log this event
when a FATAL link problem is detected. Action: Check your cable connections
and make sure the network cables are plugged in.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1793
- Severity: MAJOR
- Event Summary: Windows IML: Network adapter transmit timeout
- Event Class: System
- Problem Description:
Network Drivers detect a transmit timeout
possibly due to a bad adapter or a bad driver.
- Cause / Action:
Cause: Network Drivers log this event when a
transmit timeout is detected. This event will never be repaired due to the
possibility of a large number of transmit timeout that may occur with a bad
adapter Action: Check your network connections. If the problem remains, please
contact your support provider for further assistance.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1794
- Severity: MAJOR
- Event Summary: Windows IML: Network adapters no longer redundant
- Event Class: System
- Problem Description:
Network Drivers can not communicate with
one of the adapters in a redundant pair due to the slot being powered off.
- Cause / Action:
Cause: Network Drivers can not communicate with
one of the adapters in a redundant pair due to the slot being powered off. If
the power is turned off on both adapters of a pair, this event is only logged
once. Action: Check if the physical adapters are connected and their network
connections are working. If the problem remains, re-configure your team
settings.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1795
- Severity: MAJOR
- Event Summary: Windows IML: Network adapter redundancy reduced
- Event Class: System
- Problem Description:
Network Drivers or Agents can not
communicate with one of the adapters in a team and at least one adapter in a
team is still active.
- Cause / Action:
Cause: Network Drivers or Agents can not
communicate with one of the adapters in a team and at least one adapter in a
team is still active. Action: Check if the physical adapters are connected and
their network connections are working. If the problem remains, re-configure
your team settings.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1796
- Severity: CRITICAL
- Event Summary: Windows IML: SCSI Controller failure
- Event Class: System
- Problem Description:
This event is logged when SCSI Disk
Drivers detect a FATAL hardware failure.
- Cause / Action:
Cause: SCSI Disk Drivers detected a FATAL
hardware failure. Acton: Possible controller failure. Replace SCSI
controller.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1797
- Severity: MAJOR
- Event Summary: Windows IML: SCSI Controller failure warning
- Event Class: System
- Problem Description:
This event is logged when SCSI Disk
Drivers detect a hardware failure.
- Cause / Action:
Cause: SCSI Disk Drivers detected a hardware
failure. Acton: Possible controller failure. Replace SCSI controller.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1798
- Severity: CRITICAL
- Event Summary: Windows IML: SCSI Device failure
- Event Class: System
- Problem Description:
This event is logged when SCSI Disk
Drivers detect a FATAL disk failure.
- Cause / Action:
Cause: SCSI Disk Drivers detected a FATAL disk
failure. This event is never logged by the mini-port driver. Action: Replace
failed drive.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1799
- Severity: MAJOR
- Event Summary: Windows IML: SCSI Controller failure in redundant
configuration
- Event Class: System
- Problem Description:
This event is logged when SCSI Disk
Drivers detect a Controller failure event in a redundant configuration.
- Cause / Action:
Cause: SCSI Disk Drivers detected a Controller
failure event in a redundant configuration. Action: Identify and repair failed
component.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1800
- Severity: CRITICAL
- Event Summary: Windows IML: Disk Array Controller failure
- Event Class: System
- Problem Description:
This event is logged when Drive Array
Subsystem Drivers detect a FATAL controller failure event.
- Cause / Action:
Cause: Drive Array Subsystem Drivers detected a
FATAL controller failure event. Action: Possible controller failure. Replace
SCSI controller.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1801
- Severity: MAJOR
- Event Summary: Windows IML: Disk Array Controller failure warning
- Event Class: System
- Problem Description:
This event is logged when Drive Array
Subsystem Drivers detect a controller failure event.
- Cause / Action:
Cause: Drive Array Subsystem Drivers detected a
controller failure event. Action: Possible controller failure. Replace SCSI
controller.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1802
- Severity: CRITICAL
- Event Summary: Windows IML: Disk Array Controller device failure
- Event Class: System
- Problem Description:
This even is logged when Drive Array
Subsystem Drivers detect a FATAL disk failure.
- Cause / Action:
Cause: Drive Array Subsystem Drivers detected a
FATAL disk failure. Action: Replace failed drive.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1803
- Severity: MAJOR
- Event Summary: Windows IML: Disk Array Controller battery failure
- Event Class: System
- Problem Description:
This event is logged by Disk Array Drivers
to indicate that an Accelerator Battery Failure has occurred.
- Cause / Action:
Cause: Disk Array Drivers logged an Accelerator
Battery Failure event. Action: Replace battery on cache module.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1804
- Severity: MAJOR
- Event Summary: Windows IML: Disk Array Controller failure in
redundant configuration
- Event Class: System
- Problem Description:
This event is logged when Disk Array
Drivers detect that a Controller No Longer Redundant failure event has
occurred in a redundant configuration.
- Cause / Action:
Cause: Array Disk Drivers detected a Controller
No Longer Redundant failure event in a redundant configuration. Action:
Identify and repair failed component.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1805
- Severity: MAJOR
- Event Summary: Windows: Predictive Failure in Memory
- Event Class: System
- Problem Description:
ECC (Error Checking and Correcting) memory
is designed to detect and correct single-bit errors that occasionally occur in
computer systems. This memory module is currently correcting many single bit
errors.
- Cause / Action:
Cause: You will receive this message if the
system is correcting a lot of ECC single bit errors. It may mean that the
module is about to fail, or environmental conditions in the server are causing
more errors than usual. Action: If you receive this message, contact your
support provider to determine if a predictive repair should be made.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1806
- Severity: MAJOR
- Event Summary: Windows: Server Agents Management data not
accessible, locked property
- Event Class: System
- Problem Description:
Server Agents SNMP branch is not
responding due to a portion of the IPMI Management Subsystem being locked by
another entity.
- Cause / Action:
Cause: The installed management software has
detected an unstable state of the underlying IPMI (Intelligent Platform
Management Interface) subsystem and has disabled all management information
from being shown by any manageability applications. The management information
will become available automatically as soon as the IPMI subsystem has
stabilized. Action: None.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1807
- Severity: FATAL
- Event Summary: PDC could not access a complex profile
- Event Class: System
- Problem Description:
PDC could not access a complex profile.
The partition will be reset because the available complex profile is not
valid. Data field contains the return status from the function that
encountered the error.
- Cause / Action:
Cause1: An error occurred which prevented the
complex profiles from being distributed properly. Action1: Create and
distribute a new complex profile using ParMgr on a functional partition in the
complex. Restore the last complex profile using the "CC" command from the MP,
then use ParMgr to create a new complex profile. Generate a genesis complex
profile using the "CC" command from the MP, then use ParMgr to create a new
complex profile. Cause2: A hardware problem exists with MP or PDHC hardware.
Action2: Contact HP Support to confirm the MP and PDHC are functioning
properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1808
- Severity: MAJOR
- Event Summary: Unexpected fabric firmware error
- Event Class: System
- Problem Description:
An unexpected error occurred while
initializing the fabric. The firmware is not able to analyze this error. Clues
to the cause of this error may be found in the IPMI forward progress log (FPL)
either shortly before or after this log entry occurred. The FPL is available
from the management processor using the "sl" command.
- Cause / Action:
An unanticipated error occurred. Contact HP
Support personnel to analyze the IPMI FPL log.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1809
- Severity: FATAL
- Event Summary: PDC could not access a complex profile
- Event Class: System
- Problem Description:
PDC could not access a complex profile.
The partition will be reset. Data field contains the return status from the
function that encountered the error.
- Cause / Action:
Cause1: An error occurred which prevented the
complex profiles from being distributed properly. Action1: Create and
distribute a new complex profile using ParMgr on a functional partition in the
complex. Restore the last complex profile using the "CC" command from the MP,
then use ParMgr to create a new complex profile. Generate a genesis complex
profile using the "CC" command from the MP, then use ParMgr to create a new
complex profile. Cause2: A hardware problem exists with MP or PDHC hardware.
Action2: Contact HP Support to confirm the MP and PDHC are functioning
properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1810
- Severity: MAJOR
- Event Summary: Configuration information on the processor was
invalid
- Event Class: System
- Problem Description:
Configuration information on the processor
was invalid. The cell will be halted. Data field contains the return value
from the function that encountered the error.
- Cause / Action:
Cause: Hardware problem with the CPU. Action:
Contact HP Support to confirm the CPU is functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1811
- Severity: FATAL
- Event Summary: PDC could not read an internal CPU register
- Event Class: System
- Problem Description:
PDC could not read an internal CPU
register. The partition will be reset. Data field is the return status from
the function that encountered the error.
- Cause / Action:
Cause: Hardware problem with the CPU. Action:
Contact HP Support to confirm the CPU is functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1812
- Severity: FATAL
- Event Summary: PDC could not read an internal CPU register
- Event Class: System
- Problem Description:
PDC could not read an internal CPU
register. The partition will be reset. Data field is the return status from
the function that encountered the error.
- Cause / Action:
Cause: Hardware problem with the CPU. Action:
Contact HP Support to confirm the CPU is functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1813
- Severity: FATAL
- Event Summary: PDC failed reading a specific value from its own
copy of the internal CPU regs
- Event Class: System
- Problem Description:
PDC failed reading a value out of its own
copy of the internal CPU register settings. Data field is a status return
indicating the type of failure.
- Cause / Action:
Cause1: problem on cell wherein PDC could not
properly access memory Action1: Contact HP support to troubleshoot cell board
Cause2: a non-existent/non-accessible register was specified by software.
Action2: Contact HP support for possible PDC upgrade
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1814
- Severity: FATAL
- Event Summary: PDC failed reading a specific value from its own
copy of the internal CPU regs
- Event Class: System
- Problem Description:
PDC failed reading a value out of its own
copy of the internal CPU register settings. Data field is a status return
indicating the type of failure.
- Cause / Action:
Cause1: problem on cell wherein PDC could not
properly access memory Action1: Contact HP support to troubleshoot cell board
Cause2: a non-existent/non-accessible register was specified by software.
Action2: Contact HP support for possible PDC upgrade
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1815
- Severity: FATAL
- Event Summary: PDC failed attempting to update internal CPU
registers
- Event Class: System
- Problem Description:
PDC attempted to update CPU registers to
match their respective settings in the complex profile, but a failure was
returned from the call to accomplish the update. Data field contains the
failure.
- Cause / Action:
Cause: Could not update CPU settings Action:
Contact HP support to troubleshoot cell board and CPU.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1816
- Severity: FATAL
- Event Summary: PDC could not access a complex profile
- Event Class: System
- Problem Description:
PDC could not access a complex profile.
The partition will be reset.. Data field contains the return status from the
function that encountered the error.
- Cause / Action:
Cause1: An error occurred which prevented the
complex profiles from being distributed properly. Action1: Create and
distribute a new complex profile using ParMgr on a functional partition in the
complex. Restore the last complex profile using the "CC" command from the MP,
then use ParMgr to create a new complex profile. Generate a genesis complex
profile using the "CC" command from the MP, then use ParMgr to create a new
complex profile. Cause2: A hardware problem exists with MP or PDHC hardware.
Action2: Contact HP Support to confirm the MP and PDHC are functioning
properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1817
- Severity: FATAL
- Event Summary: Cells in the partition have different complex
profiles.
- Event Class: System
- Problem Description:
Cell boards in the same partition have
different complex profiles. The partition will be rebooted and cannot be fully
booted until the problem is resolved. The data field is a bitmap of cells
where cell 0 is the least significant bit and cell 63 is the most significant
bit. A one on a cell's bit indicated that the cell has a complex profile that
did not match that of the core cell.
- Cause / Action:
Cause1: An error occurred which prevented the
complex profiles from being distributed properly. Action1: Create and
distribute a new complex profile using ParMgr on a functional partition in the
complex. Restore the last complex profile using the "CC" command from the MP,
then use ParMgr to create a new complex profile. Generate a genesis complex
profile using the "CC" command from the MP, then use ParMgr to create a new
complex profile. Cause2: A hardware problem exists with MP or PDHC hardware.
Action2: Contact HP Support to confirm the MP and PDHC are functioning
properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1818
- Severity: FATAL
- Event Summary: Cell has different Partition Config Data CRC then
core cell
- Event Class: System
- Problem Description:
PDC checks the Complex Profile C's
Extensible Header CRC of the partition configuration data for each of the
cells in the partition. If they do not match, this means that the cells have
different complex profiles. At this point, is unable to tell which version of
the complex profile is correct. The partition cannot be booted until this
problem is resolved. This chassis code indicates all of the cells that have
complex profiles that do not match the core cell's. The data field is the CRC
of the partition configuration data for the slave cell.
- Cause / Action:
Cause: The core cell detected that a cell in
its partition has a different complex profile than it does. Action: Look for a
chassis code called, BOOT_CORE_CHECK_HCELL_PROFILE, to see which cell's complex
profile was being checked. That cell is the cell that had the inconsistent
complex profile. Make sure the utilities system is functioning and reboot the
partition. If the reboot does not solve the problem, make sure PDH tests are
enabled. Replace the cell with the inconsistent complex profile. Change core
cells to see if the core cell is the cell that has the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1819
- Severity: FATAL
- Event Summary: PDC failed to read the processor architecture for
another cell in the partition
- Event Class: System
- Problem Description:
PDC attempts to make sure that all of the
cells in a partition are installed in the same processor architecture. PDC
failed to read the architecture for another cell. PDC will reset all of the
cells in the partition when this error is detected. The data field contains
the physical location of the cell reporting the event.
- Cause / Action:
Cause: PDC was unable to read a data structure
for another cell in the partition. This should never happen unless there is an
intermittent problem with the main backplane. Action: Contact HP support to
confirm that the main backplane is functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1820
- Severity: MAJOR
- Event Summary: Windows: Predictive Failure in Memory (Warning)
- Event Class: System
- Problem Description:
ECC (Error Checking and Correcting) memory
is designed to detect and correct single-bit errors that occasionally occur in
computer systems. This memory module is currently correcting many single bit
errors.
- Cause / Action:
Cause: You will receive this message if the
system is correcting a lot of ECC single bit errors. It may mean that the
module is about to fail, or environmental conditions in the server are causing
more errors than usual. This event message will be generated for one of the
following conditions 1000 single-bit errors on the same address in a 48 hour
time period. 50 single-bit errors on the same DIMM (not the same address) in a
24 hour time period. 100 single-bit errors on the same DIMM (not the same
address) in a 1 week time period. Action: If you receive this message, contact
your support provider to determine if a predictive repair should be made.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1821
- Severity: CRITICAL
- Event Summary: Windows: Predictive Failure in Memory (FATAL)
- Event Class: System
- Problem Description:
ECC (Error Checking and Correcting) memory
is designed to detect and correct single-bit errors that occasionally occur in
computer systems. This memory module is currently correcting many single bit
errors.
- Cause / Action:
Cause: You will receive this message if the
system is correcting a lot of ECC single bit errors. It may mean that the
module is about to fail, or environmental conditions in the server are causing
more errors than usual. This event message will be generated for one of the
following conditions 1500 single-bit errors on the same address in a 72 hour
time period. 120 single-bit errors on the same DIMM (not the same address) in
a 24 hour time period. 130 single-bit errors on the same DIMM (not the same
address) in a 1 week time period. Action: If you receive this message, contact
your support provider to determine if a predictive repair should be made.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1822
- Severity: CRITICAL
- Event Summary: A rope parity error occurred
- Event Class: System
- Problem Description:
A error occurred on the bus connecting the
PCI card to the system bus.
- Cause / Action:
An unexpected but random error occurred. Reboot
the system. There is a problem with the system bus. Contact your HP
representative to check the system bus.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1823
- Severity: CRITICAL
- Event Summary: PCI card inaccessible due to bus error
- Event Class: System
- Problem Description:
A PCI card has been marked as "fatal" by
the operating system due to a bus error. The LBA has been isolated by the
operating system due to an error which occurred in a device(s) connected to
that LBA.
- Cause / Action:
Cause: An unexpected but random error occurred.
Action: Reboot the system. Cause: There is a problem with the system bus.
Contact your HP representative to check for faulty devices on the bus..
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1824
- Severity: CRITICAL
- Event Summary: PCI card inaccessible due to device error
- Event Class: System
- Problem Description:
A PCI card has been marked as "fatal" by
the operating system due to a device error.
- Cause / Action:
An unexpected but random error occurred. Reboot
the system. There is a problem with the system bus. Contact your HP
representative to check the system bus. Check the system forward progress log
(available from the Management Processor) for additional information about
this problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1825
- Severity: FATAL
- Event Summary: error reading bmc first boot token
- Event Class: System
- Problem Description:
Firmware tried to read the first boot
token an got a failure. The data field contains the token number that FW tried
to read. This is a stop boot condition
- Cause / Action:
Cause: FW tried to read the first boot token
and received a failure. Action: AC power cycle the system Action: Contact HP
support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1826
- Severity: MAJOR
- Event Summary: a rendezvousing cell is non PA architecture and thus
incompatible.
- Event Class: System
- Problem Description:
monarch PA cell has detected that a cell
it is attempting to rendezvous into its PD is not a PA cell and is thus
incompatible.
- Cause / Action:
Cause: other cell is an IA cell
Action: replace IA cell with PA cell or reconfigure partition to exclude
the IA cell.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1827
- Severity: MAJOR
- Event Summary: failed to write the XBC error log clear register
- Event Class: System
- Problem Description:
A XBC error could not be cleared due to a
write failure. The data field indicates the type of error: (XBC Port Num
<< 56) | (XBC Num << 32) | error status
- Cause / Action:
Fabric Access Failure. Could not write to the
XBC. This could indicate a hardware problem. Include the
FABRIC_ERRORS_XBC_CLEAR_WR_ADDR event log and its data in any
reports.
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1828
- Severity: MAJOR
- Event Summary: Error encountered while reading the XBC CSR Error
Status Register
- Event Class: System
- Problem Description:
Failed to read the XBC Global CSR Error
Status register. Data Field: (XBC Port Num << 56) | (XBC Num <<
32) | error status
- Cause / Action:
Fabric Access Failure. Likely hardware problem.
Look for additional chassis codes to further isolate the error.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1829
- Severity: MAJOR
- Event Summary: The XBC CSR Low Severity error was not cleared
- Event Class: System
- Problem Description:
The XBC CSR Low Severity error was not
cleared or more errors remain. Data Field: (XBC Port Num << 56) | (XBC
Num << 32) | contents of the XBC CSR Error Status Register
- Cause / Action:
This could be caused by a fabric access error
or persistent CSR Low Severity errors. Check Crossbar hardware, flex cables,
backplane
Contact HP Support personnel to check the Crossbar hardware, flex
cables, backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1830
- Severity: MAJOR
- Event Summary: The XBC CSR High Severity error was not cleared
- Event Class: System
- Problem Description:
The XBC CSR High Severity error was not
cleared or more errors remain. Data Field: (XBC Port Num << 56) | (XBC
Num << 32) | contents of the XBC CSR Error Status Register
- Cause / Action:
This could be caused by a fabric access error
or persistent CSR Low Severity errors. Check Crossbar hardware, flex cables,
backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1831
- Severity: MAJOR
- Event Summary: Error encountered while reading the XBC Port Error
Status Register
- Event Class: System
- Problem Description:
Failed to read the XBC Port Error Status
register. Data Field: (XBC Port Num << 56) | (XBC Num << 32) |
error status
- Cause / Action:
Fabric Access Failure. Likely hardware problem.
Look for additional chassis codes to further isolate the error.
Contact HP
Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1832
- Severity: MAJOR
- Event Summary: Failed to read the XBC CSR Error Status register
- Event Class: System
- Problem Description:
Failed to read the XBC Global CSR Error
Status register. Data Field: (XBC Port Num << 56) | error status
- Cause / Action:
Fabric Access Failure. Likely hardware problem.
Look for additional chassis codes to further isolate the error.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1833
- Severity: MAJOR
- Event Summary: Failed to copy the XBC CSR Error Symbol01 Block
- Event Class: System
- Problem Description:
Firmware failed to copy the XBC CSR Error
symbol 01 registers into a data structure on the stack. Data Field: address
where the register contents are being copied
- Cause / Action:
Fabric Access Failure; Possibly an invalid
destination address. Check hardware, Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1834
- Severity: MAJOR
- Event Summary: Failed to copy the XBC CSR Error Symbol23 Block
- Event Class: System
- Problem Description:
Firmware failed to copy the XBC CSR Error
symbol 23 registers into a data structure on the stack. Data Field: address
where the register contents are being copied
- Cause / Action:
Fabric Access Failure; Possibly an invalid
destination address. Check hardware, Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1835
- Severity: MAJOR
- Event Summary: Failed to reset the XBC Low Severity Error Log State
- Event Class: System
- Problem Description:
Firmware was unable to reset the XBC CSR
Low Severity error log state. Data Field: (XBC Port Num << 56) | (XBC
Num << 32) | error status
- Cause / Action:
Fabric Access Failure.
Contact HP Support
personnel to check the XBC, Flex Cables, Backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1836
- Severity: MAJOR
- Event Summary: Failed to clear the XBC Low Severity Log Symbol 01
- Event Class: System
- Problem Description:
The XBC Low Severity error logs were not
cleared. Data Field: (XBC Port Num << 56) | (XBC Num << 32) |
number of failed clear attempts
- Cause / Action:
Fabric Access Failure.
Contact HP Support
personnel to check the XBC, Backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1837
- Severity: MAJOR
- Event Summary: Could not determine if there is a new XBC CSR Low
Severity error
- Event Class: System
- Problem Description:
Reading the XBC CSR Error Status register
failed. Data field: (XBC Port Num << 56) | (XBC Num << 32) | error
status
- Cause / Action:
Fabric Access Failure.
Contact HP Support
personnel to check the XBC, Flex Cables, Backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1838
- Severity: MAJOR
- Event Summary: Failed to read the XBC CSR Low Severity Error Log
State
- Event Class: System
- Problem Description:
Failed to read a XBC Global scratch
register that indicates if new, unlogged errors have been encountered. Data
field: (XBC Port Num << 56) | (XBC Num << 32) | error status
- Cause / Action:
Fabric Access Failure.
Contact HP Support
personnel to check the XBC, Flex Cables, Backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1839
- Severity: MAJOR
- Event Summary: Failed to reset the XBC Low Severity Error Log State
- Event Class: System
- Problem Description:
Firmware was unable to reset the XBC CSR
Low Severity error log state. Data Field: (XBC Port Num << 56) | (XBC
Num << 32) | error status
- Cause / Action:
Fabric Access Failure.
Contact HP Support
personnel to check the XBC, Flex Cables, Backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1840
- Severity: MAJOR
- Event Summary: Could not determine if there is a new XBC CSR High
Severity error
- Event Class: System
- Problem Description:
Reading the XBC CSR Error Status register
failed. Data field: (XBC Port Num << 56) | (XBC Num << 32) | error
status
- Cause / Action:
Check XBC, Flex Cables, Backplane
Contact HP
Support personnel to check the XBC, Flex Cables, Backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1841
- Severity: MAJOR
- Event Summary: Failed to read the XBC CSR High Severity Error Log
State
- Event Class: System
- Problem Description:
Failed to read a XBC Global scratch
register that indicates if new, unlogged errors have been encountered. Data
field: (XBC Port Num << 56) | (XBC Num << 32) | error status
- Cause / Action:
Fabric Access Failure.
Contact HP Support
personnel to check the XBC, Flex Cables, Backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1842
- Severity: CRITICAL
- Event Summary: An error occurred while enabling hashing in the
platform cache
- Event Class: System
- Problem Description:
An error occurred while enabling hashing
in the platform cache. The data field contains the status.
- Cause / Action:
Cause: An error return status. This could
happen if the tree was corrupted or there was an error verifying the hashing
setting. Action: reset the partition
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1843
- Severity: MAJOR
- Event Summary: The XBC CSR is not a valid CSR address
- Event Class: System
- Problem Description:
A write to an invalid XBC CSR address was
attempted. The write will not be allowed. The severity of this result will be
determined by the calling function. Data Field: XBC CSR address that was
attempted
- Cause / Action:
Invalid CSR address, possible firmware
defect.
Capture complete live logs and contact HP Support representative.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1844
- Severity: MAJOR
- Event Summary: A failure has occurred with a CPU during early self
tests
- Event Class: System
- Problem Description:
An error has occurred while a CPU was
performing early self tests. The data field contains a 32-bit error number and
32-bits of additional error information. The CPU will be deconfigured.
- Cause / Action:
Cause: internal error. Action: the CPU will be
deconfigured. If the error persists after a power cycle, contact HP
Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1845
- Severity: MAJOR
- Event Summary: An error has occurred during CPU FSB interface
initialization
- Event Class: System
- Problem Description:
An error has occurred during CPU FSB
interface initialization. The data field contains a 32-bit error number and
32-bits of additional error information. The CPU will be deconfigured.
- Cause / Action:
Cause: internal error. Action: the CPU will be
deconfigured. If the error persists after a power cycle, contact HP
Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1846
- Severity: MAJOR
- Event Summary: An error has occurred while obtaining CPU parameters
- Event Class: System
- Problem Description:
An error has occurred while obtaining CPU
parameters from the CPU abstraction layer. The data field contains a 32-bit
error number and 32-bits of additional error information. The CPU will be
deconfigured.
- Cause / Action:
Cause: internal error. Action: the CPU will be
deconfigured. If the error persists after a power cycle, contact HP
Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1847
- Severity: MAJOR
- Event Summary: An error has occurred getting CPU icache parameters
- Event Class: System
- Problem Description:
An error occurred while getting CPU icache
parameters from the CPU abstraction layer. The data field contains a 32-bit
error number and 32-bits of additional error information. The CPU will be
deconfigured if this error occurs during system boot.
- Cause / Action:
Cause: internal error. Action: during system
boot the CPU will be deconfigured. If the error persists after a power cycle,
contact HP Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1848
- Severity: MAJOR
- Event Summary: An error occurred getting CPU dcache parameters
- Event Class: System
- Problem Description:
An error occurred while obtaining CPU
dcache parameters from the CPU abstraction layer. The data field contains a
32-bit error number and 32-bits of additional error information. The CPU will
be deconfigured if error occurs during system boot.
- Cause / Action:
Cause: internal error. Action: during system
boot the CPU will be deconfigured. If the error persists after a power cycle,
contact HP Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1849
- Severity: MAJOR
- Event Summary: An error has occurred while initializing the CPU
cache to a known state
- Event Class: System
- Problem Description:
An error occurred while initializing the
CPU cache to a known state. The data field contains a 32-bit error number and
32-bits of additional error information. The CPU will be deconfigured.
- Cause / Action:
Cause: internal error. Action: the CPU will be
deconfigured. If the error persists after a power cycle, contact HP
Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1850
- Severity: MAJOR
- Event Summary: An error has occurred while enabling CPU cache error
monitoring
- Event Class: System
- Problem Description:
An error occurred while enabling CPU cache
error monitoring. The data field contains a 32-bit error number and 32-bits of
additional error information. The CPU will be deconfigured.
- Cause / Action:
Cause: internal error. Action: the CPU will be
deconfigured. If the error persists after a power cycle, contact HP
Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1851
- Severity: MAJOR
- Event Summary: An error has occurred while enabling machine check
traps on a CPU
- Event Class: System
- Problem Description:
An error occurred while enabling some
machine error check traps on a CPU. The data field contains a 32-bit error
number and 32-bits of additional error information. The CPU will be
deconfigured.
- Cause / Action:
Cause: internal error. Action: the CPU will be
deconfigured. If the error persists after a power cycle, contact HP
Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1852
- Severity: MAJOR
- Event Summary: An error has occurred while disabling machine error
check traps on a CPU
- Event Class: System
- Problem Description:
An error occurred while disabling matching
error check traps on a CPU. The data field contains a 32-bit error number and
32-bits of additional error information. The CPU will be deconfigured.
- Cause / Action:
Cause: internal error. Action: the CPU will be
deconfigured. If the error persists after a power cycle, contact HP
Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1853
- Severity: MAJOR
- Event Summary: An error has occurred during CPU serialized late
self tests
- Event Class: System
- Problem Description:
An error occurred during the serialized
CPU late self tests. The data field contains a 32-bit error number and 32-bits
of additional error information. The CPU will be deconfigured.
- Cause / Action:
Cause: internal error. Action: the CPU will be
deconfigured. If the error persists after a power cycle, contact HP
Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1854
- Severity: MAJOR
- Event Summary: An error occurred while enabling CPU L2 shared cache
- Event Class: System
- Problem Description:
An error occurred while enabling the CPU
L2 shared cache. The data field contains a 32-bit error number and 32-bits of
additional error information. The CPU will be deconfigured.
- Cause / Action:
Cause: internal error. Action: the CPU will be
deconfigured. If the error persists after a power cycle, contact HP
Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1855
- Severity: MAJOR
- Event Summary: An error occurred while getting default values for
CPU internal registers
- Event Class: System
- Problem Description:
An error while getting default values for
programmable CPU internal registers from the CPU abstraction layer. The data
field contains a 32-bit error number and 32-bits of additional error
information. The CPU will be deconfigured.
- Cause / Action:
Cause: internal error. Action: the CPU will be
deconfigured. If the error persists after a power cycle, contact HP
Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1856
- Severity: MAJOR
- Event Summary: An error occurred while getting an address for a CPU
internal register
- Event Class: System
- Problem Description:
An error occurred while getting an address
for a CPU internal register within a buffer from the CPU abstraction layer.
The data field contains a 32-bit error number and 32-bits of additional error
information. The CPU will be deconfigured.
- Cause / Action:
Cause: internal error. Action: the CPU will be
deconfigured. If the error persists after a power cycle, contact HP
Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1857
- Severity: MAJOR
- Event Summary: An error occurred while programming CPU internal
registers
- Event Class: System
- Problem Description:
An error occurred while programming CPU
internal registers with final configuration values. The data field contains a
32-bit error number and 32-bits of additional error information. The CPU will
be deconfigured.
- Cause / Action:
Cause: internal error. Action: the CPU will be
deconfigured. If the error persists after a power cycle, contact HP
Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1858
- Severity: MAJOR
- Event Summary: An error has occurred while attempting to get CPU
ITLB parameters
- Event Class: System
- Problem Description:
An error occurred while getting CPU ITLB
parameters from the CPU abstraction layer. The data field contains a 32-bit
error number and 32-bits of additional error information. The CPU will be
deconfigured if this error occurs during system boot.
- Cause / Action:
Cause: internal error. Action: during system
boot the CPU will be deconfigured. If the error persists after a power cycle,
contact HP Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1859
- Severity: MAJOR
- Event Summary: An error occurred while getting CPU DTLB parameters
- Event Class: System
- Problem Description:
An error occurred while getting CPU DTLB
parameters from the CPU abstraction layer. The data field contains a 32-bit
error number and 32-bits of additional error information. The CPU will be
deconfigured if this error occurs during system boot.
- Cause / Action:
Cause: internal error. Action: during system
boot the CPU will be deconfigured. If the error persists after a power cycle,
contact HP Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1860
- Severity: CRITICAL
- Event Summary: There was not enough error free memory in the system
to run the late self tests
- Event Class: System
- Problem Description:
There was not enough error free memory in
the system to run the late self tests.
- Cause / Action:
Due to excessive memory subsystem or DIMM
errors, the late self tests could not be run. DIMMs or memory extenders have
caused excessive errors and will need to be replaced. Consult the memory test
events regarding memory errors or view the Page Deallocation Table from
BCH.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1861
- Severity: MAJOR
- Event Summary: A Checksum error was encountered in the dynamic
profile
- Event Class: System
- Problem Description:
The Dynamic Complex Profile (Group B)
stored checksum did not equal the calculated checksum. The Expected Data and
Actual date are displayed in successive chassis codes.
- Cause / Action:
Cause: Push out a new complex profile and reset.
Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1862
- Severity: MAJOR
- Event Summary: A checksum error occurred on the Partition Profile.
- Event Class: System
- Problem Description:
The stored value of the complex profile
Group C does not match the calculated value. Expected data and actual data are
stored in successive chassis codes.
- Cause / Action:
Cause: Push out a new complex profile and
reboot. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1863
- Severity: MAJOR
- Event Summary: Unable to clear error in coherency controller (CC).
- Event Class: System
- Problem Description:
An error remains in coherency controller
(CC) primary error mode register after attempt to clear it. The data field
contains the contents of the Primary Error Mode register, with the
most-significant byte over-written with the CC block address.
- Cause / Action:
Cause: During HPMC handling, when errors are
masked, this would indicate a CRITICAL failure to clear an error on the local
cell. At other times, it could indicate a recurring error. Action: Analyze
HPMC to determine the cause of the failure If during HPMC handling,
troubleshoot the cell board the cell board
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1865
- Severity: MAJOR
- Event Summary: Multiple Loss of Lockstep results in cell halt.
- Event Class: System
- Problem Description:
The cell identified by the data field
(physical location) has detected multiple loss of lockstep events the last
power-on. The cell will be halted to prevent possible spreading of fabric
errors to other partitions.
- Cause / Action:
Cause: Fabric problem. Action: Check HPMC
PIM/ErrorLogs for cause of HPMC. Check fabric and backplane connectivity.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1866
- Severity: FATAL
- Event Summary: PDC encountered an unexpected event and could not
continue.
- Event Class: System
- Problem Description:
PDC called an internal utility function and
that function unexpectedly reported an error.
- Cause / Action:
Cause: Report the incident and the Data Contents
to Hewlett-Packard. Reboot. Reinstall PDC Firmware. Contact HP Support
personnel to troubleshoot the problem. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1867
- Severity: FATAL
- Event Summary: Fatal internal error
- Event Class: System
- Problem Description:
The attempt to get GI range information
from the Stable Complex Data failed. Data Field: PDC call status return
- Cause / Action: Cause: Probable hardware error Action: Contact HP
Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1868
- Severity: FATAL
- Event Summary: Fatal internal error
- Event Class: System
- Problem Description:
The attempt to get GI resource information
from the Stable Complex Data failed. Data Field: PDC call status return
- Cause / Action:
Cause: Probable hardware error Action: Contact
HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1869
- Severity: FATAL
- Event Summary: Fatal internal error
- Event Class: System
- Problem Description:
The attempt to get the KGM value from
information from Partition Configuration Data failed. Data Field: PDC call
status return
- Cause / Action:
Cause: Probable hardware error Action: Contact
HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1870
- Severity: FATAL
- Event Summary: Fatal internal error
- Event Class: System
- Problem Description:
The attempt to get the quantity of
installed memory of a cell failed. Data Field: PDC call status return
- Cause / Action:
Cause: Probable hardware error Action: Contact
HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1871
- Severity: FATAL
- Event Summary: Fatal internal error
- Event Class: System
- Problem Description:
The attempt to read the Stable
Configuration Data failed. Data Field: PDC call status return
- Cause / Action:
Cause: Probable hardware error Action: Contact
HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1872
- Severity: FATAL
- Event Summary: Fatal internal error
- Event Class: System
- Problem Description:
The attempt to get ZI range information
from Stable Complex Data failed. Data Field: PDC call status return
- Cause / Action:
Cause: Probable hardware error Action: Contact
HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1873
- Severity: FATAL
- Event Summary: Unexpected overflow of an internal Firmware Data
Structure.
- Event Class: System
- Problem Description:
PDC detected the case where there were more
Address Map entries than space was allowed for, AND the mechanism to safely
handle this case failed.
- Cause / Action:
Cause: Record the Chassis Codes, and the exact
memory configuration, including Base and Floating Cells, and any Cell Local
Memory. Report the data to Hewlett-Packard. Reinstall PDC Firmware. Change the
memory configuration by adding or removing Floating Cells, Cell Local Memory,
deallocating DIMMs, or the cells themselves from the Partition. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1874
- Severity: MAJOR
- Event Summary: The memory configuration was adjusted to satisfy the
Minimum ZI requirement.
- Event Class: System
- Problem Description:
PDC detected the case where too much memory
was allocated to Cell Local Memory. There was not enough memory to meet the
Minimum ZI requirement. PDC reduced the amount of Cell Local Memory in order
to meet the Minimum ZI requirement, and continued.
- Cause / Action:
Cause: Review the Chassis Codes and determine
if a cell, or DIMMs within a cell, were removed from the Interleave because of
an error. If so, correct the problem and reboot. If there was no such event,
the Partition is probably misconfigured. Review the configuration and correct
it with the Parmanager tools. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1875
- Severity: FATAL
- Event Summary: Fatal internal error
- Event Class: System
- Problem Description:
The Cell Map code attempted to build a data
structure describing the memory of each cell in the partition and an error was
reported. Data Field: PDC call status return
- Cause / Action:
Cause: Probable hardware error Action: Contact
HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1876
- Severity: FATAL
- Event Summary: Chassis code not implemented
- Event Class: System
- Problem Description:
This chassis code is not used in the
current revision of PDC
- Cause / Action:
Cause: This chassis code is not used in the
current revision of PDC Action: No action is required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1877
- Severity: MAJOR
- Event Summary: The memory configuration was adjusted to satisfy the
Minimum ZI requirement.
- Event Class: System
- Problem Description:
PDC detected the case where all cells in
the Partition were configured as Floating cells, or there was not enough
memory to satisfy the Minimum ZI requirement in the available Base cell(s).
PDC converted a Floating Cell into a Base Cell in order to obtain enough
memory to satisfy the Minimum ZI requirement, and continued.
- Cause / Action:
Cause: Review the Chassis Codes and determine
if a cell, or DIMMs within a cell, were removed from the Interleave because of
an error. If so, correct the problem and reboot. If there was no such event,
the Partition is probably misconfigured. Review the configuration and correct
it with the Parmanager tools. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1878
- Severity: FATAL
- Event Summary: Internal error: unexpected internal parameter value
- Event Class: System
- Problem Description:
An internal parameter was found to be
incorrect or out of bounds.
- Cause / Action:
Cause: Probable hardware problem Action:
Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1879
- Severity: FATAL
- Event Summary: Internal error: unexpected internal parameter value.
- Event Class: System
- Problem Description:
An internal parameter was found to be
incorrect or out of bounds.
- Cause / Action:
Cause: Probable hardware problem Action:
Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1880
- Severity: FATAL
- Event Summary: Internal error: unexpected internal parameter value.
- Event Class: System
- Problem Description:
An internal parameter was found to be
incorrect or out of bounds.
- Cause / Action:
Cause: Probable hardware problem Action:
Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1881
- Severity: FATAL
- Event Summary: Internal error: unexpected internal parameter value.
- Event Class: System
- Problem Description:
An internal parameter was found to be
incorrect or out of bounds.
- Cause / Action:
Cause: Probable hardware problem Action:
Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1882
- Severity: FATAL
- Event Summary: Internal error: unexpected internal parameter value.
- Event Class: System
- Problem Description:
An internal parameter was found to be
incorrect or out of bounds.
- Cause / Action:
Cause: Probable hardware problem Action:
Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1883
- Severity: FATAL
- Event Summary: Internal error: unexpected internal parameter value.
- Event Class: System
- Problem Description:
An internal parameter was found to be
incorrect or out of bounds.
- Cause / Action:
Cause: Probable hardware problem Action:
Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1884
- Severity: FATAL
- Event Summary: PDC encountered an unexpected event and could not
continue.
- Event Class: System
- Problem Description:
While creating a memory-related data
structure, a PDC consistency check detected an illegal condition.
- Cause / Action:
Cause: Report the incident and the Data Contents
to Hewlett-Packard. Reboot. Reinstall PDC Firmware. Contact HP Support
personnel to troubleshoot the problem. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1885
- Severity: FATAL
- Event Summary: PDC encountered an unexpected event and could not
continue.
- Event Class: System
- Problem Description:
PDC called an internal utility function and
that function unexpectedly reported an error.
- Cause / Action:
Cause: Record the Chassis Codes, and the exact
memory configuration, including Base and Floating Cells, and any Cell Local
Memory. Report the data to Hewlett-Packard. Reinstall PDC Firmware. Change the
memory configuration by adding or removing Floating Cells, Cell Local Memory,
deallocating DIMMs, or the cells themselves from the Partition. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1886
- Severity: MAJOR
- Event Summary: Cell Map was changed to satisfy the KGM parameter
- Event Class: System
- Problem Description:
A cell was excluded from interleaving
because its memory had an uncorrectable (DBE) error that would otherwise be
interleaved to an address below the KGM (Known Good Memory) threshold. Note
that the processors of this cell are still included in the Partition, just its
memory has been excluded from interleaving. Note also that the Cell Map will
interleave memory correctly and the Partition will run properly. However, some
memory has not been interleaved and performance will probably be reduced,
possibly significantly.
- Cause / Action:
Cause: An uncorrectable (DBE) error in that
cell's memory Action: Check the chassis logs for the PDT entry(s) from that
cell, or at BCH, issue the command "PDT" and "PDT " from the Service
sub-menu. At the customer's convenience, replace the DIMM(s) containing the
uncorrectable error and reboot.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
|
Event 1887
- Severity: MAJOR
- Event Summary: Discrepancy in the number of overall Cell Map
entries available
- Event Class: System
- Problem Description:
Cell Map code is reporting a discrepancy
regarding the number of Cell Map entries available overall. The Cell Map
discovered a discrepancy regarding these parameters. The least significant 8
bits of the parameter report how many entries are available with which to
interleave the ZI region, and the next 8 bits report the total number of Cell
Map entries.
- Cause / Action:
Cause: Probable hardware problem Action:
Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1888
- Severity: MAJOR
- Event Summary: One or more of the Cell Map entries is not
initialized properly
- Event Class: System
- Problem Description:
Before the Cell Map code starts calculating
the Cell Map entries, it checks the Cell Map data structure to which the
finished Cell Map entries will be written, for proper initialization values.
One or more of the Cell Map elements were not initialized properly. The
parameter is a bit mask where a "1" indicates which entry(s) were not
initialized properly. Entry 0 is represented by the least significant bit.
- Cause / Action:
Cause: Probable hardware problem Action:
Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1889
- Severity: FATAL
- Event Summary: There was a fatal error reported by the Cell Map
code
- Event Class: System
- Problem Description:
The Cell Map code encountered a fatal
condition and was unable to interleave memory. The return parameter is
reported as data. Note that all possible configurations are legal and no
configuration should cause this failure. The only possibility where this could
occur is when the memory of every cell in the partition has been excluded from
the interleave due to KGM violations MEM_CMAP_INTLV_ADJUSTED_FOR_KGM) and is
extremely unlikely. If this is the case resolve the KGM problem(s.)
- Cause / Action:
Cause: Probable hardware failure KGM violation
on every cell in the Partition Action: Contact HP Support personnel to
troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1890
- Severity: FATAL
- Event Summary: Programming the Coherency Controller chip(s) with
the Cell Map failed
- Event Class: System
- Problem Description:
PDC was unable to program the Cell Map into
the Coherency Controller chip(s). The failure status is reported in the
parameter.
- Cause / Action:
Cause: Probable hardware problem. Note that the
Coherency Controller chip of every cell in the partition is written with the
Cell Map and one or more cells and/or the backplane may be defective. Action:
Replace the cell(s) or incrementally remove cell(s) from the Partition to
determine which is defective. If all cells seem good, the backplane is
probably defective.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1891
- Severity: FATAL
- Event Summary: Failure to read partition number from the Stable
Complex Profile
- Event Class: System
- Problem Description:
The call to read the Partition number from
the Complex Profile A "Cell Assignments" field failed. Note that this does not
mean the Partition number was invalid, rather that it could not be obtained at
all. The return status is reported as the parameter.
- Cause / Action:
Cause: Probable hardware error Action: Contact
HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1892
- Severity: MAJOR
- Event Summary: PDC encountered an unexpected event and could not
continue.
- Event Class: System
- Problem Description:
PDC detected a cell in the Partition that
was not a Base or Floating Cell. Possible hardware failure, corrupted
Firmware, or Firmware defect.
- Cause / Action:
Cause: Check the cell assignments with the
Parmanager tools. Try deleting and recreating the Partition in question.
Report the incident and the DataContents to Hewlett-Packard. Reboot. Reinstall
PDC Firmware. Contact HP Support personnel to troubleshoot the problem.
Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1893
- Severity: FATAL
- Event Summary: Unrecognized memory type was discovered in internal
PDC data structure
- Event Class: System
- Problem Description:
The Cell Map code, while parsing an
internal data structure in order to build the Partition Memory Map,
encountered an unrecognized memory descriptor type. The descriptor is reported
in the parameter of this chassis code.
- Cause / Action:
Cause: Probable hardware problem Action: Contact
HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1894
- Severity: MAJOR
- Event Summary: A DIMM was deallocated because the PDT was full
- Event Class: System
- Problem Description:
A DIMM was successfully deallocated from
system for the case where the PDT was full. The system is still configured
correctly and will function properly but performance my be reduced.
- Cause / Action:
Cause: Deallocation for a full PDT table Action:
Replace the DIMM(s) that were deallocated
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1895
- Severity: FATAL
- Event Summary: Error trying to retrieve CPU type - NOT USED
- Event Class: System
- Problem Description:
This chassis code is currently unused. It
was to be used to indicate a failure returning the cpu type associated with
support for DNA 3.0 processing.
- Cause / Action:
Cause: PDC error Action: Update PDC and report
error to PDC team
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1896
- Severity: FATAL
- Event Summary: Error trying to retrieve MFG Mode value- NOT USED
- Event Class: System
- Problem Description:
This chassis code is currently unused. It
was to be used to indicate a failure returning the mfg mode associated with
support for DNA 3.0 processing.
- Cause / Action:
Cause: PDC error Action: Update PDC and report
problem to PDC team
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1897
- Severity: FATAL
- Event Summary: Unsupported CPU Type detected - NOT USED
- Event Class: System
- Problem Description:
This chassis code is currently not used. It
was to indicate that an unsupported cpu type was detected for the current cell
for use in DNA 3.0 support. This is a fatal error and will result in the
halting of the cell.
- Cause / Action:
Cause: PDC error Action: Upgrade PDC and report
problem to PDC team
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1898
- Severity: FATAL
- Event Summary: Cell halted for fatal error
- Event Class: System
- Problem Description:
Cell halted when fatal error was detected
in memory
- Cause / Action:
Cause: A fatal error was detected Action: Refer
to previous chassis codes for more information on the nature of the problem
Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1899
- Severity: FATAL
- Event Summary: Bank Select bits error in programming MBAT values
- Event Class: System
- Problem Description:
Bank select programming values are
incorrect. The cell is halted.
- Cause / Action:
Cause: Corrupt Interleaving table - Could be
h/w or PDC problem Action: Report event to Response Center Update PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1900
- Severity: FATAL
- Event Summary: Invalid rank number detected in cell info table
- Event Class: System
- Problem Description:
Physical Rank number not found in cell info
table. The cell is reset.
- Cause / Action:
Cause: Corrupted cell info table Action: Report
event to Response Center Update PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1901
- Severity: FATAL
- Event Summary: Rank Number input to Report Syndrome Function is
incorrect
- Event Class: System
- Problem Description:
There was an error detected when verifying
the input rank number. The rank number was invalid.
- Cause / Action:
Cause: PDC error - contact the PDC team Action:
-
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1902
- Severity: FATAL
- Event Summary: MBAT information does not match address data
- Event Class: System
- Problem Description:
Input rank, bank, row, and column input
parameters do not match Interleaving lookup parameters associated with given
GNI address.
- Cause / Action:
Cause: Reverse Interleaving or Interleaving
lookup translation error Action: Report event to the Response Center Update
PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1903
- Severity: FATAL
- Event Summary: Timeout waiting for MOQ to clear
- Event Class: System
- Problem Description:
The MOQ failed to clear within the given
time limit. The cell is halted.
- Cause / Action: Cause: Hardware failure Action: Contact HP Support
personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1904
- Severity: FATAL
- Event Summary: PDC detects no active memory on cell board - no
DIMMs or all DIMMs deallocated
- Event Class: System
- Problem Description:
PDC detects that there is no DIMMs
installed on cell board or all DIMMs on cell board have been deallocated due
to operator deallocation or because of hardware problems.
- Cause / Action:
Cause: Hardware problem Action: Insert good
DIMMs into cell and/or re-allocate DIMMs that have been deallocated
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1905
- Severity: FATAL
- Event Summary: PDC could not mark parity error DIMM for
deallocation
- Event Class: System
- Problem Description:
PDC could not mark a DIMM for deallocation
that needed to be marked for deallocation because of the presence of a memory
parity error on the MID bus containing that DIMM. This is a fatal error and
will result in the halting of the cell.
- Cause / Action:
Cause: PDC problem Action: Upgrade PDC and
report problem to PDC team
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1906
- Severity: MAJOR
- Event Summary: Cell PDT table is full
- Event Class: System
- Problem Description:
PDC has tried to add a new entry in the
cell PDT table, but the PDT table is currently full. PDC will search the PDT
table for the memory rank with the most number of entries, and deallocate that
rank. PDC will than reset the cell which upon reboot will clear the PDT table
of all entries related to that deallocated rank.
- Cause / Action:
Cause: Bad DIMM/DIMMs/memory system Action:
Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1907
- Severity: FATAL
- Event Summary: Slave attempting to execute memory write-random code
- Event Class: System
- Problem Description:
Slave CPU attempting to execute memory
write-random code although only monarch should be executing code.
- Cause / Action:
Cause: PDC error Action: Report the event to
the Response Center Update PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1908
- Severity: FATAL
- Event Summary: Slave failed to complete write-random code
- Event Class: System
- Problem Description:
Slave failed to complete write-random code.
Monarch CPU should only execute code.
- Cause / Action:
Cause: PDC error Action: Report the event to
the Response Center Update PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1909
- Severity: FATAL
- Event Summary: Slave attempting to execute memory read-random code
- Event Class: System
- Problem Description:
Slave CPU attempting to execute memory
read-random code although only monarch should be executing code.
- Cause / Action:
Cause: PDC error Action: Report the event to
the Response Center Update PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1910
- Severity: FATAL
- Event Summary: Slave failed to complete read-random code
- Event Class: System
- Problem Description:
Slave failed to complete read-random code.
Monarch CPU should only execute code.
- Cause / Action:
Cause: PDC error Action: Report the event to
the Response Center Update PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1911
- Severity: INFORMATION
- Event Summary: The Chassis Code output FIFO is full
- Event Class: System
- Problem Description:
While attempting to output a chassis log,
PDC detected that the chassis code output FIFO is full. The current chassis
log may have been lost. Future logs may be lost.
- Cause / Action:
Cause: The Cell PDH Controller (PDHC) or the
GSP is no longer reading logs from the FIFO or is unable to read them fast
enough. Action: Check the integrity of the PDHC, GSP, and USB Contact HP
Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1912
- Severity: MAJOR
- Event Summary: The checksum for the Cell Info data structure is
invalid
- Event Class: System
- Problem Description:
The Cell Info (AKA Cell Configuration)
structure contains an incorrect checksum while attempting to validate the
structure. The Cell Info structure may be corrupt. Data Field: Pointer to the
Cell Info structure.
- Cause / Action: Cause: Corruption of ICM Internal PDC error Action:
Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1913
- Severity: MAJOR
- Event Summary: Cell Info data is not marked invalid while updating
Core I/O Present
- Event Class: System
- Problem Description:
Whenever the Cell Info (AKA Cell
Configuration) structure is being updated, its valid bit should be deasserted.
This was not the case while updating the Core I/O Present field. Data Field:
Global cell # of cell containing the target Cell Info structure.
- Cause / Action:
Cause: Corruption of ICM Internal PDC error
Action: Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1914
- Severity: MAJOR
- Event Summary: Value of Boot Inhibit field of the Cell Info data
structure is illegal
- Event Class: System
- Problem Description:
The new value for the Boot Inhibit field of
the Cell Info (AKA Cell Configuration) structure is found to be illegal while
updating the Cell Info structure. Data Field: Illegal inhibit value
- Cause / Action:
Cause: Internal PDC error Action: Contact HP
Support personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1915
- Severity: MAJOR
- Event Summary: An error occurred during initialization of the Cell
Info structure
- Event Class: System
- Problem Description:
An error occurred during initialization of
the Cell Info (AKA Cell Configuration) structure header. The Cell Info data
may be incomplete. Data Field: Return status of the internal PDC function
CellInfoInitHeader().
- Cause / Action: Cause: Corruption of Software semaphores Corruption
of ICM Action: Locate source of corruption Contact HP Support personnel to
troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1916
- Severity: MAJOR
- Event Summary: Detected an invalid state value when updating the
Cell Info structure
- Event Class: System
- Problem Description:
While updating the Cell_State field of the
Cell Info (AKA Cell Configuration) structure in ICM, PDC detected an invalid
value for the cell state value. This indicates an internal problem within PDC.
Data Field: Invalid value
- Cause / Action: Cause: Internal PDC error. Action: Contact HP
Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1917
- Severity: FATAL
- Event Summary: Unable to update the cell state field of the Cell
Info structure.
- Event Class: System
- Problem Description:
Unable to update the Cell Info structure
with the cell state for all cells within the partition. This code is issued
for a number of problems including fabric problems, target cells not in
partition, invalid arguments, PDC semaphore problems, and corruption of the
cell info structure.
- Cause / Action:
Cause: Loss of fabric connectivity. Cause:
Corruption of PDH memory. Cause: Internal PDC error. Action: Action: Reset
Partition. Action: Contact HP Support personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1918
- Severity: FATAL
- Event Summary: A hardware failure with a PDH Raiser Card
- Event Class: System
- Problem Description:
A hardware failure has been detected in a
PDH raiser board. The previous chassis log indicates the nature of the
failure. Data Field: Physical location of the cell containing the faulty
Dillon
- Cause / Action:
Cause: The cause is in the previously output
chassis log Action: Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1919
- Severity: FATAL
- Event Summary: Unable to access remote cell's revision register.
- Event Class: System
- Problem Description:
The local cell is not able to access the
cell board revision register on the target cell.
- Cause / Action:
Cause: Fabric connectivity problem. Action:
Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1920
- Severity: FATAL
- Event Summary: PDC detected an error reading PDH register
- Event Class: System
- Problem Description:
PDC detected an error trying to read the
given PDH register. This chassis code is associated with
PDH_GET_PDH_REGS_FAILED_PDH_REGISTER which identifies the PDH register that
was trying to be read.
- Cause / Action:
Cause: Hardware error Action: Contact HP
Support personnel to troubleshoot the problem Cause: PDC error Action: Upgrade
PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1921
- Severity: MAJOR
- Event Summary: CPU already owns the hardware semaphore when
attempting to lock it
- Event Class: System
- Problem Description:
The executing CPU already owns the Dillon
"hardware" semaphore when attempting to lock it. This is the register located
at offsets 0x5F00B0 through 0x5F04A8. Data Field: Target cell's physical
location.
- Cause / Action:
Cause: Corruption of the hardware semaphore
Internal PDC error Action: Contact HP Support personnel to troubleshoot the
problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1922
- Severity: MAJOR
- Event Summary: The PDH Raiser Card's "h/w" semaphore is not locked
when attempting to unlock it
- Event Class: System
- Problem Description:
The PDH RAiser Card's "hardware" semaphore
is not locked when attempting to unlock it. This is the register located at
offsets 0x5F00B0 through 0x5F04A8. Data Field: Target cell's physical location
- Cause / Action: Cause: Corruption of the hardware semaphore
Internal PDC error Action: Contact HP Support personnel to troubleshoot the
problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1923
- Severity: MAJOR
- Event Summary: CPU does not own PDH Raiser Card's "h/w" semaphore
when attempting to unlock it
- Event Class: System
- Problem Description:
The executing CPU does not own the Dillon
"hardware" semaphore when attempting to unlock it. Another CPU owns the
semaphore. This is the register located at offsets 0x5F00B0 through 0x5F04A8.
Data Field: Physical location of the current owning CPU
- Cause / Action:
Cause: Corruption of the hardware semaphore
Internal PDC error Action: Contact HP Support personnel to troubleshoot the
problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1924
- Severity: MAJOR
- Event Summary: An invalid local CPU number was detected.
- Event Class: System
- Problem Description:
An internal PDC verification of the local
CPU number detected an illegal value.
- Cause / Action: Cause: Internal PDC error. Action: Contact HP
Support personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1925
- Severity: MAJOR
- Event Summary: An invalid software semaphore ID was passed as an
argument
- Event Class: System
- Problem Description:
An internal function within PDC passed an
invalid software semaphore ID as an argument. Data Field: Invalid argument
- Cause / Action:
Cause: Internal PDC error Action: Contact HP
Support personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1926
- Severity: MAJOR
- Event Summary: An invalid software semaphore wait flag was passed
as an argument
- Event Class: System
- Problem Description:
An internal function within PDC passed an
invalid software semaphore wait flag as an argument. Data Field: Invalid
argument
- Cause / Action:
Cause: Internal PDC error Action: Contact HP
Support personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1927
- Severity: FATAL
- Event Summary: A read-after-write of PDH Raiser Card's Micro
General Purpose 2 register failed.
- Event Class: System
- Problem Description:
A read-after-write test of PDH Raiser
Card's Micro General Purpose 2 register failed. This register contains PDC's
Micro semaphore ownership flag.
- Cause / Action: Cause: Faulty PDH Raiser Card's or CC. Action:
Contact HP Support personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1928
- Severity: FATAL
- Event Summary: A read-after-write of PDH Raiser Card's Micro
General Purpose 3 register failed.
- Event Class: System
- Problem Description:
A read-after-write test of Dillon's Micro
General Purpose 3 register failed. This register contains the PDHC's Micro
semaphore ownership flag.
- Cause / Action: Cause: Faulty Dillon or path to Dillon. Action:
Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1929
- Severity: FATAL
- Event Summary: A read-after-write test of a PDH Micro Status
register failed
- Event Class: System
- Problem Description:
After writing to a PDH Raiser Card's Micro
Status register, PDC reads the register to verify the write took place. This
verification failed. Data Field: Physical location of the cell with the faulty
Dillon
- Cause / Action:
Cause: A defective PDH Raiser Card Action:
Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1930
- Severity: FATAL
- Event Summary: PDC detected an error trying to write value to PDH
register
- Event Class: System
- Problem Description:
PDC detected an error trying to write to
the given PDH register. This chassis code is associated with
PDH_SET_PDH_REGS_FAILED_PDH_REGISTER which identifies the PDH register that
was trying to be written.
- Cause / Action:
Cause: Hardware error Action: Contact HP
Support personnel to troubleshoot the problem Cause: PDC error Action: Upgrade
PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1931
- Severity: MAJOR
- Event Summary: PDC detected a semaphore error during a Proc call
- Event Class: System
- Problem Description:
PDC detected a semaphore error during a
Proc call. The previous chassis log indicates the nature of the error. This
log indicates which Proc was being executed. The data field contains the Proc
number in the upper 32 bits and the Proc option in the lower 32 bits. This log
is only output when a CRITICAL condition exists and is useful for debugging.
Data Field: PDC procedure call # << 32 | PDC procedure call option
- Cause / Action:
Cause: See the previously emitted chassis log
Action: See the previously emitted chassis log
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1932
- Severity: MAJOR
- Event Summary: PDC is unable to report the Proc number and option
- Event Class: System
- Problem Description:
An error was detected by PDC during a Proc
call. The previous chassis log indicates the nature of the error. PDC is
unable to report which Proc was executing when the error occurred. The data
field contains the return status from PDC's internal function
GetCurrentPdceCall(). Data Field: proc return status
- Cause / Action:
Cause: Memory corruption of the Proc log
Corruption of the DR_2 register Action: Locate source of corruption Contact HP
Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1933
- Severity: MAJOR
- Event Summary: Software semaphore already owned when attempting to
lock it
- Event Class: System
- Problem Description:
A software semaphore is already owned by
the executing CPU when attempting to lock it. Data Field: Identifier of the
target software semaphore
- Cause / Action:
Cause: A TOC or HPMC has interrupted a PDC
procedure call. NVM corruption Internal PDC error Action: Locate the source of
corruption Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1934
- Severity: MAJOR
- Event Summary: Software semaphores were not initialized when
attempting to use them
- Event Class: System
- Problem Description:
The software semaphores are not initialized
when attempting to use them. PDC should not attempt to use the software
semaphores before they are initialized. Data Field: Target cell's physical
location
- Cause / Action: Cause: Corruption of Dillon's MP Selection 5
register Internal PDC error Action: Locate the source of the corruption
Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1935
- Severity: MAJOR
- Event Summary: A Software semaphore is not locked as expected
- Event Class: System
- Problem Description:
During an internal verification, PDC finds
a Software semaphore is not locked as expected. PDC may have been accessing a
semaphore protected resource without owning the semaphore. Corruption may have
resulted. The data field contains information on the target semaphore. The
software semaphore ID is in the upper 32 bits and the cell's global number is
in the lower 32 bits. Data Field: S/W SM4 ID << 32 | Cell #
- Cause / Action:
Cause: NVM corruption Internal PDC error
Action: Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1936
- Severity: MAJOR
- Event Summary: A Software semaphore is not locked when attempting
to unlock it
- Event Class: System
- Problem Description:
While attempting to unlock a Software
semaphore, PDC finds the target semaphore is not locked. PDC may have been
accessing semaphore protected resources without owning the semaphore.
Corruption may have resulted. The data field contains the Software semaphore
ID of the target semaphore. Data Field: S/W SM4 ID
- Cause / Action:
Cause: NVM corruption Internal PDC error
Action: Locate the source of the corruption Contact HP Support personnel to
troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1937
- Severity: MAJOR
- Event Summary: Attempting to release a Software semaphore owned by
another CPU
- Event Class: System
- Problem Description:
While attempting to unlock a Software
semaphore, PDC finds the target semaphore is owned by another CPU. A CPU
should only unlock semaphores which it owns. Corruption may have resulted. The
data field contains the Software semaphore ID of the target semaphore. Data
Field: S/W SM4 ID
- Cause / Action:
Cause: NVM corruption Internal PDC error
Action: Locate the source of the corruption Contact HP Support personnel to
troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1938
- Severity: MAJOR
- Event Summary: Attempt to lock another cell's micro SM4 before soft
SM4 initialized
- Event Class: System
- Problem Description:
PDC was attempting to illegally lock
another cell's PDH Raiser Card's Micro semaphore. By convention within PDC, a
remote Micro semaphore can only be obtained if the Cell Global Software
semaphore on the remote cell is owned. This chassis log indicates the remote
cell's software semaphore's have not been initialized; hence the required
Software semaphore is not owned. Data Field: Target cell's physical location
- Cause / Action:
Cause: Corruption of Dillon's MP Selection 5
register Internal PDC error Action: Locate the source of the corruption
Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1939
- Severity: MAJOR
- Event Summary: Unexpected reset of the Cell PDH Controller (PDHC)
has been detected
- Event Class: System
- Problem Description:
During the execution of the Proc
PDC_PAT_EVENT[Scan Event], the EXT_AH event is detected. This means the cell
PDH controller (PDHC) has been reset. The Proc will return a -3 status to the
caller. Data Field: Physical location of the cell containing the PDHC
- Cause / Action:
Cause: Unknown source of cell PDH controller
(PDHC) reset Action: Cause: Determine source of reset Action: Contact HP
Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1940
- Severity: MAJOR
- Event Summary: PDH Micro semaphore already owned by PDC when
attempting to lock it
- Event Class: System
- Problem Description:
PDC was attempting to lock a PDH Raiser
Card's Micro semaphore and finds it already owned by PDC. Data Field: Physical
location of the cell containing the PDHC
- Cause / Action:
Cause: Corruption of Dillon's Micro semaphore
register Internal PDC error Action: Locate source of corruption Contact HP
Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1941
- Severity: MAJOR
- Event Summary: A PDH Raiser Card's Micro semaphore register was
read from an illegal location
- Event Class: System
- Problem Description:
The owner field of the target Dillon Micro
semaphore is neither PDC nor the Cell PDH Controller (PDHC). This indicates
the PDH Raiser Card's Micro semaphore register was read from an unarchitected
location. The data field contains the owner field of the target Micro
semaphore register. Data Field: SM4 owner field
- Cause / Action:
Cause: Corrupted Micro semaphore register
Action: Find source of corruption Contact HP Support personnel to troubleshoot
the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1942
- Severity: MAJOR
- Event Summary: A PDH Raiser Card's Micro semaphore is not locked
when expected
- Event Class: System
- Problem Description:
During an internal verification by PDC, the
target cell's PDH Raiser Card's Micro semaphore register is not locked as
expected. PDC may have been accessing a FATAL region protected by this
semaphore without owning the semaphore. Corruption may have resulted. Data
Field: Physical location of the cell containing the target Dillon
- Cause / Action:
Cause: Corruption of Dillon's Micro semaphore
register Internal PDC error Action: Locate source of corruption Contact HP
Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1943
- Severity: MAJOR
- Event Summary: Micro semaphore owned by both PDC and the PDHC.
- Event Class: System
- Problem Description:
The Micro semaphore ownership flags
indicate that both PDC and the PDHC believe they own the semaphore.
- Cause / Action:
Cause: Dillon's Micro General Purpose registers
2 and 3 corrupted. Action: Find source of corruption and reboot. Cause: PDC or
the PDHC improperly implementing the algorithms for dealing with the Micro
semaphore. Action: Contact HP Support personnel to troubleshoot the problem.
Cause: Dillon hardware error concerning the Micro semaphore register or the
Micro General Purpose registers 2 and 3. Action: Contact HP Support personnel
to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1944
- Severity: MAJOR
- Event Summary: The PDHC's Micro Semaphore ownership flag is
corrupt.
- Event Class: System
- Problem Description:
The PDHC's Micro Semaphore ownership flag
is corrupt. This flag is contained in Dillon's Micro General Purpose 3
register.
- Cause / Action:
Cause: PDH's Micro General Purpose register 3
corrupted. Action: Find source of corruption and reboot. Cause: PDH hardware
error with the Micro General Purpose register 3. Action: Replace cell board.
Cause: PDC or the PDHC improperly implementing the algorithms for dealing with
the Micro semaphore. Action: Upgrade PDC or the PDHC firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1945
- Severity: MAJOR
- Event Summary: A PDH Micro semaphore is unowned when attempting to
unlock it
- Event Class: System
- Problem Description:
PDC is attempting to unlock a PDH Micro
semaphore when it discovers the semaphore is unlocked. This internal check
indicates that PDC may have been accessing a FATAL region protected by this
semaphore without owning the semaphore. Corruption may have resulted. Data
Field: Physical location of the cell containing the target Dillon
- Cause / Action:
Cause: Corruption of PDH's Micro semaphore
register Internal PDC error Action: Contact HP Support personnel to
troubleshoot the problem. Action: Locate source of corruption Contact HP
Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1946
- Severity: MAJOR
- Event Summary: SM4 is owned by PDH Controller when PDC is
attempting to unlock it
- Event Class: System
- Problem Description:
PDC is attempting to unlock a PDH Micro
semaphore when it discovers the semaphore is owned by the Cell PDH Controller
(PDHC). This internal check indicates that PDC may have been accessing a FATAL
region protected by this semaphore without owning the semaphore. Corruption
may have resulted. Data Field: Physical location of the cell containing the
target Dillon
- Cause / Action:
Cause: Illegal read of PDH's Micro semaphore
register Internal PDC error Action: Find source of corruption Contact HP
Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1947
- Severity: MAJOR
- Event Summary: Semaphore is owned by PDH Controller when PDC is
attempting to verify
- Event Class: System
- Problem Description:
PDC is attempting to verify that PDC owns a
PDH Micro semaphore but finds the Cell PDH Controller (PDHC) currently owns
the semaphore. This internal check indicates that PDC may have been accessing
a FATAL region protected by this semaphore without owning the semaphore.
Corruption may have resulted. Data Field: Physical location of the cell
containing the target Dillon
- Cause / Action:
Cause: Illegal read of PDH's Micro semaphore
register Internal PDC error Action: Find source of corruption Contact HP
Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1948
- Severity: MAJOR
- Event Summary: Attempted access to SM4 on remote cell before SM4 is
initialized
- Event Class: System
- Problem Description:
PDC is attempting to access a remote cell's
Dillon Micro semaphore when the software semaphores are uninitialized on the
remote cell. By convention, PDC must own the Cell Global Software semaphore
before accessing the micro semaphore on a remote cell. Data Field: Physical
location of the cell containing the target Dillon
- Cause / Action:
Cause: Corruption of Dillon's MP Selection 5
register Internal PDC error Action: Locate the source of the corruption
Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1949
- Severity: MAJOR
- Event Summary: Unable to lock Micro semaphore.
- Event Class: System
- Problem Description:
PDC has not been able to lock the Micro
semaphore in PDH after repeated attempts. This is due to a bug (HD2496) in
Dillon 2.0 in which read-read conflicts between PDC and the PDHC result in
neither entity locking the semaphore. At this time, there is no intent to fix
the bug in Dillon 2.0 so the only option is to reset the cell.
- Cause / Action:
Cause: Hardware bug in PDH 2.0. Action: Reset
the cell--both PDC and the PDHC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1950
- Severity: MAJOR
- Event Summary: Invalid address passed to TogoWriteVerify().
- Event Class: System
- Problem Description:
An invalid address was passed to
TogoWriteVerify(). Data Field: CSR Address that passed to TogoWriteVerify()
- Cause / Action:
Cause: PDC Runtime Error Possible memory
corruption or misuse of functions Action: Report this error to the Response
Center Reset the cell Update PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1951
- Severity: MAJOR
- Event Summary: Unexpected fabric firmware error
- Event Class: System
- Problem Description:
An unexpected error occurred while
initializing the fabric. The firmware is not able to analyze this error. Clues
to the cause of this error may be found in the IPMI forward progress log (FPL)
either shortly before or after this log entry occurred. The FPL is available
from the management processor using the "sl" command.
- Cause / Action:
An unanticipated
error occurred. Contact HP Support personnel to analyze the IPMI FPL log.
Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1952
- Severity: MAJOR
- Event Summary: Reading the XBC Global Semaphore register failed
- Event Class: System
- Problem Description:
While attempting to get the XBC Global
Semaphore, the read to the register failed. Data Field: XBC address
- Cause / Action:
Cause: XBC read failure. Action: check XBC,
check link, check CC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1953
- Severity: CRITICAL
- Event Summary: An attempted takeover of the XBC Global Semaphore
has failed.
- Event Class: System
- Problem Description:
A failure occurred while attempting to
takeover the XBC Global semaphore. This is a sign of a fabric connectivity
problem. Data Field: (XBC Port Number << 44) | (XBC number << 32)
| return status
- Cause / Action: Cause: Fabric Access Error Action: Check XBC. Check
Links/Flex Cables.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1954
- Severity: FATAL
- Event Summary: The cabinet number for the cabinet containing the
XBC is incorrect.
- Event Class: System
- Problem Description:
The cabinet number for the cabinet
containing the XBC is incorrect. Data Field: (Expected external port <<
32) | external port determined from CC
- Cause / Action:
Cause: The cabinet number for the cabinet
containing the XBC is incorrect. The cabinet numbering rules are: Left
cabinets use even numbers, Right cabinets use odd numbers, and Left / Right
cabinet pairs must be numbered sequentially. Action: Check all the cabinet
numbers Have your HP Support Representative check the System Utilities
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1955
- Severity: FATAL
- Event Summary: The cabinet numbering is non-sequential.
- Event Class: System
- Problem Description:
The cabinet numbering is non-sequential.
Data Field Value (Togo number << 32) | neighbor identification
- Cause / Action:
Cause: The cabinet numbering is non-sequential.
Cause: The cabinet numbering is non-sequential. The cabinet numbering rules
are: Left cabinets use even numbers, Right cabinets use odd numbers, and Left
/ Right cabinet pairs must be numbered sequentially. Action: Check all the
cabinet numbers Have your HP Support Representative check the System Utilities
Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1956
- Severity: MAJOR
- Event Summary: The local cell is not connected to fabric.
- Event Class: System
- Problem Description:
While testing fabric route, the local cell
could not read from the Coherency Controller (CC) or the CC was not connected
to a XBC.
- Cause / Action:
Cause: Hardware problem Action: Check CC to XBC
link. Check CC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1957
- Severity: MAJOR
- Event Summary: The fabric link is not useable due to errors on this
port.
- Event Class: System
- Problem Description:
While testing the route to the target cell,
a port was found to be unusable.
- Cause / Action:
Cause: A XBC port was found to have errors
while traversing the route to the target XBC. Action: Look for additional
chassis codes that provide more detailed information. Contact HP Support
personnel to analyze the flex cables and crossbar chip.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1958
- Severity: MAJOR
- Event Summary: Unexpected errors were encountered while testing the
route to the target cell.
- Event Class: System
- Problem Description:
While testing a port on the route to the
target cell, an unknown error was encountered. Data Field: (target cell
<< 56) | (cell port << 44) | (crossbar num << 32)
- Cause / Action:
Cause: A XBC port was found to have errors
while traversing the route to the target XBC. Action: Look for additional
chassis codes that provide more detailed information. Contact HP Support
personnel to analyze the flex cables, crossbar chip, and CC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1959
- Severity: MAJOR
- Event Summary: The target XBC's link to the target cell is not
useable.
- Event Class: System
- Problem Description:
While examining the route to the target
cell, an unexpected failure occurred while traversing from the target XBC to
the target cell.
- Cause / Action:
Cause: A XBC port was found to have errors
while traversing the route to the target XBC. Action: Look for additional
chassis codes that provide more detailed information. Contact HP Support
personnel to analyze the flex cables, crossbar chip, and CC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1960
- Severity: MAJOR
- Event Summary: The target XBC's link to the target cell is not
useable.
- Event Class: System
- Problem Description:
The fabric route from the local cell to the
target cell is being examined. A problem was encountered on the link to the
target cell. Data Field: (target cell << 56) | (xbc num << 32) \
- Cause / Action:
Cause: There was a problem with the target
cell's link. Either an error such as LOL, FE, or one side of the link is
powered off. Action: Contact HP Support personnel to analyze the cell power,
XBC, CC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1961
- Severity: MAJOR
- Event Summary: A failure occurred when testing the route from the
local XBC to the target XBC.
- Event Class: System
- Problem Description:
While examining the route to the target
cell, an unexpected failure occurred while traversing from the local XBC to
the target XBC. Data Field: (target cell << 56) | (xbc num << 32)
- Cause / Action:
Cause: A XBC port was found to have errors
while traversing the route to the target XBC. Action: Look for additional
chassis codes that provide more detailed information. Contact HP Support
personnel to analyze the flex cables, crossbar chip, and CC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1962
- Severity: MAJOR
- Event Summary: The fabric route from the local XBC to the target
XBC could not be traversed.
- Event Class: System
- Problem Description:
The fabric route from the local cell to the
target cell is being examined. A problem was encountered on the route to the
target's XBC. Data Field: (target cell << 56) | (xbc num << 32)
- Cause / Action:
Cause: A link between the local XBC and the
target XBC was not alive. This means the link is either not yet initialized,
powered off, or a Fatal Error has been encountered preventing the link from
being used. Action: Look for additional chassis codes to provide more detailed
info. Contact HP Support personnel to analyze the port status registers on the
XBC, flex cables, and XBC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1963
- Severity: MAJOR
- Event Summary: The XBC global semaphore was not locked during a PDC
procedure call
- Event Class: System
- Problem Description:
During a PDC procedure, the XBC's global
semaphore was expected to be locked, but the semaphore was found not to be
locked or the lock couldn't be verified. Data Field: (cell port << 44) |
(xbc num << 32) | xbc register
- Cause / Action:
Cause: There was a problem accessing the XBC.
Action: Contact HP Support personnel to analyze the crossbar chip, flex
cables, CC, PDC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1964
- Severity: MAJOR
- Event Summary: Could not route across port 4 of the local XBC.
- Event Class: System
- Problem Description:
Couldn't get the XBC num connected to the
local XBC Port 4 or port 4 is not healthy. Data Field: (port << 44) |
(xbc num << 32) | ret status
- Cause / Action: Cause: Local XBC port 4 link not healthy or local
XBC failing. Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1965
- Severity: MAJOR
- Event Summary: Could not complete remote routing of the
kitty-korner XBC.
- Event Class: System
- Problem Description:
There was a problem performing remote
routing on the kitty-korner XBC. Chassis codes sent before this one may
provide more details about the exact nature of the problem. The executing cell
will attempt a fabricless boot. Data Field: (xbc num << 32) | return
status
- Cause / Action:
Cause: A failure was encountered while
performing remote routing on the kitty-korner XBC, most likely due to a
problem with the system backplane or local cell. Action: Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1966
- Severity: MAJOR
- Event Summary: Could not complete remote routing of the mirror XBC.
- Event Class: System
- Problem Description:
There was a problem performing remote
routing on the mirror XBC. Chassis codes sent before this one may provide more
details about the exact nature of the problem. The executing cell will attempt
a fabricless boot. Data Field: (xbc num << 32) | return status
- Cause / Action:
Cause: A failure was encountered while
performing remote routing on the mirror XBC, most likely due to a problem with
the system backplane or local cell. Action: Contact HP Support personnel to
analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1967
- Severity: MAJOR
- Event Summary: Could not complete remote routing of the sister XBC.
- Event Class: System
- Problem Description:
There was a problem performing remote
routing on the sister XBC. Chassis codes sent before this one may provide more
details about the exact nature of the problem. The executing cell will attempt
a fabricless boot. Data Field: (xbc num << 32) | return status
- Cause / Action:
Cause: A failure was encountered while
performing remote routing on the sister XBC, most likely due to a problem with
the system backplane or local cell. Action: Contact HP Support personnel to
analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1968
- Severity: MAJOR
- Event Summary: Could not complete remote routing of the sister XBC.
- Event Class: System
- Problem Description:
This system is a Thinboy. There was a
problem performing remote routing on the sister XBC. Chassis codes sent before
this one may provide more details about the exact nature of the problem. The
executing cell will attempt a fabricless boot. Data Field: (xbc num <<
32) | return status
- Cause / Action: Cause: A failure was encountered while performing
remote routing on the sister XBC, most likely due to a problem with the system
backplane or local cell. Action: Contact HP Support personnel to analyze the
fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1969
- Severity: MAJOR
- Event Summary: Could not complete remote routing of the local XBC.
- Event Class: System
- Problem Description:
There was a problem performing remote
routing on the local XBC. Chassis codes sent before this one may provide more
details about the exact nature of the problem. The executing cell will attempt
a fabricless boot. Data Field: (xbc num << 32) | return status
- Cause / Action:
Cause: A failure was encountered while
performing remote routing on the local XBC, most likely due to a problem with
the system backplane or local cell. Action: Contact HP Support personnel to
analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1970
- Severity: MAJOR
- Event Summary: Too many broken links: ports 4 & 5 not routable
- Event Class: System
- Problem Description:
Ports 4 and 5 of the local XBC were not
routable. However, port 5 of the sister XBC was routable and connected to
another XBC. Therefore, the system is a fatboy with too many broken links. The
executing cell will attempt a fabricless boot. Data Field: (port << 44)
| (xbc num << 32) | ret status
- Cause / Action:
Cause: Ports 4 and 5 of the local XBC are not
healthy. Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1971
- Severity: MAJOR
- Event Summary: PDC cannot determine the system's topology
- Event Class: System
- Problem Description:
PDC initially determines the system's
topology early in fabric discovery. Later in fabric discovery PDC compares the
topology found by DiscoverTopology with the topology it sees. If the two do
not match this chassis code is sent. Data Field: (xbc num << 32) |
topology
- Cause / Action:
Cause: There is a fabric problem that causes
two different XBCs to appear as if they have different topologies. Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1972
- Severity: MAJOR
- Event Summary: A XBC read failed due to a Multi-Bit Error
- Event Class: System
- Problem Description:
A XBC read failed due to a Multi-Bit Error.
Data Field: return data
- Cause / Action:
Cause: likely fabric hardware failure Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1973
- Severity: MAJOR
- Event Summary: There was a failure reading from the XBC.
- Event Class: System
- Problem Description:
The routing forward progress is stored in a
scratch register on the XBC. A read of that register failed during an audit of
the XBC Global Semaphore. This indicates a connectivity failure. Data Field:
(port << 44) | (xbc num << 32) | ret status
- Cause / Action:
Cause: likely fabric hardware failure Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1974
- Severity: MAJOR
- Event Summary: PDC failed to unlock the fabric as it tried to
release the XBC Semaphore.
- Event Class: System
- Problem Description:
The fabric has to be unlocked for PDC to
release the fabric semaphore. PDC tried to unlock the fabric and failed. Data
Field: (cell << 56) | (port << 44) | (xbc << 32) | return
status
- Cause / Action: Cause: There was a problem reading the XBC
semaphore. Or there was a problem writing the XBC key. Action: Contact HP
Support personnel to analyze the fabric, crossbar, and PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1975
- Severity: MAJOR
- Event Summary: PDC failed to read an XBC SM4 while trying to
release it.
- Event Class: System
- Problem Description:
PDC checks to make sure that the cell
releasing the semaphore actually owns the semaphore it is trying to release.
This chassis code is sent when PDC cannot read the owner of the semaphore.
Data Field: (cell << 56) | (port << 44) | (xbc << 32) |
return status
- Cause / Action:
Cause: There was a fabric failure reading the
XBCs CSRs. Action: Look for FABRIC_READ_ERROR_xxx chassis codes or a chassis
code indicating the data from the XBC slices are different. Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1976
- Severity: MAJOR
- Event Summary: PDC failed while attempting to determine a SM4's
current owner
- Event Class: System
- Problem Description:
PDC is attempting to release a SM4. After
it has released the semaphore it checks to make sure that it no longer owns
the semaphore. This chassis code is sent when PDC fails while reading the XBC
SM4. This chassis code is also sent when PDC fails to read a SM4 as part of
tracking the owner of a semaphore and the length of time the owner has held
the semaphore. Data Field: (cell << 56) | (port << 44) | (xbc
<< 32) | return status
- Cause / Action: Cause: There was a fabric failure reading the XBCs
CSRs. Action: Look for FABRIC_READ_ERROR_xxx chassis codes or a chassis code
indicating the data from the XBC slices are different. Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1977
- Severity: MAJOR
- Event Summary: A timeout occurred while attempting to release the
XBC semaphore.
- Event Class: System
- Problem Description:
The XBC Release Semaphore timeout is
designed to fail last. The semaphore could not be released. Any other cell
(even outside the PD) may be blocked because the XBC is a global resource.
Data Field: (cell << 56) | (port << 44) | (xbc << 32) |
current owner
- Cause / Action:
Cause: XBC Key Contention. Hardware Failure
Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1978
- Severity: MAJOR
- Event Summary: PDC tried to write to a fabric SM4 and failed
- Event Class: System
- Problem Description:
PDC attempted to write the XBC SM4 register
and detected a problem in doing the write. PDC was unable release the SM4.
Data Field: (cell << 56) | (port << 44) | (xbc << 32) |
return status
- Cause / Action:
Cause: There was a problem determining if the
fabric was in a writable state. Action: Contact HP Support personnel to
analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1979
- Severity: MAJOR
- Event Summary: PDC failed reading an XBC SM4 while insuring the
cell didn't errantly hold SM4s
- Event Class: System
- Problem Description:
This chassis code is sent when PDC cannot
read the XBC's global port SM4.Data Field: (port << 44) | (xbc num
<< 32) | XBC semaphore read data
- Cause / Action:
Cause: There was a fabric failure reading the
XBCs CSRs. Action: Look for FABRIC_READ_ERROR_xxx chassis codes or a chassis
code indicating the data from the XBC slices are different. Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1980
- Severity: MAJOR
- Event Summary: PDC failed releasing a XBC SM4 while insuring the
cell held no XBC SM4s
- Event Class: System
- Problem Description:
PDC checks to make sure that the cell on
which it is running does not hold any XBC SM4s during fabric discovery and
during a number of error handling conditions. The purpose behind this check is
to make sure that a failure or a previous failure on this cell does not result
in XBC SM4s remaining locked. This chassis code is sent when PDC detects that
the cell holds a semaphore that it shouldn't hold and fails when it attempts
to release the SM4. Data Field: (port << 44) | (xbc num << 32) |
ret status
- Cause / Action:
Cause: There was a problem determining if the
fabric was in a writable state. Action: Contact HP Support personnel to
analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1981
- Severity: MAJOR
- Event Summary: PDC failed reading an XBC SM4 while making sure it
didn't errantly hold SM4s
- Event Class: System
- Problem Description:
PDC checks to make sure that the cell on
which it is running does not hold any XBC SM4s during fabric discovery and
during a number of error handling conditions. The purpose behind this check is
to make sure that a failure or a previous failure on this cell does not result
in XBC SM4s remaining locked. This chassis code is sent when PDC cannot read
the owner of a port's XBC semaphore Data Field: (port << 44) | (xbc num
<< 32) | read data
- Cause / Action:
Cause: There was a fabric failure reading the
XBCs CSRs. Action: Look for FABRIC_READ_ERROR_xxx chassis codes or a chassis
code indicating the data from the XBC slices are different. Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1982
- Severity: MAJOR
- Event Summary: PDC failed releasing a XBC SM4 while insuring the
cell held no XBC SM4s
- Event Class: System
- Problem Description:
PDC checks to make sure that the cell on
which it is running does not hold any XBC SM4s during fabric discovery and
during a number of error handling conditions. The purpose behind this check is
to make sure that a failure or a previous failure on this cell does not result
in XBC SM4s remaining locked. This chassis code is sent when PDC detects that
the cell holds a semaphore that it shouldn't hold and fails when it attempts
to release the SM4. Data Field: (port << 44) | (xbc num << 32) |
return status
- Cause / Action:
Cause: There was a problem determining if the
fabric was in a writable state. Action: Contact HP Support personnel to
analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1983
- Severity: MAJOR
- Event Summary: PDC failed while making sure the cell on which it's
running didn't hold any SM4s
- Event Class: System
- Problem Description:
PDC checks to make sure that the cell on
which it is running does not hold any XBC SM4s during fabric discovery and
during a number of error handling conditions. The purpose behind this check is
to make sure that a failure or a previous failure on this cell does not result
in XBC SM4s remaining locked. This chassis code is sent when PDC cannot figure
out which XBCs are in the system. Data Field: return status
- Cause / Action:
Cause: PDC couldn't read the XBC register that
contained the topology. Look for additional chassis codes that provide
additional details about the problem. This is probably the result of a fabric
failure, but the nature of the failure cannot be determined from this chassis
code alone. Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1984
- Severity: MAJOR
- Event Summary: The XBC write to the remote routing register failed.
- Event Class: System
- Problem Description:
When a system has already been routed and a
cell is reset, it's remote routing registers are setup by copying the routing
from the built-in XBC port. This copy has failed. There was an error while
attempting to write the register. The cell will reset and reboot, as the copy
may succeed on next boot. Data Field: write address
- Cause / Action:
Cause: The XBC key has been locked or the
Global Semaphore is not owned, thus preventing writes from occurring. XBC
write failure Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1985
- Severity: MAJOR
- Event Summary: A failure occurred when testing the route from the
target XBC to the local XBC.
- Event Class: System
- Problem Description:
An unexpected failure occurred while
traversing from the target XBC to the local XBC. Data Field: (target cell
<< 56) | (xbc num << 32)
- Cause / Action:
Cause: There was a failure (most likely during
a XBC read) while traversing the route to the target XBC. Action: Contact HP
Support personnel to analyze the fabric, crossbar, flex cables
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1986
- Severity: MAJOR
- Event Summary: The fabric route from the target XBC is not
traversable.
- Event Class: System
- Problem Description:
The fabric data route from the local XBC to
the target XBC is being examined. A problem was encountered on the return
route from the target XBC to the local XBC. Data Field: (target cell <<
56) | (xbc num << 32)
- Cause / Action:
Cause: A link between the target XBC and the
local XBC was not useable. Since this is the return path (and the to path has
already been tested), then a fabric link was probably broken during routing.
This would be the second broken link. Action: Contact HP Support personnel to
analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1987
- Severity: MAJOR
- Event Summary: A failure occurred when testing the route from the
local XBC to the target XBC.
- Event Class: System
- Problem Description:
An unexpected failure occurred while
traversing from the local XBC to the target XBC. Data Field: (target cell
<< 56) | (xbc num << 32)
- Cause / Action:
Cause: There was a failure (most likely during
a XBC read) while traversing the route to the target XBC. Action: Contact HP
Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1988
- Severity: MAJOR
- Event Summary: The fabric route to the target XBC is not
traversable.
- Event Class: System
- Problem Description:
The fabric route from the local XBC to the
target XBC is being examined. A problem was encountered on the route to the
target's XBC. Data Field: (target cell << 56) | (xbc num << 32)
- Cause / Action: Cause: A link between the local XBC and the target
XBC was not alive. This means the link is either not yet initialized, powered
off, or a Fatal Error has been encountered preventing the link from being
used. Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1989
- Severity: MAJOR
- Event Summary: The cell cannot reach the fabric. It's link is not
initialized.
- Event Class: System
- Problem Description:
Testing the fabric link between a cell and
its XBC. This chassis code indicates that a cell is no longer visible on the
fabric or that the cell can no longer see the fabric. Any cells in this PD
should have already HPMC'd. Data Field: (cell << 56) | (port <<
44) | (xbc << 32)
- Cause / Action:
Cause: Fabric link error. Hardware failure.
Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1990
- Severity: MAJOR
- Event Summary: A failure occurred when reading an XBC's local
routing table
- Event Class: System
- Problem Description:
While examining a cell link, a read of the
XBC port's local routing register failed. Data Field: (target cell <<
56) | (xbc num << 32) | return status
- Cause / Action:
Cause: An XBC read failed. Possibly a new
failure or an intermittent failure. Action: Contact HP Support personnel to
analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1991
- Severity: MAJOR
- Event Summary: The Cell Local Semaphore was not locked.
- Event Class: System
- Problem Description:
The fabric walk code needs to have the Cell
Local Semaphore locked in order to send chassis codes safely. This semaphore
was not locked, so the fabric walk has failed. Data Field: (target cell
<< 56) |
- Cause / Action: Cause: PDC runtime error. Action: Contact HP
Support personnel to analyze the fabric and PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1992
- Severity: MAJOR
- Event Summary: An unknown backplane was detected during the fabric
walk.
- Event Class: System
- Problem Description:
During a fabric walk, the fabric code needs
to know the system type. This chassis code indicates that there was an error
in determining the system type. Perhaps a new type has been added. Data Field:
system type
- Cause / Action: Cause: Unknown system type Action: Contact HP
Support personnel to analyze the backplanes and activity logs
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1993
- Severity: MAJOR
- Event Summary: Couldn't read the local XBC number from the CC.
- Event Class: System
- Problem Description:
An error reading the CC prevented PDC from
obtaining the number of the local XBC Data Field: return status
- Cause / Action:
Cause: Failed to read a CSR on the CC. Action:
Contact HP Support personnel to check the CC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1994
- Severity: MAJOR
- Event Summary: Couldn't read a XBC Global General Purpose register.
- Event Class: System
- Problem Description:
Attempted to read the routing state from a
global general purpose register on the XBC. The read access failed. Data
Field: (xbc num << 32) | return status
- Cause / Action:
Cause: XBC register read failure Action:
Contact HP Support personnel to analyze the crossbar chip.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1995
- Severity: MAJOR
- Event Summary: Couldn't release the XBC's global semaphore.
- Event Class: System
- Problem Description:
After attempting to perform routing for the
XBC, the XBC global semaphore could not be released. Data Field: (xbc num
<< 32) | return status
- Cause / Action:
Cause: The XBC global semaphore could not be
released, possibly due to a XBC read/write failure or to XBC contention.
Action: Contact HP Support personnel to analyze the fabric, crossbar chip
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1996
- Severity: MAJOR
- Event Summary: The XBC's routing state was marked as in ERROR
- Event Class: System
- Problem Description:
For the XBC being routed, routing has
already been attempted, but an error occurred. Inspect chassis codes from
other cells for more details regarding the nature of the problem. Data Field:
(xbc num << 32)
- Cause / Action:
Cause: Another cell already attempted routing
for the XBC and found an error. Action: Contact HP Support personnel to
analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1997
- Severity: MAJOR
- Event Summary: A read after write of a XBC address failed to
contain the expected contents.
- Event Class: System
- Problem Description:
When a system has already been routed and a
cell is reset, it's remote routing registers are setup by copying the routing
from the built-in XBC port. This copy has failed. The first register to fail a
read after write triggers this chassis code. The cell will reset and reboot,
as the copy may succeed on next boot. Data Field: XBC physical location
- Cause / Action:
Cause: The XBC key has been locked or the
Global Semaphore is not owned, thus preventing writes from occurring. This is
frequently a timing/contention issue and the cell will probably succeed on
next boot. Action: Contact HP Support personnel to analyze the fabric Cause:
XBC write failure Action: Contact HP Support personnel to analyze the XBC, CC,
and XBC to CC link
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1998
- Severity: MAJOR
- Event Summary: The Cell Local Semaphore was not locked.
- Event Class: System
- Problem Description:
The fabric walk code needs to have the Cell
Local Semaphore locked in order to send chassis codes safely. This semaphore
was not locked, so the fabric walk has failed. Data Field: (target cell
<< 56) | return value
- Cause / Action:
Cause: The cell local semaphore was not locked
Action: Contact HP Support personnel to analyze the XBC semaphores and
activity logs
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 1999
- Severity: MAJOR
- Event Summary: An unknown backplane was detected during the fabric
call.
- Event Class: System
- Problem Description:
During a fabric walk, the fabric code needs
to know the system type. This chassis code indicates that there was an error
in determining the system type. Perhaps a new type has been added. Data Field:
system type
- Cause / Action:
Cause: Unknown system type Action: Contact HP
Support personnel to analyze the backplanes and activity logs
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2000
- Severity: MAJOR
- Event Summary: Cell's XBC port has been marked in error
- Event Class: System
- Problem Description:
A cell's XBC port has been marked in error
because it is in FE, has failed link to link tests or is already marked in
error. Data Field: XBC number << 32 | internal port number (8-F)
- Cause / Action: Cause: The XBC port is in FE, has failed link to
link tests, or has already been marked in error. Action: Reset the cell Reset
the system backplane Contact HP Support personnel to troubleshoot problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2001
- Severity: MAJOR
- Event Summary: Could not unlock the XBC Global Key.
- Event Class: System
- Problem Description:
During a write of a protected XBC register,
the global semaphore was owned by this cell, however the global key was
locked. The locked key would prevent the lock from occurring. Since the cell
owns the semaphore, the key will be unlocked to allow the write. However, the
write to unlock the key failed. The cell will halt. Data Field: return status
- Cause / Action:
Cause: XBC write failure Action: Contact HP
Support personnel to analyze the XBC, CC, and XBC to CC link
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2002
- Severity: MAJOR
- Event Summary: The XBC global semaphore was not locked.
- Event Class: System
- Problem Description:
The XBC's global semaphore was expected to
be locked, but the semaphore was found not to be locked or the lock couldn't
be verified. Data Field: (xbc num << 32) | error id
- Cause / Action:
Cause: There was a problem accessing the XBC.
Action: Contact HP Support personnel to analyze the crossbar chip, flex cable,
CC, PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2003
- Severity: MAJOR
- Event Summary: The XBC global semaphore was not locked
- Event Class: System
- Problem Description:
The XBC's global semaphore was expected to
be locked, but the semaphore was found not to be locked or the lock couldn't
be verified. Data Field: (port << 44) | (xbc num << 32) | error id
- Cause / Action: Cause: There was a problem accessing the XBC.
Action: Contact HP Support personnel to analyze the crossbar chip, flex cable,
CC, PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2004
- Severity: CRITICAL
- Event Summary: A failure occurred while releasing the XBC Global
Semaphore
- Event Class: System
- Problem Description:
At the end of Fabric Discovery, the local
XBC's Global Semaphore needs to be released. An error has occurred that
prevented the release of the XBC semaphore. Data Field: return value
- Cause / Action:
Cause: Fabric failure Action: Contact HP
Support personnel to analyze the fabric, XBC, XBC to CC link Look for
additional event ids that may indicate XBC Key Contention.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2005
- Severity: CRITICAL
- Event Summary: Fabric Discovery did not complete correctly
- Event Class: System
- Problem Description:
A failure or problem was encountered in
fabric state validation at the end of fabric discovery. Chassis codes sent
before this one should give more details about the nature of the problem. Data
Field: return status
- Cause / Action:
Cause: Hardware failure or PDC runtime error.
Action: Contact HP Support personnel to check the flex cables, crossbar chips,
CC, and PDC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2006
- Severity: CRITICAL
- Event Summary: PDC failed to lock an XBC after it took over the
XBC's semaphore.
- Event Class: System
- Problem Description:
When a cell holds a fabric semaphore for an
extended period of time, PDC will attempt to takeover the semaphore so that
the rest of the cells will have access to it. PDC tried to lock the fabric
after taking the XBC semaphore from the hung cell and failed. Data Field: (xbc
num << 32) | return status
- Cause / Action:
Cause: There was a problem accessing the
fabric. There could be a problem with PDC where it fails to keep track which
cells owns an XBC semaphore (unlikely after PDC 32.4). Action: Look for other
chassis codes providing more information about the problem. Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2007
- Severity: CRITICAL
- Event Summary: In taking over a port SM4, PDC attempted to read the
SM4 and failed.
- Event Class: System
- Problem Description:
When a cell holds a fabric semaphore for an
extended period of time, PDC will attempt to takeover the semaphore so that
the rest of the cells will have access to it. When this chassis code is sent,
PDC cannot access the XBC semaphore and is probably unable to access anything
else on the XBC. Data Field: (xbc num << 32) | return status
- Cause / Action:
Cause: There was a fabric failure reading the
XBCs CSRs. Action: Look for FABRIC_READ_ERROR_xxx chassis codes or a chassis
code indicating the data from the XBC slices are different. Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2008
- Severity: MAJOR
- Event Summary: PDC failed attempting to force an XBC to unlock as
part of taking over a SM4
- Event Class: System
- Problem Description:
When a cell holds a fabric semaphore for an
extended period of time, PDC will attempt to takeover the semaphore so that
the rest of the cells will have access to it. This chassis code is sent we PDC
encounters a problem in trying to enable the XBC key for the semaphore that it
is trying to take over. Data Field: (xbc num << 32) | return status
- Cause / Action:
Cause: This could be a hardware problem that
prevents PDC from manipulating the fabric CSRs. This could be a problem with
XBC key contention. Action: Look for other chassis codes that contain more
specific data as to why enabling the XBC key failed. If the problem is
repeatable, note the circumstances under which this event is occurring and
capture complete activity logs. Contact HP Support personnel to analyze the
fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2009
- Severity: MAJOR
- Event Summary: PDC is attempting to take over a fabric SM4 and had
trouble reading the SM4 CSR
- Event Class: System
- Problem Description:
When a cell holds a fabric semaphore for an
extended period of time, PDC will attempt to takeover the semaphore so that
the rest of the cells will have access to it. This chassis code is sent after
PDC has already tried to take over the semaphore and is reading it to see if,
now that the semaphore is unlocked, another cell has taken ownership of it.
PDC will perform this check for a certain period of time and will then emit a
chassis code indicating that it timed out. Data Field: (xbc num << 32) |
return status
- Cause / Action:
Cause: There was a fabric failure reading the
XBCs CSRs. Action: Look for FABRIC_READ_ERROR_xxx chassis codes or a chassis
code indicating the data from the XBC slices are different. Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2010
- Severity: CRITICAL
- Event Summary: PDC attempted a fabric SM4 takeover and timed out
trying to unlock the SM4
- Event Class: System
- Problem Description:
When a cell holds a fabric semaphore for an
extended period of time, PDC will attempt to takeover the semaphore so that
the rest of the cells will have access to it. PDC will attempt to take the SM4
for a period of time. If it is unable to unlock the SM4 within the timeout
period, it will send this chassis code and halt the cell. Data Field: (xbc num
<< 32) | return status
- Cause / Action:
Cause: PDC cannot takeover a fabric semaphore
that has been held for a long time. Action: Look for other fabric chassis
codes that explain why the current owner of the SM4 was unable to release it.
Contact HP Support personnel to analyze the fabric and backplane.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2011
- Severity: MAJOR
- Event Summary: In taking over an XBC SM4, PDC failed to write the
unlocked value to the SM4
- Event Class: System
- Problem Description:
When a cell holds a fabric semaphore for an
extended period of time, PDC will attempt to takeover the semaphore so that
the rest of the cells will have access to it. This chassis code is sent when
PDC attempts to write the unlocked value to the semaphore and the write fails.
Data Field: (xbc num << 32) |return status
- Cause / Action:
Cause: There was a problem determining if the
fabric was in a writable state. Look for other chassis codes indicating a
fabric problem. There may be a backplane problem. Action: Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2012
- Severity: MAJOR
- Event Summary: Could not get neighbor information.
- Event Class: System
- Problem Description:
The XBC could not get neighbor information.
Data Field: XBC # << 32 | internal port attempting to access neighbor
- Cause / Action: Cause: Fabric Failure Action: Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2013
- Severity: MAJOR
- Event Summary: PDC attempted to lock the fabric after not getting
the SM4 and failed.
- Event Class: System
- Problem Description:
PDC attempted to get a fabric semaphore,
but another cell got the SM4 before this cell could obtain it. PDC tried to
lock the fabric and failed. Data Field: (cell << 56) | (port <<
44) | (xbc << 32) | return status
- Cause / Action:
Cause: PDC could not read or write the fabric
SM4 to see if it was already owned. There was a problem reading the fabric.
Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2014
- Severity: MAJOR
- Event Summary: PDC attempted to unlock the fabric and failed while
trying to get the XBC SM4
- Event Class: System
- Problem Description:
The fabric has to be unlocked for PDC to
get the fabric semaphore. PDC tried to unlock the fabric and failed. Data
Field: (cell << 56) | (port << 44) | (xbc << 32) | return
status
- Cause / Action:
Cause: PDC could not read or write the fabric
SM4 to see if it was already owned. There was a problem reading the fabric.
Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2015
- Severity: MAJOR
- Event Summary: PDC failed while attempting to determine a SM4's
current owner
- Event Class: System
- Problem Description:
PDC is attempting to get a SM4. As part of
the attempt it checks to see who owns the SM4 and how long they have owned it.
This chassis code is sent when PDC fails while reading the XBC SM4. Data
Field: (cell << 56) | (port << 44) | (xbc << 32) | return
status
- Cause / Action:
Cause: There was a fabric failure reading the
XBCs CSRs. Action: Look for FABRIC_READ_ERROR_xxx chassis codes or a chassis
code indicating the data from the XBC slices are different. Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2016
- Severity: MAJOR
- Event Summary: PDC failed while attempting to determine a SM4's
current owner
- Event Class: System
- Problem Description:
PDC is attempting to get a SM4. As part of
the attempt it checks to see who owns the SM4 and how long they have owned it.
This chassis code is sent when PDC fails while reading the XBC SM4. Data
Field: (cell << 56) | (port << 44) | (xbc << 32) | return
status
- Cause / Action:
Cause: There was a fabric failure reading the
XBCs CSRs. Action: Look for FABRIC_READ_ERROR_xxx chassis codes or a chassis
code indicating the data from the XBC slices are different. Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2017
- Severity: MAJOR
- Event Summary: PDC tried to write to a fabric SM4 and failed
- Event Class: System
- Problem Description:
PDC attempted to write the XBC SM4 register
and detected a problem in doing the write. PDC was unable obtain the SM4. Data
Field: (cell << 56) | (port << 44) | (xbc << 32) | return
status
- Cause / Action:
Cause: There was a problem determining if the
fabric was in a writable state. Look for other chassis codes indicating a
fabric problem. There may be a backplane problem. Action: Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2018
- Severity: MAJOR
- Event Summary: PDC could not find data for the Domelight fabric
neighbor table.
- Event Class: System
- Problem Description:
PDC uses tables to drive the fabric code.
The data in the neighbor table was empty. Data Field: address of neighbor info
table
- Cause / Action:
Cause: The backplane type in the PDH external
backplane type register was incorrect. Action: Contact HP Support personnel to
analyze the backplane and CC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2019
- Severity: MAJOR
- Event Summary: PDC could not find data for the single-cabinet
Superdome fabric neighbor table.
- Event Class: System
- Problem Description:
PDC uses tables to drive the fabric code.
The data in the neighbor table was empty. Data Field: return status
- Cause / Action:
Cause: Unknown system type Action: Contact HP
Support personnel to analyze the backplanes and activity logs
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2020
- Severity: MAJOR
- Event Summary: An invalid XBC number was encountered.
- Event Class: System
- Problem Description:
An invalid XBC number was passed to an
internal PDC function. Data Field: (port << 44) | (xbc num << 32)
- Cause / Action:
Cause: PDC runtime error. Action: Contact HP
Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2021
- Severity: MAJOR
- Event Summary: PDC could not find data for the simple crossbar
topology fabric neighbor table.
- Event Class: System
- Problem Description:
PDC uses tables to drive the fabric code.
The data in the neighbor table was empty. Data Field: return status
- Cause / Action:
Cause: Unknown system type Action: Contact HP
Support personnel to analyze the backplanes and activity logs
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2022
- Severity: MAJOR
- Event Summary: PDC called an XBC function on a back to back
topology that is not supported.
- Event Class: System
- Problem Description:
PDC was searching for information about
what is supposed to be connected to an XBC, but Matterhorn systems do not
support XBCs. Data Field: return status
- Cause / Action:
Cause: Unknown system type Action: Contact HP
Support personnel to analyze the backplanes and activity logs
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2023
- Severity: MAJOR
- Event Summary: Failed to fill in the neighbor info from the table
of expected values.
- Event Class: System
- Problem Description:
Attempted to read a table of expected
neighbor information but was unable to do so. Data Field: (port << 44) |
(xbc num << 32) | ret status
- Cause / Action:
Cause: The table could not be accessed. PDC
runtime error. Action: Contact HP Support personnel to check the CC and PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2024
- Severity: MAJOR
- Event Summary: Failed to fill in the expected neighbor info because
no neighbor was expected.
- Event Class: System
- Problem Description:
Attempted to read a table of expected
neighbor information but was unable to do so because no neighbor was expected
for the specified XBC and XBC port numbers. Data Field: system type
- Cause / Action:
Cause: PDC runtime error. Action: Contact HP
Support personnel to analyze the fabric and PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2025
- Severity: MAJOR
- Event Summary: An invalid XBC port number was encountered.
- Event Class: System
- Problem Description:
An invalid XBC port number was passed to an
internal PDC function. An external port number was expected, but the port
number encountered was not one. Data Field: (port << 44) | (xbc num
<< 32)
- Cause / Action:
Cause: PDC runtime error. Action: Contact HP
Support personnel to analyze the fabric and PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2026
- Severity: MAJOR
- Event Summary: An invalid XBC port number was encountered.
- Event Class: System
- Problem Description:
An invalid XBC port number was passed to an
internal PDC function. An internal port number was expected, but the port
number encountered was not one. Data Field: (port << 44) | (xbc num
<< 32)
- Cause / Action:
Cause: PDC runtime error. Action: Contact HP
Support personnel to analyze the fabric and PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2027
- Severity: MAJOR
- Event Summary: PDC could not find data for the single-cabinet
Superdome fabric neighbor table.
- Event Class: System
- Problem Description:
PDC uses tables to drive the fabric code.
The data in the neighbor table was empty. Data Field: address of neighbor info
table
- Cause / Action:
Cause: Unknown system type Action: Contact HP
Support personnel to analyze the backplanes and activity logs
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2028
- Severity: MAJOR
- Event Summary: An unrecognized backplane type was read from PDH.
- Event Class: System
- Problem Description:
The system backplane type that was read
from PDH was not a recognized type. Data Field: system type
- Cause / Action:
Cause: Unknown system type Action: Contact HP
Support personnel to analyze the backplanes and activity logs
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2029
- Severity: MAJOR
- Event Summary: PDC did not recognize the fabric topology of the
system.
- Event Class: System
- Problem Description:
PDC was verifying that the neighbor for an
XBC port was the neighbor that was expected based on the system's topology.
The topology was stored on the XBC during fabric discovery by the PDC that
routed the fabric. PDC did not recognize the topology stored on the XBC or did
not expect the topology it found. Data Field: topology
- Cause / Action:
Cause: Unknown or unsupported topology. Perhaps
the XBC information became corrupted. Action: Contact HP Support personnel to
analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2030
- Severity: MAJOR
- Event Summary: PDC could not determine the fabric topology of the
system
- Event Class: System
- Problem Description:
This chassis code is sent when PDC is
trying to determine which XBCs exist in the current system's topology. PDC
determines the topology during boot and stores it in an XBC CSR. This chassis
code is sent when PDC cannot read that CSR. Data Field: return status
- Cause / Action: Cause: The failure was probably one of the
following: a multi-bit error reading a fabric CSR, unable to access an XBC,
XBC bit slices returned inconsistent data. Action: Look for chassis codes that
indicate a fabric read failed. These chassis codes may provide more
information about the failure. Contact HP Support personnel to analyze the
fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2031
- Severity: MAJOR
- Event Summary: Could not traverse to a XBC that was expected to be
present in the system.
- Event Class: System
- Problem Description:
When collecting the fabric ICM neighbor
information, the route to the first XBC (XBC 0) was not traversable. Based on
the fabric topology (obtained from the XBC general purpose register on the
local XBC), the first XBC was expected to be present. Data Field: (xbc num
<< 32)
- Cause / Action:
Cause: A hardware failure prevented traversal
to the XBC. Action: Contact HP Support personnel to check the flex cables,
crossbar chips, PDC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2032
- Severity: MAJOR
- Event Summary: Could not traverse to a XBC that was expected to be
present in the system.
- Event Class: System
- Problem Description:
When collecting the fabric ICM neighbor
information, the route to the fourth XBC was not traversable. Based on the
fabric topology (obtained from the XBC general purpose register on the local
XBC), the first XBC was expected to be present. Data Field: (xbc num <<
32)
- Cause / Action:
Cause: A hardware failure prevented traversal
to the XBC. Action: Contact HP Support personnel to check the flex cables,
crossbar chips, PDC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2033
- Severity: MAJOR
- Event Summary: Could not traverse to a XBC that was expected to be
present in the system.
- Event Class: System
- Problem Description:
When collecting the fabric ICM neighbor
information, the route to the second XBC was not traversable. Based on the
fabric topology (obtained from the XBC general purpose register on the local
XBC), the first XBC was expected to be present. Data Field: (xbc num <<
32)
- Cause / Action: Cause: A hardware failure prevented traversal to
the XBC. Action: Contact HP Support personnel to check the flex cables,
crossbar chips, PDC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2034
- Severity: MAJOR
- Event Summary: Could not traverse to a XBC that was expected to be
present in the system.
- Event Class: System
- Problem Description:
When collecting the fabric ICM neighbor
information, the route to the mirror XBC was not traversable. Based on the
fabric topology (obtained from the XBC general purpose register on the local
XBC), the first XBC was expected to be present. Data Field: (xbc num <<
32)
- Cause / Action: Cause: A hardware failure prevented traversal to
the XBC. Action: Contact HP Support personnel to check the flex cables,
crossbar chips, PDC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2035
- Severity: MAJOR
- Event Summary: Invalid address passed to one a fabric functions
- Event Class: System
- Problem Description:
An invalid XBC address is being used Data
Field: XBC #
- Cause / Action: Cause: PDC runtime error. Action: Contact HP
Support personnel to analyze the fabric and PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2036
- Severity: MAJOR
- Event Summary: The address to write is not a fabric address
- Event Class: System
- Problem Description:
An attempted XBC write has failed because
the address provided is not a Fabric address. Data Field: CSR address Cause /
Action:
Cause: PDC runtime error. Action: Contact HP Support personnel to
analyze the fabric and PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2037
- Severity: CRITICAL
- Event Summary: Invalid neighbor found attached to fabric.
- Event Class: System
- Problem Description:
Invalid neighbor found attached to fabric.
Data Field: Neighbor Type 0x00 CC 0x01 XBC 0x02 - 0xFE Reserved 0xFF No
connection
- Cause / Action: Cause: An invalid neighbor was found attached to
the fabric. Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2038
- Severity: MAJOR
- Event Summary: While routing around an unhealthy port, the reroute
port calculated is invalid.
- Event Class: System
- Problem Description:
While routing around an unhealthy port, the
reroute port calculated is invalid. Data Field: (xbc num << 32) | port
- Cause / Action: Cause: PDC runtime error. Action: Contact HP
Support personnel to analyze the fabric and PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2039
- Severity: MAJOR
- Event Summary: The fabric topology does not match a known topology.
- Event Class: System
- Problem Description:
The topology is unknown. The fabric
information cannot be gathered. Data Field: (togo num << 32) | topology
- Cause / Action: Cause: PDC runtime error. Action: Contact HP
Support personnel to analyze the fabric and PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2041
- Severity: MAJOR
- Event Summary: The quadrant calculated is unknown.
- Event Class: System
- Problem Description:
The kitty corner XBC cannot be calculated
because an invalid quadrant number was calculated. Data Field: (xbc num
<< 32) | xbc quadrant
- Cause / Action: Cause: PDC runtime error. Action: Contact HP
Support personnel to analyze the fabric and PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2042
- Severity: MAJOR
- Event Summary: An error occurred while testing the CC to XBC link.
- Event Class: System
- Problem Description:
At the beginning of Fabric Discovery a link
test is performed. After writing a few pattern tests, an SBE or LPE error was
logged on the CC for this link. Data Field: (port << 44) | (xbc num
<< 32) | 0x1E
- Cause / Action: Cause: crossbar link failure, parity error Action:
Contact HP Support personnel to check link connectivity, XBC, CC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2043
- Severity: MAJOR
- Event Summary: The link is not useable due to problems with this
port.
- Event Class: System
- Problem Description:
While examining a fabric link, one of the
ports was found to have problems that prevent its use. Data Field: (port
<< 44) | (xbc num << 32)
- Cause / Action: Cause: A XBC port was found to have errors while
traversing the route to the target XBC. Action: Contact HP Support personnel
to check the flex cables, crossbar chip, etc.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2044
- Severity: MAJOR
- Event Summary: There was a fabric access error while examining a
XBC port.
- Event Class: System
- Problem Description:
While examining a fabric link for
traversability, there was an error accessing a fabric resource. Data Field:
(port << 44) | (xbc num << 32)
- Cause / Action: Cause: An unknown error was encountered. This is
probably due to a fabric read error or trouble accessing a fabric resource.
Action: Contact HP Support personnel to check the flex cables, crossbar chip,
etc.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2045
- Severity: MAJOR
- Event Summary: An error occurred while reading the Neighbor Info
register.
- Event Class: System
- Problem Description:
While examining a XBC link for usability,
there was an error reading from the XBC. Data Field: (port << 44) | (xbc
num << 32) | ret status
- Cause / Action:
Cause: There was a failure performing a read
while traversing a fabric link. Action: Contact HP Support personnel to check
the flex cables, crossbar chip, etc.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2046
- Severity: MAJOR
- Event Summary: An error occurred while reading the Neighbor Info
register.
- Event Class: System
- Problem Description:
While examining a XBC link for usability,
there was an error reading from the XBC. Data Field: (port << 44) | (xbc
num << 32) | ret status
- Cause / Action:
Cause: There was a failure performing a read
while traversing a fabric link. Action: Contact HP Support personnel to check
the flex cables, crossbar chip, etc.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2047
- Severity: MAJOR
- Event Summary: The link is not useable due to problems with this
port.
- Event Class: System
- Problem Description:
While examining a fabric link, one of the
ports was found to have problems that prevent its use. Data Field: (port
<< 44) | (xbc num << 32)
- Cause / Action: Cause: A XBC port was found to have errors while
traversing the route to the target XBC. Action: Look for additional chassis
codes that provide more detailed information. Contact HP Support personnel to
check the flex cables, crossbar chip, etc.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2048
- Severity: MAJOR
- Event Summary: There was a fabric access error while examining a
XBC port.
- Event Class: System
- Problem Description:
While examining a fabric link for
traversability, there was an error accessing a fabric resource. Data Field:
(port << 44) | (xbc num << 32)
- Cause / Action: Cause: An unknown error was encountered. This is
probably due to a fabric read error or trouble accessing a fabric resource.
Action: Look for additional chassis codes that provide more detailed
information. Contact HP Support personnel to check the flex cables, crossbar
chip, etc.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2049
- Severity: MAJOR
- Event Summary: Failure reading XBC Port status register.
- Event Class: System
- Problem Description:
While initiating the CC to XBC link test, a
read failure occurred. This link will now be landmined to prevent use since it
is considered unreliable. Data Field: (port << 44) | (xbc num <<
32)
- Cause / Action: Cause: likely fabric hardware failure Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2050
- Severity: MAJOR
- Event Summary: The pattern 0 test failed
- Event Class: System
- Problem Description:
The test write of all zeroes to both slices
of a XBC failed. Data Field: (port << 44) | (xbc num << 32) |
pattern
- Cause / Action: Cause: likely fabric hardware failure Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2051
- Severity: MAJOR
- Event Summary: The pattern 5 test failed
- Event Class: System
- Problem Description:
The test write of all 0x5's to both slices
of a XBC failed. Data Field: (port << 44) | (xbc num << 32) |
pattern
- Cause / Action: Cause: likely fabric hardware failure Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2052
- Severity: MAJOR
- Event Summary: The pattern A test failed
- Event Class: System
- Problem Description:
The test write of all 0xA's to both slices
of a XBC failed. Data Field: (port << 44) | (xbc num << 32) |
pattern
- Cause / Action: Cause: likely fabric hardware failure Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2053
- Severity: MAJOR
- Event Summary: The pattern F test failed
- Event Class: System
- Problem Description:
The test write of all ones to both slices
of a XBC failed. Data Field: (port << 44) | (xbc num << 32) |
pattern
- Cause / Action: Cause: likely fabric hardware failure Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2054
- Severity: MAJOR
- Event Summary: The CC to XBC Link pattern test failed
- Event Class: System
- Problem Description:
The CC to XBC Link pattern test failed.
Data Field: (port << 44) | (xbc num << 32) | (pattern & 0xf)
pattern = all F's, A's, 5's, 0's pattern test (XBC # << 32) | (internal
port # << 16) | (0x5BE) failed logging Togo SBE or LPE errors (XBC #
<< 32) | (internal port # << 16) | (0x1E) failed logging DNA SBE
or LPE errors
- Cause / Action: Cause: The CC to XBC Link is corrupted. Either the
CC, the local XBC, or the connection is faulty Action: Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2055
- Severity: MAJOR
- Event Summary: Could not read the link landmine state.
- Event Class: System
- Problem Description:
While testing the CC to XBC link, PDC could
not determine if the link is landmined. The link will be landmined. Data
Field: (port << 44) | (xbc num << 32)
- Cause / Action:
Cause: Failed reading the XBC port state
register Action: Contact HP Support personnel to check the XBC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2056
- Severity: MAJOR
- Event Summary: The XBC number provided is not a valid XBC Chip Id.
- Event Class: System
- Problem Description:
The argument passed into the function is
invalid. If this code was called from a proc, then the argument should have
been checked at the proc entrance. Data Field: (port << 44) | (xbc num
<< 32)
- Cause / Action: Cause: An invalid XBC number was provided. Action:
Capture chassis logs Document events leading up to the error Contact HP
Support personnel to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2057
- Severity: MAJOR
- Event Summary: The port number provided is not a valid XBC Internal
Port Number
- Event Class: System
- Problem Description:
The argument passed into the function is
invalid. If this code was called from a proc, then the argument should have
been checked at the proc entrance. Data Field: (port << 44) | (xbc num
<< 32)
- Cause / Action: Cause: An invalid XBC number was provided. Action:
Capture chassis logs Document events leading up to the error Contact HP
Support personnel to check the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2058
- Severity: MAJOR
- Event Summary: An SBE or LPE was logged on the XBC during the link
test.
- Event Class: System
- Problem Description:
After completing all the pattern tests, a
Single Bit Error or a Link Parity Error was logged on the XBC. The link is not
good. Data Field: (port << 44) | (xbc num << 32) | 0x5BE
- Cause / Action: Cause: XBC Link failure, XBC failure Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2059
- Severity: MAJOR
- Event Summary: The XBC global semaphore was not locked during a PDC
procedure call
- Event Class: System
- Problem Description:
During a PDC procedure, the XBC's global
semaphore was expected to be locked, but the semaphore was found not to be
locked or the lock couldn't be verified. Data Field: (port << 44) | (xbc
num << 32) | log
- Cause / Action: Cause: There was a problem accessing the XBC.
Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2060
- Severity: MAJOR
- Event Summary: The XBC global semaphore was not locked during a PDC
procedure call
- Event Class: System
- Problem Description:
During a PDC procedure, the XBC's global
semaphore was expected to be locked, but the semaphore was found not to be
locked or the lock couldn't be verified. Data Field: (port << 44) | (xbc
num << 32) | log
- Cause / Action: Cause: There was a problem accessing the XBC.
Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2061
- Severity: FATAL
- Event Summary: Exceeded max number of failed XBC links during
initial fabric routing
- Event Class: System
- Problem Description:
The maximum number of failed crossbar links
has been exceeded during initial fabric routing. The cell will halt. Data
Field: (XBC # attempting to route << 32) | number of failed ports
- Cause / Action: Cause: The maximum number of failed crossbar links
has been exceeded during initial fabric routing. Review the previous chassis
codes to determine which links have failed. The routing table could be
corrupt, i.e. links marked in error when healthy but perceived as
non-functional Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2062
- Severity: MAJOR
- Event Summary: Multi-bit error occurred in fabric function
- Event Class: System
- Problem Description:
A multi-bit error occurred while reading
the XBC Data Field: XBC read data
- Cause / Action: Cause: A multi-bit read error occurred Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2063
- Severity: MAJOR
- Event Summary: Could not check the neighbor port's health status
- Event Class: System
- Problem Description:
When routing a XBC link, the neighbor side
of the link needs to be tested. This chassis code indicates that a read of
that neighbor side failed. The failure prevents testing of the neighbor port
and causes the link to be landmined. Data Field: (xbc num << 32) | xbc
port
- Cause / Action: Cause: XBC Read failure Action: Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2064
- Severity: MAJOR
- Event Summary: During remote routing, the current port's neighbor
is not healthy.
- Event Class: System
- Problem Description:
An XBC port was found that is not healthy.
This indicates at least one of the following about the port: - Hardware link
is not okay - Presence detect is false - Fatal error detected - SBE detected -
LPE detected - Port landmined The data field of the chassis code indicates
which port is unhealthy, as well as the fabric routing state before the
problem was encountered.
- Cause / Action: Cause: An XBC port is not healthy. Action: Contact
HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2065
- Severity: MAJOR
- Event Summary: PDC could not read the topology of the system from
an XBC register.
- Event Class: System
- Problem Description:
The topology is stored in a XBC scratch
register during FabricDiscovery(). The read of this register failed. Data
Field: return status
- Cause / Action: Cause: Look for chassis codes that indicate a
fabric read failed. These chassis codes may provide more information about the
failure. The failure was probably one of the following: a multi-bit error
reading a fabric CSR, unable to access an XBC, XBC bit slices returned
inconsistent data. Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2066
- Severity: CRITICAL
- Event Summary: No local XBC was present.
- Event Class: System
- Problem Description:
Could not communicate with local XBC. The
cell will attempt to reboot without fabric.
- Cause / Action: Cause: Could not communicate with local XBC.
Action: Contact HP Support personnel to analyze the local XBC, CC, and
backplane.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2067
- Severity: MAJOR
- Event Summary: Could not clear XBC error injection register for
bits [14:0]
- Event Class: System
- Problem Description:
Could not clear XBC error injection
register for bits [14:0] Data Field: (port << 44) | (xbc num <<
32) | 0x1400
- Cause / Action: Cause: write to XBC failed Action: Contact HP
Support personnel to check the XBC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2068
- Severity: MAJOR
- Event Summary: Could not clear XBC error injection register for
bits [29:15]
- Event Class: System
- Problem Description:
Could not clear XBC error injection
register for bits [29:15] Data Field: (port << 44) | (xbc num <<
32) | 0x2915
- Cause / Action: Cause: write to XBC failed Action: Contact HP
Support personnel to check the XBC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2069
- Severity: MAJOR
- Event Summary: Could not clear XBC error injection register for
bits [44:30]
- Event Class: System
- Problem Description:
Could not clear XBC error injection
register for bits [44:30] Data Field: (port << 44) | (xbc num <<
32) | 0x4430
- Cause / Action: Cause: write to XBC failed Action: Contact HP
Support personnel to check the XBC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2070
- Severity: MAJOR
- Event Summary: Could not clear XBC error injection register for
bits [59:45]
- Event Class: System
- Problem Description:
Could not clear XBC error injection
register for bits [59:45] Data Field: (port << 44) | (xbc num <<
32) | 0x5945
- Cause / Action: Cause: write to XBC failed Action: Contact HP
Support personnel to check the XBC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2071
- Severity: MAJOR
- Event Summary: Could not clear XBC error injection register for
bits [73:60]
- Event Class: System
- Problem Description:
Could not clear XBC error injection
register for bits [73:60] Data Field: (port << 44) | (xbc num <<
32) | 0x7360
- Cause / Action: Cause: write to XBC failed Action: Contact HP
Support personnel to check the XBC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2072
- Severity: MAJOR
- Event Summary: Could not clear CC debug control register
- Event Class: System
- Problem Description:
The write to the CC debug control register
failed. Data Field: (port << 44) | (xbc num << 32) | 0xDDC2
- Cause / Action: Cause: write to CC failed Action: Contact HP
Support personnel to check the CC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2073
- Severity: MAJOR
- Event Summary: Could not clear CC debug counter register
- Event Class: System
- Problem Description:
Could not clear CC debug counter register.
Data Field: (port << 44) | (xbc num << 32) | 0xDDC1
- Cause / Action: Cause: write to CC failed Action: Contact HP
Support personnel to check the CC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2074
- Severity: MAJOR
- Event Summary: The CC SBE and LPE errors were not cleared properly
- Event Class: System
- Problem Description:
The CC logged a SBE or LPE after they
should have been cleared. Either the clear failed, or a new error was logged
immediately. Data Field: (port << 44) | (xbc num << 32) | 0x1E
- Cause / Action: Cause: write to CC Debug registers failed C2: the
link generated a new error A2: check CC, check link Check logs for other
errors. If error is persistent, replace cell board Action: Contact HP Support
personnel to check the CC Cause: the link generated a new error Action:
Contact HP Support personnel to check the CC, link
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2075
- Severity: MAJOR
- Event Summary: Could not clear CC seed error register
- Event Class: System
- Problem Description:
Could not clear CC seed error register Data
Field: (port << 44) | (xbc num << 32) | 0x5DE
- Cause / Action:
Cause: write to CC failed Action: Contact HP
Support personnel to check the CC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2076
- Severity: MAJOR
- Event Summary: The XBC SBE and LPE errors were not cleared properly
- Event Class: System
- Problem Description:
The XBC logged a SBE or LPE after they
should have been cleared. Either the clear failed, or a new error was logged
immediately. Data Field: (port << 44) | (xbc num << 32) | 0x5BE
- Cause / Action: Cause: write to CC Debug registers failed C2: the
link generated a new error A2: check CC, check link Check logs for other
errors. If error is persistent, replace cell board Action: Contact HP Support
personnel to check the CC Cause: the link generated a new error Action:
Contact HP Support personnel to check the CC, link
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2077
- Severity: MAJOR
- Event Summary: Could not clear XBC link parity error logs
- Event Class: System
- Problem Description:
Could not clear XBC link parity error logs.
Data Field: (port << 44) | (xbc num << 32) | 0xF01E Cause /
Action:
Cause: write to XBC failed Action: Contact HP Support personnel to
check the XBC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2078
- Severity: MAJOR
- Event Summary: Could not clear XBC routing table error logs
- Event Class: System
- Problem Description:
Could not clear XBC routing table error
logs. Data Field: (port << 44) | (xbc num << 32) | 0xF01F Cause /
Action:
Cause: write to XBC failed Action: Contact HP Support personnel to
check the XBC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2079
- Severity: MAJOR
- Event Summary: Could not clear XBC single bit error logs
- Event Class: System
- Problem Description:
Could not clear XBC single bit error logs
Data Field: (port << 44) | (xbc num << 32) | 0xF5DE
- Cause / Action:
Cause: write to XBC failed Action: Contact HP
Support personnel to check the XBC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2080
- Severity: MAJOR
- Event Summary: Could not read the local XBC number from the CC.
- Event Class: System
- Problem Description:
A read to the CC's XIN_LINK_STATE register
failed. As a result, the local XBC number could not be determined. There must
be problems with the CC's link to the fabric. Data Field: return status
- Cause / Action: Cause: likely fabric hardware failure Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2081
- Severity: MAJOR
- Event Summary: The XBC number provided is not a valid XBC Chip Id.
- Event Class: System
- Problem Description:
The argument passed into the function is
invalid. If this code was called from a proc, then the argument should have
been checked at the proc entrance. This is a firmware bug. Data Field: (local
xbc num << 32) | target xbc
- Cause / Action: Cause: An invalid XBC number was provided. Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2082
- Severity: MAJOR
- Event Summary: A link in the fabric PIOB route is not useable.
- Event Class: System
- Problem Description:
The PIOB route was found to have errors
preventing its use. Data Field: (port << 44) | (xbc num << 32)
- Cause / Action: Cause: Fabric Access Failure Action: Contact HP
Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2083
- Severity: MAJOR
- Event Summary: A link in the fabric PIOB route is not useable.
- Event Class: System
- Problem Description:
An error was encountered while testing the
PIOB route. The test could not complete. Therefore, the route is not
traversable. Data Field: (port << 44) | (xbc num << 32)
- Cause / Action:
Cause: Fabric Access Failure Action: Contact HP
Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2084
- Severity: MAJOR
- Event Summary: The CC to XBC link is not initialized.
- Event Class: System
- Problem Description:
When testing the PIOB route to a XBC, the
local cell's fabric link was found to be uninitialized. This cell cannot talk
to the fabric. Data Field: XIN link state
- Cause / Action:
Cause: likely fabric hardware failure Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2085
- Severity: MAJOR
- Event Summary: Could not read the landmine state from the XBC
register.
- Event Class: System
- Problem Description:
Testing the fabric PIOB route to a XBC.
There was a failure reading from the XBC registers. The landmine state could
not be determined. Data Field: (port << 44) | (xbc num << 32) |
ret status
- Cause / Action: Cause: likely fabric hardware failure Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2086
- Severity: MAJOR
- Event Summary: Error reading the Remote Routing Register on the
XBC.
- Event Class: System
- Problem Description:
While traversing a fabric PIOB route, a
port on the neighbor XBC was found to be uninitialized or in error. This
should never happen since the routing should have already been completed. Data
Field: return status
- Cause / Action: Cause: The remote routing register does not contain
a valid, initialized value. There may have been a failure reading from the
XBC. Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2087
- Severity: MAJOR
- Event Summary: The cell local semaphore was not locked during a
fabric function call.
- Event Class: System
- Problem Description:
The cell local semaphore is needed to send
chassis codes. This fabric traversable function found the semaphore unlocked
during execution. Data Field: (xbc num << 32) | return status
- Cause / Action: Cause: Firmware forgot to lock the semaphore. Or
another cpu has unlocked the semaphore behind the owner's back. Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2088
- Severity: MAJOR
- Event Summary: A fabric call has been attempted on a back to back
system
- Event Class: System
- Problem Description:
While testing a fabric PIOB route, the
system type was determined to be a Matterhorn. Matterhorn systems do not have
fabric, so it cannot be tested. Data Field: system type
- Cause / Action:
Cause: The fabric function is being used on the
wrong system. Firmware bug. Action: Capture Chassis Codes. Document the events
that led up to the problem. Contact the PDC team.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2089
- Severity: MAJOR
- Event Summary: Fabric could not determine the system type from the
backplane.
- Event Class: System
- Problem Description:
While testing a fabric PIOB route, the
system type could not be determined. This indicates that either a new system
type has been created, or the register contains faulty data. Data Field:
system type
- Cause / Action: Cause: Firmware does not support this type of
system. Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2090
- Severity: MAJOR
- Event Summary: Failed to read the port's Neighbor Information
register.
- Event Class: System
- Problem Description:
A failure occurred while reading the XBC
Port's Neighbor Information register. Data Field: (port << 44) | (xbc
num << 32) | ret status
- Cause / Action: Cause: Fabric Access Failure Action: Contact HP
Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2091
- Severity: MAJOR
- Event Summary: A read of a XBC Port Status register failed.
- Event Class: System
- Problem Description:
A read of a XBC Port Status register
failed. Data Field: (xbc num << 32) | xbc port
- Cause / Action:
Cause: write to XBC failed Action: Contact HP
Support personnel to check the XBC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2092
- Severity: MAJOR
- Event Summary: This port's hardware has experienced a Fatal Error.
It cannot be used.
- Event Class: System
- Problem Description:
While examining a XBC port for
traversability, the Port Status Register was read from the XBC. The FE bit is
set indicating that there was a fatal problem with the link. Data Field: (port
<< 44) | (xbc num << 32) | port status
- Cause / Action:
Cause: The link may have experienced a
Multi-Bit Error. Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2093
- Severity: MAJOR
- Event Summary: The port does not have the Hardware Link bit set on.
- Event Class: System
- Problem Description:
While examining a XBC port for
traversability, the Port Status Register was read from the XBC. The HW Link
bit was not set in the data read from the register. The hardware has not
detected a link connected to this port. Data Field: (port << 44) | (xbc
num << 32) | port status
- Cause / Action: Cause: The port is not connected to another chip.
Either the link is physically not attached, one side of the link is not
powered, or there are problems with the hardware. Action: Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2094
- Severity: MAJOR
- Event Summary: An unexpected failure occurred while checking the
port landmine state.
- Event Class: System
- Problem Description:
While examining a fabric route between two
XBCs, there was a failure reading a XBC port's landmine state from an XBC
scratch register. Data Field: (port << 44) | (xbc num << 32) | ret
status
- Cause / Action: Cause: There was a failure performing a read while
traversing the route to the target XBC. Action: Contact HP Support personnel
to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2095
- Severity: MAJOR
- Event Summary: The port's output enable bit is not set. The link
has not been configured.
- Event Class: System
- Problem Description:
While examining a XBC port for
traversability, the Port Status Register was read from the XBC. The OE bit is
not set, indicating that the link was not configured during boot (Fabric
Discovery). Data Field: (port << 44) | (xbc num << 32) | port
status
- Cause / Action: Cause: The link probably experienced errors before
routing occurred. The link may also have been reset which would have cleared
the OE bit. Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2096
- Severity: MAJOR
- Event Summary: The XBC port is not connected to its expected
neighbor!
- Event Class: System
- Problem Description:
Each XBC port is expected to be connected
in a specific configuration according to the topology. The current
configuration is not appropriate for the topology being used. Data Field:
(port << 44) | (xbc num << 32)
- Cause / Action: Cause: The XBC link is connected wrong. The XBc
link may also have been reset which would have cleared the neighbor
information. The XBC link may be programmed incorrectly Action: Contact HP
Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2097
- Severity: MAJOR
- Event Summary: The port's presence detect bit is not set. The
fabric link is not connected.
- Event Class: System
- Problem Description:
While examining a XBC port for
traversability, the Port Status Register was read from the XBC. The presence
detect bit was not set in the data read from the register. Data Field: (port
<< 44) | (xbc num << 32) | port status
- Cause / Action:
Cause: The port is not connected to another
chip. Either the link is physically not attached, one side of the link is not
powered, or there are problems with the hardware. Action: Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2098
- Severity: MAJOR
- Event Summary: The port was found to be landmined
- Event Class: System
- Problem Description:
The port being examined has experienced
errors and has been marked to not be used. Data Field: (port << 44) |
(xbc num << 32) | ret status
- Cause / Action: Cause: There was a failure performing a read while
traversing the route to the target XBC. Action: Contact HP Support personnel
to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2099
- Severity: MAJOR
- Event Summary: There was an error reading from a fabric table in
the PDC ROM.
- Event Class: System
- Problem Description:
While checking if a route is traversable,
there was an error getting the address to the XBC Neighbor Info table stored
in the PDC ROM. Data Field: return status
- Cause / Action: Cause: PDC runtime error. Action: Contact HP
Support personnel to analyze the fabric and PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2100
- Severity: MAJOR
- Event Summary: There was an error reading from a fabric table in
the PDC ROM.
- Event Class: System
- Problem Description:
While checking if a route is traversable,
there was an error getting the address to the XBC Neighbor Info table stored
in the PDC ROM. Data Field: return status
- Cause / Action: Cause: PDC runtime error. Action: Contact HP
Support personnel to analyze the fabric and PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2101
- Severity: MAJOR
- Event Summary: The XBC number provided is not a valid XBC Chip Id.
- Event Class: System
- Problem Description:
The argument passed into the function is
invalid. If this code was called from a proc, then the argument should have
been checked at the proc entrance. Data Field: (port << 44) | (xbc num
<< 32)
- Cause / Action: Cause: An invalid XBC number was provided. Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2102
- Severity: MAJOR
- Event Summary: The current fabric port is connected to a CC when a
XBC was expected.
- Event Class: System
- Problem Description:
When checking if a route is traversable,
each link is put through a sanity check. This test ensures that the link
connects ports that are supposed to be connected. In this case the port
indicates it is connected to a CC, however the topology indicates it should be
connected to a XBC. Data Field: (port << 44) | (xbc num << 32)
- Cause / Action: Cause: Hardware failure. Invalid topology. Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2103
- Severity: MAJOR
- Event Summary: The port number provided is not a valid internal
port number.
- Event Class: System
- Problem Description:
The port number passed into this function
is not an internal port number. This is a misuse of the functionality. Data
Field: (port << 44) | (xbc num << 32)
- Cause / Action:
Cause: PDC runtime error. Action: Contact HP
Support personnel to analyze the fabric and PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2104
- Severity: MAJOR
- Event Summary: The current fabric port is connected to a XBC when a
CC was expected.
- Event Class: System
- Problem Description:
When checking if a route is traversable,
each link is put through a sanity check. This test ensures that the link
connects ports that are supposed to be connected. In this case the port
indicates it is connected to a XBC, however the topology indicates it should
be connected to a CC. Data Field: (port << 44) | (xbc num << 32)
- Cause / Action: Cause: Hardware failure. Invalid topology. Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2105
- Severity: MAJOR
- Event Summary: The neighbor port number is not an internal port
number.
- Event Class: System
- Problem Description:
Fabric code uses internal port numbers for
XBC ports except when an external number is absolutely necessary. The port
number used here breaks the convention. Data Field: (port << 44) | (xbc
num << 32)
- Cause / Action: Cause: Hardware failure. Invalid topology. Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2106
- Severity: MAJOR
- Event Summary: The neighbor fabric port was expected to be
connected to a XBC.
- Event Class: System
- Problem Description:
When checking if a route is traversable,
each link is put through a sanity check. This test ensures that the link
connects ports that are supposed to be connected. In this case the port is
expected to be connected to a XBC, however the topology indicates it should be
connected to a CC. Data Field: neighbor port
- Cause / Action:
Cause: Hardware failure. Invalid topology.
Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2107
- Severity: MAJOR
- Event Summary: There was an error reading a XBC port status
register.
- Event Class: System
- Problem Description:
While checking if a route is traversable,
there was an error reading the port status register on a XBC. Data Field:
return status
- Cause / Action: Cause: Hardware problem. Intermittent XBC errors.
Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2108
- Severity: MAJOR
- Event Summary: The neighbor chip does not indicate it is connected
to the correct chip.
- Event Class: System
- Problem Description:
When checking if a route is traversable,
each link is put through a sanity check. This test ensures that the link
connects ports that are supposed to be connected. In this case the neighbor
chip thinks it is connected to a chip other than the source chip. Data Field:
(port << 44) | (xbc num << 32)
- Cause / Action:
Cause: Hardware failure. Invalid topology.
Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2109
- Severity: MAJOR
- Event Summary: The neighbor chip is not registering its neighbor
appropriately.
- Event Class: System
- Problem Description:
When checking if a route is traversable,
each link is put through a sanity check. This test ensures that the link
connects ports that are supposed to be connected. In this case the current
chip correctly identified its neighbor. However, its neighbor indicates it is
connected to something different. Data Field: (expected neighbor xbc num
<< 32) | expected neighbor port
- Cause / Action: Cause: Hardware failure. Invalid topology. Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2110
- Severity: MAJOR
- Event Summary: The XBC port is connected incorrectly.
- Event Class: System
- Problem Description:
When checking if a route is traversable,
each link is put through a sanity check. This test ensures that the link
connects ports that are supposed to be connected. In this case the neighbor
port thinks it is connected to a port other than the source port. Data Field:
(Expected neighbor port << 16) | actual neighbor port num Cause /
Action:
Cause: Hardware failure. Invalid topology. Action: Contact HP
Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2111
- Severity: MAJOR
- Event Summary: The XBC port connected is not the expected port.
- Event Class: System
- Problem Description:
When checking if a route is traversable,
each link is put through a sanity check. This test ensures that the link
connects ports that are supposed to be connected. In this case the port is
connected to the right chip, but the wrong port on that chip. Data Field:
(expected neighbor port << 16) | (actual neighbor port)
- Cause / Action:
Cause: Hardware failure. Invalid topology.
Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2112
- Severity: MAJOR
- Event Summary: The neighbor type read is not the type that was
expected.
- Event Class: System
- Problem Description:
Each fabric chip is setup a specific way
for its topology. This chip was found to be connected in an unexpected way.
Data Field: (expected neighbor type << 48) | (actual neighbor type)
- Cause / Action: Cause: Hardware failure. Invalid topology. Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2113
- Severity: MAJOR
- Event Summary: The fabric chip's neighbor info indicates an unknown
neighbor type.
- Event Class: System
- Problem Description:
When checking if a route is traversable,
each link is put through a sanity check. This test ensures that the link
connects ports that are supposed to be connected. In this case the neighbor
info register contains invalid information. Data Field: neighbor type Cause /
Action:
Cause: Hardware failure. Invalid topology. Action: Contact HP
Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2114
- Severity: MAJOR
- Event Summary: A XBC port route around has occurred
- Event Class: System
- Problem Description:
During fabric routing a port on a XBC was
found in error or had been previously marked as in error. PDC will route
around this XBC port. Data Field: (XBC # << 32) | external XBC port
number
- Cause / Action: Cause: During routing, when a XBC to XBC port is
found to be in error, or was previously marked in error, it is routed around.
This chassis code indicates that which XBC port was routed around. A
subsequent FABRIC_REMOTE_ROUTING chassis code should indicate what the route
around for the port is. Action: Contact HP Support personnel to analyze the
crossbars, flex cables, backplanes, and other fabric components.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2115
- Severity: MAJOR
- Event Summary: Could not determine health state of a XBC port.
- Event Class: System
- Problem Description:
During the collection of neighbor info, the
health of the port could not be determined. This port was expected to be
healthy. Data Field: (xbc num << 32) | xbc port
- Cause / Action:
Cause: XBC register read failure Action:
Contact HP Support personnel to analyze the crossbar chip.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2116
- Severity: MAJOR
- Event Summary: Failed reading the XBC port register
- Event Class: System
- Problem Description:
While checking the port health, a read to
the Port Status register or a Scratch Register failed. Data Field: (xbc num
<< 32) | xbc port
- Cause / Action: Cause: XBC register read failure Action: Contact HP
Support personnel to analyze the crossbar chip.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2117
- Severity: MAJOR
- Event Summary: The XBC number provided is not a valid XBC Chip Id.
- Event Class: System
- Problem Description:
The argument passed into the function is
invalid. If this code was called from a proc, then the argument should have
been checked at the proc entrance. Data Field: (port << 44) | (xbc num
<< 32)
- Cause / Action: Cause: PDC runtime error. Action: Contact HP
Support personnel to analyze the fabric and PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2118
- Severity: MAJOR
- Event Summary: The port number provided is not a valid XBC Internal
Port Number
- Event Class: System
- Problem Description:
The argument passed into the function is
invalid. If this code was called from a proc, then the argument should have
been checked at the proc entrance. Data Field: (port << 44) | (xbc num
<< 32)
- Cause / Action: Cause: PDC runtime error. Action: Contact HP
Support personnel to analyze the fabric and PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2119
- Severity: MAJOR
- Event Summary: PDC cannot determine the system's topology
- Event Class: System
- Problem Description:
PDC initially determines the system's
topology early in fabric discovery. Later in fabric discovery PDC compares
the topology found by Discover Topology with the topology it sees. If the two
do not match this chassis code is sent. This chassis code should only come out
when port 4 is not routable and PDC sees a connection to a fabric component on
port 5 that it does not expect. Data Field: (xbc num << 32) | topology
- Cause / Action: Cause: There is a fabric problem that causes two
different XBCs to appear as if they have different topologies. There is
probably a broken link that needs to be repaired. Action: Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2120
- Severity: MAJOR
- Event Summary: Could not complete routing of the kitty-korner XBC
- Event Class: System
- Problem Description:
Port 4 on the local XBC was broken and a
route-around was attempted. During the route-around, there was a problem
performing remote routing on the kitty-korner XBC. Chassis codes sent before
this one may provide more details about the exact nature of the problem. The
executing cell will attempt a fabricless boot. Data Field: (xbc num <<
32) | return status
- Cause / Action: Cause: The local XBC's port 4 is not healthy. A
failure was encountered while performing remote routing on the kitty-korner
XBC, most likely due to a problem with the system backplane or local cell.
Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2121
- Severity: MAJOR
- Event Summary: Could not complete routing of the sister XBC
- Event Class: System
- Problem Description:
Port 4 on the local XBC was broken and a
route-around was attempted. During the route-around, there was a problem
performing remote routing on the sister XBC. Chassis codes sent before this
one may provide more details about the exact nature of the problem. The
executing cell will attempt a fabricless boot. Data Field: (xbc num <<
32) | return status
- Cause / Action: Cause: The local XBC's port 4 is not healthy. A
failure was encountered while performing remote routing on the sister XBC,
most likely due to a problem with the system backplane or local cell. Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2122
- Severity: MAJOR
- Event Summary: PDC cannot determine the system's topology
- Event Class: System
- Problem Description:
PDC initially determines the system's
topology early in fabric discovery. Later in fabric discovery PDC compares
the topology found by DiscoverTopology with the topology it sees. If the two
do not match this chassis code is sent. This chassis code should only come out
when ports 4 and 5 are not routable. In such a case, PDC could be running in a
dual-cabinet configuration with two links (to the other cabinet) being broken.
Data Field: (xbc num << 32) | topology
- Cause / Action:
Cause: There is a fabric problem that causes
two different XBCs to appear as if they have different topologies. There is
likely a broken link in the fabric. Action: Contact HP Support personnel to
analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2123
- Severity: CRITICAL
- Event Summary: Remote routing failed
- Event Class: System
- Problem Description:
Remote routing failed. Chassis codes sent
before this one may provide more details about the exact nature of the
problem. The executing cell will attempt a fabricless boot. Data Field: return
status
- Cause / Action: Cause: Remote routing failure Action: Contact HP
Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2124
- Severity: MAJOR
- Event Summary: This system is a Fatboy with a bad built-in port and
a bad port 5
- Event Class: System
- Problem Description:
The system was determined to be a fatboy.
The local XBC's built-in port and port 5 were both unhealthy. Therefore, too
many links are broken to continue. The executing cell will attempt a
fabricless boot. Data Field: (built-in port health << 48) | port 5 health
- Cause / Action: Cause: The built-in port and port 5 of the local
XBC are not healthy. Action: Contact HP Support personnel to analyze the
crossbar and flex cables.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2125
- Severity: MAJOR
- Event Summary: There are too many broken XBC links in the system
- Event Class: System
- Problem Description:
System is a fatboy. Ports 4 & 5 of the
local XBC are both broken. Chassis codes sent before this one may provide more
details about the exact nature of the problem. The executing cell will attempt
a fabricless boot. Data Field: (xbc num << 32) Cause / Action:
Cause:
Both the ports 4 and 5 of the local XBC had errors. Action: Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2126
- Severity: MAJOR
- Event Summary: PDC cannot determine the system's topology.
- Event Class: System
- Problem Description:
PDC initially determines the system's
topology early in fabric discovery. Later in fabric discovery PDC compares
the topology found by DiscoverTopology with the topology it sees. If the two
do not match this chassis code is sent. Data Field: (xbc num << 32) |
topology
- Cause / Action: Cause: There is a fabric problem that causes two
different XBCs to appear as if they have different topologies. There is likely
a broken link in the fabric. Action: Contact HP Support personnel to analyze
the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2127
- Severity: MAJOR
- Event Summary: Couldn't get the XBC number connected to port 5 of
the sister XBC
- Event Class: System
- Problem Description:
Port 5 of the sister XBC is connected to
something, but the number of the XBC to which it is connected could not be
determined. Could be because the link is not healthy. Data Field: (port
<< 44) | (xbc num << 32) | ret status
- Cause / Action: Cause: A hardware problem with the sister XBC or
the link connected to port 5 of the sister XBC. Action: Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2128
- Severity: MAJOR
- Event Summary: Too many XBC-to-XBC were broken in the complex.
- Event Class: System
- Problem Description:
Both the built-in port and port 4 of the
Local XBC are broken. The executing cell will halt. Data Field: (built-in port
health << 32) | port 4 health
- Cause / Action: Cause: Port status indicated that both the built-in
port and port 4 had errors. Action: Contact HP Support personnel to analyze
the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2129
- Severity: MAJOR
- Event Summary: Error while writing the topology to the XBC general
purpose register.
- Event Class: System
- Problem Description:
During remote routing, a failed XBC
register access (read or write) prevented the fabric topology from being
written to a XBC global general purpose register. Data Field: (xbc num
<< 32) | return status
- Cause / Action: Cause: XBC register read failure Action: Contact HP
Support personnel to analyze the crossbar chip.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2130
- Severity: CRITICAL
- Event Summary: A routing error has been discovered
- Event Class: System
- Problem Description:
A routing error was read from a XBC's
General Purpose Register 3. The cell will attempt a fabricless boot. Data
Field: 0x0BADBADBADBADBAD - failed routing opposite corner XBC - must find
current XBC being routed from previous chassis codes 0x0000000000000000 -
cell's local XBC has been noted having a routing error.
- Cause / Action:
Cause: Possibilities include (but are not
limited to):: Failed link Defective XBC port Found a invalid device on the
fabric System backplane error Action: Contact HP Support personnel to analyze
the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2131
- Severity: MAJOR
- Event Summary: While routing the XBC to XBC ports, a read of the
XBC Forward Progress failed.
- Event Class: System
- Problem Description:
A read to the XBC scratch register used to
store the forward progress state failed. This state indicates which port is to
be routed next. Since the read failed, the state cannot be determined. The
processor will indicate that it encountered routing errors. Data Field: return
status
- Cause / Action: Cause: XBC register read failure Action: Contact HP
Support personnel to analyze the crossbar chip.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2132
- Severity: MAJOR
- Event Summary: Fabric Discovery was stuck in a loop for 10 seconds
- Event Class: System
- Problem Description:
While routing a remote XBC, a cell
occasionally gets stuck in a loop because the XBC's forward progress state is
not updated correctly. This chassis code indicates that the cell has been in
this loop for ten seconds and will now reboot. The cell should join the
partition properly on the next boot. Data Field: (target cell << 56) |
(xbc num << 32) | forward progress
- Cause / Action: Cause: The XBC forward progress state is trashed.
Upon reboot, the cell should join the PD and finish booting. Action: Contact
HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2133
- Severity: MAJOR
- Event Summary: While trying to route the XBC ports, an unexpected
fwd progress state was found
- Event Class: System
- Problem Description:
While trying to route the XBC ports, an
unexpected forward progress state was found. This may cause the processor to
get stuck in an endless loop. A timer will be started to prevent the processor
from being assassinated. Data Field: (target cell << 56) | (xbc num
<< 32) | forward progress
- Cause / Action: Cause: The XBC forward progress state is trashed.
Upon reboot, the cell should join the PD and finish booting. Action: Contact
HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2134
- Severity: MAJOR
- Event Summary: The XBC SM4 is being taken from another cell
- Event Class: System
- Problem Description:
A cell that owns the SM4 has not made
sufficient progress in routing so another cell is attempting to take
ownership. Data Field: Data Field: (cell << 56) | (port << 44) |
(xbc << 32) | return status
- Cause / Action: Cause: The cell which owns the SM4 has not made
sufficient routing progress Action: Contact HP Support personnel to analyze
the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2135
- Severity: MAJOR
- Event Summary: This cell did not get the XBC Global Semaphore.
- Event Class: System
- Problem Description:
After unlocking the XBC Global Semaphore
for a takeover, this cell did not get the semaphore. Data Field: (cell
<< 56) | (port << 44) | (xbc << 32) | return status Cause /
Action:
Cause: Another cell won the race and got the semaphore before this
cell. This would be apparent in chassis codes. XBC write or read failure.
Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2136
- Severity: MAJOR
- Event Summary: PDC attempted a fabric SM4 takeover but had a
problem reading the SM4
- Event Class: System
- Problem Description:
When a cell holds a fabric semaphore for an
extended period of time, PDC will attempt to takeover the semaphore so that
the rest of the cells will have access to it. This chassis code is sent when
PDC successfully releases the SM4 from the cell that hung, but then fails to
read the SM4 as part of obtaining the SM4 for itself. Data Field: (cell
<< 56) | (port << 44) | (xbc << 32) | return status
- Cause / Action: Cause: There was a fabric failure reading the XBCs
CSRs. Action: Look for FABRIC_READ_ERROR_xxx chassis codes or a chassis code
indicating the data from the XBC slices are different. Contact HP Support
personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2137
- Severity: CRITICAL
- Event Summary: A problem occurred in routing the fabric
- Event Class: System
- Problem Description:
A problem occurred in routing the complex.
The cell will halt. Refer to the FABRIC_ROUTING_ERROR chassis code for more
information. Data Field: 0x0000000000000000
- Cause / Action:
Cause: A problem occurred in routing the
fabric. See the FABRIC_ROUTING_ERROR chassis code for more details Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2138
- Severity: MAJOR
- Event Summary: An unknown routing state was encountered
- Event Class: System
- Problem Description:
An unknown routing state was read from the
XBC scratch register. Data Field: xbc num
- Cause / Action: Cause: PDC runtime error. Action: Contact HP
Support personnel to analyze the fabric and PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2139
- Severity: MAJOR
- Event Summary: XBC ECC error
- Event Class: System
- Problem Description:
An ECC error was detected across the XBC
link.
- Cause / Action: Cause: An ECC error detected on the XBC Link
Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2140
- Severity: MAJOR
- Event Summary: PDC did not recognize the fabric topology of the
system.
- Event Class: System
- Problem Description:
PDC was determining which XBCs are expected
to be present based on the system's topology. The topology was stored on the
XBC during fabric discovery by the PDC that routed the fabric. PDC did not
recognize the topology stored on the XBC or did not expect the topology it
found. Data Field: return status
- Cause / Action: Cause: PDC runtime error. Action: Contact HP
Support personnel to analyze the fabric and PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2141
- Severity: MAJOR
- Event Summary: PDC did not recognize the system type in the
external backplane type register.
- Event Class: System
- Problem Description:
PDC uses the backplane type to control how
it determines what parts of the fabric are present. PDC did not recognize the
backplane type. Data Field: system type
- Cause / Action:
Cause: Unknown system type Action: Contact HP
Support personnel to analyze the backplanes and activity logs
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2142
- Severity: MAJOR
- Event Summary: PDC did not recognize the fabric topology of the
system
- Event Class: System
- Problem Description:
PDC was verifying that a cell could exist
in the fabric topology of the machine on which it is running. The topology was
stored on the XBC during fabric discovery by the PDC that routed the fabric.
PDC did not recognize the topology stored on the XBC. Data Field: topology
- Cause / Action: Cause: Unknown system type Action: Contact HP
Support personnel to analyze the backplanes and activity logs
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2143
- Severity: MAJOR
- Event Summary: PDC could not determine the fabric topology of the
system
- Event Class: System
- Problem Description:
This chassis code is sent when PDC is
trying to see if a cell exists in the current system's topology. PDC
determines the topology during boot and stores it in an XBC CSR. This chassis
code is sent when PDC cannot read that CSR. Data Field: return status
- Cause / Action: Cause: The failure was probably one of the
following: a multi-bit error reading a fabric CSR, unable to access an XBC,
XBC bit slices returned inconsistent data. Look for fabric problems. Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2144
- Severity: MAJOR
- Event Summary: Tried all XBC links to do route around
- Event Class: System
- Problem Description:
Tried all XBC links while trying to route
around the fabric Data Field: XBC # currently trying to route
- Cause / Action:
Cause: No more links to try to route around
Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2145
- Severity: MAJOR
- Event Summary: An unexpected fabric status error has occurred
- Event Class: System
- Problem Description:
Could not determine if there was a MBE, if
the XBC slices are different, if address was not in range. Data Field: Status
which could not be determined
- Cause / Action: Cause: Error in reading / writing a XBC CSR Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2146
- Severity: MAJOR
- Event Summary: Error in determining the numbers of the 1st and last
XBCs in the complex.
- Event Class: System
- Problem Description:
Failed in an attempt to determine the
numbers of the first and last XBCs in the complex. A chassis code preceding
this one will give more details about the nature of the problem. The executing
cell will attempt a fabricless boot. Data Field: return status
- Cause / Action: Cause: A problem accessing the local XBC. Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2147
- Severity: CRITICAL
- Event Summary: A XBC register read access failed.
- Event Class: System
- Problem Description:
An attempt was made to read the landmine
state from the XBC general purpose register, but the read access failed. The
executing cell will attempt a fabricless boot. Data Field: (port << 44)
| (xbc num << 32) | ret status
- Cause / Action: Cause: A hardware failure caused an error during a
XBC register read access. Action: Contact HP Support personnel to analyze the
fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2148
- Severity: CRITICAL
- Event Summary: The number of landmined XBC ports is not allowed.
- Event Class: System
- Problem Description:
The number of landmined XBC ports was not
within the allowable range. There is a minimum number of landmined ports
because some ports are always unused. There is a maximum number of landmined
ports because there is a limit to the number of broken links allowed in a
system. The executing cell will attempt a fabricless boot due to this error.
Data Field: landmine count
- Cause / Action: Cause: PDC runtime error, which was probably
exposed due to a hardware failure. Action: Contact HP Support personnel to
analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2149
- Severity: CRITICAL
- Event Summary: The port on one side of a link was landmined, but
the neighbor port was not.
- Event Class: System
- Problem Description:
If a link is landmined, the XBC ports on
both sides of the link should indicate the landmine. The port specified in the
data field of this chassis code was NOT landmined, even though the port on the
other side of the link was. The executing cell will attempt a fabricless boot.
Data Field: (port << 44) | (xbc num << 32) | ret status
- Cause / Action: Cause: PDC runtime error, probably exposed by a
hardware failure. Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2150
- Severity: CRITICAL
- Event Summary: A XBC register read access failed.
- Event Class: System
- Problem Description:
Attempted to read a neighbor XBC's port
status register, but failed. The executing cell will attempt a fabricless
boot. Data Field: (port << 44) | (xbc num << 32) | ret status
- Cause / Action: Cause: A hardware failure caused an error during a
XBC register read access. Action: Contact HP Support personnel to analyze the
fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2151
- Severity: CRITICAL
- Event Summary: A XBC register read access failed.
- Event Class: System
- Problem Description:
An attempt was made to read the landmine
state from the XBC general purpose register, but the read access failed. The
executing cell will attempt a fabricless boot. Data Field: (port << 44)
| (xbc num << 32) | ret status
- Cause / Action: Cause: A hardware failure caused an error during a
XBC register read access. Action: Contact HP Support personnel to analyze the
fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2152
- Severity: CRITICAL
- Event Summary: A XBC register read access failed.
- Event Class: System
- Problem Description:
Attempted to read the XBC port status
register, but failed. The executing cell will attempt fabricless boot. Data
Field: (port << 44) | (xbc num << 32) | ret status
- Cause / Action:
Cause: A hardware failure caused an error
during a XBC register read access. Action: Contact HP Support personnel to
analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2153
- Severity: CRITICAL
- Event Summary: A XBC register read access failed.
- Event Class: System
- Problem Description:
Attempted to obtain the port's neighbor
information by reading the XBC port neighbor information register, but the
read access failed. The executing cell will attempt fabricless boot. Data
Field: (port << 44) | (xbc num << 32) | ret status
- Cause / Action:
Cause: A hardware failure caused an error
during a XBC register read access. Action: Contact HP Support personnel to
analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2154
- Severity: MAJOR
- Event Summary: The specified XBC could not be reached via its PIOB
route.
- Event Class: System
- Problem Description:
Could not traverse to the XBC using the
PIOB route. Chassis codes sent before this one should give more details about
the exact nature of the problem. The executing cell will attempt a fabricless
boot. Data Field: (port << 44) | (xbc num << 32) | ret status
- Cause / Action: Cause: A hardware failure caused the PIOB route to
be invalid. PDC runtime error. Action: Contact HP Support personnel to analyze
the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2155
- Severity: MAJOR
- Event Summary: A XBC register read access failed.
- Event Class: System
- Problem Description:
There was an error reading one of the XBC
routing registers. Data Field: (port << 44) | (xbc num << 32) |
ret status
- Cause / Action: Cause: A XBC register read access failed due to a
hardware problem. Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2156
- Severity: MAJOR
- Event Summary: The XBC general purpose register indicates an
unknown topology.
- Event Class: System
- Problem Description:
The fabric topology read from the XBC
general purpose register was unrecognized. Data Field: (port << 44) |
(xbc num << 32) | ret status
- Cause / Action: Cause: XBC register read failure Action: Contact HP
Support personnel to analyze the crossbar chip.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2157
- Severity: MAJOR
- Event Summary: The routing for this XBC is not useable.
- Event Class: System
- Problem Description:
The routing tables for the XBC were not
valid. Refer to preceding chassis codes for details about the nature of the
problem. The executing cell will attempt a fabricless boot. Data Field: (port
<< 44) | (xbc num << 32) | ret status
- Cause / Action:
Cause: Could not access the XBC. PDC runtime
error. Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2158
- Severity: MAJOR
- Event Summary: An internal PDC function was called for a topology
not supported by that fcn.
- Event Class: System
- Problem Description:
At the end of fabric discovery, an internal
PDC function was called to validate the fabric state. However, the function
does not support the fabric topology. The executing cell will attempt a
fabricless boot. Data Field: topology
- Cause / Action: Cause: PDC runtime error. Action: Contact HP
Support personnel to analyze the fabric and PDC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2159
- Severity: CRITICAL
- Event Summary: PDC tried to get a fabric semaphore and detected a
fatal error.
- Event Class: System
- Problem Description:
There was a fabric access problem when
trying to grab the Global SM4. The executing cell will attempt a fabricless
boot. Data Field: (port << 44) | (xbc num << 32) | ret status
- Cause / Action: Cause: This is probably an intermittent hardware
failure. Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2160
- Severity: CRITICAL
- Event Summary: Auditing the XBC Global Semaphore for ownership has
failed.
- Event Class: System
- Problem Description:
A failure occurred while checking if the
semaphore's owner is making progress. This is a sign of a fabric connectivity
problem. The cell will attempt a fabricless boot! Data Field: (port <<
44) | (xbc num << 32) | ret status
- Cause / Action:
Cause: likely fabric hardware failure Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2161
- Severity: CRITICAL
- Event Summary: Waiting for the XBC Global Semaphore has timed out.
- Event Class: System
- Problem Description:
During Fabric Discovery, the cell will wait
until it gets the XBC's Global Semaphore. It waits for a very long time. This
chassis code indicates that the wait has timed out. As a result, the cell will
reboot. Data Field: (port << 44) | (xbc num << 32) | ret status
- Cause / Action: Cause: XBC Key Contention. Hardware Failure Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2162
- Severity: MAJOR
- Event Summary: There was a Multi-Bit Error detected during the XBC
write.
- Event Class: System
- Problem Description:
There was a Multi-Bit Error detected during
the XBC write. Data Field: xbc num
- Cause / Action: Cause: likely fabric hardware failure Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2163
- Severity: MAJOR
- Event Summary: Fabric write did not compare
- Event Class: System
- Problem Description:
A read after write did not compare Data
Field: (data read after write << 32) | (desired data to be written)
- Cause / Action: Cause: XBC slice has failed CC to XBC link failed
Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2164
- Severity: MAJOR
- Event Summary: Could not read the routing register for the PIOB
route.
- Event Class: System
- Problem Description:
Testing the fabric data route between two
XBCs requires testing of the PIOB route as well. During this test, a read
failure occurred which prevent the read of the routing register needed for the
PIOB route. Or the routing register was uninitialized. Data Field: return
status
- Cause / Action: Cause: XBC or Port reset prior to the read. Fabric
access failure. Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2165
- Severity: MAJOR
- Event Summary: A link in the fabric Data route is not useable.
- Event Class: System
- Problem Description:
Testing the fabric data route between two
XBCs. During this test, a link on the PIOB route was found to have errors
preventing its use. Data Field: (port << 44) | (xbc num << 32)
- Cause / Action: Cause: The Data link has errors Action: Contact HP
Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2166
- Severity: MAJOR
- Event Summary: A link in the fabric Data route is not useable.
- Event Class: System
- Problem Description:
Testing the fabric data route between two
XBCs. During this test, a fabric access failure occurred which prevented
completion of the testing. The link is not traversable. Data Field: (port
<< 44) | (xbc num << 32)
- Cause / Action: Cause: The Data link has errors Action: Contact HP
Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2167
- Severity: MAJOR
- Event Summary: Couldn't get Local XBC num from CC
- Event Class: System
- Problem Description:
An error reading the CC prevented PDC from
obtaining the number of the local XBC Data Field: return status
- Cause / Action:
Cause: Failed to read a CSR on the CC. Action:
Contact HP Support personnel to analyze the CC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2168
- Severity: MAJOR
- Event Summary: The XBC number provided is not a valid XBC Chip Id.
- Event Class: System
- Problem Description:
The argument passed into the function is
invalid. If this code was called from a proc, then the argument should have
been checked at the proc entrance. Data Field: (local xbc num << 32) |
target XBC
- Cause / Action: Cause: An invalid XBC number was provided. Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2169
- Severity: MAJOR
- Event Summary: The PIOB route to the neighbor XBC is not useable.
- Event Class: System
- Problem Description:
The PIOB route to the neighbor XBC is no
longer traversable. It can no longer be used. Data Field: (xbc num <<
32)
- Cause / Action: Cause: Errors have been found somewhere on the
fabric route. Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2170
- Severity: MAJOR
- Event Summary: The PIOB route to the neighbor XBC is not useable.
- Event Class: System
- Problem Description:
The PIOB route to the neighbor XBC could
not be fully tested. An error was encountered which prevented completion of
the tests. The route is no longer useable. Data Field: (xbc num << 32)
- Cause / Action: Cause: Errors have been encountered somewhere on
the fabric route. Fabric access failure. Action: Contact HP Support personnel
to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2171
- Severity: MAJOR
- Event Summary: The local XBC to local cell link is bad
- Event Class: System
- Problem Description:
The link between the local cell and the
local XBC is not healthy, as indicated by either the CC or the XBC. Data
Field: (cell << 56) | (xbc num << 32)
- Cause / Action:
Cause: The link between the local cell and
local XBC is bad. Action: Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2172
- Severity: MAJOR
- Event Summary: A failure was encountered on the cell's link to the
fabric.
- Event Class: System
- Problem Description:
While testing the fabric route between two
XBCs, an error was encountered on the link between the local cell and the
local XBC. Probably a fabric access failure. Data Field: (cell << 56) |
(xbc num << 32)
- Cause / Action: Cause: The CC to XBC link may not be connected.
There may be intermittent errors. Action: Contact HP Support personnel to
analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2173
- Severity: MAJOR
- Event Summary: An invalid XBC number was passed into a fabric walk
function.
- Event Class: System
- Problem Description:
This function is only intended to traverse
routes that either start or end at the local XBC. This chassis code indicates
that this construct is not satisfied. Data Field:(target xbc << 60) |
(xbc num << 32)
- Cause / Action: Cause: An invalid XBC number was provided. Action:
Contact HP Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2174
- Severity: MAJOR
- Event Summary: A link in the fabric PIOB route is not useable.
- Event Class: System
- Problem Description:
Testing the fabric data route between two
XBCs requires testing of the PIOB route as well. During this test, a link on
the PIOB route was found to have errors preventing its use. Data Field: (port
<< 44) | (xbc num << 32)
- Cause / Action: Cause: The PIOB link has errors Action: Contact HP
Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2175
- Severity: MAJOR
- Event Summary: A link in the fabric PIOB route is not useable.
- Event Class: System
- Problem Description:
Testing the fabric data route between two
XBCs requires testing of the PIOB route as well. During this test, a fabric
access failure occurred which prevented completion of the testing. The link is
not traversable. Data Field: (port << 44) | (xbc num << 32)
- Cause / Action: Cause: The PIOB link has errors Action: Contact HP
Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2176
- Severity: MAJOR
- Event Summary: There was an error reading the XBC port's remote
routing register.
- Event Class: System
- Problem Description:
While testing the fabric route between two
XBCs, a port's remote routing register either could not be read or was found
to be uninitialized. This code should not be called prior to fabric routing,
so this indicates that there is a problem on the port. Data Field: return
status
- Cause / Action: Cause: Routing register no longer contains valid
information or is no longer accessible. Action: Contact HP Support personnel
to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2177
- Severity: MAJOR
- Event Summary: Could not read the XBC Port's Neighbor Info Register
- Event Class: System
- Problem Description:
Testing the fabric route between two XBCs.
An unexpected error occurred while reading the port's Neighbor Info register
on the XBC. Data Field: (port << 44) | (xbc num << 32) | ret
status
- Cause / Action: Cause: Fabric access error. Action: Contact HP
Support personnel to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2178
- Severity: MAJOR
- Event Summary: The XBC Global Semaphore is not owned, yet is
required to write this register.
- Event Class: System
- Problem Description:
During XBC writes, the CSR addresses are
scanned to determine if they are protected by the Global Semaphore. If they
are protected, then the semaphore must be owned in order for a write to
proceed. Data Field: csr address
- Cause / Action: Cause: A semaphore takeover has occurred. This cell
took too long to route the fabric and now it must halt. PDC tried to write a
protected CSR without the semaphore. Action: Contact HP Support personnel to
analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2179
- Severity: FATAL
- Event Summary: PDC failed to read the cabinet type for another cell
in the partition
- Event Class: System
- Problem Description:
PDC checks that all of the cells in a
partition are installed in the same type of cabinet. PDC failed to read the
cabinet type for another cell in the partition. PDC will reset all of the
cells in the partition when this error is detected. The data field contains
the physical location of the cell reporting the event.
- Cause / Action:
Cause: PDC was unable to read a data structure
for another cell in the partition. This should never happen unless there is an
intermittent problem with the main backplane. Action: Contact HP support to
confirm that the main backplane is functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2180
- Severity: FATAL
- Event Summary: PDC failed to read the CPU HVERSION for another cell
in the partition
- Event Class: System
- Problem Description:
PDC attempts to insure that all of the CPUs
in the partition have the same HVERSION. PDC failed to read the CPU HVERSION
for another cell in the partition. PDC will reset all of the cells in the
partition when this error is detected. The data field contains the physical
location of the cell detected the event.
- Cause / Action:
Cause: PDC was unable to read a data structure
for another cell in the partition. This should never happen unless there is an
intermittent problem with the main backplane. Action: Contact HP support to
confirm that the main backplane is functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2181
- Severity: FATAL
- Event Summary: PDC failed to read the CPU speeds for another cell
in the partition
- Event Class: System
- Problem Description:
PDC attempts to make sure that all of the
CPUs in the partition run at the same speed. This chassis code is sent when
PDC is unable to perform this check PDC will reset all of the cells in the
partition when this error is detected. The data field contains the physical
location of the cell detecting the event.
- Cause / Action: Cause: PDC was unable to read a data structure for
another cell in the partition. This should never happen unless there is an
intermittent problem with the main backplane. Action: Contact HP support to
confirm that the main backplane is functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2182
- Severity: FATAL
- Event Summary: Could not access local rendezvous set data in cell
previously accessed
- Event Class: System
- Problem Description:
All of the cells create a set of cells that
they could rendezvous with. This cell tried to read that set on another cell
and failed. PDC will reset all of the cells in the partition when this error
is detected.
- Cause / Action: Cause: PDC was unable to read a data structure for
another cell in the partition. This should never happen unless there is an
intermittent problem with the main backplane. Action: Contact HP support to
confirm that the main backplane is functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2183
- Severity: FATAL
- Event Summary: Unable to access the PDC version info on cell in
rendezvous set
- Event Class: System
- Problem Description:
PDC checks to insure that all of the cells
have the same version of PDC. PDC failed accessing the PDC version on another
cell. PDC will reset all of the cells in the partition when this error is
detected. The data field contains the physical location of the cell detecting
the event.
- Cause / Action: Cause: PDC was unable to read a data structure for
another cell in the partition. This should never happen unless there is an
intermittent problem with the main backplane. Action: Contact HP support to
confirm that the main backplane is functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2184
- Severity: MAJOR
- Event Summary: PDC was unable to read a data structure on the local
cell board.
- Event Class: System
- Problem Description:
PDC was unable to read a data structure on
the local cell board. When this error is detected, the cell will be reset for
reconfiguration and will not join the partition on this boot. The data field
contains the physical location of the cell that detected the event.
- Cause / Action: Cause: The cell board or PDH riser card may not be
functioning correctly. Action: Contact HP support to confirm that the cell
board and PDH riser card are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2185
- Severity: MAJOR
- Event Summary: Boot failed because the IPL address was either zero
or not 2K aligned.
- Event Class: System
- Problem Description:
IPL address is the byte offset from the
start of the boot device to the program IPL. This chassis code comes out when
a boot fails because the IPL address was either zero or not 2K aligned.
- Cause / Action: Cause: Bad boot disk image or network boot image.
Action: Ensure the correct boot path is being used to access the boot device.
Replace boot disk image or network boot image.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2186
- Severity: MAJOR
- Event Summary: Boot failed because the LIF image's checksum was
invalid.
- Event Class: System
- Problem Description:
Boot failed because the LIF image's
checksum was invalid. The system should return to BCH.
- Cause / Action:
Cause: Bad boot disk image or network boot
image. Action: Ensure the correct boot path is being used to access the boot
device. Replace boot disk image or network boot image.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2187
- Severity: MAJOR
- Event Summary: Boot failed because IPL ENTRY offset was invalid
- Event Class: System
- Problem Description:
IPL ENTRY is the offset into the IPL
program where execution starts. A chassis codes is emitted when a boot fails
because this offset is not less than the size of the IPL image or is not word
aligned.
- Cause / Action: Cause: Bad boot disk image or network boot image.
Action: Ensure the correct boot path is being used to access the boot device.
Replace boot disk image or network boot image.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2188
- Severity: MAJOR
- Event Summary: Boot failed because size of IPL was invalid
- Event Class: System
- Problem Description:
IPL size is the total size (in bytes) of
the IPL program. This chassis code is emitted on a failed boot due to IPL size
being zero, not 2K aligned, or greater than 256 K.
- Cause / Action:
Cause: Bad boot disk image or network boot
image. Action: Ensure the correct boot path is being used to access the boot
device. Replace boot disk image or network boot image.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2189
- Severity: MAJOR
- Event Summary: Boot failed - LIF image had an invalid value for
magic number
- Event Class: System
- Problem Description:
Boot failed because LIF (boot disk or
network boot) image had an invalid value for the HP-architected magic number.
System should return to BCH.
- Cause / Action: Cause: The boot path did not specify a valid HP
boot disk image or network boot image. Action: Ensure the correct boot path is
being used to access the boot device. Replace boot disk image or network boot
image.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2190
- Severity: MAJOR
- Event Summary: At rendezvous, a cell is found to be incompatible
with the core cell.
- Event Class: System
- Problem Description:
At rendezvous, a cell is found to be
incompatible with the core cell. Data field contains physical location of the
incompatible cell. This chassis code should be immediately preceded by a
chassis code explaining the specific incompatibility, (e.g.
BOOT_INCOMPATIBLE_CPU_ID)
- Cause / Action: Cause: Cell is incompatible with core cell. Action:
See the preceding chassis code in the log for specific incompatibility and
proper action.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2191
- Severity: MAJOR
- Event Summary: Failed to program the partition set into the CC
- Event Class: System
- Problem Description:
PDC could not program the CC to recognize
which cells are in the partition. The cell will be reset. The data field
contains the value that was attempted to program.
- Cause / Action: Cause: A hardware problem with the CC or the cell
board. Action: Contact HP Support to confirm the CC and cell board are
functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2192
- Severity: MAJOR
- Event Summary: Cell did not rendezvous with the rest of the cells
in the partition
- Event Class: System
- Problem Description:
A cell that is booting checks to determine
if the other cells in the partition are ready to rendezvous with it. If the
other cells in the partition have already rendezvoused, the cell cannot join
the partition. PDC accommodates a relatively large skew between cells, but
will eventually give up waiting on a cell. Data field is the physical location
of the cell that "missed the boat".
- Cause / Action:
Cause: The cell booted too slowly. It either
started booting too late or has some problem that caused it to boot slowly.
Action: Reboot the partition from the MP and see if the cell is able
rendezvous into the partition. If the cell is still too slow, contact HP
Support to confirm the cell board and CPUs are function properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2193
- Severity: MAJOR
- Event Summary: Physical location of the cell from
BOOT_CELL_STATE_ERROR_STATUS
- Event Class: System
- Problem Description:
This is an informational IPMI event used to
provide the physical location of the cell board affected by an error indicated
by a preceding IPMI event. The data field holds the physical location of the
cell board.
- Cause / Action: Cause: Refer to preceding high-alert level IPMI
events for cause/action information. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2194
- Severity: FATAL
- Event Summary: Cells in the partition have different complex
profiles.
- Event Class: System
- Problem Description:
Cell boards in the same partition have
different complex profiles. The partition will be rebooted and cannot be fully
booted until the problem is resolved. The data field is a bitmap of cells
where cell 0 is the least significant bit and cell 63 is the most significant
bit. A one on a cell's bit indicated that the cell has a complex profile that
did not match that of the core cell.
- Cause / Action:
Cause: An error occurred which prevented the
complex profiles from being distributed properly. Action: Create and
distribute a new complex profile using ParMgr on a functional partition in the
complex. Restore the last complex profile using the "CC" command from the MP,
then use ParMgr to create a new complex profile. Generate a genesis complex
profile using the "CC" command from the MP, then use ParMgr to create a new
complex profile. Cause: A hardware problem exists with MP or PDHC hardware.
Action: Contact HP Support to confirm the MP and PDHC are functioning
properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2195
- Severity: FATAL
- Event Summary: Cells in the partition have different complex
profiles.
- Event Class: System
- Problem Description:
Cell boards in the same partition have
different complex profiles. The partition will be rebooted and cannot be fully
booted until the problem is resolved. The data field is a bitmap of cells
where cell 0 is the least significant bit and cell 63 is the most significant
bit. A one on a cell's bit indicated that the cell has a complex profile that
did not match that of the core cell.
- Cause / Action:
Cause: An error occurred which prevented the
complex profiles from being distributed properly. Action: Create and
distribute a new complex profile using ParMgr on a functional partition in the
complex. Restore the last complex profile using the "CC" command from the MP,
then use ParMgr to create a new complex profile. Generate a genesis complex
profile using the "CC" command from the MP, then use ParMgr to create a new
complex profile. Cause: A hardware problem exists with MP or PDHC hardware.
Action: Contact HP Support to confirm the MP and PDHC are functioning
properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2196
- Severity: FATAL
- Event Summary: Cells in the partition have different complex
profiles.
- Event Class: System
- Problem Description:
Cell boards in the same partition have
different complex profiles. The partition will be rebooted and cannot be fully
booted until the problem is resolved. The data field is a bitmap of cells
where cell 0 is the least significant bit and cell 63 is the most significant
bit. A one on a cell's bit indicated that the cell has a complex profile that
did not match that of the core cell.
- Cause / Action:
Cause: An error occurred which prevented the
complex profiles from being distributed properly. Action: Create and
distribute a new complex profile using ParMgr on a functional partition in the
complex. Restore the last complex profile using the "CC" command from the MP,
then use ParMgr to create a new complex profile. Generate a genesis complex
profile using the "CC" command from the MP, then use ParMgr to create a new
complex profile. Cause: A hardware problem exists with MP or PDHC hardware.
Action: Contact HP Support to confirm the MP and PDHC are functioning
properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2197
- Severity: FATAL
- Event Summary: Cells in the partition have different complex
profiles
- Event Class: System
- Problem Description:
Cell boards in the same partition have
different complex profiles. The partition will be rebooted and cannot be fully
booted until the problem is resolved. The data field is a bitmap of cells
where cell 0 is the least significant bit and cell 63 is the most significant
bit. A one on a cell's bit indicated that the cell has a complex profile that
did not match that of the core cell.
- Cause / Action:
Cause: An error occurred which prevented the
complex profiles from being distributed properly. Action: Create and
distribute a new complex profile using ParMgr on a functional partition in the
complex. Restore the last complex profile using the "CC" command from the MP,
then use ParMgr to create a new complex profile. Generate a genesis complex
profile using the "CC" command from the MP, then use ParMgr to create a new
complex profile. Cause: A hardware problem exists with MP or PDHC hardware.
Action: Contact HP Support to confirm the MP and PDHC are functioning
properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2198
- Severity: FATAL
- Event Summary: Cells in the partition have different complex
profiles.
- Event Class: System
- Problem Description:
Cell boards in the same partition have
different complex profiles. The partition will be rebooted and cannot be fully
booted until the problem is resolved. The data field is a bitmap of cells
where cell 0 is the least significant bit and cell 63 is the most significant
bit. A one on a cell's bit indicated that the cell has a complex profile that
did not match that of the core cell.
- Cause / Action:
Cause: An error occurred which prevented the
complex profiles from being distributed properly. Action: Create and
distribute a new complex profile using ParMgr on a functional partition in the
complex. Restore the last complex profile using the "CC" command from the MP,
then use ParMgr to create a new complex profile. Generate a genesis complex
profile using the "CC" command from the MP, then use ParMgr to create a new
complex profile. Cause: A hardware problem exists with MP or PDHC hardware.
Action: Contact HP Support to confirm the MP and PDHC are functioning
properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2199
- Severity: FATAL
- Event Summary: Cells in the partition have different complex
profiles.
- Event Class: System
- Problem Description:
Cell boards in the same partition have
different complex profiles. The partition will be rebooted and cannot be fully
booted until the problem is resolved. The data field is a bitmap of cells
where cell 0 is the least significant bit and cell 63 is the most significant
bit. A one on a cell's bit indicated that the cell has a complex profile that
did not match that of the core cell.
- Cause / Action:
Cause: An error occurred which prevented the
complex profiles from being distributed properly. Action: Create and
distribute a new complex profile using ParMgr on a functional partition in the
complex. Restore the last complex profile using the "CC" command from the MP,
then use ParMgr to create a new complex profile. Generate a genesis complex
profile using the "CC" command from the MP, then use ParMgr to create a new
complex profile. Cause: A hardware problem exists with MP or PDHC hardware.
Action: Contact HP Support to confirm the MP and PDHC are functioning
properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2200
- Severity: MAJOR
- Event Summary: Unable to access core I/O data on a cell that
rendezvoused with the partition.
- Event Class: System
- Problem Description:
PDC was unable to access data on another
cell that rendezvoused with the partition. The executing cell will be reset.
The data field contains the physical location of the cell that will be reset.
- Cause / Action: Cause: Hardware problem with the main backplane.
Action: Contact HP Support to confirm that the main backplane is functioning
properly. Cause: Hardware problem with the cell board, CPU, or PDH riser card.
Action: Contact HP Support to confirm the cell board, CPUs, and PDH riser card
are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2201
- Severity: FATAL
- Event Summary: Unable to read a cell board register on another cell
board
- Event Class: System
- Problem Description:
Unable to read a cell board register on a
cell board that rendezvoused with the executing cell. All the cells that have
rednezvoused (i.e. the entire partition) will be reset.
- Cause / Action:
Cause: Hardware problem with the main
backplane. Action: Contact HP Support to confirm that the main backplane is
functioning properly. Cause: Hardware problem with the cell board, CPU, or PDH
riser card. Action: Contact HP Support to confirm the cell board, CPUs, and
PDH riser card are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2202
- Severity: FATAL
- Event Summary: The partition console device wasn't ready.
- Event Class: System
- Problem Description:
Before accessing the console, PDC first
checked to see if the console device was ready and found that it was not. The
partition will be reset for reconfiguration. The data field contains the
status from the PDC function that checks the console device.
- Cause / Action:
Cause: Console timed out or PDC could not read
the core I/O card. Action: Make sure that the core I/O card is installed
correctly. Make sure the I/O chassis is installed correctly. Contact HP
Support to confirm the core I/O card and I/O chassis are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2203
- Severity: MAJOR
- Event Summary: The partition console device could not be
configured.
- Event Class: System
- Problem Description:
PDC could not map the console device path
to a PCI functional address (PFA) and therefore, could not configure the
console.
- Cause / Action: Cause: A hardware problem with the core I/O card or
I/O chassis. Action: Make sure that the core I/O card is installed correctly.
Make sure the I/O chassis is installed correctly. Contact HP Support personnel
to confirm the core I/O card and I/O chassis are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2204
- Severity: MAJOR
- Event Summary: A write attempt to a PDC data structure failed.
- Event Class: System
- Problem Description:
A write attempt to a data structure failed,
most likely to another cell in the partition. The data field contains number
of cell whose control structure could not be written. Cause /
Action:
Cause: Hardware problem with the main backplane. Action: Contact HP
Support to confirm that the main backplane is functioning properly. Cause:
Hardware problem with the cell board, CPU, or PDH riser card. Action: Contact
HP Support to confirm the cell board, CPUs, and PDH riser card are functioning
properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2205
- Severity: FATAL
- Event Summary: Could not write a cell board register on a cell that
has rendezvoused.
- Event Class: System
- Problem Description:
A write attempt to a cell board register
failed for a cell that has rendezvoused into the partition. The cell that
encountered the error will be reset. The data field contains the return value
from PDC function that detected the error.
- Cause / Action:
Cause: Hardware problem with the main
backplane. Action: Contact HP Support to confirm that the main backplane is
functioning properly. Cause: Hardware problem with the cell board, CPU, or PDH
riser card. Note that this may be a problem on the remote cell rather than the
cell that sent the chassis code. Action: Contact HP Support to confirm the
cell board, CPUs, and PDH riser card are functioning properly, for both the
local and remote cells.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2206
- Severity: MAJOR
- Event Summary: A cell board register could not be accessed or
register contents were invalid.
- Event Class: System
- Problem Description:
A cell board register could not be accessed
or the register contents were invalid.
- Cause / Action:
Cause: Hardware problem with the cell board,
CPUs, or PDH riser card. Action: Contact HP Support to confirm the cell board,
CPUs, and PDH riser card are functioning properly. Cause: The MP or PDHC are
malfunctioning. Action: Check communication with the MP. Contact HP Support to
confirm the MP and PDHC are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2207
- Severity: FATAL
- Event Summary: The stable complex profile in this cell has an
invalid sequence ID.
- Event Class: System
- Problem Description:
PDC detected an invalid sequence ID in the
Stable complex profile on this cell. This means that the complex profile is
not valid and that PDC cannot rendezvous the cells into a partition. The cell
will be reset for reconfiguration, allowing another complex profile to be
pushed out before it attempts to boot again.
- Cause / Action: Cause: The cell did not have a valid complex
profile. Action: Push a new complex profile out. Make sure that the utilities
system is still functioning.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2208
- Severity: FATAL
- Event Summary: The stable complex profile in this cell has an
invalid sequence ID.
- Event Class: System
- Problem Description:
PDC detected an invalid checksum in the
stable complex profile on this cell. This means that the complex profile is
not valid and that PDC cannot rendezvous the cells into a partition. The cell
will be reset for reconfiguration, allowing another complex profile to be
pushed out before it attempts to boot again.
- Cause / Action: Cause: The cell did not have a valid complex
profile. Action: Push a new complex profile out. Make sure that the utilities
system is still functioning.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2209
- Severity: MAJOR
- Event Summary: Halting cell because PDC can't determine CPU type or
revision.
- Event Class: System
- Problem Description:
While trying to determine whether the CPU
is supported given the backplane and cell board revision, PDC couldn't access
CPU type or revision, which is supposed to be available through a data
structure.
- Cause / Action: Cause: Hardware problem where either the PDH memory
is bad or the CPU or CC corrupted the write or read to this area. Action:
Contact HP Support personnel to troubleshoot the cell board
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2210
- Severity: MAJOR
- Event Summary: Halting cell because PDC was unable to determine the
operating mode of the sys.
- Event Class: System
- Problem Description:
While trying to determine whether the CPU
is supported on the backplane and cell board revision present, PDC needs to
determine the operating mode of the system because PDC is more lenient if the
system is in Manufacturing mode. This chassis log is sent if PDC wasn't able
to determine whether or not the system is in MFG mode.
- Cause / Action:
Cause: Hardware problem either with PDH memory
or with the CPU or CC corrupting the read or write to the location containing
the operating mode. Action: Contact HP Support personnel to troubleshoot the
cell board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2212
- Severity: MAJOR
- Event Summary: Halting cell because PDC doesn't support the main
backplane type.
- Event Class: System
- Problem Description:
While trying to determine whether or not
the CPU is supported on the backplane and cell board revision present, PDC
obtained an invalid backplane type.
- Cause / Action: Cause: firmware is running on a machine with a
different backplane type than it supports Action: ensure firmware version is
correct for machine type and whether or not new backplane type might require
new firmware C2: Hardware problem either with PDH memory or with the CPU or CC
corrupting the read or write led to PDC obtaining an invalid backplane type
value. A2: Contact HP Support personnel to troubleshoot the cell board. cause
2: PDC could not correctly determine the backplane type action2:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2213
- Severity: MAJOR
- Event Summary: MBE free area of memory for late CPU self tests could
not be found
- Event Class: System
- Problem Description:
A multi-bit error free area of memory,
large enough for late CPU self tests, could not be found. The cell will be
halted.
- Cause / Action: Cause: Excessive errors due to defective DIMMs
Coherency controller seating Action: Reseat DIMM(s) and reboot Replace DIMM(s)
based on PDT entries or previous chassis codes Contact HP Support personnel to
troubleshoot the Cell board
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2214
- Severity: MAJOR
- Event Summary: Couldn't find a large enough area of error free
memory to load PDC
- Event Class: System
- Problem Description:
Couldn't find a large enough area of error
free memory to load PDC into memory. The cell will be hard halted.
- Cause / Action: Cause: Excessive errors in memory. Action:
Reseat/troubleshoot DIMMs. Contact HP Support personnel to troubleshoot cell
board
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2215
- Severity: MAJOR
- Event Summary: Couldn't find large enough area of error free memory
to load PDC
- Event Class: System
- Problem Description:
Couldn't find a large enough area of error
free memory to load PDC ROM into memory. Data field contains the ROM
relocation address. Cell will halt. The cell will be halted.
- Cause / Action:
Cause: Excessive errors in memory. Action:
Reseat/troubleshoot DIMMs. Contact HP Support personnel to troubleshoot cell
board
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2216
- Severity: FATAL
- Event Summary: More than one cell in a partition believes it is the
core cell
- Event Class: System
- Problem Description:
More than one cell in a partition believes
it is the core cell. Chassis code from one core cell will contain the physical
location of the other core cell in its data field.
- Cause / Action:
Cause: Cells in partition have different
partition configuration data Fabric problem prevents the cells from seeing one
another Action: Fix utilities problem/push out new complex profile Check
fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2217
- Severity: FATAL
- Event Summary: On a boot action of "Attempt next path", PDC tries
invalid next path.
- Event Class: System
- Problem Description:
On a boot action of "Attempt next path",
PDC tries invalid next path. The data field contains the invalid path. Cause /
Action:
Cause: Should never happen, but it if does it would be caused by an
internal PDC error. Action: Check for PDC upgrade Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2218
- Severity: MAJOR
- Event Summary: Error occurred in NVM initialization
- Event Class: System
- Problem Description:
Error occurred in NVM initialization. Cell
will reset and halt.
- Cause / Action: Cause: NVM error Action: Contact HP Support
personnel to troubleshoot the cell board
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2219
- Severity: MAJOR
- Event Summary: AutoBoot attempt failed, so stopping at BCH
according to Boot Action
- Event Class: System
- Problem Description:
System was configured for AutoBoot. This
chassis code indicates that the system attempted and failed to boot off a path
whose Boot Action specified to attempt a boot and if the boot failed, return
to BCH.
- Cause / Action: Cause: No boot disk or defective boot disk. Invalid
Path Action: Insert a good disk Specify a valid path from the BCH MAin menu
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2220
- Severity: FATAL
- Event Summary: Partition numbers mismatch in stable and partition
configuration data
- Event Class: System
- Problem Description:
The stable complex profile can be used to
get a cell's partition number because it contains the partition number to
which the cell is assigned. The partition configuration data also holds a
partition number. PDC compares the partition number in the partition
configuration data with the partition to which the cell is assigned in the
stable complex profile. If the two partition numbers are not the same, PDC
sends this chassis code and resets the partition for reconfiguration. The data
field is the partition number from the partition configuration data.
- Cause / Action: Cause: The system utilities did not update ICM to
contain the partition configuration data for this cell's partition. ICM is
corrupted and the utilities system cannot write a corrected partition
configuration data to the cell Action: Make sure the system utilities are
working correctly. Make sure that PDH self tests are enabled. Try rebooting the
cell. Try pushing a new complex profile to the cell Contact HP Support
personnel to troubleshoot the cell board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2221
- Severity: FATAL
- Event Summary: Cell has been assigned to partition number not
supported by MP
- Event Class: System
- Problem Description:
The stable complex profile has a field that
tells PDC the maximum number of partitions that the system utilities will
support. If PDC finds itself in a partition with a partition number greater
than the maximum number of partitions supported by the system utilities, PDC
will reset the cell for reconfiguration, allowing another stable complex
profile to be pushed out to fix the problem.
- Cause / Action:
Cause: The stable complex profile contains an
illegal partition number for this cell. Action: Run SAM and remove the
partition with the illegal number.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2222
- Severity: FATAL
- Event Summary: Cell has different Stable Complex Profile sequence
ID then core cell
- Event Class: System
- Problem Description:
PDC checks the sequence ID of the Stable
complex profile for each of the cells in the partition. If they do not match,
this means that the cells have different complex profiles. Since the complex
profiles are used to assign resources to a partition, PDC, at this point, is
unable to tell which version of the complex profile is correct. The partition
cannot be booted until this problem is resolved. The data field contains the
stable complex profile sequence ID from the cell that did not match the core
cell.
- Cause / Action: Cause: The core cell detected that a cell in its
partition has a different complex profile than it does. Action: Look for a
chassis code called,BOOT_CORE_CHECK_HCELL_PROFILE, to see which cell's complex
profile was being checked. That cell is the cell that had the inconsistent
complex profile. Make sure the utilities system is functioning and reboot the
partition. If the reboot does not solve the problem, make sure PDH tests are
enabled. Replace the cell with the inconsistent complex profile. Change core
cells to see if the core cell is the cell that has the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2223
- Severity: FATAL
- Event Summary: Cell has different Stable Complex Profile checksum
then core cell
- Event Class: System
- Problem Description:
PDC checks the sequence ID of the Stable
complex profile for each of the cells in the partition. If they do not match,
this means that the cells have different complex profiles. Since the complex
profiles are used to assign resources to a Partition, PDC, at this point, is
unable to tell which version of the complex profile is correct. The partition
cannot be booted until this problem is resolved. The data field contains the
checksum of the slave cell's stable complex profile.
- Cause / Action: Cause: The core cell detected that a cell in its
partition has a different complex profile than it does. Action: Look for a
chassis code called,BOOT_CORE_CHECK_HCELL_PROFILE, to see which cell's complex
profile was being checked. That cell is the cell that had the inconsistent
complex profile. Make sure the utilities system is functioning and reboot the
partition. If the reboot does not solve the problem, make sure PDH tests are
enabled. Replace the cell with the inconsistent complex profile. Change core
cells to see if the core cell is the cell that has the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2224
- Severity: FATAL
- Event Summary: Cell has different Dynamic Complex Profile sequence
ID than core cell
- Event Class: System
- Problem Description:
PDC checks the sequence ID of the Dynamic
complex profile for each of the cells in the partition. If they do not match,
this means that the cells have different complex profiles. At this point, is
unable to tell which version of the complex profile is correct. The partition
cannot be booted until this problem is resolved. This chassis code indicates
all of the cells that have complex profiles that do not match the core cell's.
The data field is sequence ID from the dynamic complex profile for the slave
cell that did not match the core cell.
- Cause / Action:
Cause: The core cell detected that a cell in
its partition has a different complex profile than it does. Action: Look for a
chassis code called,BOOT_CORE_CHECK_HCELL_PROFILE, to see which cell's complex
profile was being checked. That cell is the cell that had the inconsistent
complex profile. Make sure the utilities system is functioning and reboot the
partition. If the reboot does not solve the problem, make sure PDH tests are
enabled. Replace the cell with the inconsistent complex profile. Change core
cells to see if the core cell is the cell that has the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2225
- Severity: FATAL
- Event Summary: Cell has different Dynamic Complex Profile checksum
then core cell
- Event Class: System
- Problem Description:
PDC checks the checksum of the Dynamic
complex profile for each of the cells in the partition. If they do not match,
this means that the cells have different complex profiles. At this point, is
unable to tell which version of the complex profile is correct. The partition
cannot be booted until this problem is resolved. This chassis code indicates
all of the cells that have complex profiles that do not match the core cell's.
The data field is the checksum of the slave cell's dynamic complex profile.
- Cause / Action: Cause: The core cell detected that a cell in its
partition has a different complex profile than it does. Action: Look for a
chassis code called,BOOT_CORE_CHECK_HCELL_PROFILE, to see which cell's complex
profile was being checked. That cell is the cell that had the inconsistent
complex profile. Make sure the utilities system is functioning and reboot the
partition. If the reboot does not solve the problem, make sure PDH tests are
enabled. Replace the cell with the inconsistent complex profile. Change core
cells to see if the core cell is the cell that has the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2226
- Severity: FATAL
- Event Summary: Cell has different Partition Config Data sequence ID
then core cell
- Event Class: System
- Problem Description:
PDC checks the sequence ID of the partition
configuration data for each of the cells in the partition. If they do not
match, this means that the cells have different complex profiles. At this
point, is unable to tell which version of the complex profile is correct. The
partition cannot be booted until this problem is resolved. This chassis code
indicates all of the cells that have complex profiles that do not match the
core cell's. The data field is the sequence ID of the partition configuration
data for the slave cell.
- Cause / Action:
Cause: The core cell detected that a cell in
its partition has a different complex profile than it does. Action: Look for a
chassis code called,BOOT_CORE_CHECK_HCELL_PROFILE, to see which cell's complex
profile was being checked. That cell is the cell that had the inconsistent
complex profile. Make sure the utilities system is functioning and reboot the
partition. If the reboot does not solve the problem, make sure PDH tests are
enabled. Replace the cell with the inconsistent complex profile. Change core
cells to see if the core cell is the cell that has the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2227
- Severity: FATAL
- Event Summary: Cell has different Partition Config Data checksum
then core cell
- Event Class: System
- Problem Description:
PDC checks the checksum of the partition
configuration data for each of the cells in the partition. If they do not
match, this means that the cells have different complex profiles. At this
point, is unable to tell which version of the complex profile is correct. The
partition cannot be booted until this problem is resolved. This chassis code
indicates all of the cells that have complex profiles that do not match the
core cell's. The data field is the checksum for the PD profile of the slave
cell.
- Cause / Action: Cause: The core cell detected that a cell in its
partition has a different complex profile than it does. Action: Look for a
chassis code called,BOOT_CORE_CHECK_HCELL_PROFILE, to see which cell's complex
profile was being checked. That cell is the cell that had the inconsistent
complex profile. Make sure the utilities system is functioning and reboot the
partition. If the reboot does not solve the problem, make sure PDH tests are
enabled. Replace the cell with the inconsistent complex profile. Change core
cells to see if the core cell is the cell that has the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2228
- Severity: FATAL
- Event Summary: PDC detected that cells in partition have different
complex profiles
- Event Class: System
- Problem Description:
PDC on the core cell checks the sequence
IDs and checksums for all of the complex profiles (stable, dynamic and
partition data) on each of the cells in the partition. If they do not match
the core cell's profiles, this means that the cells have different complex
profiles. At this point, is unable to tell which version of the complex
profile is correct. The partition cannot be booted until this problem is
resolved. This chassis code data field contains a bitmap of all of the cells
that have complex profiles that do not match the core cell's, where cell 0 is
the least significant bit and cell 63 is the most significant bit. Cause /
Action:
Cause: The complex profiles for the slaves cells in the partition
do not match the complex profiles on the core cell. Action: Try to push out a
new complex profile. Check for failures in the system utilities. As a last
resort, try pushing out a genesis complex profile.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2229
- Severity: FATAL
- Event Summary: Unable to write alive set to all coherency
controllers in partition
- Event Class: System
- Problem Description:
Once PDC in the core cell has determined
which cells are going to make it into the partition, it programs the coherency
controllers on each of the slave cells and its own cell. If the write to the
cell fails, PDC will send this chassis code, which contains, in the data
field, the status from the function that failed to write to the coherency
controllers.
- Cause / Action: Cause: A Coherency controller (CC) in the partition
failed a read after write test. Action: Reboot the partition. Look for chassis
codes that indicate a primary or secondary CC error. Cause: Core Cell lost
communication with a remote cell whose coherency controller it was attempting
to write Action: Check for problems with the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2230
- Severity: FATAL
- Event Summary: Failed while writing coherency set registers in
coherency controller
- Event Class: System
- Problem Description:
Once PDC in the core cell has determined
which cells are going to make it into the partition, it programs the coherency
controllers on each of the slave cells and its own cell. If the write to the
cell fails, PDC will send this chassis code, which contains, in the data
field, the alive set of cells.
- Cause / Action: Cause: A coherency controller (CC) in the partition
failed a read after write test. Action: Reboot the partition. Look for chassis
codes that indicate a primary or secondary CC error. Cause: Core Cell lost
communication with a remote cell whose coherency controller it was attempting
to write Action: Check for problems with the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2231
- Severity: MAJOR
- Event Summary: PDC could not read the time of day on the RTC
- Event Class: System
- Problem Description:
PDC could not read the time of day (TOD) on
the real time clock (RTC). Data field contains the status returned from the
attempt to read the TOD.
- Cause / Action: Cause: Semaphore problem. Action: Contact HP
Support personnel to troubleshoot the cell board (suspect PDH) Check for PDC
upgrade for possible internal software problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2232
- Severity: MAJOR
- Event Summary: A CPU's data area has overflowed its bounds.
- Event Class: System
- Problem Description:
A CPU's data area has overflowed its
bounds. The data field contains the physical location of the CPU whose data
area overflowed.
- Cause / Action: Cause: Hardware problem with the CPU, cell board,
or CC. Action: Contact HP Support to confirm the CPU, cell board and CC are
function properly. Update PDC if a version is available to fix this problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2233
- Severity: MAJOR
- Event Summary: A CPU is being stopped and deconfigured.
- Event Class: System
- Problem Description:
A CPU is being stopped and deconfigured.
See the previous IPMI events to determine the reason that the CPU is being
deconfigured. The data field is the physical location of the CPU being
deconfigured.
- Cause / Action: Cause: A CPU is being stopped and deconfigured.
Action: See previous IPMI events to determine the reason that the CPU is being
deconfigured. Contact HP Support personnel to confirm the CPU is functioning
properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2234
- Severity: MAJOR
- Event Summary: An invalid CPU number was passed to a procedure to
deconfigure the CPU
- Event Class: System
- Problem Description:
An invalid CPU number was passed into a
procedure to deconfigure the CPU. The cell will be halted. The data field is
the value of the invalid CPU number. See BOOT_HALT_DUE_TO_PDC_ERROR following
this chassis code for physical location of cell that has been halted.
- Cause / Action: Cause: Hardware problem with the CPU, cell board,
or CC. Action: Contact HP Support to confirm the CPU, cell board and CC are
function properly. Update PDC if a version is available to fix this problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2235
- Severity: MAJOR
- Event Summary: Invalid CPU number passed to procedure to schedule
CPU deconfiguration
- Event Class: System
- Problem Description:
AAn invalid CPU number was passed into a
procedure to deconfigure the CPU. The cell will be halted. The data field is
the value of the invalid parameter.
- Cause / Action: Cause: Hardware problem with the CPU, cell board,
or CC. Action: Contact HP Support to confirm the CPU, cell board and CC are
function properly. Update PDC if a version is available to fix this problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2236
- Severity: MAJOR
- Event Summary: A CPU's stack has overflowed its allocated area
- Event Class: System
- Problem Description:
A CPU's stack has overflowed its allocated
area. The data field contains the physical location of the CPU whose stack
overflowed.
- Cause / Action: Cause: Hardware problem with the CPU, cell board,
or CC. Action: Contact HP Support to confirm the CPU, cell board and CC are
function properly. Update PDC if a version is available to fix this problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2237
- Severity: FATAL
- Event Summary: PDC encountered a fatal error after a boot device
failed.
- Event Class: System
- Problem Description:
PDC encountered a fatal error after a boot
device failed. The partition will be rebooted. The data field contains the
return status from the PDC function that encountered the error.
- Cause / Action:
Cause: Hardware problem with the I/O device,
I/O chassis, or I/O cables between cells and I/O chassis. Action: Contact HP
Support to confirm the I/O device is functioning properly. Contact HP Support
to confirm the I/O chassis is functioning properly. Contact HP Support to
confirm the I/O cables between cells and I/O chassis are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2238
- Severity: MAJOR
- Event Summary: PDC could not access a data structure on the local
cell
- Event Class: System
- Problem Description:
PDC could not access one of its own data
structures on the local cell. The cell will be halted. The data field contains
the return status from the PDC function that encountered the error.
- Cause / Action: Cause: Hardware problem with the PDH riser card.
Action: Contact HP Support to confirm the PDH riser card is functioning
properly. Cause: Hardware problem with the CPU or cell board. Action: Contact
HP Support to confirm the CPUs and cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2239
- Severity: MAJOR
- Event Summary: PDC could not access a data structure on the local
cell
- Event Class: System
- Problem Description:
PDC could not access one of its own data
structures on the local cell. The cell will be halted. The data field contains
the return status from the PDC function that encountered the error.
- Cause / Action: Cause: Hardware problem with the PDH riser card.
Action: Contact HP Support to confirm the PDH riser card is functioning
properly. Cause: Hardware problem with the CPU or cell board. Action: Contact
HP Support to confirm the CPUs and cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2240
- Severity: FATAL
- Event Summary: Core cell failed to write to a PDC data structure on
all cells in the partition
- Event Class: System
- Problem Description:
Core cell failed to write to a PDC data
structure on all cells in the partition. Data field is the return status from
the PDC function that encountered the error.
- Cause / Action:
Cause: Hardware problem with the main
backplane. Action: Contact HP Support to confirm the main backplane is
functioning properly. Cause: Hardware problem with the cell board, CPU, or PDH
riser card, possibly on another cell in the partition. Action: Look for IPMI
events indicating errors on other cells in the partition. Contact HP Support
to confirm the cell board, CPUs, and PDH riser card are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2241
- Severity: MAJOR
- Event Summary: An error was detected in CC before HPMC handling was
enabled.
- Event Class: System
- Problem Description:
An error was detected in the coherency
controller (CC) before HPMC handling was enabled. The cell will be halted. The
data field is a bit mask where bit numbers correspond to CC block numbers and
a set bit indicates that block logged an error. The least-significant bit is
bit 0.
- Cause / Action: Cause: Hardware problem with the cell board, CPUs,
or CC. Action: Contact HP Support personnel to confirm the cell board is
functioning properly. Contact HP Support personnel to confirm the CPUs and CC
are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2242
- Severity: MAJOR
- Event Summary: The end marker separating two PDC data structures
has been overwritten
- Event Class: System
- Problem Description:
The end marker separating two PDC data
structures has been overwritten. Data field contains the expected value of the
end marker.
- Cause / Action: Cause: Hardware problem with PDH riser card Action:
Contact HP Support to confirm PDH riser card is functioning properly. Upgrade
PDC if a newer version is available to fix this problem. Cause: Hardware
problem with CPU or cell board. Action: Contact HP Support to confirm CPUs and
cell board are functioning properly. Upgrade PDC if a newer version is
available to fix this problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2243
- Severity: MAJOR
- Event Summary: Error occurred accessing a PDC data structure
- Event Class: System
- Problem Description:
Error occurred accessing a PDC data
structure. Depending upon the situation, the cell or entire partition will be
reset. The data field contains the return status for the function that
encountered the error.
- Cause / Action: Cause: Hardware problem with the PDH riser card.
Action: Contact HP Support to confirm the PDH riser card is functioning
properly. Cause: Hardware problem with the CPU or cell board. Action: Contact
HP Support to confirm the CPUs and cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2244
- Severity: MAJOR
- Event Summary: PDC could not access a complex profile
- Event Class: System
- Problem Description:
PDC could not access a complex profile.
Depending on when this error occurs, the local cell may be reset, the entire
partition may be reset, or no action will be taken whatsoever. Data field
contains the return status from the function that encountered the error.
- Cause / Action: Cause: An error occurred which prevented the
complex profiles from being distributed properly. Action: Create and
distribute a new complex profile using ParMgr on a functional partition in the
complex. Restore the last complex profile using the "CC" command from the MP,
then use ParMgr to create a new complex profile. Generate a genesis complex
profile using the "CC" command from the MP, then use ParMgr to create a new
complex profile. Cause: A hardware problem exists with MP or PDHC hardware.
Action: Contact HP Support to confirm the MP and PDHC are functioning
properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2245
- Severity: MAJOR
- Event Summary: PDC could not access a complex profile
- Event Class: System
- Problem Description:
PDC could not access a complex profile.
Depending on when this error occurs, the local cell may be reset, the entire
partition may be reset, or no action will be taken whatsoever. Data field
contains the return status from the function that encountered the error.
- Cause / Action: Cause: An error occurred which prevented the
complex profiles from being distributed properly. Action: Create and
distribute a new complex profile using ParMgr on a functional partition in the
complex. Restore the last complex profile using the "CC" command from the MP,
then use ParMgr to create a new complex profile. Generate a genesis complex
profile using the "CC" command from the MP, then use ParMgr to create a new
complex profile. Cause: A hardware problem exists with MP or PDHC hardware.
Action: Contact HP Support to confirm the MP and PDHC are functioning
properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2246
- Severity: MAJOR
- Event Summary: PDC could not access a complex profile
- Event Class: System
- Problem Description:
PDC could not access a complex profile.
Depending on when this error occurs, the local cell may be reset, the entire
partition may be reset, or no action will be taken whatsoever. Data field
contains the return status from the function that encountered the error.
- Cause / Action: Cause: An error occurred which prevented the
complex profiles from being distributed properly. Action: Create and
distribute a new complex profile using ParMgr on a functional partition in the
complex. Restore the last complex profile using the "CC" command from the MP,
then use ParMgr to create a new complex profile. Generate a genesis complex
profile using the "CC" command from the MP, then use ParMgr to create a new
complex profile. Cause: A hardware problem exists with MP or PDHC hardware.
Action: Contact HP Support to confirm the MP and PDHC are functioning
properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2247
- Severity: FATAL
- Event Summary: PDC could not access a CC hardware register.
- Event Class: System
- Problem Description:
PDC could not access a CC hardware
register. The data field contains the physical location of the cell on which
the CC hardware register could not be accessed.
- Cause / Action:
Cause: Hardware problem with the CC. Action:
Contact HP Support to confirm the CC is functioning properly. Cause: Hardware
problem with the CPU or cell board. Action: Contact HP Support to confirm the
CPU or cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2248
- Severity: MAJOR
- Event Summary: PDC could not access a complex profile
- Event Class: System
- Problem Description:
PDC could not access a complex profile. No
action will be taken. Data field contains the return status from the function
that encountered the error.
- Cause / Action: Cause: An error occurred which prevented the
complex profiles from being distributed properly. Action: Create and
distribute a new complex profile using ParMgr on a functional partition in the
complex. Restore the last complex profile using the "CC" command from the MP,
then use ParMgr to create a new complex profile. Generate a genesis complex
profile using the "CC" command from the MP, then use ParMgr to create a new
complex profile. Cause: A hardware problem exists with MP or PDHC hardware.
Action: Contact HP Support to confirm the MP and PDHC are functioning
properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2249
- Severity: MAJOR
- Event Summary: PDC could not access a complex profile
- Event Class: System
- Problem Description:
PDC could not access a complex profile. The
cell will be reset.. Data field contains the return status from the function
that encountered the error.
- Cause / Action: Cause: An error occurred which prevented the
complex profiles from being distributed properly. Action: Create and
distribute a new complex profile using ParMgr on a functional partition in the
complex. Restore the last complex profile using the "CC" command from the MP,
then use ParMgr to create a new complex profile. Generate a genesis complex
profile using the "CC" command from the MP, then use ParMgr to create a new
complex profile. Cause: A hardware problem exists with MP or PDHC hardware.
Action: Contact HP Support to confirm the MP and PDHC are functioning
properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2250
- Severity: MAJOR
- Event Summary: Error occurred initializing a PDC data structure
- Event Class: System
- Problem Description:
Error occurred initializing a PDC data
structure. The cell will be reset. The data field contains the return status
for the function that encountered the error.
- Cause / Action:
Cause: Hardware problem with the PDH riser
card. Action: Contact HP Support to confirm the PDH riser card is functioning
properly. Cause: Hardware problem with the CPU or cell board. Action: Contact
HP Support to confirm the CPUs and cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2251
- Severity: FATAL
- Event Summary: Error occurred accessing a PDC data structure
- Event Class: System
- Problem Description:
Error occurred accessing a PDC data
structure. The partition will be reset. The data field contains the return
status for the function that encountered the error.
- Cause / Action:
Cause: Hardware problem with the PDH riser
card. Action: Contact HP Support to confirm the PDH riser card is functioning
properly. Cause: Hardware problem with the CPU or cell board. Action: Contact
HP Support to confirm the CPUs and cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2252
- Severity: FATAL
- Event Summary: Error occurred accessing a PDC data structure
- Event Class: System
- Problem Description:
Error occurred accessing a PDC data
structure. The partition will be reset. The data field contains the return
status for the function that encountered the error.
- Cause / Action:
Cause: Hardware problem with the PDH riser
card. Action: Contact HP Support to confirm the PDH riser card is functioning
properly. Cause: Hardware problem with the CPU or cell board. Action: Contact
HP Support to confirm the CPUs and cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2253
- Severity: FATAL
- Event Summary: PDC could not access a complex profile
- Event Class: System
- Problem Description:
PDC could not access a complex profile. The
partition will be reset.. Data field contains the return status from the
function that encountered the error.
- Cause / Action: Cause: An error occurred which prevented the
complex profiles from being distributed properly. Action: Create and
distribute a new complex profile using ParMgr on a functional partition in the
complex. Restore the last complex profile using the "CC" command from the MP,
then use ParMgr to create a new complex profile. Generate a genesis complex
profile using the "CC" command from the MP, then use ParMgr to create a new
complex profile. Cause: A hardware problem exists with MP or PDHC hardware.
Action: Contact HP Support to confirm the MP and PDHC are functioning
properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2254
- Severity: FATAL
- Event Summary: PDC does not recognize CC chip revision.
- Event Class: System
- Problem Description:
PDC does not recognize the CC chip
revision. The cell will be halted. The data field physical location of the
cell that is having the CC revision problem.
- Cause / Action:
Cause: Hardware problem with the CC. Action:
Contact HP Support to confirm the CC is functioning properly. Upgrade PDC if a
newer version is available to fix this problem. Cause: Hardware problem with
the CPU or cell board. Action: Contact HP Support to confirm the CPU or cell
board is functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2255
- Severity: FATAL
- Event Summary: Error occurred accessing a PDC data structure
- Event Class: System
- Problem Description:
Error occurred accessing a PDC data
structure. Depending upon the situation the cell or entire partition will be
reset. The data field contains the return status for the function that
encountered the error.
- Cause / Action: Cause: Hardware problem with the PDH riser card.
Action: Contact HP Support to confirm the PDH riser card is functioning
properly. Cause: Hardware problem with the CPU or cell board. Action: Contact
HP Support to confirm the CPUs and cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2256
- Severity: MAJOR
- Event Summary: Cell could not communicate with all the other cells
in rendezvous set
- Event Class: System
- Problem Description:
PDC checks to make sure that all of the
cells in the partition rendezvous set can communicate bilaterally. This cell
could communicate with at least one of the other cells in the partition, but
could not communicate with every cell that made the rendezvous. The cell will
reset. Data field is the physical location of the cell.
- Cause / Action:
Cause: This may indicate an intermittent
problem with the main backplane. Action: Contact HP Support to confirm the
main backplane is functioning properly. Contact HP Support to confirm the cell
board is functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2257
- Severity: FATAL
- Event Summary: PDC could not access a complex profile
- Event Class: System
- Problem Description:
PDC could not access a complex profile. The
partition will be reset. Data field contains the return status from the
function that encountered the error.
- Cause / Action: Cause: An error occurred which prevented the
complex profiles from being distributed properly. Action: Create and
distribute a new complex profile using ParMgr on a functional partition in the
complex. Restore the last complex profile using the "CC" command from the MP,
then use ParMgr to create a new complex profile. Generate a genesis complex
profile using the "CC" command from the MP, then use ParMgr to create a new
complex profile. Cause: A hardware problem exists with MP or PDHC hardware.
Action: Contact HP Support to confirm the MP and PDHC are functioning
properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2258
- Severity: FATAL
- Event Summary: PDC could not access a complex profile
- Event Class: System
- Problem Description:
PDC could not access a complex profile. The
partition will be reset. Data field contains the return status from the
function that encountered the error.
- Cause / Action: Cause: An error occurred which prevented the
complex profiles from being distributed properly. Action: Create and
distribute a new complex profile using ParMgr on a functional partition in the
complex. Restore the last complex profile using the "CC" command from the MP,
then use ParMgr to create a new complex profile. Generate a genesis complex
profile using the "CC" command from the MP, then use ParMgr to create a new
complex profile. Cause: A hardware problem exists with MP or PDHC hardware.
Action: Contact HP Support to confirm the MP and PDHC are functioning
properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2259
- Severity: FATAL
- Event Summary: PDC_IODC failed to read information about the
console
- Event Class: System
- Problem Description:
There was a problem attempting to use an
architected PDC procedure to read information about the console. This failure
in the PDC call is considered fatal, so the partition will be reset. The data
field contains the return value from the function that encountered the error.
- Cause / Action: Cause: PDC procedure failed. Action: Look for
another error IPMI event such as BOOT_CONSOLE_PDC_IODC_HEADER_ERR that
indicates that a problem occurred. Try rebooting the cell and then changing
the core cell. Contact HP Support personnel to confirm the cell board is
functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2260
- Severity: FATAL
- Event Summary: PDC detected an illegal CPU number passed to an
internal function
- Event Class: System
- Problem Description:
An invalid CPU number was passed into an
internal PDC function. The data field contains the invalid parameter. Cause /
Action:
Cause: Hardware failure with CPU, CC or cell board. Action: Contact
HP Support to confirm the CPUs, CC, and cell board are functioning properly.
Update PDC if a version is available to fix this problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2261
- Severity: MAJOR
- Event Summary: Error occurred accessing a PDC data structure
- Event Class: System
- Problem Description:
Error occurred accessing a PDC data
structure. The executing CPU will be stopped. The data field contains the
return status for the function that encountered the error.
- Cause / Action:
Cause: Hardware problem with the PDH riser
card. Action: Contact HP Support to confirm the PDH riser card is functioning
properly. Cause: Hardware problem with the CPU or cell board. Action: Contact
HP Support to confirm the CPUs and cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2262
- Severity: MAJOR
- Event Summary: PDC detected an illegal CPU number passed to an
internal function
- Event Class: System
- Problem Description:
PDC detected an illegal CPU number passed
to an internal function. The data field contains the invalid parameter.
- Cause / Action: Cause: Hardware failure with CPU, CC or cell board.
Action: Contact HP Support to confirm the CPUs, CC, and cell board are
functioning properly. Update PDC if a version is available to fix this
problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2263
- Severity: MAJOR
- Event Summary: PDC received an error from the PDHC while trying to
communicate with the MP
- Event Class: System
- Problem Description:
PDC received an error from the PDHC while
trying to communicate with the MP. Default or cached platform configuration
information will be used. Data field contains the error return value from the
PDHC.
- Cause / Action: Cause: Hardware problem with the MP or PDHC.
Action: Contact HP Support to confirm the manageability subsystem is
functioning properly. Cause: PDHC, MP, and/or PDC firmware are not compatible.
Action: Upgrade PDHC, MP, and/or PDC firmware to supported and compatible
revisions.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2264
- Severity: MAJOR
- Event Summary: Error occurred accessing a PDC data structure
- Event Class: System
- Problem Description:
Error occurred accessing a PDC data
structure. The cell will be halted. Error occurred while deconfiguring a CPU.
The data field contains the physical location of the CPU.
- Cause / Action:
Cause: Hardware problem with the PDH riser
card. Action: Contact HP Support to confirm the PDH riser card is functioning
properly. Cause: Hardware problem with the CPU or cell board. Action: Contact
HP Support to confirm the CPUs and cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2265
- Severity: MAJOR
- Event Summary: PDC detected an illegal CPU number passed to an
internal function
- Event Class: System
- Problem Description:
An invalid CPU number was passed into an
internal PDC function. The cell will be halted. That data field contains the
physical location of the cell being halted.
- Cause / Action:
Cause: Hardware failure with CPU, CC or cell
board. Action: Contact HP Support to confirm the CPUs, CC, and cell board are
functioning properly. Update PDC if a version is available to fix this
problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2266
- Severity: MAJOR
- Event Summary: Cell board has a CPU with an unsupported CPU
revision
- Event Class: System
- Problem Description:
PDC found a CPU on the cell board which has
an unsupported CPU revision. The cell will be halted. The data field reports
the physical location of the cell.
- Cause / Action: Cause: PDC found a CPU with an unsupported CPU
revision Action: Contact HP Support to confirm the cell board is functioning
properly, install supported CPUs, or upgrade PDC to a version that supports
the installed CPUs.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2267
- Severity: MAJOR
- Event Summary: PDC and the manageability subsystem interface
revisions do not match
- Event Class: System
- Problem Description:
PDC and the manageability subsystem
interface revisions do not match. The cell will be halted. The data contents
have the format: 0x5500PPGG5000ppgg where: PP = Utilities' PDHC/PDC revision
number GG = Utilities' MP/PDC revision number pp = PDC's PDHC/PDC revision
number gg = PDC's MP/PDC revision number
- Cause / Action: Cause: Incorrect PDC and/or PDHC firmware
installed. Action: Install compatible versions of PDC and/or PDHC firmware.
Cause: Hardware problem with the PDH riser card. Action: Contact HP Support to
confirm the PDH riser card is functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2268
- Severity: MAJOR
- Event Summary: PDC received an error from the PDHC while trying to
communicate with the MP
- Event Class: System
- Problem Description:
PDC received an error from the PDHC while
trying to communicate with the MP. The cell will be halted.
- Cause / Action:
Cause: Hardware problem with the MP or PDHC.
Action: Contact HP Support to confirm the manageability subsystem is
functioning properly. Cause: PDHC, MP, and/or PDC firmware are not compatible.
Action: Upgrade PDHC, MP, and/or PDC firmware to supported and compatible
revisions.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2269
- Severity: MAJOR
- Event Summary: PDC detected an invalid complex profile change.
- Event Class: System
- Problem Description:
PDC detected an invalid complex profile
change. The cell will be reset. Data field contains the return status from the
function that encountered the error. The data field contains the partition
configuration data sequence ID.
- Cause / Action: Cause: An error occurred which prevented the
complex profiles from being created properly. Action: Create and distribute a
new complex profile using ParMgr on a functional partition in the complex.
Restore the last complex profile using the "CC" command from the MP, then use
ParMgr to create a new complex profile. Generate a genesis complex profile
using the "CC" command from the MP, then use ParMgr to create a new complex
profile. Check for an OS patch or firmware upgrade that fixes this problem.
Cause: A hardware problem exists with MP or PDHC hardware. Action: Contact HP
Support to confirm the MP and PDHC are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2270
- Severity: MAJOR
- Event Summary: PDC detected an illegal CPU number passed to an
internal function
- Event Class: System
- Problem Description:
An invalid CPU number was passed into an
internal PDC function. Previous IPMI events may indicate why a CPU was being
deconfigured. Depending upon the situation, either the cell will be halted or
the entire partition will be reset. The data field contains the invalid
parameter.
- Cause / Action: Cause: Hardware failure with CPU, CC or cell board.
Action: Contact HP Support to confirm the CPUs, CC, and cell board are
functioning properly. Update PDC if a version is available to fix this
problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2271
- Severity: MAJOR
- Event Summary: Cell attempting rendezvous had different main
backplane type than core cell
- Event Class: System
- Problem Description:
A cell attempting to rendezvous in a
partition had a different main backplane type than the core cell. Differing
cell will be reset. The data field contains the main backplane type that
differed from the core cell's.
- Cause / Action: Cause: Main backplanes are misconfigured. Action:
Contact HP Support to confirm main backplanes are setup and functioning
properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2272
- Severity: MAJOR
- Event Summary: Cell attempting rendezvous has CPU HVERSION
different than core cell
- Event Class: System
- Problem Description:
A cell attempting to rendezvous in a
partition had Processor Module HVERSION that differed from the core cell's.
Differing cell will be reset. The data field contains the HVERSION that
differed from the core cell.
- Cause / Action: Cause: Partition was created with incompatible cell
boards. Action: Reassign cells into partitions with compatible cell boards.
Cause: CPU or cell board is misconfigured or not functioning properly. Action:
Contact HP Support to confirm the CPUs and cell boards are configured and
functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2273
- Severity: MAJOR
- Event Summary: Cell has CPUs with different CPU speed than core
cell
- Event Class: System
- Problem Description:
A cell attempting to rendezvous in a
partition had a different CPU speed than the core cell for that partition.
Differing cell will be reset. The data field contains the speed of the CPUs in
the cell that differs from the core cell.
- Cause / Action:
Cause: Partition was created with incompatible
cell boards. Action: Reassign cells into partitions with compatible cell
boards. Cause: CPU or cell board is misconfigured or not functioning properly.
Action: Contact HP Support to confirm the CPUs and cell boards are configured
and functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2274
- Severity: MAJOR
- Event Summary: A cell had a different version of PDC than the core
cell.
- Event Class: System
- Problem Description:
A cell attempting to rendezvous in a
partition had a different PDC revision than the core cell for that partition.
The cell with the PDC revision differing from the core cell's will be reset.
The data field contains the PDC revision of the cell that differs.
- Cause / Action: Cause: Cells in a partition have different PDC
revisions Action: Upgrade PDC to the same revision on all cells in the
partition. Cause: Partition was created with incompatible cell boards. Action:
Reassign cells into partitions with compatible cell boards.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2275
- Severity: MAJOR
- Event Summary: Cell(s) in the dead set could not be reset.
- Event Class: System
- Problem Description:
Cell(s) in the dead set could not be reset
by the core cell.
- Cause / Action: Cause: Fabric or PDC bug Action: If intermittent
problem, check fabric. If repetitive, check for PDC upgrade.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2276
- Severity: MAJOR
- Event Summary: The scratch RAM test failed.
- Event Class: System
- Problem Description:
The scratch RAM test failed. This is most
likely a failure in the scratch RAM and should be replaced. The cell will be
halted. Data field contains the physical location of the cell with the
failure.
- Cause / Action: Cause: Bad scratch RAM Action: Contact HP Support
personnel to troubleshoot the cell board
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2277
- Severity: MAJOR
- Event Summary: Could not read the Complex Serial number or write it
to the SCSI Parms area.
- Event Class: System
- Problem Description:
There was an error while writing the
Complex Serial Number, during the validation of the PDC NVRAM SCSI Parms area
at boot time. The cell will Reset.
- Cause / Action: Cause: The local Cell Global or Cell Micro
semaphores were not locked. Or the target cell global semaphore was not
locked. Action: Capture logs, contact HP Support Cause: Couldn't read the
Complex Serial Number from the Complex Profile. Action: : Capture logs,
contact HP Support Cause: Couldn't get the address of the SCSI Parms area in
PDC NVRAM. Action: : Capture logs, contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2278
- Severity: MAJOR
- Event Summary: Could not read the SCSI Parms Layout Version.
- Event Class: System
- Problem Description:
Could not access the PDC NVRAM SCSI Parms
area. The cell will reset!
- Cause / Action: Cause: Couldn't get the address of the SCSI Parms
area in PDC NVRAM. Action: Capture logs, contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2279
- Severity: MAJOR
- Event Summary: Could not read the SCSI Parameters Checksum.
- Event Class: System
- Problem Description:
Could not access the PDC NVRAM SCSI Parms
area. The cell will reset!
- Cause / Action: Cause: Couldn't get the address of the SCSI Parms
area in PDC NVRAM. Action: Capture logs, contact HP Support Cause: Calculation
of SCSI Parms checksum failed Action: Upgrade PDC, capture logs, contact HP
Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2280
- Severity: MAJOR
- Event Summary: Could not get access to the SCSI Parms area of PDC
NVRAM.
- Event Class: System
- Problem Description:
During boot, the SCSI Parameters area of
PDC NVRAM was found to be unavailable. The area could not be validated. The
cell will Reset. Data field contains failure from function call to access SCSI
Parameters area.
- Cause / Action: Cause: Couldn't get the address of the SCSI Parms
area in PDC NVRAM. Action: Upgrade PDC if available, capture logs, contact HP
Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2281
- Severity: MAJOR
- Event Summary: Clearing the SCSI Parms area of PDC NVRAM failed.
- Event Class: System
- Problem Description:
Could not re-initialize the SCSI Parms area
of PDC NVRAM. The area has not been cleared. The cell will reset. Cause /
Action:
Cause: The appropriate semaphores were not acquired. Action: Capture
logs, contact HP Support Cause: Writing the Complex Serial Number failed.
Possibly couldn't access the Complex Profile. Action: Capture logs, contact
PDC team. Cause: The SCSI Parms checksum algorithm failed Action: Capture
logs, contact PDC team. Cause: Couldn't get the address to the SCSI NVM area.
Action: Capture logs, contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2282
- Severity: MAJOR
- Event Summary: There was a failure while getting or releasing a
semaphore.
- Event Class: System
- Problem Description:
The SCSI Parms proc needs 4 different
semaphores: the Cell Local Semaphore, Local Cell's Global Semaphore, Target
Cell's Global Semaphore, Micro Semaphore. This chassis code indicates that an
unknown error occurred while either getting or releasing one of the
semaphores.
- Cause / Action: Cause: Error accessing a semaphore Action: Capture
chassis logs. Document events that led up to the error. Contact HP Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2283
- Severity: MAJOR
- Event Summary: The Cell Global Semaphore was not locked.
- Event Class: System
- Problem Description:
Bootstrap did not own the Cell Global
Semaphore when it verified that the SCSI parms area has been initialized. The
cell will reset.
- Cause / Action: Cause: Could not obtain the Micro Semaphore Action:
Reset. If that does not resolve issue capture Logs, contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2284
- Severity: MAJOR
- Event Summary: Could not release the Micro Semaphore after
initializing the SCSI Parms area.
- Event Class: System
- Problem Description:
Could not release the Micro Semaphore after
initializing the SCSI Parms area.
- Cause / Action: Cause: The semaphore is owned by another entity
Action: Reset Cause: PDC bug Action: Capture logs, contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2285
- Severity: FATAL
- Event Summary: PDC tried to tell a slave cell monarch CPU to do
something and failed.
- Event Class: System
- Problem Description:
The core cell send a command to the slave
cells and failed.
- Cause / Action: Cause: The core cell could not communicate with the
slave cells. There is an intermittent problem with the fabric. Action: Check
the fabric for intermittent problems.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2286
- Severity: MAJOR
- Event Summary: Halting cell because this code shouldn't execute
before monarch selection.
- Event Class: System
- Problem Description:
There is a per-cell flag that indicates
whether or not the deconfig bytes are valid in an internal PDC data structure.
The code that sets this flag should therefore be called once per boot. PDC
expects the monarch CPU to be the only CPU that executes this code. This
chassis log indicates that a monarch has not yet been selected. This is a PDC
bug.
- Cause / Action: Cause: PDC bug. Action: Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2287
- Severity: MAJOR
- Event Summary: Halting cell because PDC failed a read-after-write
check to a Dillon register.
- Event Class: System
- Problem Description:
While trying to set a flag in a cell board
register to indicate that the deconfig bytes are now valid in the
CELL_CPU_STATE structure, PDC failed on the read-after-write to that register.
Data field contains the value PDC expected to read from the register (i.e.,
the value just written to it).
- Cause / Action:
Cause: Hardware problem with the cell board,
but could be that CC or the CPU corrupted the write or read. Action: Contact
HP Support personnel to troubleshoot the cell board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2288
- Severity: MAJOR
- Event Summary: PDC could not set the time of day on the RTC.
- Event Class: System
- Problem Description:
PDC could not set the time of day on the
RTC. Data field contains the status returned from the attempt to set the TOD.
- Cause / Action: Cause: Semaphore problem. Action: Contact HP
Support personnel to troubleshoot cell board or to check for PDC upgrade if
possible software problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2289
- Severity: FATAL
- Event Summary: The core cell could not read the cell state of a
slave cell.
- Event Class: System
- Problem Description:
The core cell could not read the cell state
of a slave cell. The core cell is waiting for the monarch CPU on all of the
slave cells to change their cell state to indicate that the slave cell has
entered the slave cell rendezvous after the core cell is selected. Data field
contains the physical location of the slave cell that could not be read.
- Cause / Action: Cause: There is an intermittent problem with the
fabric. Action: Look for problems in the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2290
- Severity: FATAL
- Event Summary: Slave cell picked a different core cell than the one
that wrote to it
- Event Class: System
- Problem Description:
The slave cells wait for the core cell to
write its cell number to their micro general-purpose register 1. When the
slaves see that the core cell has written its cell number to this register, it
compares the core cell number in the micro general-purpose register 1 with the
core cell that was selected by the slave cell. If the core cells don't match,
then the slave cell knows that there is a split-brain problem where the cells
did not rendezvous properly. Data Field: The core cell that wrote its number
to this cell's micro general-purpose register 1.
- Cause / Action:
Cause: There is a split brain problem. Action:
Look for fabric problems. Contact HP support if this can't be resolved
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2291
- Severity: MAJOR
- Event Summary: Slave CPU has noticed the monarch CPU has failed and
will deconfig it
- Event Class: System
- Problem Description:
A slave CPU has noticed that the monarch
CPU has failed and will deconfigure it. The cell will be reset.
- Cause / Action:
Cause: The monarch CPU has failed. A slave CPU
has noticed this and will deconfigure the monarch CPU and reset the cell.
Action: Contact HP Support personnel to troubleshoot the CPU/cell board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2292
- Severity: MAJOR
- Event Summary: Write to the Speedy Boot Data Structure Failed
- Event Class: System
- Problem Description:
Write to the Speedy Boot Data Structure
Failed. The most likely reason for this failure is that a remote cell could
not be reached. Data field contains the physical location of the cell that
could not be written.
- Cause / Action: Cause: Fabric problem. CC is defective Action:
Contact HP Support personnel to troubleshoot fabric connections or the cell
board
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2293
- Severity: MAJOR
- Event Summary: Halting cell because operating mode should be
available by this point.
- Event Class: System
- Problem Description:
During boot, PDC sends a command to the
Utilities to get "platform configuration info", which includes the operating
mode. This chassis log is sent if the command to the Utilities completed in
error and PDC doesn't have valid cached values.
- Cause / Action: Cause: PDH memory problem or other cell problem in
which the values previously cached were corrupted. Action: Contact HP Support
personnel to troubleshoot the cell board. Cause: PDC has a bug in which it
didn't write the cached values and validate them correctly or it read the
cached value incorrectly. Action: Upgrade PDC if newer PDC is known to have
fixed such a problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2294
- Severity: MAJOR
- Event Summary: Halting cell because PDC couldn't write the cached
operating mode.
- Event Class: System
- Problem Description:
The Utilities system tells PDC what the
operating mode is (Mfg or Normal), among other things, and PDC then writes
this value to a data structure in NVM to cache it. This chassis log is sent if
PDC can't write the mode to the data structure. After writing the cached
value, PDC does a read-after-write check. Cell will hard halt if it
experiences this failure.
- Cause / Action: Cause: Hardware problem like PDH memory or
corrupted reads and writes. Action: Contact HP Support personnel to
troubleshoot the cell board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2295
- Severity: FATAL
- Event Summary: PDC could not synchronize the cells' RTCs with the
core cell's RTC
- Event Class: System
- Problem Description:
PDC synchronizes all of the Real Time
Clocks (RTCs) on each cell in the PD with the core cell's RTC. If this fails,
this chassis code is sent before the PD is reset.
- Cause / Action:
Cause: Look for either of the following chassis
codes and their cause action statements: CC_BOOT_READ_TOD_FAILED
CC_BOOT_SET_TOD_FAILED These chassis codes will contain status information
indicating why reading the core cell's RTC or setting the slave cells' RTCs
failed. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2296
- Severity: MAJOR
- Event Summary: Cell left another cell behind and it missed the
partition rendezvous
- Event Class: System
- Problem Description:
Before trying to rendezvous with the other
cells assigned to the PD, a cell will check to make sure that none of the
other cells in the partition have left this cell behind. Data Field: The first
cell that this cell noticed had left this cell behind
- Cause / Action:
Cause: This cell did not boot quickly enough to
rendezvous with the other cells in the PD. Action: Try to figure out why the
cell that sent this chassis code booted so late. See if the cell was powered
up much later than the other cells in the PD. Look for chassis codes that
indicate that the cell found a problem. See if the other cell whose physical
location is in the data field of this chassis code is hung. Try resetting all
of the cells in the partition using the GSP commands so that the partition is
reset almost simultaneously.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2297
- Severity: FATAL
- Event Summary: Other cells in partition did not create local
rendezvous set in time
- Event Class: System
- Problem Description:
PDC creates a rendezvous set that consists
of all of the cells that can communicate bilaterally. Each cell has to do this
at the same time. If some of the cells do not create their local rendezvous
set and make it available to the other cells in time for them to make their
rendezvous set, the cells that are waiting will timeout and send this chassis
code. The data field of the chassis code contains the cells that delivered
their local rendezvous set in time. The data field is a bitmap of cells where
cell 0 is the least significant bit and cell 63 is the most significant bit. A
one on a cell's bit indicates that the cell delivered its local rendezvous set
in time.
- Cause / Action: Cause: Some of the cells in the partition did not
deliver their local rendezvous set in time. Action: Look at the data field of
the chassis code. Find the cells that are configured to be in the PD that did
not deliver their rendezvous set in time and look for problems in those cells
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2298
- Severity: FATAL
- Event Summary: A cell was unable to access data on the core cell.
- Event Class: System
- Problem Description:
PDC could not access data in the core
cell's data structure. This chassis code is probably a result of a failed
attempt to walk the fabric to the core cell.
- Cause / Action: Cause: There was an intermittent problem in the
fabric and a slave cell could not reach the core cell. Action: Look for
problems in the fabric. Try rebooting the partition.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2299
- Severity: MAJOR
- Event Summary: Received an unexpected interrupt during boot
- Event Class: System
- Problem Description:
Received an unexpected interrupt during
boot. The cell or partition will be reset. Data Field: The interrupt number of
the unexpected interrupt. 1-HPMC 2-Power Failure Interrupt 3-Recovery Counter
Trap 4-External Interrupt 5-LPMC 6-Instruction TLB Miss Fault / Instruction
Page Fault 7-Instruction Memory Protection Trap 8-Illegal Instruction Trap
9-Break Instruction Trap 10-Privileged Operation Trap 11-Privileged Register
Trap 12-Overflow Trap 13-Conditional Trap 14-Assist Exception Trap 15-Data TLB
Miss Fault / Data Page Fault 16-Non-Access Instruction TLB Miss Fault
17-Non-Access Data TLB Miss Fault / Non-Access Data Page Fault 18-Data Memory
Protection Trap / Unaligned Data Reference Trap 19-Data Memory Break Trap
20-TLB Dirty Bit Trap 21-Page Reference Trap 22-Assist Emulation Trap
23-Higher Privilege Transfer Trap 24-Lower Privilege Transfer Trap 25 Taken
Branch Trap 26-Data Memory Access Rights Trap 27-Data Memory Protection ID
Trap 28-Unaligned Data Reference Trap 29-Performance Monitor Interrupt
- Cause / Action: Cause: Received an unexpected interrupt during
boot. The data field contains the interrupt number of the unexpected
interrupt. Action: Actions taken will be dependent on the interrupt class and
previous chassis codes. If the cause of the interrupt can not be determined
from the previous chassis code, contact HP Support for assistance.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2300
- Severity: MAJOR
- Event Summary: PDC read a cell state that it did not recognize on
another cell.
- Event Class: System
- Problem Description:
PDC read a cell state that it did not
recognize on another cell. Data field contains the unknown cell state. Cell
will reset for reconfiguration.
- Cause / Action: Cause: Bad cell hardware or fabric Action: Contact
HP Support personnel to troubleshoot problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2301
- Severity: FATAL
- Event Summary: Data obtained from a core cell data structure was
unintelligible.
- Event Class: System
- Problem Description:
Data obtained from a core cell data
structure was unintelligible. Data field contains the data PDC could not
interpret.
- Cause / Action: Cause: PDC read invalid data from an internal data
structure Action: Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2302
- Severity: FATAL
- Event Summary: Slave cells timed out before reaching correct cell
state.
- Event Class: System
- Problem Description:
Core Cell detected that slave cell(s) did
not reach correct cell state within allocated time. Data field contains bit
mask of cells present.
- Cause / Action: Cause: Cell hung Action: Root cause cell hang -
investigate previous chassis codes from the hung cell(s). Contact HP support
for help troubleshooting the cell. Boot without cells that experienced
failure, either through powering them off and rebooting and waiting for the
partition to detect them as missing or through reconfiguring the complex
profile not to include the failing cell(s). The latter option is faster.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2303
- Severity: MAJOR
- Event Summary: A cell is about to be halted.
- Event Class: System
- Problem Description:
Whenever PDC halts a cell, this IPMI event
will be sent with the physical location of the cell in the data field. One or
more preceding IPMI events should indicate what has gone wrong and why the
cell is being halted.
- Cause / Action: Cause: Refer to preceding IPMI events for
cause/action. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2304
- Severity: FATAL
- Event Summary: No cell rendezvoused in the partition can be the
core cell
- Event Class: System
- Problem Description:
No cell rendezvoused in the partition can
be the core cell. The data field contains the physical location of the cell
reporting the problem.
- Cause / Action: Cause: No cell has core IO. There is an IO problem
with cell(s) that does have core IO Action: Configure the partition to include
cell with core IO. Check IO and IO connections (REO cables)
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2305
- Severity: MAJOR
- Event Summary: End of 10 minute waiting period for other cells to
rendezvous.
- Event Class: System
- Problem Description:
Cells are done waiting the 10 minute period
for other cells to rendezvous.
- Cause / Action: Cause: Cells did not power on at same time and/or
have different amounts of IO, memory, etc that affect booting time. A cell had
a problem that caused it to halt or reset for reconfiguration and wait forever
at SINC_BIB. Action: Do nothing, continue booting without cells that did not
make rendezvous. Reboot partition. Make sure all configured cells are powered
on and reset at approximately the same time. Investigate problem on that cell
via its chassis logs.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2306
- Severity: MAJOR
- Event Summary: Unknown I/O failure
- Event Class: System
- Problem Description:
Data is platform dependent. It might not
mean anything. Indicates firmware issue.
- Cause / Action: Cause: Fatal error on the IO of this cell. IO is
not operational. Action: Contact HP Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2307
- Severity: MAJOR
- Event Summary: (HWE) I/O cable error
- Event Class: System
- Problem Description:
Indicates that an I/O Cable is present on
the cell, but that there is an error. Data is the status returned by the I/O
cable, and can be decoded for more information on the failure.
- Cause / Action:
Cause: Problem with I/O Cable or connector. I/O
connected to the cell will not be initialized. Action: Reseat cable. Reseat
I/O backplane or chassis. Reseat Cell. Replace I/O Cable. Replace I/O
backplane or chassis. Replace System Backplane. Replace Cell.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2308
- Severity: MAJOR
- Event Summary: (HWE) Failed to initialize the errors subsystem
- Event Class: System
- Problem Description:
Error subsystem is not operational. IO
discovery cannot progress. IO will not be operational on this cell.
- Cause / Action:
Cause: Insufficient room in SRAM access
errors Action: Replace PDH riser.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2309
- Severity: MAJOR
- Event Summary: (HWE) Failed to send reset to I/O subsystem
- Event Class: System
- Problem Description:
We could not reset IO subsystem. IO is not
available on this cell.
- Cause / Action: Cause: Bad I/O cable. Action: Check/replace I/O
cable. Check/replace I/O chassis.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2310
- Severity: MAJOR
- Event Summary: (HWE) I/O cable could not initialize
- Event Class: System
- Problem Description:
I/O link could not be initialized for use.
I/O for the cell will not be functional.
- Cause / Action: Cause: Bad hardware Action: Replace I/O cable.
Replace I/O chassis.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2311
- Severity: MAJOR
- Event Summary: (HWE) Failed to init rope units in SBA
- Event Class: System
- Problem Description:
All I/O rope units failed initialization.
I/O for this cell will not be functional.
- Cause / Action: Cause: Rope units in the SBA could not be
initialized. Bad hardware. Action: Check for other failures. Replace I/O
chassis
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2312
- Severity: MAJOR
- Event Summary: (HWE) Multiple failed in LBAs and Ropes, not enough
IO to continue
- Event Class: System
- Problem Description:
All ropes or LBAs have failed
initialization. I/O for this cell will not be functional.
- Cause / Action:
Cause: Bad hardware. Action: Check for other
failures. Replace I/O chassis.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2313
- Severity: MAJOR
- Event Summary: (HWE) Failed to init PCI busses
- Event Class: System
- Problem Description:
PCI bus initialization failed.
- Cause / Action:
Cause: All busses on the cell are deconfigured
due to failures. IO on this cell will be non-functional. Action: Check for
other errors. Replace I/O cards.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2314
- Severity: MAJOR
- Event Summary: (HWE) Failed to map SBA to MMIO
- Event Class: System
- Problem Description:
See Summary.
- Cause / Action:
Cause: Could not map SBA into MMIO. IO is not
operational on this cell. Bad hardware. Action: Check for additional failures.
Replace I/O chassis
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2315
- Severity: MAJOR
- Event Summary: (HWE) Failed to map LBAs to MMIO
- Event Class: System
- Problem Description:
See Summary
- Cause / Action: Cause: We could not map the LBAs into MMIO. This
might mean that IO is not functional on this cell. Probably caused by a
hardware failure. Action: Check system for other errors. If no other errors,
replace I/O chassis.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2316
- Severity: MAJOR
- Event Summary: (Medium weight error) RIN block in CC had errors
(might be correctable)
- Event Class: System
- Problem Description:
See Summary. Data is the value of the RIN
primary error register.
- Cause / Action: Cause: RIN block has a bit set. If it is a
recoverable error logged during the opening of the link, the error will be
cleared and initialization will continue. Otherwise, the link will be
deconfigured, and I/O will not be initialized. May be caused by I/O cable
issues. Action: If I/O configuration fails, check RI cable connections. If
problem persists, replace RI cable. If problem still persists, replace HW in
the following order: I/O chassis, Cell, System Back Plane.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2317
- Severity: MAJOR
- Event Summary: (MWE) ROUT block in CC had errors (might be
correctable)
- Event Class: System
- Problem Description:
See summary. The data will be the value of
the ROUT primary error log.
- Cause / Action: Cause: ROUT has an error set. If the error is
correctable, it will be cleared and configuration will continue. If the error
is not correctable, the I/O link will be deconfigured, and any I/O for this
cell will be unreachable. May be caused by I/O cable problems. Action: If I/O
configuration fails: Check RI cables. Reseat RI cable. If problem persists:
Replace RI cable. If problem still persists contact HP support and replace in
the following order: I/O chassis, Cell, System Backplane.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2318
- Severity: MAJOR
- Event Summary: (MWE) IORD returned fatal
- Event Class: System
- Problem Description:
See Summary. Data is the physical address
that we tried to read that failed.
- Cause / Action: Cause: A read from a physical address failed. This
is most likely fatal Caused by Hardware Error. Action: Check system for other
errors. Replace device that the address belongs to.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2319
- Severity: MAJOR
- Event Summary: (MWE) IOWR failed
- Event Class: System
- Problem Description:
See Summary. Data is the physical address
that failed. Could not verify write to I/O space.
- Cause / Action:
Cause: Cause is probably failed/bad hardware.
This error is probably fatal. Action: Check for other errors in the system.
Replace the device that the address is associated with.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2320
- Severity: MAJOR
- Event Summary: The main backplane type is unsupported.
- Event Class: System
- Problem Description:
The main backplane type, read from a
register on the PDH riser card, is unsupported. The cell will be halted. Data
field is the encoded backplane type that was read.
- Cause / Action:
Cause: Misconfigured or failing main backplane.
Action: Contact HP Support to confirm the main backplane is functioning
properly. Cause: Cell board or PDH riser hardware problem preventing PDC from
accessing PDH memory or registers. Action: Contact HP Support to confirm the
cell board and PDH riser card are functioning properly. Cause: Incorrect or
invalid version of PDC installed. Action: Install a supported version of PDC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2321
- Severity: MAJOR
- Event Summary: Legacy
- Event Class: System
- Problem Description:
Unused on Pinnacles Platforms.
- Cause / Action:
Cause: No action is required. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2322
- Severity: MAJOR
- Event Summary: Cannot access complex profile A data
- Event Class: System
- Problem Description:
Cannot get access to complex profile A data
through the utility system. The cell will halt.
- Cause / Action:
Cause: Possible Cause 1: Cannot get data from
complex profile A. Action 1: Reload complex profile A through the utility
interface. Possible Cause 2: Utility firmware version is incompatible with
system firmware. Action 2: Make sure utility firmware and system firmware are
compatible. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2323
- Severity: MAJOR
- Event Summary: Back-to-Back CC link initialization failed
- Event Class: System
- Problem Description:
Coherency Controller (CC) to CC link failed
to establish connection. Data Field: PDC Return Status (-2 is the expected
failure)
- Cause / Action: Cause: Have your HP Support Representative check
the link between each CC Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2324
- Severity: FATAL
- Event Summary: Unexpected condition during the generation of the
Address Map.
- Event Class: System
- Problem Description:
PDC detected a programming error while
trying to build the cell address map. The data field contains the starting
address of the entry
- Cause / Action: Cause: PDC programming problem Action: Contact PDC
team with chassis codes showing problems
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2325
- Severity: CRITICAL
- Event Summary: Compare failure found between global write csr and
cmap local csr
- Event Class: System
- Problem Description:
Before initialization of the cmap registers
during early boot, the cmap registers are tested by writing a pattern to the
global cmap registers and reading the values back through the cmap local
registers. During this test, a compare error was found between the value that
was written and the value that was read. The data field provides the physical
address of the location being written/read.
- Cause / Action:
Cause: This problem is caused by a hardware
problem with the cmap registers on Concorde. Contact a hardware person that
could look into the problem further. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2326
- Severity: CRITICAL
- Event Summary: Failure reading cmap csr register during cmap csr
test
- Event Class: System
- Problem Description:
This chassis code reports that a failure
was returned from the read remote csr routine during the testing of the cmap
csr registers. The data field contains the address of the csr register trying
to be read.
- Cause / Action: Cause: Hardware problem with the cmap csr register
given. Check Hardware. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2327
- Severity: CRITICAL
- Event Summary: Failure was indicated trying to write to the global
cmap csr register
- Event Class: System
- Problem Description:
Failure was detected trying to write to the
cmap global register during the cmap csr register testing during early boot
before cmap csr registers have been set to their initial values. The data
field contains the csr address of the cmap register trying to be written.
- Cause / Action: Cause: Hardware failure/Check Concorde cmap csr
registers for failures. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2328
- Severity: CRITICAL
- Event Summary: Checksum calculated for the cell info structure was
invalid
- Event Class: System
- Problem Description:
The cell info structure within ICM was
detected to be corrupted when the cell map structure within the cell info
structure was being updated. The data field shows the address of the start of
the cell info structure.
- Cause / Action: Cause: ICM data corruption - firmware problem or
hardware problem causing ICM to be corrupted Action: Contact HP Support
personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2329
- Severity: FATAL
- Event Summary: Unable to update the cmap info of the cell info
extensible data structure
- Event Class: System
- Problem Description:
Problem in updating the cmap info structure
within the cell info extensible structure. This code is issued for a number of
problems including fabric problems, target cell not in partition, invalid
arguments, PDC semaphore problems, and corrupted data structures. The data
field contains status information from the call to CellInfoUpdateCmapData().
- Cause / Action: Cause: Loss of fabric connectivity. Cause2:
corruption of pdh memory. Cause3: Internal PDC error. Action: Contact HP
Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2330
- Severity: MAJOR
- Event Summary: There is a previous pending Icm Command when
retrieving SPIROM information
- Event Class: System
- Problem Description:
There is a previous pending Icm Command
when retrieving SPIROM information for the cpu module. The Data Field contents
the cpu module number.
- Cause / Action: Cause: Try to retrieve SPIROM information for the
CPU module but unable to call the ICM command structure due to ICM Command
Structure is in used. Action: Contact HP Support personnel to troubleshoot the
problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2331
- Severity: MAJOR
- Event Summary: Failure to modify Command Work Bit in in Icm command
structure
- Event Class: System
- Problem Description:
Failure to modify Command Work Bit in in
Icm command structure when retrieving SPIROM information for the cpu module.
The Data Field contents the cpu module number.
- Cause / Action:
Cause: Failure to modify Command Work Bit in in
Icm command structure when retrieving SPIROM information for the cpu module.
Action: Contact HP Support personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2332
- Severity: MAJOR
- Event Summary: The Icm Command structure is not valid when calling
IcmCompleteGetSpirom()
- Event Class: System
- Problem Description:
The Icm Command structure is not valid when
calling IcmCompleteGetSpirom().
- Cause / Action: Cause: PDC error. Action: Contact HP Support
personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2333
- Severity: MAJOR
- Event Summary: The Spirom Format in the Returned SPIROM information
is invalid
- Event Class: System
- Problem Description:
The Spirom Format in the Returned SPIROM
information in ICM structure is invalid for the Cpu Module. The Data Field
contents the Cpu Module number (MS Byte) and returned Spirom Format (LS Byte).
- Cause / Action: Cause: SPIROM in the CPU module is invalid. Action:
Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2334
- Severity: MAJOR
- Event Summary: The returned SPIROM size in Icm Command Structure is
invalid
- Event Class: System
- Problem Description:
The returned SPIROM size in Icm Command
Structure is invalid for the Cpu Module. The Data Field contents the Cpu
Module Number.
- Cause / Action: Cause: SPIROM in the CPU module is invalid. Action:
Contact HP Support personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2335
- Severity: MAJOR
- Event Summary: The returned SPIROM header in Icm Command Structure
is invalid.
- Event Class: System
- Problem Description:
The returned SPIROM header in Icm Command
Structure is invalid for the Cpu Module. The Data Field contents the Cpu
Module Number.
- Cause / Action: Cause: SPIROM in the CPU module is invalid. Action:
Contact HP Support personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2336
- Severity: MAJOR
- Event Summary: The returned SPIROM in Icm Command Structure has
Header Checksum error
- Event Class: System
- Problem Description:
The returned SPIROM in Icm Command
Structure has Header Checksum error for the Cpu Module. The Data Field
contents Cpu Module Number (MS Byte), Expected Checksum Value (2nd LS Byte),
and the returned Checksum (LS Byte).
- Cause / Action: Cause: The returned SPIROM in Icm Command Structure
has Header Checksum error. Action: Contact HP Support personnel to
troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2337
- Severity: MAJOR
- Event Summary: The returned SPIROM in Icm Command Structure has
Core Data Checksum error.
- Event Class: System
- Problem Description:
The returned SPIROM in Icm Command
Structure has Core Data Checksum error for the Cpu Module. The Data Field
contents Cpu Module Number (MS Byte), Expected Checksum Value (2nd LS Byte),
and the returned Checksum (LS Byte).
- Cause / Action: Cause: Utility returns SPIROM with Core Data
Checksum error. Action: Contact HP Support personnel to troubleshoot the
problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2338
- Severity: MAJOR
- Event Summary: The returned SPIROM in Icm Command Structure has L1
Cache Checksum error.
- Event Class: System
- Problem Description:
The returned SPIROM in Icm Command
Structure has L1 Cache Checksum error for the Cpu Module. The Data Field
contents Cpu Module Number (MS Byte), Expected Checksum Value (2nd LS Byte),
and the returned Checksum (LS Byte).
- Cause / Action: Cause: The SPIROM in the CPU module is invalid.
Action: Contact HP Support personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2339
- Severity: MAJOR
- Event Summary: The returned SPIROM in Icm Command Structure has
Part Number Checksum error.
- Event Class: System
- Problem Description:
The returned SPIROM in Icm Command
Structure has Part Number Checksum error for the Cpu Module. The Data Field
contents Cpu Module Number (MS Byte), Expected Checksum Value (2nd LS Byte),
and the returned Checksum (LS Byte).
- Cause / Action: Cause: The SPIROM in the CPU module is invalid.
Action: Contact HP Support personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2340
- Severity: CRITICAL
- Event Summary: Unable to access ACPI GPE Status registers
- Event Class: System
- Problem Description:
An access attempt to an ACPI GPE Status
register failed. This can either be a read or write access. The data field
contains the physical location of the cell which can't access its GPE
registers.
- Cause / Action: Cause: The ACPI block isn't enabled for access.
Action: Upgrade PDC FW if this problem has been fixed in a new version. Cause:
The PDHC is not correctly providing access to the ACPI block. Action: Upgrade
PDHC if this problem has been fixed in a new version.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2341
- Severity: MAJOR
- Event Summary: Fail to get SPIROM from Utilities but the Previous
SPIROM structure is valid.
- Event Class: System
- Problem Description:
Fail to get SPIROM from Utilities but the
Previous SPIROM structure is valid. The Data Field indicates the Cpu Module
Number.
- Cause / Action: Cause: PDC could not get SPIROM from Utilities.
Action: Contact HP Support personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2342
- Severity: MAJOR
- Event Summary: A invalid return value returned from
SpiromGetModuleSpirom()
- Event Class: System
- Problem Description:
A invalid return value returned from
SpiromGetModuleSpirom(). The Data Field contains the invalid returned value.
- Cause / Action: Cause: See PDH_GET_SPIROM_END Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2343
- Severity: MAJOR
- Event Summary: The ICM command structure could not be invalidated
after retrieving SPIROM
- Event Class: System
- Problem Description:
FAILURE returned from
IcmDoneWithCmdStruct() in SpiromGetModuleSpirom() due to the Wait Response Bit
is not set. The Data Field contains the return value from
IcmDoneWithCmdStruct().
- Cause / Action: Cause: The ICM command structure could not be
invalidated due to the Wait Response Bit is not set. Action: Contact HP
Support personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2344
- Severity: MAJOR
- Event Summary: Icm Command is not completed successfully when the
Response Byte is valid
- Event Class: System
- Problem Description:
Icm Command is not completed successfully
when the Response Byte is valid. The Data Field contains the Response Byte.
- Cause / Action: Cause: Icm Command is not completed successfully.
Action: Contact HP Support personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2345
- Severity: MAJOR
- Event Summary: Returned status from IcmModifyCmdWork() in
IcmInitGetSpirom()
- Event Class: System
- Problem Description:
Returned status from IcmModifyCmdWork() in
IcmInitGetSpirom(). The Data Field contains the returned status.
- Cause / Action:
Cause: Not successfully IcmModifyCmdWork
Action: Contact HP Support personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2346
- Severity: MAJOR
- Event Summary: The returned SPIROM size in Icm Command Structure is
invalid for the Cpu Module.
- Event Class: System
- Problem Description:
The returned SPIROM size in Icm Command
Structure is invalid for the Cpu Module. The Data Field contents the SPIROM
Format
- Cause / Action: Cause: See PDH_GET_SPIROM_END Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2347
- Severity: MAJOR
- Event Summary: The returned SPIROM header in Icm Command Structure
is invalid.
- Event Class: System
- Problem Description:
The returned SPIROM header in Icm Command
Structure is invalid for the Cpu Module. The Data Field contents the Cpu
Module Number.
- Cause / Action: Cause: Invalid Spirom header. Action: Contact HP
Support personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2348
- Severity: FATAL
- Event Summary: invalid pointer to cell info extensible data
- Event Class: System
- Problem Description:
Error discovered in trying to determine
pointer to cell info data within the cell info extensible data structure. Data
field contains the address of the extensible data structure. This is a fatal
error and will result in the halting of the cell along with a fatal chassis
code
- Cause / Action: Cause: This problem is most likely caused by the
corruption of the cell info extensible data structure. It could also be caused
by a PDC error. Action: Contact the PDC team to diagnose problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2349
- Severity: MAJOR
- Event Summary: invalid pointer detected to cell info extensible
data
- Event Class: System
- Problem Description:
Error detected in getting the pointer to
the cell info data within the cell info extensible data structure. The data
field contains the address of the cell info extensible data structure. Cause /
Action:
Cause: This problem is most likely caused by the corruption of the
cell info extensible data structure. Action: Have your HP support
representative check the system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2350
- Severity: MAJOR
- Event Summary: Error detected writing to cell map registers with
checking disabled
- Event Class: System
- Problem Description:
There was an error detected in writing to
the global cell map registers with a passed in parameter. This error was that
the request was to write the cell map registers for all of the cells with no
checking that the cell is not within the alive set. This is an error and will
result in the sending of this chassis code along with checking that all of the
cells are in the alive set. The data field is unused.
- Cause / Action: Cause: PDC firmware problem. Action: None.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2351
- Severity: MAJOR
- Event Summary: Attempts to release BIB during reconfig reset have
failed
- Event Class: System
- Problem Description:
The data field contains the return status
of the call to IcmQueuePartitionReleaseBib to release BIB for reconfig reset
- Cause / Action: Cause: Reset for Reconfiguration was called by the
OS with the reboot_flag set to bypass BIB upon reboot and failed. Action:
Upgrade the Manageability FW to a new version if the newer version has fixed
this problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2352
- Severity: MAJOR
- Event Summary: Error occurred accessing a PDC data structure
- Event Class: System
- Problem Description:
Error occurred accessing a PDC data
structure. The cell will be reset. The data field contains the return status
for the function that encountered the error.
- Cause / Action:
Cause: Hardware problem with the PDH riser
card. Action: Contact HP Support to confirm the PDH riser card is functioning
properly. Cause: Hardware problem with the CPU or cell board. Action: Contact
HP Support to confirm the CPUs and cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2353
- Severity: MAJOR
- Event Summary: The CPU failed to complete the task for which it was
awoken
- Event Class: System
- Problem Description:
The CPU failed to complete the task for
which it was awoken. Data field contains the physical location of the CPU that
didn't complete the task for which it was awoken.
- Cause / Action:
Cause: specified CPU executing slowly Action:
Contact HP support to troubleshoot CPU/cell board Cause2: CPU never got
correctly awakened, so could never finish its task Action2: Contact HP support
to see if there is a PDC upgrade for this issue
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2354
- Severity: MAJOR
- Event Summary: PDC detected an illegal CPU number while trying to
deconfigure a CPU
- Event Class: System
- Problem Description:
Cell is about to be halted because an
invalid CPU number was passed into an internal PDC function. The data field
contains the invalid parameter.
- Cause / Action: Cause: Hardware failure with CPU, CC or cell board.
Action: Contact HP Support to confirm the CPUs, CC, and cell board are
functioning properly. Update PDC if a version is available to fix this
problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2355
- Severity: MAJOR
- Event Summary: PDC could not access a data structure on the local
cell
- Event Class: System
- Problem Description:
PDC could not access one of its own data
structures on the local cell. The cell will be halted. The data field contains
the return status from the PDC function that encountered the error.
- Cause / Action: Cause: Hardware problem with the PDH riser card.
Action: Contact HP Support to confirm the PDH riser card is functioning
properly. Cause: Hardware problem with the CPU or cell board. Action: Contact
HP Support to confirm the CPUs and cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2356
- Severity: MAJOR
- Event Summary: PDC could not access a hardware register on the
local cell board
- Event Class: System
- Problem Description:
PDC could not access a hardware register on
the local cell board. The cell will be halted. The data field contains the
return status from the PDC function that encountered the error.
- Cause / Action:
Cause: Hardware problem with the PDH riser
card. Action: Contact HP Support to confirm the PDH riser card is functioning
properly. Cause: Hardware problem with the CPU or cell board. Action: Contact
HP Support to confirm the CPUs and cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2357
- Severity: MAJOR
- Event Summary: PDC encountered an error accessing a CC hardware
register.
- Event Class: System
- Problem Description:
PDC encountered an error accessing a CC
hardware register. The value obtained from the register was unexpected. The
cell will be halted. The data field contains the invalid value.
- Cause / Action:
Cause: Hardware problem with the CC. Action:
Contact HP Support to confirm the CC is functioning properly. Cause: Hardware
problem with the CPU or cell board. Action: Contact HP Support to confirm the
CPUs and cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2358
- Severity: MAJOR
- Event Summary: PDC could not access a CC hardware register
- Event Class: System
- Problem Description:
PDC could not access a CC hardware
register. The cell will be halted. The data field contains the address of the
register.
- Cause / Action: Cause: Hardware problem with the CC. Action:
Contact HP Support to confirm the CC is functioning properly. Cause: Hardware
problem with the CPU or cell board. Action: Contact HP Support to confirm the
CPUs and cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2359
- Severity: MAJOR
- Event Summary: PDC could not access a CC hardware register
- Event Class: System
- Problem Description:
PDC could not access a CC hardware
register. The cell will be halted. The data field contains the return value
from the PDC function that encountered the error.
- Cause / Action:
Cause: Hardware problem with the CC. Action:
Contact HP Support to confirm the CC is functioning properly. Cause: Hardware
problem with the CPU or cell board. Action: Contact HP Support to confirm the
CPUs and cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2360
- Severity: MAJOR
- Event Summary: PDC could not access a CC hardware register
- Event Class: System
- Problem Description:
PDC could not access a CC hardware
register. The cell will be halted. The data field contains the address of the
register.
- Cause / Action: Cause: Hardware problem with the CC. Action:
Contact HP Support to confirm the CC is functioning properly. Cause: Hardware
problem with the CPU or cell board. Action: Contact HP Support to confirm the
CPUs and cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2361
- Severity: MAJOR
- Event Summary: PDC could not access a CC hardware register
- Event Class: System
- Problem Description:
PDC could not access a CC hardware
register. The cell will be halted. The data field contains the return value
from the PDC function that encountered the error.
- Cause / Action:
Cause: Hardware problem with the CC. Action:
Contact HP Support to confirm the CC is functioning properly. Cause: Hardware
problem with the CPU or cell board. Action: Contact HP Support to confirm the
CPUs and cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2362
- Severity: MAJOR
- Event Summary: PDC could not access a CC hardware register
- Event Class: System
- Problem Description:
PDC could not access a CC hardware
register. The cell will be halted. The data field contains the address of the
register.
- Cause / Action: Cause: Hardware problem with the CC. Action:
Contact HP Support to confirm the CC is functioning properly. Cause: Hardware
problem with the CPU or cell board. Action: Contact HP Support to confirm the
CPUs and cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2363
- Severity: MAJOR
- Event Summary: PDC could not access a CC hardware register
- Event Class: System
- Problem Description:
PDC could not access a CC hardware
register. The cell will be halted. The data field contains the return value
from the PDC function that encountered the error.
- Cause / Action:
Cause: Hardware problem with the CC. Action:
Contact HP Support to confirm the CC is functioning properly. Cause: Hardware
problem with the CPU or cell board. Action: Contact HP Support to confirm the
CPUs and cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2364
- Severity: MAJOR
- Event Summary: PDC could not access a data structure on the local
cell
- Event Class: System
- Problem Description:
PDC could not access one of its own data
structures on the local cell. The cell will be halted. The data field contains
the return status from the PDC function that encountered the error.
- Cause / Action: Cause: Hardware problem with the PDH riser card.
Action: Contact HP Support to confirm the PDH riser card is functioning
properly. Cause: Hardware problem with the CPU or cell board. Action: Contact
HP Support to confirm the CPUs and cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2365
- Severity: MAJOR
- Event Summary: PDC could not access a hardware register on the
local cell board
- Event Class: System
- Problem Description:
PDC could not access a hardware register on
the local cell board. The cell will be halted. The data field contains the
return status from the PDC function that encountered the error.
- Cause / Action:
Cause: Hardware problem with the PDH riser
card. Action: Contact HP Support to confirm the PDH riser card is functioning
properly. Cause: Hardware problem with the CPU or cell board. Action: Contact
HP Support to confirm the CPUs and cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2366
- Severity: MAJOR
- Event Summary: A PDC state variable was set incorrectly or lost.
- Event Class: System
- Problem Description:
A PDC state variable was set incorrectly or
lost. The data field contains the value of an internal PDC variable.
- Cause / Action: Cause: Hardware problem with the CPU, PDH riser
card, or cell board. Action2: Contact HP Support to confirm the CPUs, PDH
riser card, and cell board are functioning properly. Update PDC if a version
is available to fix this problem. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2367
- Severity: MAJOR
- Event Summary: PDC detected an illegal CPU number passed to an
internal function
- Event Class: System
- Problem Description:
An invalid CPU number was passed into an
internal PDC function. Depending upon the situation, either the cell will be
halted or the entire partition will be reset. The data field contains the
invalid parameter.
- Cause / Action: Cause: Hardware failure with CPU, CC or cell board.
Action: Contact HP Support to confirm the CPUs, CC, and cell board are
functioning properly. Update PDC if a version is available to fix this
problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2368
- Severity: MAJOR
- Event Summary: Multiple uncorrected errors have occurred in shared
CPU resource(s)
- Event Class: System
- Problem Description:
During logging of errors within the
executing CPU, multiple uncorrected errors have been detected in shared CPU
resources. The data field contains the physical location of the CPU module
containing the error. This occurrence is stored by PDC for later action by the
cell crash monarch. The monarch will flag this core and its siblings for
deconfiguration.
- Cause / Action: Cause: Multiple uncorrected errors have been
detected in the CPU module shared cache. Action: Contact HP Support to Analyze
HPMC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2369
- Severity: FATAL
- Event Summary: Firmware error detected while trying to select
blocks to interleave
- Event Class: System
- Problem Description:
This error was caused by PDC bug where a
block size was selected for interleaving memory, but there were no cells to
interleave. This should not happen. The data field contains the block size
that was selected to be interleaved.
- Cause / Action: Cause: PDC bug Action Report problem to PDC team
Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2370
- Severity: MAJOR
- Event Summary: Cell map alias value could not be calculated based
on max zi address
- Event Class: System
- Problem Description:
The aliased offset of the memory hole could
not be determined because the given current maximum zi address was out of
range. This indicates a PDC bug because the address should not be over the
maximum allowable value. The data field contains the currently calculated
maximum zi address.
- Cause / Action: Cause: PDC Bug Action: Call PDC team
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2371
- Severity: MAJOR
- Event Summary: The value to be programmed in the cell map alias
value was invalid
- Event Class: System
- Problem Description:
PDC detected a problem in the alias value
to be programmed in the cell map alias register was detected to be out of
range. This should never happen and would indicate a PDC bug. The data field
contains the invalid alias config value that was calculated to be set in the
cell map alias register
- Cause / Action: Cause: Pdc bug Action: Contact the PDC team
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2372
- Severity: MAJOR
- Event Summary: Base address calculation failure of floater cell
- Event Class: System
- Problem Description:
PDC detected a problem in calculating the
base address of the cell memory info structure for a floater cell. The data
field contains the physical location of the cell having the problem
- Cause / Action:
Cause: Hardware problem not allowing access to
the cell info structure of the floater cell Action: Reboot system and if
problem persists, contact PDC team
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2373
- Severity: FATAL
- Event Summary: No Interleaved memory or floater cells detected
- Event Class: System
- Problem Description:
PDC detected that there is no interleaved
memory and no floater cells and thus no memory available for interleaving.
This is a fatal error and will result in halting of the pd
- Cause / Action:
Cause: no memory in cell or memory has been
deallocated. Review chassis logs to verify memory is available for
interleaving and that all the memory has not been deallocated. Action: if no
memory, add memory to cells. If all of the memory has been deallocated,
determine reason that memory was deallocated - hardware deallocation or
software deallocated. Add memory to cells if needed. If there is memory in
cells not deallocated, clear nvm memory and reboot
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2374
- Severity: MAJOR
- Event Summary: Repairs needed in at least one CPU cache array
exceed available repair elements.
- Event Class: System
- Problem Description:
Processor cache errors have accumulated
until there are insufficient repair elements left to effect the repair. When
repair fails on the subsequent boot, the module will be deconfigured. The data
field can be interpreted by lab personnel to determine which array(s) cannot
be repaired.
- Cause / Action: Cause: Cache repairs needed exceed cache repair
elements available. Action: System can continue until reboot, and afterward
with the offending module deconfigured. Replacement of CPU is recommended to
complete repair.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2376
- Severity: FATAL
- Event Summary: There is a failure with communications between PDC
and the BMC's BT interface
- Event Class: System
- Problem Description:
Since PDC depends on the BT interface and
is only allowed to use this interface once the OS boots. PDC will prevent
booting from BCH and will display an appropriate error message at BCH which
can be seen from the "IN WA" command.
- Cause / Action: Cause: Fix BMC's BT Interface Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2378
- Severity: MAJOR
- Event Summary: During initialization of the nvm blocks, a fatal
error was detected
- Event Class: System
- Problem Description:
This chassis code is used to indicate a
fatal status returned for an nvm block being initialized. The data field is of
type actual data and contains the index of the block reporting the problem.
- Cause / Action: Cause: Firmware error Action: Collect chassis codes
and send to HP support team to determine hardware/software problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2379
- Severity: FATAL
- Event Summary: CPU attempted to time the monarch CPU after
partition creation.
- Event Class: System
- Problem Description:
CPU entering a rendezvous detected that a
timeout value had been set for it to time how long it waits on the monarch.
This timing should never occur after a partition is created, so send chassis
log and reset the partition. Data field contains the illegal timeout value.
This should never be seen in the field.
- Cause / Action: Cause: firmware experienced an internal error
Action: contact HP support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2380
- Severity: MAJOR
- Event Summary: Page zero contained invalid console or boot path
- Event Class: System
- Problem Description:
See summary
- Cause / Action:
Cause: Crash dump called before system boot.
Action: No action is required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2381
- Severity: MAJOR
- Event Summary: Indicates a rope could not be reset.
- Event Class: System
- Problem Description:
Data will be the physical location of the
LBA on the rope that failed reset. LBA will not be accessible, nor will any of
the devices below it.
- Cause / Action: Cause: Bad hardware Action: Look for other errors.
Replace I/O chassis.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2382
- Severity: CRITICAL
- Event Summary: An undEFIned error was encountered on an LBA.
- Event Class: System
- Problem Description:
An unknown error was detected on an LBA.
- Cause / Action: Cause: Bad hardware Action: Replace I/O chassis.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2383
- Severity: MAJOR
- Event Summary: Console is not responding.
- Event Class: System
- Problem Description:
Console cannot be used. Data is status of
why console isn't responding.
- Cause / Action: Cause: Hardware failure. Action: Look for other
failures. Reboot core I/O. Replace core I/O.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2384
- Severity: MAJOR
- Event Summary: Remapping of Console failed
- Event Class: System
- Problem Description:
The data portion represents the failure
status of the console remap.
- Cause / Action: Cause: HPMC/TOC failure Action: Look for other
errors. Reboot core I/O Replace Core I/O.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2385
- Severity: MAJOR
- Event Summary: Boot device failed remap
- Event Class: System
- Problem Description:
The boot device with the path in the data
portion of this event could not be mapped into MMIO.
- Cause / Action:
Cause: Bad Hardware Action: Look for other
failures. Replace device.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2386
- Severity: MAJOR
- Event Summary: Reading the Primary Error Mode register failed in
the memory subsystem check.
- Event Class: System
- Problem Description:
Reading the Primary Error Mode register
failed in the memory subsystem check.
- Cause / Action: Cause: An unknown hardware or firmware problem
caused the register read to fail. Action: contact HP support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2387
- Severity: MAJOR
- Event Summary: A memory subsystem test wait timeout failed.
- Event Class: System
- Problem Description:
A memory subsystem test wait timeout
failed. The data field is the timeout length that was exceeded.
- Cause / Action:
Cause: An unknown hardware of firmware error
caused a wait timeout on a register. Action: contact HP support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2388
- Severity: MAJOR
- Event Summary: Two memory subsystem errors were detected.
- Event Class: System
- Problem Description:
Two memory subsystem or parity errors were
detected by firmware. The data field is the error log register contents.
- Cause / Action: Cause: A hardware problem has occurred with the
memory subsystem that caused a parity error or other error. Action: Contact HP
support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2389
- Severity: MAJOR
- Event Summary: A memory subsystem error was detected on controller
0.
- Event Class: System
- Problem Description:
A memory subsystem error was detected on
controller 0. The data field is the error log register contents.
- Cause / Action:
Cause: Controller 0 has reported a parity or
memory subsystem error. Action: contact HP support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2390
- Severity: MAJOR
- Event Summary: A memory subsystem error was detected on controller
1.
- Event Class: System
- Problem Description:
A memory subsystem error was detected on
controller 1. The data field is the error log register contents.
- Cause / Action:
Cause: Controller 0 has reported a parity or
memory subsystem error. Action: Contact HP support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2391
- Severity: MAJOR
- Event Summary: A memory subsystem error has occurred, but no
details were logged by the system.
- Event Class: System
- Problem Description:
A memory subsystem error has occurred, but
no details were logged by the system. The data field is the error log register
contents.
- Cause / Action: Cause: A hardware error has occurred where no error
detail was logged for a parity or memory subsystem error. Action: contact HP
support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2392
- Severity: MAJOR
- Event Summary: Deallocated DIMM from system due to a memory
subsystem error.
- Event Class: System
- Problem Description:
A DIMM is being deallocated from the system
due to a parity or memory subsystem error. The data field contains the
physical location of the DIMM to be deallocated.
- Cause / Action:
Cause: A memory subsystem or parity error has
been detected and the DIMM is being deallocated. Action: 1. replace the DIMM
in error 2. contact HP support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2393
- Severity: MAJOR
- Event Summary: Reconfiguration of SBA failed.
- Event Class: System
- Problem Description:
SBA will not be accessible.
- Cause / Action:
Cause: Hardware failure. Action: Look for other
failures. Replace I/O chassis.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2394
- Severity: MAJOR
- Event Summary: PDC could not access an internal CC register.
- Event Class: System
- Problem Description:
PDC could not access an internal CC
register. The cell will be reset. Data field is the return status from the
function that encountered the error.
- Cause / Action: Cause: Hardware problem with the CC Action: Contact
HP Support to confirm the CC is functioning properly. Cause: Hardware problem
with the CPU. Action: Contact HP Support to confirm the CPU is functioning
properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2395
- Severity: MAJOR
- Event Summary: An internal PDC function was passed an
unexpected/illegal argument.
- Event Class: System
- Problem Description:
An internal PDC function was passed an
unexpected/illegal value based on the contents of an internal PDH register.
The cell will be halted. Data field is the return status from the function
that encountered the error.
- Cause / Action: Cause: Hardware problem with the PDH riser card
Action: Contact HP Support to confirm the PDH riser card is functioning
properly. Cause: Hardware problem with the CPU or cell board. Action: Contact
HP Support to confirm the CPUs and cell board are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2399
- Severity: MAJOR
- Event Summary: Fabric cannot continue boot. Cell will reset.
- Event Class: System
- Problem Description:
Fabric cannot continue boot. Cell will
reset or halt. Data Field: Reset Type: 0x1: cold boot 0x2: warm boot 0x3:
reconfigure reset 0x4: error boot 0x5: hard halt 0xF: fabricless boot Cause /
Action:
Cause: A problem occurred while discovering the fabric. More
information should be available in the preceding impi events. Action: No
action is required. The cell will reboot or halt. However, it is advised that
an HP service representative analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2400
- Severity: MAJOR
- Event Summary: Fabric boot cannot continue. Cell will reboot
without fabric.
- Event Class: System
- Problem Description:
Fabric boot cannot continue. Cell will
reboot without fabric. Data Field: Reset Type: 0x1: cold boot 0x2: warm boot
0x3: reconfigure reset 0x4: error boot 0x5: hard halt 0xF: fabricless boot
- Cause / Action: Cause: A problem occurred while discovering the
fabric. More information should be available in preceding IPMI events. This
cell cannot talk to other cells. Action: No action is required. The cell will
reboot and attempt to boot without it's connection to the fabric. It is
advised to have a HP service representative analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2401
- Severity: MAJOR
- Event Summary: Failed the read the routing forward progress state
from the XBC
- Event Class: System
- Problem Description:
Failed to read the local XBC's routing
progress register. Data Field: (xbc num << 32) | return status
- Cause / Action:
Cause: Failure reading from the local XBC
Action: Contact your local HP support representative
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2402
- Severity: CRITICAL
- Event Summary: Slave(s) have failed the their memory tests.
- Event Class: System
- Problem Description:
Slave(s) have failed the their memory
tests.
- Cause / Action: Cause: The CPU failed during its memory test for an
unknown reason. Action: The CPU may need to be replaced.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2403
- Severity: MAJOR
- Event Summary: Firmware could not get the semaphore when trying to
get a memory range.
- Event Class: System
- Problem Description:
Firmware could not get the semaphore when
trying to get a memory range. The data field is the semaphore status. Cause /
Action:
Cause: A Firmware error occurred with the semaphore. Action:
Contact HP support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2404
- Severity: MAJOR
- Event Summary: Firmware could not release the semaphore when trying
to get a memory range.
- Event Class: System
- Problem Description:
Firmware could not release the semaphore
when trying to get a memory range. The data field is the semaphore status.
- Cause / Action: Cause: A firmware problem occurred with the
semaphores. Action: Contact HP support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2405
- Severity: MAJOR
- Event Summary: Firmware could not get the semaphore when trying to
reset memory ranges.
- Event Class: System
- Problem Description:
Firmware could not get the semaphore when
trying to reset memory ranges. The data field is the semaphore status. Cause /
Action:
Cause: A Firmware semaphore error occurred. Action: Contact HP
Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2406
- Severity: MAJOR
- Event Summary: Firmware could not release the semaphore when trying
to reset memory ranges.
- Event Class: System
- Problem Description:
Firmware could not release the semaphore
when trying to reset memory ranges. The data field is the semaphore status.
- Cause / Action: Cause: A firmware semaphore error occurred. Action:
Contact HP support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2407
- Severity: CRITICAL
- Event Summary: There were no CPUs available for the memory test.
- Event Class: System
- Problem Description:
There were no CPUs available for the memory
test. The data field is the number of active CPUs in the system for the memory
test.
- Cause / Action: Cause: A CPU or firmware error occurred that caused
no CPUs available to execute the memory tests. Action: Contact HP support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2408
- Severity: CRITICAL
- Event Summary: The number of memory ranges allowed has been
exceeded.
- Event Class: System
- Problem Description:
The number of memory ranges allowed has
been exceeded. The data field is the number of the highest range number.
- Cause / Action: Cause: A firmware error has occurred that cause the
number of memory ranges to be too high. Action: Contact HP support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2409
- Severity: FATAL
- Event Summary: Data obtained from a core cell data structure was
unintelligible.
- Event Class: System
- Problem Description:
Data obtained from a core cell data
structure was unintelligible. Data field contains the data PDC could not
interpret.
- Cause / Action: Cause: PDC read invalid data from an internal data
structure Action: Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2410
- Severity: FATAL
- Event Summary: A cell was unable to access data on the core cell
- Event Class: System
- Problem Description:
PDC could not access data in the core
cell's data structure. This chassis code is probably a result of a failed
attempt to walk the fabric to the core cell.
- Cause / Action: Cause: There was an intermittent problem in the
fabric and a slave cell could not reach the core cell. Action: Look for
problems in the fabric. Try rebooting the partition.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2411
- Severity: MAJOR
- Event Summary: Status reporting for cell latches is not
functioning.
- Event Class: System
- Problem Description:
The optical emitters on the cell latch
sensors are not functioning.
- Cause / Action: Cause: There is a hardware failure. Action: The
cell board must be repaired or replaced.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2412
- Severity: WARNING
- Event Summary: The right cell latch is open
- Event Class: System
- Problem Description:
The right ejector latch on the specified
cell board is open.
- Cause / Action: Cause: The right ejector latch is open. Action:
Close the right ejector latch on the specified cell.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2413
- Severity: WARNING
- Event Summary: The left cell latch is open.
- Event Class: System
- Problem Description:
The left cell ejector on the specified cell
is open.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2414
- Severity: MAJOR
- Event Summary: Both cell ejector latches are closed.
- Event Class: System
- Problem Description:
Both ejector latches are latched on the
specified cell.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2415
- Severity: MAJOR
- Event Summary: Could not read the XBC register containing the
fabric halt state
- Event Class: System
- Problem Description:
Could not read the XBC register containing
the fabric halt state. Data Field: (xbc num << 32) | return status
- Cause / Action: Cause: XBC register read failure Action: Contact HP
Support personnel to analyze the crossbar chip.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2416
- Severity: MAJOR
- Event Summary: Could not write the XBC register to change the
fabric halt state
- Event Class: System
- Problem Description:
Could not write the XBC register to change
the fabric halt state. Data Field: (xbc num << 32) | return status
- Cause / Action: Cause: write to XBC failed Action: Contact HP
Support personnel to check the XBC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2417
- Severity: MAJOR
- Event Summary: Could not get the XBC semaphore while updating the
fabric halt state
- Event Class: System
- Problem Description:
Could not get the XBC semaphore while
updating the fabric halt state. Data Field: (xbc num << 32) | return
status
- Cause / Action: Cause: Couldn't get the XBC semaphore before
resetting the cell Action: No action is required. The real problem is
indicated in earlier IPMI events.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2418
- Severity: MAJOR
- Event Summary: Could not release the XBC semaphore while updating
the fabric halt state
- Event Class: System
- Problem Description:
Could not release the XBC semaphore while
updating the fabric halt state. Data Field: (xbc num << 32) | return
status
- Cause / Action: Cause: Couldn't get the XBC semaphore before
resetting the cell Action: No action is required. The real problem is
indicated in earlier IPMI events
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2419
- Severity: MAJOR
- Event Summary: A memory SM4 error has occurred.
- Event Class: System
- Problem Description:
A memory SM4 error has occurred.
- Cause / Action:
Cause: An internal firmware semaphore error has
occurred. Action: Contact HP Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2420
- Severity: MAJOR
- Event Summary: Error: PCI Buses are configured for multiple speeds
- Event Class: System
- Problem Description:
The physical location of the PCI
controller with mixed speeds is displayed in the data field.
- Cause / Action:
Correct the mixed speeds configuration
problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2421
- Severity: CRITICAL
- Event Summary: The bus depth was exceeded during IO probing.
- Event Class: System
- Problem Description:
During walking of the IO busses, the
maximum depth was exceeded. This is a hardware configuration problem. The data
field contains the physical location.
- Cause / Action:
Reconfigure the hardware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2422
- Severity: CRITICAL
- Event Summary: Timeout occurred during RI initialization.
- Event Class: System
- Problem Description:
A Timeout occurred during RI
initialization. The CSR is in the data field. The alert level will vary.
- Cause / Action:
Correct the hardware problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2423
- Severity: MAJOR
- Event Summary: SuperIO has been detected in slot
- Event Class: System
- Problem Description:
SuperIO has been detected in the PCI slot
displayed in the data field.
- Cause / Action:
No action required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2424
- Severity: CRITICAL
- Event Summary: Misc Uncorrectable Error encountered.
- Event Class: System
- Problem Description:
Misc Uncorrectable Error encountered on
the card specified in the data field.
- Cause / Action:
Replace the card in the specified data
field.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2425
- Severity: FATAL
- Event Summary: Miscellaneous fatal error discovered on PCI card
- Event Class: System
- Problem Description:
Misc fatal error was discovered on the PCI
card specified in the data field.
- Cause / Action:
Replace the failing card.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2426
- Severity: CRITICAL
- Event Summary: NVRAM test failed with a data compare error
- Event Class: System
- Problem Description:
NVRAM failed with a data compare error. 3
chassis codes will be sent. The first one will have the expected data and the
second the actual data and the third one will be the address.
- Cause / Action:
Bad NVRAM, replace the bad chip.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2427
- Severity: CRITICAL
- Event Summary: An NVRAM CRC Error was detected
- Event Class: System
- Problem Description:
An NVRAM CRC Error was detected. There
will always be 2 chassis codes sent. The first one will give the expected data
and the second the actual data.
- Cause / Action:
Bad Hardware or a coding problem; Replace the
NVRAM chip.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2428
- Severity: CRITICAL
- Event Summary: The BLOCK requested in NVRAM has the wrong revision
- Event Class: System
- Problem Description:
The NVRAM block requested has an
unexpected revision. This chassis code will be emitted in pairs. The first one
will contain the expected revision number, the second will contain the actual
revision number.
- Cause / Action:
Clear NVRAM and reset.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2429
- Severity:
- Event Summary: Error loading EFI driver.
- Event Class: System
- Problem Description:
Debugging event, not for release. This
event is no longer used on Everest/xPeak systems but its event ID is still
contained in the code base.
- Cause / Action:
Debugging event, not for release.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2430
- Severity: CRITICAL
- Event Summary: A remote cell is in an unknown state of PD
rendezvous
- Event Class: System
- Problem Description:
The reporting cell read an unknown state
from a remote cell. This chassis code will be emitted in pairs. The first one
indicating the cell number in the data field and the second code indicating
the state read in the data field.
- Cause / Action:
Reset the PD.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2431
- Severity: CRITICAL
- Event Summary: Failed to update CSR contents
- Event Class: System
- Problem Description:
Read test on a CSR failed and the CSR
contents were not updated. This chassis code will be emitted in 3's, with the
CSR Address, expected data, and actual data given.
- Cause / Action:
Bad hardware / replace suspect hardware
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2432
- Severity: CRITICAL
- Event Summary: The complex profile is not the same on all cells in
the PD.
- Event Class: System
- Problem Description:
Not all complex profiles in the PD have
the same checksum and sequence ID. The data field provides details. This event
id will be emitted in a series. Data fields will contain the following: First
Cell number that was different; expected profile A checksum, actual profile A
checksum, expected profile A sequence ID, actual profile A sequence ID,
expected profile C checksum, actual profile C checksum, expected profile C
sequence ID, actual profile C sequence ID. The trouble shooter than determine which cell and profile is the problem. Note that if a new profile A
is distributed during cell rendezvous, this event can be seen even if the
cells rendezvousing were not directly affected by the change.
- Cause / Action:
The Group A or Group C complex profiles didn't
match on all cells in the partition.
Push out new complex profiles and
reset.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2433
- Severity: MAJOR
- Event Summary: /options settings for CPUBusConfigValue aren't
compatible with PAL
- Event Class: System
- Problem Description:
CPUBusConfigValue setting in options as
been compared with what is reported by PAL as settable and CPUBusConfigValue
is specifying at least one bit that is not supported by this version of PAL
using the Get Processor Bus Dependent Configuration Features. This chassis
code is emitted in pairs, the first data field contains the PAL expected
values; the second data field contains the requested setting from options.
- Cause / Action:
Update the /options value to comply with the
desired features and what is supported.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2434
- Severity: CRITICAL
- Event Summary: Data field contains data meant for firmware debug
only.
- Event Class: System
- Problem Description:
This event is a tool to aid in the
debugging and testing of firmware.
- Cause / Action:
Debug event.
Contact your hp support
representative.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2435
- Severity: FATAL
- Event Summary: Too Few Bulk Power Supplies Available
- Event Class: System
- Problem Description:
The power requirements for the cabinet
exceeds the capabilities of the available Bulk Power Supplies. The PM3 section
of the Utilities Subsystems keeps a tally of the power requirements for all
installed entities in a cabinet. The power available from the Bulk Power
Supplies does not meet the requirements of the installed entities, according
to the Cabinet Power Tally.
- Cause / Action:
Cause: One or more Bulk Power Supplies are
missing or in fault condition at Cabinet Power Up. Action: Contact your HP
support representative to check for faulty Bulk Power Supplies Add Bulk Power
Supplies, if under populated. Cause: Too many entities installed in the
cabinet for the Power Available. Action: Remove entities until Power
Requirements are met. Add Bulk Power Supplies, if under populated.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2436
- Severity: MAJOR
- Event Summary: 1 Side Converter Over Temp
- Event Class: System
- Problem Description:
The 48 Volt DC Converter powered by the
specified PDCA in the designated Bulk Power Supply, is in over temp condition.
The PDCA powering the converter on the BPS that failed can be identified by
the activity status (last byte of chassis code): e = PDCA 1 f = PDCA 0
- Cause / Action:
Cause: If this code is not accompanied by other
codes pointing to the same Bulk Power Supply, then it is an internal component
failure. Action: Contact HP Support personnel to troubleshoot problem. Cause:
If there are chassis codes indicating an elevated ambient air temperature,
other chassis codes will be generated and system shutdown will be necessary.
Action: Resolve the elevated ambient air temperature problem as quickly as
possible. If the air temperature gets too high, the complex must be shutdown
to avoid damage.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2437
- Severity: MAJOR
- Event Summary: The CPU Node reported a problem initializing its
node in the device tree
- Event Class: System
- Problem Description:
The CPU node retuned an error when calling
its initialize node function. The specific status is displayed in the data
field.
- Cause / Action:
See additional logs in determine why initialize
node failed. There could be a hardware or PAL problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2438
- Severity: CRITICAL
- Event Summary: The CC to XBC link pattern test failed.
- Event Class: System
- Problem Description:
The CC to XBC Link pattern test failed.
Data Field: This event will be emitted three times 1st event: XBC number
(32:43), port number (44:55) 2nd event: expected pattern (0:63) 3rd event:
actual data read (0:63)
- Cause / Action:
Unspecified
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2439
- Severity: CRITICAL
- Event Summary: CPUProcConfigValue in /options is not compatible
with the current PAL/CPU
- Event Class: System
- Problem Description:
CPUProcConfigValue setting in options as
been compared with what is reported by PAL as settable and CPUProcConfigValue
is specifying at least one bit that is not supported by this version of PAL
using the Get Processor Dependent Features. This chassis code is emitted in
pairs, the first data field contains the PAL expected values; the second data
field contains the requested setting from options.
- Cause / Action:
Modify CPUProcConfigValue in /options to set
the correct bits.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2440
- Severity: CRITICAL
- Event Summary: Data field contains data meant for firmware debug
only.
- Event Class: System
- Problem Description:
This event is a tool to aid in the
debugging and testing of firmware.
- Cause / Action:
Debug event.
Contact your hp support
representative.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2441
- Severity: CRITICAL
- Event Summary: Complex Profile Group C CRC didn't match the
expected value
- Event Class: System
- Problem Description:
The Calculated CRC for the group C
Partition Profile did not match the stored value. The expected value and the
actual value will be emitted in 2 sequential chassis codes.
- Cause / Action:
Push out a new Group C complex profile.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2442
- Severity: MAJOR
- Event Summary: The Dillon IPR is not cleared after a
MAX_RD_CLR_IPR_TRIES on a cell.
- Event Class: System
- Problem Description:
The Dillon IPR is not cleared after a
MAX_RD_CLR_IPR_TRIES on a cell. The Data Filed in the Cell number. See the
next Chassis Code to see the MAX_RD_CLR_IPR_TRIES
- Cause / Action:
Cause: The Dillon IPR is not cleared after a
MAX_RD_CLR_IPR_TRIES on a cell. Action: Contact HP Support personnel to
troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2443
- Severity: MAJOR
- Event Summary: Firmware could not set Pal Proc features
- Event Class: System
- Problem Description:
Firmware call to set processor feature
resulted in an error.
- Cause / Action:
FW MCA code uses this PAL_SET_PROC_FEATURES to
escalate future MCA to BINIT. This event id when emitted during an MCA
processing means that any subsequent global MCA would not cause all the
processors in the PD to MCA again.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2444
- Severity: UNKNOWN
- Event Summary: Invalid parameter when setting cpu frequency ratios
- Event Class: System
- Problem Description:
An invalid parameter was specified when
trying to set the cpu bus config register, cpu frequencies ratio field. Valid
range is 0xe - 0x17
- Cause / Action:
Specify a valid parameter.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2445
- Severity: MAJOR
- Event Summary: The RTC is providing inconsistent data.
- Event Class: System
- Problem Description:
While validating the internal RTC data the
data is read twice and compared. If the comparison fails to match the read is
performed again up a maximum of 50 times. If we cannot get a match in 50
retries we issue this event.
- Cause / Action:
Cause: there is a known issue with some Dallas
Semiconductor DS1501 RTC parts that can cause read data anomalies but they
should never be this consistent, this event indicates faulty PDH hardware.
Action: replace the PDH hardware.
Action: Contact HP Support personnel to
replace the PDH hardware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2446
- Severity: CRITICAL
- Event Summary: Error encountered while collecting PCI error logs
- Event Class: System
- Problem Description:
error code
- Cause / Action:
Cause: An error occurred while collecting the
PCI error logs
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2447
- Severity: CRITICAL
- Event Summary: Error encountered while collecting SBA error logs
- Event Class: System
- Problem Description:
error code
- Cause / Action:
Cause: An error occurred while collecting the
SBA error logs
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2448
- Severity: CRITICAL
- Event Summary: It indicates loss of cell connectivity in the
partition.
- Event Class: System
- Problem Description:
It indicates loss of cell connectivity in
the partition during a global MCA processing.
- Cause / Action:
It will lead to cells performing
RESET_FOR_RECONFIG after getting the error logs.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2449
- Severity: MAJOR
- Event Summary: Chassis Code Fifo pointers are reinitialized due to
their invalid values in ICM
- Event Class: System
- Problem Description:
Chassis Code Fifo pointers are
reinitialized due to their invalid value in ICM. The chassis code is sent when
PDC intend to send a chassis code but fail to verify the Chassis Code Log Fifo
Pointers and Size. PDC then reinitialized the Chassis Code Log Fifo Pointers
and send this chassis code before sending the intended one.
- Cause / Action:
Cause: There is a problem with ICM in the
system. The Chassis Code Log Fifo Pointers are invalid in ICM and are
reinitialized.
Action: None.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2450
- Severity: CRITICAL
- Event Summary: Status of building the cell-level firmware device
tree
- Event Class: System
- Problem Description:
Error status of building the cell-level
firmware device tree
- Cause / Action:
Data field displays the status of creating the
cell level device tree for use by firmware. See previous errors already
reported.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2452
- Severity: CRITICAL
- Event Summary: SetViewRoot on a remote cell failed.
- Event Class: System
- Problem Description:
System firmware on the Core cell was unable
to update a slave cell with the location of the root of the partition tree.
The CPU that was unable to be contacted is printed in the data field.
- Cause / Action: Cause: Inter Processor interrupts failed. Be sure
that the partition rendezvous was successfully completed. Reset. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2453
- Severity: CRITICAL
- Event Summary: Complex Profile A has a checksum error
- Event Class: System
- Problem Description:
Complex Profile A (Stable Profile) stored
checksum does not match the calculated checksum. The expected data and actual
data are displayed in the 2 reporting event Ids.
- Cause / Action:
Cause: Reconfigure the system to get a new
complex profile distributed and reset. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2454
- Severity: CRITICAL
- Event Summary: The Stable Complex Profile Sequence Id is invalid
- Event Class: System
- Problem Description:
The Complex Profile Group A sequence ID is
invalid. Booting cannot continue. The actual data is in the chassis code data
field.
- Cause / Action: Cause: Push out a new complex profile and reset the
system. The cell will be waiting for reconfiguration. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2455
- Severity: MAJOR
- Event Summary: The Dynamic Complex Profile Sequence ID is invalid
- Event Class: System
- Problem Description:
The Dynamic Complex Profile (Group B)
sequence ID is invalid. The invalid Sequence ID is displayed in the data
field.
- Cause / Action: Cause: Push out a new complex profile. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2456
- Severity: CRITICAL
- Event Summary: The Partition Profile Sequence ID is invalid
- Event Class: System
- Problem Description:
The Group C Partition Complex Profile
Sequence ID is invalid. The value read is displayed in the data field. Cause /
Action:
Cause: Push out a new complex profile and reset. The cell will be
waiting for reconfiguration. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2457
- Severity: CRITICAL
- Event Summary: The PD numbers in Group A and Group C of the complex
profile are inconsistent.
- Event Class: System
- Problem Description:
The Complex Profile Group A PD assignment
for this cell does not match the PD or Partition number in Group C of the
complex profile. This is a fatal condition for the cell. The PD number from
group A will be emitted first, followed by a subsequent code for the PD
assigned in group C.
- Cause / Action: Cause: Push out consistent Complex profiles and
reset the system. The cell will be waiting for reconfiguration. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2458
- Severity: CRITICAL
- Event Summary: The PD number specified in the complex profile is
out of range.
- Event Class: System
- Problem Description:
The Partition (PD) assigned to this cell in
the complex profile group A and C is larger than the maximum allowed number of
PDs as specified by Group A.
- Cause / Action: Cause: Reconfigure the partition number, push out a
new profile and reset. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2459
- Severity: CRITICAL
- Event Summary: The PDH component encountered an error dealing with
a property on a node.
- Event Class: System
- Problem Description:
The PDH service was unable to either get or
set the property specified in the data field as an ascii message.
- Cause / Action:
Cause: This is usually due to a memory
allocation problem. Verify that SRAM is usable and there is memory available.
Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2460
- Severity: CRITICAL
- Event Summary: Error creating the acpi_hw node.
- Event Class: System
- Problem Description:
PDH encountered an error creating the ACPI
Hardware Node in the device tree or installing its properties.
- Cause / Action:
Cause: May be out of malloc space or a previous
tree error prevented this from being successful. Check for earlier errors.
Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2461
- Severity: CRITICAL
- Event Summary: Error encountered creating or initializing the IPMI
node
- Event Class: System
- Problem Description:
The PDH service encountered an error while
creating the IPMI node or adding properties to it. The status is in the data
field.
- Cause / Action: Cause: Possibly out of memory or an earlier error
left the tree in an unusable state. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2462
- Severity: CRITICAL
- Event Summary: Supplemental cpu tests generated an unexpected
result
- Event Class: System
- Problem Description:
Supplemental tests have been run on the
CPUs in the node and an error was encountered. If this event id is only
emitted once, then the cpu detected that it failed on its own and the
offending CPU id is in the data field. If the event is emitted three times,
then the monarch CPU on the cell detected the error and the offending CPU id,
expected data and actual data are reported in the data field of the three
events.
- Cause / Action: Cause: A CPU failed supplemental self tests.
Replace the offending CPU. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2463
- Severity: MAJOR
- Event Summary: Too many parameters were passed to the utilities
system
- Event Class: System
- Problem Description:
Too many parameters were passed in a
request for the utilities system to perform an operation. No more data is
provided.
- Cause / Action: Cause: This is a firmware error. Contact FW
engineering. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2464
- Severity: MAJOR
- Event Summary: A bad parameter was passed to the LED function in
the utilities component
- Event Class: System
- Problem Description:
A bad parameter was passed to the utilities
function that manipulates the LED on replaceable parts. The offending
parameter is displayed in the data field.
- Cause / Action:
Cause: Contact FW engineering. This is a bug.
Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2465
- Severity: MAJOR
- Event Summary: Utilities reported an error while trying to
manipulate the LED
- Event Class: System
- Problem Description:
The utilities system reported an error
while trying to carry out the command to turn on, flash or turn off the LED.
The status returned by the command is displayed in the data field.
- Cause / Action:
Cause: It is likely the GSP is not present or
the device specified is not present. Solve these problems and try again.
Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2467
- Severity: FATAL
- Event Summary: Testing the Reset Status register in Dillon failed.
- Event Class: System
- Problem Description:
Testing the Reset Status register in Dillon
failed. PDC was unable to correctly write and read a test pattern to the
register. The cell will be halted.
- Cause / Action: Cause: PDC cannot communicate properly to the cell
board. Action: Contact HP Support personnel to troubleshoot the cell board
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2468
- Severity: FATAL
- Event Summary: Unable to report to the MP that PDC firmware has
reached BIB
- Event Class: System
- Problem Description:
During a boot, PDC attempts to inform the
MP that it has reached the SINC BIB point. This is performed via a command
sent to the MP. Some problem occurred while sending this command. It is
possible that the MP did not receive PDC's notification. The data field
contains the return status from PDC's internal function IcmReportAtSincBib().
- Cause / Action: Cause: Loss of communication to the MP Error with
Cell PDH Controller (PDHC) Error with MP. Action: Check USB, GSP, and PDHC.
Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2469
- Severity: FATAL
- Event Summary: PDC/Utilities ICM protocol revision mismatch.
- Event Class: System
- Problem Description:
Both the Utilities and PDC have written
their ICM protocol revisions to ICM, PDC has checked them, and at least one of
the revision numbers is not an exact match between PDC and the Utilities. The
cell is about to be halted. The data contents have the format:
0x5500PPGG5000ppgg where: PP = Utilities' PDHC/PDC revision number GG =
Utilities' GSP/PDC revision number pp = PDC's PDHC/PDC revision number gg =
PDC's GSP/PDC revision number
- Cause / Action: Cause: Incorrect PDC and/or PDHC firmware
installed. Cause2: ICM corrupted. Action: Contact HP Support personnel to
troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2470
- Severity: MAJOR
- Event Summary: Cell Info data is not marked invalid while updating
RIO failed field
- Event Class: System
- Problem Description:
Whenever the Cell Info (AKA Cell
Configuration) structure is being updated, its valid bit should be deasserted.
This was not the case while updating the RIO failed field. Data Field: Global
cell # of the cell containing the target Cell Info structure.
- Cause / Action: Cause: Corruption of ICM Internal PDC error Action:
Contact HP Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2471
- Severity: MAJOR
- Event Summary: PDC's Micro Semaphore ownership flag is corrupt.
- Event Class: System
- Problem Description:
PDC's Micro Semaphore ownership flag is
corrupt. This flag is contained in Dillon's Micro General Purpose 2 register.
- Cause / Action: Cause: PDH''s Micro General Purpose register 2
corrupted. Action: Find source of corruption and reboot. Cause: PDC or the
PDHC improperly implementing the algorithms for dealing with the Micro
semaphore. Action: Upgrade PDC or the PDHC firmware. Cause: PDH hardware error
with the Micro General Purpose register 2. Action: Contact HP Support
personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2472
- Severity: MAJOR
- Event Summary: An error has occurred loading the instruction cache.
- Event Class: System
- Problem Description:
An error has occurred loading the
instruction cache. Subsequent chassis codes hold the pointers to the code
address, actual and expected data and indicate whether the error was a data or
tag error? The CPU will be deconfigured.
- Cause / Action: Cause: An error occurred loading the instruction
cache on the CPU. Action: Contact HP Support personnel to troubleshoot
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2473
- Severity: FATAL
- Event Summary: The PDC ROM has a checksum error.
- Event Class: System
- Problem Description:
The PDC ROM has a checksum error. This
could be caused by a defective PDH, errors when loading the image from the GSP
or a bit error in the image. The cell is halted. Data field contains physical
location of cell to be halted.
- Cause / Action: Cause: Error in the PDC ROM image Action: Use FWUU
to reload PDC. This should take care of most instances. Otherwise, contact HP
Support personnel to troubleshoot the cell board and associated cables.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2474
- Severity: MAJOR
- Event Summary: Halting cell because PDC found a non-monarch CPU
setting the valid flag.
- Event Class: System
- Problem Description:
There is a per-cell flag that indicates
whether or not the deconfig bytes are valid in an internal data structure. The
code that sets this flag should therefore be called once per boot. PDC expects
the monarch CPU to be the only CPU that executes this code. This chassis log
indicates that PDC has detected a CPU other than the monarch to be executing
this code. Data field contains the CPU number of the CPU PDC expected to
execute this code, the monarch CPU.
- Cause / Action: Cause: PDC issue Action: Contact HP support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2475
- Severity: MAJOR
- Event Summary: An invalid global CPU number was passed as an
argument
- Event Class: System
- Problem Description:
An internal function within PDC passed an
invalid global CPU number as an argument. Data Field: Invalid argument Cause /
Action:
Cause: Internal PDC error Action: Contact HP Support personnel to
troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2476
- Severity: MAJOR
- Event Summary: PDC has detected a bus number that it doesn't
support
- Event Class: System
- Problem Description:
PDC supports a certain number of processor
buses, like 2. In this case, PDC has found that is it using a bus number that
it (PDC) does not know how to handle - it's an unsupported bus number. The
data field contains the bus number that PDC was working with.
- Cause / Action: Cause: PDC could either be incorrectly determining
the bus number or it is not handling all the bus numbers it should. Action:
Note the bus number in the data field of this chassis log. Contact HP support
to troubleshoot firmware and cell board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2477
- Severity: MAJOR
- Event Summary: SpiromGetModuleSpirom() returned Failure.
- Event Class: System
- Problem Description:
SpiromGetModuleSpirom() returned Failure
for the Cpu Module. The Data Field contents the Cpu Module Number. There is no
SPIROM information for the Cpu Module. The CPU Module will be deconfigured.
- Cause / Action: Cause: Fail to get SPIROM information from at least
one of the Cpu Module in the cell. Action: Look at the Previous Chassis Code
for more information. Contact HP Support personnel to troubleshoot the
problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2478
- Severity: FATAL
- Event Summary: Problem trying to retrieve PD Address Map base
pointer
- Event Class: System
- Problem Description:
Problem detected trying to retrieve the pd
base address. The data field contains the return status from
GetCellMemStructBaseAddress used to retrieve the cell memory structure
address.
- Cause / Action: Cause: There are a number of possible causes for
this problem with the data being corrupted. This could be caused by corruption
of the data by PDC or utilities. In addition, there could be fabric problems
in trying to retrieve information for a remote cell. This will cause the cell
to be reset. Action: Contact the PDC team.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2484
- Severity: MAJOR
- Event Summary: The mp in the master slot of the specified cabinet
is reporting failure.
- Event Class: System
- Problem Description:
The master mp has failed. The slave has
taken over management of the cabinet.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2485
- Severity: MAJOR
- Event Summary: The vrm on the specified cell is reporting a voltage
fault.
- Event Class: System
- Problem Description:
The specified vrm is reporting a fault.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2486
- Severity: MAJOR
- Event Summary: The vrm on the specified cell is reporting a
temperature fault.
- Event Class: System
- Problem Description:
The specified vrm is reporting a fault.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2487
- Severity: MAJOR
- Event Summary: The rail on the specified cell is reporting a fault.
- Event Class: System
- Problem Description:
The specified vrm is reporting a fault.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2494
- Severity: INFORMATION
- Event Summary: Power for the specified cell has been shut off due
to open ejector latches.
- Event Class: System
- Problem Description:
N/a
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2498
- Severity: CRITICAL
- Event Summary: An error was encountered communicating with
utilities during a cell OLA.
- Event Class: System
- Problem Description:
An error was encountered communicating
with Utilities during a cell OLA.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2499
- Severity: CRITICAL
- Event Summary: No additional blocks can be allocated from NVRAM.
- Event Class: System
- Problem Description:
The NVRAM service was unable to satisfy an
NVRAM allocation request.
- Cause / Action:
NVRAM is full. This is extremely
unlikely.
Clear NVRAM and reset.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2500
- Severity: CRITICAL
- Event Summary: Error creating internal firmware data structure for
the "mp" node.
- Event Class: System
- Problem Description:
Internal firmware error creating firmware
tree node.
- Cause / Action: Cause: This event is probably the side-effect of a
prior hardware issue, or corruption in internal firmware data structures
located in SRAM. Action: Use prior events to identify faulty hardware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2501
- Severity: CRITICAL
- Event Summary: One DIMM in a set is not installed.
- Event Class: System
- Problem Description:
One DIMM in a set is not installed. The
data field is the physical location of the DIMM that is not installed. Cause /
Action:
Cause: One DIMM in a Echelon is not installed. Action: Install a
DIMM in the Slot that is reported in the Chassis code data field
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2503
- Severity: UNKNOWN
- Event Summary: There was a problem accessing Group A Profile
- Event Class: System
- Problem Description:
There was a problem accessing Group A
Profile. Most likely it is due to a SM4 failure. The data field contains the
status of attempting to access the Group A Profile
- Cause / Action:
Cause: There was a problem accessing Group A
Profile. Action: Update System Firmware if a newer version has fixed it.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2505
- Severity: UNKNOWN
- Event Summary: Error obtaining/releasing SM4 during access to Group
A Profile
- Event Class: System
- Problem Description:
There was an error obtaining/releasing SM4
while attempting to access Group A Profile. The data field contains the status
of attempting to access the Profile SM4s
- Cause / Action:
Cause: Unable to obtain/release SM4 Action:
Upgrade System Firmware if a newer version fixes the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2506
- Severity: UNKNOWN
- Event Summary: PDC_SEED_ERROR call to stop a single processor
failed
- Event Class: System
- Problem Description:
Failed to stop a processor. The data field
contains the status of stopping the processor
- Cause / Action:
Cause: Failed to stop a processor Action:
Contact HP support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2507
- Severity: CRITICAL
- Event Summary: FATAL event corresponding to legacy 20-bit chassis
codes
- Event Class: System
- Problem Description:
This event is used for translated FATAL
legacy 20-bit chassis codes to E0 format. The data field will be that of the
legacy 20-bit chassis code.
- Cause / Action: Cause: A legacy 20-bit chassis code is emitted
Action: The alert level is FATAL. Contact HP support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2508
- Severity: WARNING
- Event Summary: Warning event corresponding to legacy 20-bit chassis
codes
- Event Class: System
- Problem Description:
This event is used for translated warning
legacy 20-bit chassis codes to E0 format. The data field will be that of the
legacy 20-bit chassis code.
- Cause / Action: Cause: A warning legacy 20-bit chassis code is
emitted Action: The alert level is warning. Contact HP support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2509
- Severity: FATAL
- Event Summary: Event corresponding to PAT encoded chassis codes
- Event Class: System
- Problem Description:
This event is used for translated PAT
encoded chassis codes to E0 format. Data field contains the legacy chassis
code.
- Cause / Action: Cause: A PAT encoded chassis code is emitted
Action: Alert level is fatal. Contact HP support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2510
- Severity: CRITICAL
- Event Summary: Event corresponding to PAT encoded chassis codes
- Event Class: System
- Problem Description:
This event is used for translated PAT
encoded chassis codes to E0 format. Data field contains the legacy chassis
code.
- Cause / Action: Cause: A PAT encoded chassis code is emitted
Action: The alert level is FATAL. Contact HP support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2511
- Severity: WARNING
- Event Summary: Event corresponding to PAT encoded chassis codes
- Event Class: System
- Problem Description:
This event is used for translated PAT
encoded chassis codes to E0 format. Data field contains the legacy chassis
code.
- Cause / Action: Cause: A PAT encoded chassis code is emitted
Action: The alert level is warning. Contact HP support since system may be
degraded.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2512
- Severity: CRITICAL
- Event Summary: The RTC appears to not be functioning correctly.
- Event Class: System
- Problem Description:
The RTC seems to not be updating its
internal time registers. The cell cannot boot without the RTC.
- Cause / Action:
Cause: Bad RTC chip on the PDH daughtercard.
Action: Replace PDH daughtercard
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2513
- Severity: FATAL
- Event Summary: Event corresponding to PAT encoded chassis codes'
data field
- Event Class: System
- Problem Description:
This event is used for translated PAT
encoded chassis codes' data field to E0 format. Data field contains the legacy
chassis code's data.
- Cause / Action: Cause: A PAT encoded chassis code is emitted
Action: Contact HP support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2514
- Severity: CRITICAL
- Event Summary: Event corresponding to PAT encoded chassis codes'
data field
- Event Class: System
- Problem Description:
This event is used for translated PAT
encoded chassis codes' data field to E0 format. Data field contains the legacy
chassis code's data.
- Cause / Action: Cause: A PAT encoded chassis code is emitted
Action: Contact HP support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2515
- Severity: WARNING
- Event Summary: Event corresponding to PAT encoded chassis codes'
data field
- Event Class: System
- Problem Description:
This event is used for translated PAT
encoded chassis codes' data field to E0 format. Data field contains the legacy
chassis code's data.
- Cause / Action: Cause: A PAT encoded chassis code is emitted
Action: System may be degraded. Contact HP support if system is not running
optimally.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2516
- Severity: MAJOR
- Event Summary: Attempt to initialize PA CPU intrigue engine for
debugger trapping has failed.
- Event Class: System
- Problem Description:
PA CPU intrigue engine load during
bootstrap to enable the intrigue-triggered Low Level Debugger trap failed.
This CPU will be deconfigured.
- Cause / Action: Cause: CPU or system firmware failure at the
conclusion of CPU self tests. Action: Contact HP service personnel for failure
analysis.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2517
- Severity: MAJOR
- Event Summary: Cell will be halted because PDC couldn't determine
relocated address of code.
- Event Class: System
- Problem Description:
PDC is about to halt the cell because it
was unable to determine the GNI address of the CpuInstallIntrigueLdb()
function relocated to memory. The data field contains the error return value
from the function GetGniCodeAddrFromRomAddr.
- Cause / Action:
Cause: Hardware connecting cells in the
partition experienced a problem. Action: Contact HP support personnel to
verify intercell connection hardware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2518
- Severity: WARNING
- Event Summary: An uncorrectable Power Aware Architecture (PAA)
fault has occurred on the MX2 module
- Event Class: System
- Problem Description:
An uncorrectable PAA fault has occurred on
the MX2 module and PAA firmware has halted. The system will continue to run
without PAA firmware.
- Cause / Action: Cause: An uncorrectable PAA fault has occurred on
the MX2 module and PAA firmware has halted. Action: Contact your HP support
personnel.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2519
- Severity: FATAL
- Event Summary: Firmware is unable to set the PageZero Memory Size
field.
- Event Class: System
- Problem Description:
During page zero initialization, system
firmware is unable to set the Memory Size field to be used in the Mako API PIM
structure.
- Cause / Action: Cause: PDC has failed to update a FATAL data
structure during PD boot. The partition will be reset. Action: Contact HP
personnel for error analysis.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2520
- Severity: CRITICAL
- Event Summary: Clear Error Logs routine failed
- Event Class: System
- Problem Description:
Clear Error Logs routine failed. Data field
contains error status.
- Cause / Action: Cause: Clear Error Logs routine returned a
non-recoverable error Action: CPU will be halted.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2686
- Severity: FATAL
- Event Summary: BOOT_BAD_CPU_ORDER
- Event Class: System
- Problem Description:
The Data Field contains the value used to
determine a valid order.
- Cause / Action: Cause: Check CPU load ordering Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2687
- Severity: FATAL
- Event Summary: The clock ratio reported by manageability does not
match the actual clock ratio
- Event Class: System
- Problem Description:
The clock ratio that the processor is
running at does not match the clock ratio that manageability is reporting.
- Cause / Action: Cause: This is a hardware error and it is fatal
Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2689
- Severity: FATAL
- Event Summary: The Nvm section specified could not be accessed
- Event Class: System
- Problem Description:
The section of NVM referenced by the data
field could not be accessed during initialization. This will cause the machine
to be halted as without NVM we cannot function properly.
- Cause / Action:
Cause: Should never happen, but if it does, you
will probably require a new system board. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2690
- Severity: MAJOR
- Event Summary: The RTC time/date were invalid and have been reset
to the Epoch.
- Event Class: System
- Problem Description:
One of the RTC time/date validity checks
failed and the RTC time and date have been reset back to the Epoch 1/1/1970
00:00:00.
- Cause / Action: Cause: consult prior events for a determination of
specific cause. Action: manually update the RTC time and date to the current
time and date.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2691
- Severity: MAJOR
- Event Summary: The RTC internal RAM data was invalid.
- Event Class: System
- Problem Description:
The RTC internal RAM data did not have the
expected value. The actual value read is displayed in the data field.
- Cause / Action: Cause: this error can be caused by first system
turn-on, bad PDH battery, or removal of the PDH battery. Action: the RTC will
be reset to the Epoch of 1/1/1970 00:00:00 and must be manually set to current
time and date.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2692
- Severity: MAJOR
- Event Summary: The RTC internal weekday register contained an
invalid value.
- Event Class: System
- Problem Description:
The RTC internal weekday register contained
an invalid value. Valid values are 1-7 and the actual value read is contained
in the data field.
- Cause / Action: Cause: this error may be caused by first system
turn-on, bad PDH battery, or removal of PDH battery. Action: the RTC time/date
will be reset back to the Epoch 1/1/1970 00:00:00 and must manually be reset
to current time/date.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2693
- Severity: MAJOR
- Event Summary: The value contained in the internal RTC year
register was invalid.
- Event Class: System
- Problem Description:
The value contained in the internal RTC
year register was invalid. Normal expected values are limited to the range of
1970-3000. The actual year value read is contained in the data field.
- Cause / Action: Cause: this error may be caused by first system
turn-on, bad PDH battery, or removal of the PDH battery. Action: the RTC
time/date will be reset back to the Epoch 1/1/1970 00:00:00 and must be
manually reset to the current time and date.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2694
- Severity: MAJOR
- Event Summary: The RTC indicates it's backup battery is discharged.
- Event Class: System
- Problem Description:
The RTC internal status has flagged the
backup battery as being discharged. The RTC will likely lose time if the
system power is turned off and be reset back to the Epoch 1/1/1970 00:00:00.
- Cause / Action: Cause: the PDH battery is either discharged or
disconnected. Action: install a new battery.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2695
- Severity: MAJOR
- Event Summary: The requested NVM record module ID was not found.
- Event Class: System
- Problem Description:
During operation NVM is dynamically mapped
and bound through the use of NVM record module IDs. System firmware modules
will request a address in NVM by providing a module ID. This event is produced
when the requested module ID is invalid and is not found in the NVM managers
internal mapping table.
- Cause / Action: Cause: internal system firmware error. Action: one
of more of the following until error is corrected: first reset system and if
error persists, reset all NVM to defaults.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2696
- Severity: MAJOR
- Event Summary: Provides NVM-based index and RAM-based index module
ID numbers.
- Event Class: System
- Problem Description:
During startup the NVM manager creates an
index of NVM records in RAM as it maps requests to storage. This index is
processed in parallel with a similar index saved in NVM. This event is
produced when the module IDs in the two indexes do not match. The data field
provides the two module IDs in low 32-bits and high 32-bits.
- Cause / Action:
Cause: system firmware error. Action: reset
system and if error persists reset NVM to default values.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2697
- Severity: MAJOR
- Event Summary: The NVM manager has detected an index
synchronization error.
- Event Class: System
- Problem Description:
During startup the NVM manager creates a
record index in RAM and then processes this index in parallel with a similar
index saved in NVM. This event is produced when the NVM record module IDs in
the two indexes to not match. The data field contains the index value where
the error occurred.
- Cause / Action: Cause: system firmware error. Action: first reset
the system and if the error persists reset NVM to default values.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2698
- Severity: MAJOR
- Event Summary: The specified CPU model does not match the Monarch
CPU model
- Event Class: System
- Problem Description:
The model of the CPU specified by the data
field does not match the model of the Monarch CPU. The system will be
prevented from booting past BCH and a warning message will be issued at BCH.
- Cause / Action: Cause: unsupported mixture of CPU models installed.
Action: determine which CPU is invalid from BCH and remove it.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2699
- Severity: MAJOR
- Event Summary: BOOT_INIT_OTHR_PATH_FAILED
- Event Class: System
- Problem Description:
PDC loading ISL.
- Cause / Action:
Cause: PDC attempted to initialize the
non-primary boot path and failed. Action: Data field contains error status.
Verify non-primary boot path is correct. Check console for other messages.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2700
- Severity: MAJOR
- Event Summary: BOOT_SS_ERROR
- Event Class: System
- Problem Description:
Event ID can occur during early boot when
attempting to initialize and validate stable store or during an attempt to
Autoboot.
- Cause / Action: Cause: During early boot or Autoboot PDC could not
access stable store. Action: Note error status in data field and verify that
BMC stable store tokens can be accessed my MP directly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2701
- Severity: MAJOR
- Event Summary: BOOT_NO_CONS_FOUND
- Event Class: System
- Problem Description:
Failed to find a default console path.
- Cause / Action: Cause: PDC has attempted to initialize all console
paths and has failed to find one good console. Action: Verify console paths
are correct and console hardware is installed in correct slot.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2702
- Severity: FATAL
- Event Summary: BOOT_SS_FATAL_ERROR
- Event Class: System
- Problem Description:
PDC could not access stable store will
booting to ISL.
- Cause / Action: Cause: PDC could not access the primary boot path
in architected stable store or autostart flag in unarchitected stable store
while trying to boot to ISL. Action: Data field contains the error code for
the failure. Verify that the BMC stable store tokens can be accessed directly
using the MP IPMI commands.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2703
- Severity: MAJOR
- Event Summary: BOOT_NO_GO_SS_CONS
- Event Class: System
- Problem Description:
PDC is attempting to select a console.
- Cause / Action: Cause: PDC either got an error trying to read the
architected stable store console path or failed to initial the path it was
able to read. PDC will try the other options to find a console path that
works. Action: Data field contains the failure status for the architected
stable store read or the console initialization failure. PDC should be able to
find a console path that works.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2704
- Severity: UNKNOWN
- Event Summary: MC_UNKNOWN
- Event Class: System
- Problem Description:
PDC has received an unknown machine check.
- Cause / Action: Cause: The machine check PDC has processed is not
an HPMC, TOC or LPMC. Action: Data field contains the contains the offending
API failure code. This code needs to be reported to the labs for RCA
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2705
- Severity: MAJOR
- Event Summary: BOOT_BAD_IPL_SIZE_PRI
- Event Class: System
- Problem Description:
PDC is validating LIF header.
- Cause / Action:
Cause: While validating LIF header, PDC found a
bad size for the primary boot path. Action: IPL must be non-zero, less than
256KB and a multiple of 2KB.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2706
- Severity: MAJOR
- Event Summary: BOOT_OTHR_IPL_FAULT
- Event Class: System
- Problem Description:
PDC reading IPL into memory.
- Cause / Action:
Cause: While trying to read IPL for the
non-primary boot path, PDC found an error. Action: Data field contains error
status. Verify IPL meets requirements.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2707
- Severity: MAJOR
- Event Summary: BOOT_SER_NUM_WARNING
- Event Class: System
- Problem Description:
PDC is validating system serial number.
- Cause / Action: Cause: PDC was unable to validate the system serial
number. Action: Data field contains string BADSSN#. Check validity of system
serial number. Warning bit will be set and displayed by BCH.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2708
- Severity: MAJOR
- Event Summary: BOOT_BAD_IPL_ADDR_OTHR
- Event Class: System
- Problem Description:
PDC is validating LIF header.
- Cause / Action:
Cause: The LIF on the non primary boot path is
invalid. Action: Verify IPL start address is non-zero and aligned on 2KB
boundary.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2709
- Severity: FATAL
- Event Summary: MC_HPMC_MONARCH_SELECTED
- Event Class: System
- Problem Description:
PDC is processing an HPMC and has selected
a Crash Monarch to be used in processing.
- Cause / Action:
Cause: PDC has selected a HPMC Crash Monarch.
Action: Data field contains the base address of PDC.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2710
- Severity: MAJOR
- Event Summary: BOOT_BAD_IPL_ENTRY_OTHR
- Event Class: System
- Problem Description:
PDC is reading LIF.
- Cause / Action:
Cause: PDC found an error while verifying the
LIF header from the non primary path. Action: Verify the LIF header has an
entry address less than the IPL size and is word-aligned.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2711
- Severity: MAJOR
- Event Summary: BOOT_BAD_IPL_ENTRY_PRI
- Event Class: System
- Problem Description:
PDC is reading LIF.
- Cause / Action:
Cause: While validating the LIF header for the
primary boot path, PDC found an error in the entry address. Action: Verify
that the LIF header entry address is less than IPL size and word aligned.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2712
- Severity: MAJOR
- Event Summary: BOOT_BAD_IPL_ADDR_PRI
- Event Class: System
- Problem Description:
PDC is reading LIF.
- Cause / Action:
Cause: PDC is validating the LIF header for the
primary boot path and found the start address is either zero or not aligned on
2KB boundary. Action: Verify that the start address is non-zero and on a 2KB
boundary.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2713
- Severity: MAJOR
- Event Summary: BOOT_INIT_PRI_PATH_FAILED
- Event Class: System
- Problem Description:
PDC is loading ISL.
- Cause / Action:
Cause: PDC attempted to initialize the primary
boot path and failed. Action: Data field contains the error status. Verify
primary boot path is correct. Check console for other messages.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2714
- Severity: FATAL
- Event Summary: BOOT_BOOT_FAILURE
- Event Class: System
- Problem Description:
PDC is halting the machine.
- Cause / Action:
Cause: PDC found a halt machine condition.
Action: Examine other event codes to determine reason for halting the machine.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2715
- Severity: MAJOR
- Event Summary: BOOT_BAD_IPL_CHECKSUM_OTHR
- Event Class: System
- Problem Description:
PDC is reading IPL.
- Cause / Action:
Cause: PDC found bad checksum in IPL from non
primary path. Action: Verify non primary path IPL is good.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2716
- Severity: MAJOR
- Event Summary: BOOT_BAD_IPL_CHECKSUM_PRI
- Event Class: System
- Problem Description:
PDC is reading IPL.
- Cause / Action:
Cause: PDC found a bad checksum for the IPL on
the primary boot path. Action: Verify the IPL checksum.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2717
- Severity: FATAL
- Event Summary: BOOT_FATAL_ERR_WRITING_NVM
- Event Class: System
- Problem Description:
PDC initializing NVM data.
- Cause / Action:
Cause: PDC found an error while testing NVM.
Action: Data field contains last value read back during test.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2718
- Severity: MAJOR
- Event Summary: BOOT_PRI_IPL_FAULT
- Event Class: System
- Problem Description:
PDC reading IPL into memory.
- Cause / Action:
Cause: While trying to read IPL for the primary
boot path, PDC found an error. Action: Data field contains error status.
Verify IPL meets requirements.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2719
- Severity: FATAL
- Event Summary: BOOT_NO_BOOT_SELECTION
- Event Class: System
- Problem Description:
PDC failed to boot from selected device.
- Cause / Action: Cause: PDC failed to boot from selected device.
Action: Data field contains ASCII string to indicate Autoboot, Autosearch or
Manual Boot failure and whether it failed validation or loading ISL. Primary
and alternate paths are also indicated for Autosearch case.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2720
- Severity: MAJOR
- Event Summary: BOOT_BAD_LIF_MAGIC_OTHR
- Event Class: System
- Problem Description:
PDC is validating LIF header.
- Cause / Action:
Cause: PDC found a magic number mismatch in the
LIF header for the non primary boot path. Action: Verify magic number in LIF
header.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2721
- Severity: MAJOR
- Event Summary: BOOT_BAD_LIF_MAGIC_PRI
- Event Class: System
- Problem Description:
PDC is validating LIF header.
- Cause / Action:
Cause: PDC found a magic number mismatch in the
LIF header for the primary boot path. Action: Verify magic number in LIF
header.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2722
- Severity: MAJOR
- Event Summary: BOOT_BAD_IPL_SIZE_OTHR
- Event Class: System
- Problem Description:
PDC is validating LIF header.
- Cause / Action:
Cause: While validating LIF header, PDC found a
bad size for the non primary boot path. Action: IPL must be non-zero, less
than 256KB and a multiple of 2KB.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2726
- Severity: MAJOR
- Event Summary: The Creator Product Name is invalid
- Event Class: System
- Problem Description:
The Creator Product Name partition variable
is invalid. This is entered at the manageability processor, typically by the
factory. System firmware was unable to extract the platform ID
- Cause / Action: Cause: The Creator Product Name partition variable
is invalid. This is entered at the manageability processor, typically by the
factory. System firmware was unable to extract the platform ID. Action: From
the manageability processor, enter a valid Creator Product Name.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2747
- Severity: MAJOR
- Event Summary: I/O host bridge is deconfigured
- Event Class: System
- Problem Description:
Firmware has deconfigured an I/O host
bridge due to an error (see earlier error event). Data Field: Physical
location of the deconfigured I/O host bridge.
- Cause / Action: Cause: See earlier error event. Action: See earlier
error event.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2759
- Severity: FATAL
- Event Summary: Cell boards in the same partition have different
complex profiles.
- Event Class: System
- Problem Description:
Cell boards in the same partition have
different complex profiles. The partition will be rebooted and cannot be fully
booted until the problem is resolved.
- Cause / Action: Cause: 1. An error occurred, which prevented the
complex profiles from being distributed properly. 2. A hardware problem exists
with MP or PDHC hardware. Action: 1. Create and distribute a new complex
profile using parMgr on a functional partition in the complex. Restore the
last complex profile using the "CC" command from the MP, then use ParMgr to
create a new complex profile. 2. Contact HP Support to confirm the MP and PDHC
are functioning properly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2760
- Severity: FATAL
- Event Summary: Failed to acquire all needed semaphores to access
partition complex profiles.
- Event Class: System
- Problem Description:
An attempt to access complex profiles of
all cells in the partition has failed due to a failure to acquire all the
needed semaphores.
- Cause / Action: Cause: Unexpected semaphore contention. Partition
will reboot. Action: Contact HP Support to interpret associated events in
listing.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2761
- Severity: FATAL
- Event Summary: Failed to release all semaphores after attempt to
access complex profiles.
- Event Class: System
- Problem Description:
An attempt to access complex profiles of
all cells in the partition has failed due to a failure to release all
associated semaphores.
- Cause / Action:
Cause: Unexpected semaphore contention.
Partition will be rebooted. Action: Contact HP Support to interpret associated
events in listing.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2762
- Severity: FATAL
- Event Summary: Failure to acquire semaphores needed to access
complex profile
- Event Class: System
- Problem Description:
Failure to acquire semaphores needed to
access complex profile. Data field indicates the reason for failure (usually
indicates which semaphore could not be locked.
- Cause / Action:
Cause: Unexpected semaphore contention. Action:
Contact HP Support to interpret associated events in listing.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2763
- Severity: FATAL
- Event Summary: Failure to acquire semaphores needed to access
complex profile
- Event Class: System
- Problem Description:
Failure to acquire semaphores needed to
access complex profile. The data field contains the cell number that owns the
resources the reporting processor is attempting to lock.
- Cause / Action:
Cause: Unexpected semaphore contention. Action:
Contact HP Support to interpret associated events in listing.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2764
- Severity: FATAL
- Event Summary: Failure to release semaphores that were acquired to
access complex profile
- Event Class: System
- Problem Description:
Failure to release semaphores that were
acquired to access complex profile. The data field indicates which semaphore
could not be released (-8 for cell global software semaphore, -9 for the cell
micro semaphore).
- Cause / Action:
Cause: Unexpected semaphore contention Action:
Contact HP Support to interpret associated events in listing.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2765
- Severity: FATAL
- Event Summary: Failure to release semaphores that were acquired to
access complex profile.
- Event Class: System
- Problem Description:
Failure to release semaphores that were
acquired to access complex profile. The data field contains the cell number
that owns the resources the reporting processor is attempting to unlock.
- Cause / Action:
Cause: Unexpected semaphore contention. Action:
Contact HP Support to interpret associated events in listing
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2904
- Severity: FATAL
- Event Summary: CPU fan failed
- Event Class: System
- Problem Description: The fan on a CPU is not operating in a manner
that provides enough air flow to cool the processor
- Cause / Action:
Cause: The CPU fan identified by the attached
physical location has failed.
Action: Contact HP Support personnel to
troubleshoot problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2905
- Severity: MAJOR
- Event Summary: CPU fan failing
- Event Class: System
- Problem Description: The fan on a CPU is operating in a manner that
provides enough air flow to cool the processor but is not at normal operation
- Cause / Action:
Cause: The CPU fan identified by the attached
physical location is not operating at expected speed.
Action: Contact HP
Support personnel to troubleshoot problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2906
- Severity: FATAL
- Event Summary: CC chip fan failed
- Event Class: System
- Problem Description: The fan on a CC chip is not operating in a
manner that provides enough air flow to cool the processor
- Cause / Action:
Cause: The CC chip fan identified by the
attached physical location has failed.
Action: Contact HP Support personnel
to troubleshoot problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2907
- Severity: MAJOR
- Event Summary: CC chip fan failing
- Event Class: System
- Problem Description: The fan on a CC chip is operating in a manner
that provides enough air flow to cool the processor but is not at normal
operation
- Cause / Action:
Cause: The CC chip fan identified by the
attached physical location is not operating at expected speed.
Action:
Contact HP Support personnel to troubleshoot problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2908
- Severity: CRITICAL
- Event Summary: An Arches fabric CSR read resulted in a timeout
- Event Class: System
- Problem Description:
An Arches fabric CSR read was unsuccessful
because of a timeout.
- Cause / Action:
Cause: An error occurred while reading a
hardware register. Other events should detail the error. Action: Contact your
HP Support Representative to investigate the fabric and firmware vertex
modules.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2909
- Severity: CRITICAL
- Event Summary: An Arches fabric CSR write was unsuccessful
- Event Class: System
- Problem Description:
An Arches fabric CSR write was unsuccessful
- Cause / Action:
Cause: An error occurred while writing a
hardware register. Other events should detail the error. Action: Contact your
HP Support Representative to investigate the fabric and firmware vertex
modules.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2910
- Severity: CRITICAL
- Event Summary: A failure occurred while prepping a back-to-back
system for rendezvous
- Event Class: System
- Problem Description:
The event data details the error that was
encountered
- Cause / Action:
Cause: A fabric error occurred while preparing
for rendezvous Action: Contact your HP Support Representative to investigate
the fabric and firmware vertex modules.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2911
- Severity: FATAL
- Event Summary: The maximum memory supported by this system has been
exceeded.
- Event Class: System
- Problem Description: The maximum memory supported by this system
has been exceeded. The data field contains the amount of memory supported by
the system.
- Cause / Action:
Cause: Too much memory has been installed in the
system.
Action: Remove DIMMs to get the total memory size below the amount
of memory returned in the data field
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2914
- Severity: MAJOR
- Event Summary: fabric API is about to open the a link going out of
the local cell
- Event Class: System
- Problem Description:
fabric API is about to open the a link
going out of the local cell
- Cause / Action: Cause: An error prevented the use of a fabric link.
Action: Collect IPMI event logs for more information regarding the failure.
Contact your HP Support Representative to investigate the fabric subsystem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2917
- Severity: CRITICAL
- Event Summary: A command to the memory buffer chip failed to
complete.
- Event Class: System
- Problem Description: A command to the memory buffer chip failed to
complete.
- Cause / Action:
Cause: System firmware did not get a completion
status from a command to a memory buffer chip.
Action: Contact HP support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2918
- Severity: MAJOR
- Event Summary: ArfIsCsrRouteTraversable was called on a
back-to-back system.
- Event Class: System
- Problem Description:
The Arches Fabric function
ArfIsCsrRouteTraversable was called for a back-to-back system. This is
unsupported, ArfIsCsrRouteTraversable should only be called for systems with
crossbars. No data.
- Cause / Action: Cause: An unsupported fabric call was made. Action:
Contact your HP Support Representative to investigate the fabric and firmware
vertex modules.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2919
- Severity: MAJOR
- Event Summary: An invalid port number was passed to an ARF
Traversibility function.
- Event Class: System
- Problem Description:
An invalid port number was given as an
input to an Arches Fabric Traversibility function. The port is expected to be
an internal XBC num (8-15), but the actual port was not. Data field: (XBC
number << 32) | (port number << 48)
- Cause / Action:
Cause: Internal firmware error. Action: Contact
your HP Support Representative to investigate the fabric and firmware vertex
modules.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2920
- Severity: MAJOR
- Event Summary: Error reading the XBC neighbor info (ALB ID) in
ArfIsPortPairValid.
- Event Class: System
- Problem Description:
Unable to read XBC port neighbor
information. Data field: (xbcNum << 32) | (portNum << 48) Cause /
Action:
Cause: System firmware was unable to read a fabric hardware
register. Action: Capture IPMI event logs, look for additional errors. Contact
your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2921
- Severity: MAJOR
- Event Summary: An XBC port has an unexpected neighbor type.
- Event Class: System
- Problem Description:
XBC port with an unexpected neighbor chip
type. Data field: (expected chip type) | (actual chip type << 16) |
(xbcNum << 32) | (portNum << 48)
- Cause / Action: Cause: System firmware noticed a discrepancy from
the expected topology of the system. Action: Capture IPMI event logs, look for
additional errors. Contact your HP Support Representative to analyze the
fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2922
- Severity: MAJOR
- Event Summary: An XBC port was found to have an unexpected neighbor
chip ID.
- Event Class: System
- Problem Description:
An XBC port was found to have an unexpected
neighbor ID. Data field: (expected NID) | (actual NID << 16) | (xbcNum
<< 32) | (portNum << 48)
- Cause / Action:
Cause: System firmware found a discrepancy in
the expected topology of the system. Action: Capture IPMI event logs, look for
additional errors. Contact your HP Support Representative to analyze the
fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2923
- Severity: MAJOR
- Event Summary: An XBC port was found to have an unexpected neighbor
port connection.
- Event Class: System
- Problem Description:
An XBC was found to have an unexpected
neighbor port connection. Data field: (expected port) | (actual port <<
16) | (xbcNum << 32) | (portNum << 48)
- Cause / Action:
Cause: System firmware found a discrepancy in
the expected topology of the system. Action: Capture IPMI event logs, look for
additional errors. Contact your HP Support Representative to analyze the
fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2924
- Severity: MAJOR
- Event Summary: Unable to find the expected neighbor information
from the Arches Fabric data.
- Event Class: System
- Problem Description:
The expected data was not found in the
Arches Fabric expected edge list. Data field is unused
- Cause / Action:
Cause: Internal system firmware error or data
corruption. Action: Capture IPMI event logs, look for additional errors.
Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2925
- Severity: MAJOR
- Event Summary: An XBC port has an unsupported neighbor chip type.
- Event Class: System
- Problem Description:
An XBC port has an unexpected neighbor chip
type. Data field: neighbor type found
- Cause / Action: Cause: System firmware found an unexpected chip in
the system. Action: Capture IPMI event logs, look for additional errors.
Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2926
- Severity: MAJOR
- Event Summary: A previously good XBC-XBC link was found not
connected.
- Event Class: System
- Problem Description:
A previously good XBC-XBC link was found to
be down. Data field: (xbcNum << 32) | (portNum << 48)
- Cause / Action:
Cause: A link that was previously healthy was
found to be down. Action: Capture IPMI event logs, look for additional errors.
Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2927
- Severity: MAJOR
- Event Summary: An FE was found on an XBC port during an ARF
traversability test.
- Event Class: System
- Problem Description:
An XBC port was found to have a Fatal Error
during traversability test. Data field: (xbcNum << 32) | (portNum
<< 48)
- Cause / Action: Cause: A link was found to have fatal errors,
possible bad link/connection. Action: Capture IPMI event logs, look for
additional errors. Contact your HP Support Representative to analyze the
fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2928
- Severity: MAJOR
- Event Summary: Unable to read the local cell's LINK_SEL_FABRIC CSR
- Event Class: System
- Problem Description:
Unable to read the local cell's
LINK_SEL_FABRIC CSR Data field: unused
- Cause / Action: Cause: System firmware was unable to read a
hardware register. Action: Capture IPMI event logs, look for additional
errors. Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2929
- Severity: MAJOR
- Event Summary: Unable to read the XBC route table index.
- Event Class: System
- Problem Description:
Unable to read the XBC route table. Data
field: (route index) | (xbcNum << 32) | (portNum << 48)
- Cause / Action:
Cause: System firmware was unable to read a
fabric hardware register. Action: Capture IPMI event logs, look for additional
errors. Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2930
- Severity: MAJOR
- Event Summary: An XBC link was found to be unexpectedly not
connected.
- Event Class: System
- Problem Description:
An XBC link was found to be not connected
while testing CSR traffic traversibility. Data field: (xbcNum << 32) |
(portNum << 48)
- Cause / Action: Cause: A previously healthy link was found to be
down. Possible bad cable/connection. Action: Capture IPMI event logs, look for
additional errors. Contact your HP Support Representative to analyze the
fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2931
- Severity: MAJOR
- Event Summary: Error reading ALB ID information from a Skyline port
- Event Class: System
- Problem Description:
Error reading Skyline ALREC_ALB_ID CSR Data
field: (Skyline port) | (Cell number << 56)
- Cause / Action:
Cause: System firmware was unable to read a
fabric hardware register. Action: Capture IPMI event logs, look for additional
errors. Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2932
- Severity: MAJOR
- Event Summary: A circular route was found while testing XBC CSR
traffic traversability.
- Event Class: System
- Problem Description:
A circular route was found while testing
XBC CSR traffic traversibility. Data field: (target XBC << 32) |
(cellNum << 56)
- Cause / Action: Cause: Internal firmware error, fabric was routed
incorrectly. Action: Capture IPMI event logs, look for additional errors.
Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2933
- Severity: MAJOR
- Event Summary: Error reading the ALREC_ALB_ID CSR for an XBC port
- Event Class: System
- Problem Description:
Error reading ALREC_ALB_ID CSR for an XBC
port. Data field: (xbcNum << 32) | (portNum << 48)
- Cause / Action:
Cause: System firmware was unable to read a
fabric hardware register. Action: Capture IPMI event logs, look for additional
errors. Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2934
- Severity: MAJOR
- Event Summary: An XBC-XBC port had an unexpected neighbor type
- Event Class: System
- Problem Description:
An XBC-XBC port had an invalid chip type
connection. Data field: (expected chip type) | (actual chip type << 32)
- Cause / Action: Cause: System firmware found a discrepancy in the
system topology. Action: Capture IPMI event logs, look for additional errors.
Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2935
- Severity: MAJOR
- Event Summary: ArfIsXbcRouteTraversable was called on a
back-to-back system.
- Event Class: System
- Problem Description:
ArfIsXbcRouteTraversable was called on a
back-to-back system which is an improper use of the function. Data field:
unused
- Cause / Action: Cause: Internal firmware error. Action: Capture
IPMI event logs, look for additional errors. Contact your HP Support
Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2936
- Severity: MAJOR
- Event Summary: In ARF Traversability code, an XBC link had an
unexpected error.
- Event Class: System
- Problem Description:
In ArfIsXbcRouteTraversable, an XBC-XBC
link was found to have fatal errors. Data field: (xbcNum << 32) |
(portNum << 48)
- Cause / Action: Cause: A previously healthy link was found to have
fatal errors. Action: Capture IPMI event logs, look for additional errors.
Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2937
- Severity: MAJOR
- Event Summary: Unable to read the Moab ROUTE_TABLE_ENABLE_MASK CSR
- Event Class: System
- Problem Description:
Unable to read the XBC
ROUTE_TABLE_ENABLE_MASK CSR. Data field: (xbcNum << 32) | (portNum
<< 48)
- Cause / Action: Cause: System firmware was unable to read a fabric
hardware register. Action: Capture IPMI event logs, look for additional
errors. Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2938
- Severity: MAJOR
- Event Summary: Unable to read the XBC ROUTEx CSR.
- Event Class: System
- Problem Description:
Error reading an XBC ROUTE_TABLE CSR. Data
field: route index | (xbcNum << 32) | (portNum << 48) Cause /
Action:
Cause: System firmware was unable to read a fabric hardware
register. Action: Capture IPMI event logs, look for additional errors. Contact
your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2939
- Severity: MAJOR
- Event Summary: Error reading the ALB ALREC_ALB_ID CSR for an XBC
port.
- Event Class: System
- Problem Description:
Error reading ALB ALREC_ALB_ID CSR for an
XBC port. Data field: (xbcNum << 32) | (portNum << 48)
- Cause / Action:
Cause: System firmware was unable to read a
fabric hardware register. Action: Capture IPMI event logs, look for additional
errors. Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2940
- Severity: MAJOR
- Event Summary: An XBC port has an unexpected neighbor chip
- Event Class: System
- Problem Description:
An XBC port was found with an unexpected
neighbor chip. Data field: expected chip type | (actual chip type << 32)
- Cause / Action: Cause: System firmware found a discrepancy in the
expected system topology. Action: Capture IPMI event logs, look for additional
errors. Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2942
- Severity: MAJOR
- Event Summary: A cell and port pair was not found in the ARF
expected graph data.
- Event Class: System
- Problem Description:
A cell/port pair was not found in the
expected graph data. Data field: (cellNum << 32) | CC portNum
- Cause / Action:
Cause: Internal firmware error or data
corruption. Action: Capture IPMI event logs, look for additional errors.
Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2943
- Severity: MAJOR
- Event Summary: A cell is not connected to the expected fabric
during a traversability test.
- Event Class: System
- Problem Description:
Arches Fabric Traversability code
unexpectedly found the local cell link not connected. Data field: linkState (0
= connected, 1 = link pending, 2 = not connected, -1 = error)
- Cause / Action:
Cause: A previously healthy link was found to
have fatal errors. Action: Capture IPMI event logs, look for additional
errors. Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2944
- Severity: MAJOR
- Event Summary: Unable to read the ROUTE_TABLE CSR on the XBC.
- Event Class: System
- Problem Description:
Error reading the ROUTEx CSR on the XBC.
Data field: route index | (xbcNum << 32) | (portNum << 48)
- Cause / Action:
Cause: System firmware was unable to read a
fabric hardware register. Action: Capture IPMI event logs, look for additional
errors. Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2945
- Severity: MAJOR
- Event Summary: Error reading the ALB ALREC_ALB_ID CSR for an XBC
port.
- Event Class: System
- Problem Description:
Unable to read the ALB ALREC_ALB_ID CSR for
an XBC port. Data field: (xbcNum << 32) | (portNum << 48)
- Cause / Action: Cause: System firmware was unable to read a fabric
hardware register. Action: Capture IPMI event logs, look for additional
errors. Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2946
- Severity: MAJOR
- Event Summary: An XBC port has an unexpected neighbor chip type
- Event Class: System
- Problem Description:
An XBC port has an unexpected neighbor chip
type. Data field: expected neighbor chip type | (actual neighbor chip type
<< 32)
- Cause / Action: Cause: System firmware found a discrepancy in the
expected system topology. Action: Capture IPMI event logs, look for additional
errors. Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2947
- Severity: MAJOR
- Event Summary: A Cell-Cell link has an unexpected neighbor chip
type.
- Event Class: System
- Problem Description:
A Cell-Cell link has an unexpected neighbor
chip type. Data field: expected neighbor chip type | (actual neighbor chip
type << 32)
- Cause / Action:
Cause: System firmware has found a discrepancy
in the expected system topology. Action: Capture IPMI event logs, look for
additional errors. Contact your HP Support Representative to analyze the
fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2948
- Severity: MAJOR
- Event Summary: An error while getting the expected neighbor
information for a CC port.
- Event Class: System
- Problem Description:
The cell/port pair does not exist in the
Arches Fabric expected graph data. Data field: CC port | (cellNum << 32)
- Cause / Action:
Cause: System firmware was unable to read a
fabric hardware register. Action: Capture IPMI event logs, look for additional
errors. Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2949
- Severity: MAJOR
- Event Summary: A Cell-Cell link is connected to an unexpected
neighbor port.
- Event Class: System
- Problem Description:
A Cell-Cell link is connected to an
unexpected neighbor port. Data field: expected neighbor port | (actual
neighbor port << 32)
- Cause / Action:
Cause: System firmware found a discrepancy in
the expected system topology. Action: Capture IPMI event logs, look for
additional errors. Contact your HP Support Representative to analyze the
fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2950
- Severity: MAJOR
- Event Summary: A Cell-Cell link is connected to an unexpected cell
- Event Class: System
- Problem Description:
A Cell-Cell link is connected to an
unexpected cell. Data field: expected neighbor ID | (actual neighbor ID
<< 16) | (target cell << 32) | (local cell << 48)
- Cause / Action:
Cause: System firmware found a discrepancy in
the expected system topology. Action: Capture IPMI event logs, look for
additional errors. Contact your HP Support Representative to analyze the
fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2951
- Severity: MAJOR
- Event Summary: ARF vertex module returned an unexpected error
during SetupNCTable.
- Event Class: System
- Problem Description:
VM_FindAllRoutes returned an unexpected
error during Non-Coherent table setup. Data field: (return value) | (source
cell << 32) | (target cell << 48)
- Cause / Action:
Cause: Internal firmware error or data
corruption. Action: Capture IPMI event logs, look for additional errors.
Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2952
- Severity: MAJOR
- Event Summary: A write to the LINK_SEL_NONCOH global CSR was not
successful.
- Event Class: System
- Problem Description:
In SetupNCTable, a write to the
GLOBAL_LINK_SEL_NONCOH CSR failed. Data field: (data written to CSR) | (actual
CSR data read << 32)
- Cause / Action:
Cause: System firmware was unable to write a
fabric hardware register. Action: Capture IPMI event logs, look for additional
errors. Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2953
- Severity: MAJOR
- Event Summary: Arches Fabric pre-rendezvous setup of the
Non-Coherent link select CSR failed.
- Event Class: System
- Problem Description:
SetupNCTable failed in ArfPhase3. Data
field: cell set passed in to SetupNCTable
- Cause / Action: Cause: System firmware was unable to write some
fabric hardware registers. Action: Capture IPMI event logs, look for
additional errors. Contact your HP Support Representative to analyze the
fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2954
- Severity: MAJOR
- Event Summary: A failure occurred while prepping a back-to-back
system for ArfPhase4.
- Event Class: System
- Problem Description:
A failure occurred while prepping a
back-to-back system for post-rendezvous Arches Fabric setup. Data field:
return value from Phase4BackToBackPrep
- Cause / Action: Cause: System firmware was unable to write various
fabric hardware registers. Action: Capture IPMI event logs, look for
additional errors. Contact your HP Support Representative to analyze the
fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2955
- Severity: MAJOR
- Event Summary: ArfPhase4 was unable to set up the NC link select to
a cell.
- Event Class: System
- Problem Description:
ArfPhase4 was unable to set up the
Non-Coherent link select to at least one cell in its alive/gsm sharing set.
Data field: bitmap of unreachable cells
- Cause / Action: Cause: System firmware was unable to find valid
routes to each cell in a previously good partition. Possible cable/connection
problems. Action: Capture IPMI event logs, look for additional errors. Contact
your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2956
- Severity: MAJOR
- Event Summary: Unable to setup the Non-Coherent route table for a
cell in our partition.
- Event Class: System
- Problem Description:
Unable to set the Non-Coherent route. Data
field: (sourceCell << 32) | (cell set passed in to SetupNCTable)
- Cause / Action:
Cause: System firmware was unable to write
various fabric hardware registers. Action: Capture IPMI event logs, look for
additional errors. Contact your HP Support Representative to analyze the
fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2957
- Severity: MAJOR
- Event Summary: Unable to setup the Coherent route table for a cell
in our partition.
- Event Class: System
- Problem Description:
Unable to setup the Coherent route table
for a cell in our partition. Data field: (sourceCell << 32) | (cell set
passed in to SetupCohTable)
- Cause / Action: Cause: System firmware was unable to write various
fabric hardware registers. Action: Capture IPMI event logs, look for
additional errors. Contact your HP Support Representative to analyze the
fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2958
- Severity: MAJOR
- Event Summary: Error writing the XBC ROUTE_TABLE_ENABLE_MASK CSR.
- Event Class: System
- Problem Description:
Error writing the XBC
ROUTE_TABLE_ENABLE_MASK CSR. Data field: (route set to disable) | (xbcNum
<< 32) | (xbc port << 48)
- Cause / Action: Cause: System firmware was unable to write various
fabric hardware registers. Action: Capture IPMI event logs, look for
additional errors. Contact your HP Support Representative to analyze the
fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2959
- Severity: MAJOR
- Event Summary: The ARF vertex module returned an unexpected error
in SetupCohTable.
- Event Class: System
- Problem Description:
The Arches Fabric vertex module returned an
unexpected error during Non-Coherent table setup. Data field: (return value) |
(sourceCell << 32) | (dest cell << 48)
- Cause / Action:
Cause: Internal firmware error or data
corruption. Action: Capture IPMI event logs, look for additional errors.
Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2960
- Severity: MAJOR
- Event Summary: A write to a Skyline Global LINK_SEL_COHx CSR
failed.
- Event Class: System
- Problem Description:
A write to a Skyline Global LINK_SEL_COHx
CSR failed. Data field: cell number of the Skyline that failed
- Cause / Action:
Cause: System firmware was unable to write a
fabric hardware register. Action: Capture IPMI event logs, look for additional
errors. Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2961
- Severity: WARNING
- Event Summary: Firmware detected an incompatible processor
installed for this type of system.
- Event Class: System
- Problem Description:
System has an incompatible processor
installed.
- Cause / Action: Cause: One or more of the processors installed in a
system is incompatible with that system. Action: Data field contains details
about HVersion, L2 cache size and processor number that was detected.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2962
- Severity: MAJOR
- Event Summary: Error writing the XBC scratch CSR with the routing
state.
- Event Class: System
- Problem Description:
Unable to write the XBC port scratch CSR
with the routing state. Data field: (routing state) | (xbcNum << 32) |
(portNum << 48)
- Cause / Action:
Cause: System firmware was unable to write a
fabric hardware register. Action: Capture IPMI event logs, look for additional
errors. Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2963
- Severity: MAJOR
- Event Summary: Input power to the UPS has failed. A power failure
has occurred.
- Event Class: System
- Problem Description: Input power to the specified UPS has failed.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2964
- Severity: MAJOR
- Event Summary: Input power to the UPS has been restored.
- Event Class: System
- Problem Description: Input power to the specified UPS has been
restored.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2969
- Severity: WARNING
- Event Summary: Thread Configuration Mismatch
- Event Class: System
- Problem Description:
Firmware has detected that the tread
enablement settings are not consistent on all CPUs in the partition. The
offending CPU is reported in the data field.
- Cause / Action: Cause: At least one processor has an inconsistent
setting for multi-thread enablement. Action: Firmware will correctly set the
thread enablement settings for all cpu cores in the hard partition and reset
the partition with the correct settings.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2970
- Severity: FATAL
- Event Summary: The system is being powered down as UPS power is
exhausted.
- Event Class: System
Problem Description: The system is
being powered down as UPS power is exhausted.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2971
- Severity: WARNING
- Event Summary: A crc problem was detected while writing the default
control structure in NVM.
- Event Class: System
- Problem Description:
This chassis code indicates that a crc
problem was detected when trying to write the default control structure. The
calculated crc value did not match the value stored in the structure. The cell
will be reset.
- Cause / Action: Cause: Firmware/software error Action: Check for
new version of firmware/os Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 2972
- Severity: WARNING
- Event Summary: Stored crc does not match calculated crc in mfg mode
structure.
- Event Class: System
- Problem Description:
NVM crc problem was detected in the mfg
mode structure. The calculated crc value did not match the value stored in the
structure. The partition will be reset.
- Cause / Action:
Cause: Firmware/software error Action: Check
for new version of firmware/os Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4523
- Severity: OTHER
- Event Summary: The Error Response Mode has been determined
- Event Class: System
- Problem Description:
Get Error Response Mode has been called.
The first 8 bytes of the response mode string are displayed in the data field
and must be converted to ascii from the hex values.
- Cause / Action:
Cause: Decode the hex vales to ascii to
determine the mode. Other errors will determine action. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4524
- Severity: MAJOR
- Event Summary: FW was unable to merge two cells' graphs
- Event Class: System
- Problem Description:
SynchGraphs() failed during Arches Fabric
Phase 4. Data field: return value from SynchGraphs()
- Cause / Action:
Cause: Internal firmware error or data
corruption. Action: Capture IPMI event logs, look for additional errors.
Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4525
- Severity: MAJOR
- Event Summary: VM_DeleteVertex() failed in the Arches Fabric
function SynchGraphs().
- Event Class: System
- Problem Description:
VM_DeleteVertex() failed in SynchGraphs().
Data field: (vertex id << 32) | (vertex type << 56) | (return
value from VM_DeleteVertex())
- Cause / Action:
Cause: Internal firmware error or data
corruption. Action: Capture IPMI event logs, look for additional errors.
Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4526
- Severity: MAJOR
- Event Summary: VM_DeleteEdge() failed in the Arches Fabric function
SynchGraphs().
- Event Class: System
- Problem Description:
VM_DeleteEdge() failed in the Arches Fabric
function SynchGraphs(). Data field:(vertex id1 << 32) | (vertex port1
<< 48) | (vertex type1 << 56) | (vertex id2 << 0) | (vertex
port2 << 16) | (vertex type2 << 24)
- Cause / Action:
Cause: Internal firmware error or data
corruption. Action: Capture IPMI event logs, look for additional errors.
Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4527
- Severity: MAJOR
- Event Summary: Unexpected error when calling an Arches Fabric
vertex module function.
- Event Class: System
- Problem Description:
Unexpected error when calling an Arches
Fabric vertex module function. Data field: return value from vertex module
function
- Cause / Action: Cause: Internal firmware error or data corruption.
Action: Capture IPMI event logs, look for additional errors. Contact your HP
Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4528
- Severity: MAJOR
- Event Summary: Unexpected return from an ARF vertex module
function.
- Event Class: System
- Problem Description:
Unexpected return from an Arches Fabric
vertex module function while copying a cell's graph. Data field: return value
from vertex module function
- Cause / Action: Cause: Internal firmware error or data corruption.
Action: Capture IPMI event logs, look for additional errors. Contact your HP
Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4529
- Severity: WARNING
- Event Summary: The checksum update for the PDT has failed.
- Event Class: System
- Problem Description:
The checksum update for the PDT has failed.
- Cause / Action: Cause: The call to update the checksum for the PDT
failed. The PDT will most likely be cleared upon reboot. Action: Reboot
Upgrade SFW Contact HP support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4530
- Severity: WARNING
- Event Summary: The checksum in the memory area of NVM is bad.
Memory NVM will be cleared.
- Event Class: System
- Problem Description:
The checksum in the memory area of NVM is
bad. Memory NVM will be cleared.
- Cause / Action: Cause: The checksum in the memory area of NVM is
bad. Action: Upgrade SFW Contact HP support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4531
- Severity: WARNING
- Event Summary: Checksum calculation failed.
- Event Class: System
- Problem Description:
Checksum calculation failed.
- Cause / Action:
Cause: The checksum calculation failed. The
data field contains the address that the checksum calculation was attempted
on. Action: Upgrade SFW Contact HP support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4532
- Severity: WARNING
- Event Summary: The token has been initialized to the default value
due to access errors.
- Event Class: System
- Problem Description:
The BMC copy and SAL copy of the token
specified in the data field has been initialized from the default value for
the token due to an IPMI error and SAL NVM error.
- Cause / Action:
Cause: Both SAL's copy and BMC copy of the
token specified in the data field is bad. SFW is reinitializing that token to
it's default value Action: Verify that the default settings are appropriate.
For example, run the EFI baud and boot test command.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4533
- Severity: CRITICAL
- Event Summary: Detected incompatibility between the cell board and
the I/O backplane and/or I/O backplane cables.
- Event Class: System
- Problem Description:
Detected incompatibility between the cell
board and the I/O backplane and/or I/O backplane cables.
- Cause / Action:
Cause: Incorrect I/O backplane and/or I/O
backplane cables are connected to the cell board. The cell board is not
compatible with I/O backplane and/or I/O backplane cables. Action: Remove I/O
backplane and/or I/O backplane cables and replace with compatible I/O
backplane and/or I/O backplane cables.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4535
- Severity: CRITICAL
- Event Summary: CPU modules have been installed in an illegal
configuration on the cell board.
- Event Class: System
- Problem Description:
The CPU modules have been installed in an
illegal configuration on the cell board.
- Cause / Action: Cause: The CPU modules have been installed in an
illegal configuration on the cell board. Action: Contact your HP support
representative.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4536
- Severity: CRITICAL
- Event Summary: An invalid terminator card is installed on the cell
board
- Event Class: System
- Problem Description:
An invalid CPU terminator card is installed
on the cell board. The data field indicates the physical location of the
offending socket.
- Cause / Action: Cause: An invalid CPU terminator card is installed
on the cell board. Action: Contact your HP support representative.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4538
- Severity: CRITICAL
- Event Summary: The initialization of the memory buffer to CEC
timing has failed.
- Event Class: System
- Problem Description:
The initialization of the memory buffer to
CEC timing has failed. The return variable is the status.
- Cause / Action:
Cause: The memory buffer to CEC timing
calibration failed to produce expected results. Action: Contact HP support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4539
- Severity: CRITICAL
- Event Summary: SPD data for an invalid DIMM slot was requested from
manageability.
- Event Class: System
- Problem Description:
System Firmware has tried to retrieve SPD
data on an invalid DIMM slot from manageability. The data field is the return
status from manageability.
- Cause / Action: Cause: System Firmware has tried to retrieve SPD
data on an invalid DIMM slot from manageability. Action: Contact HP support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4540
- Severity: WARNING
- Event Summary: Releasing of a semaphore in the memory PDC code
failed.
- Event Class: System
- Problem Description:
Releasing of a semaphore in the memory PDC
code failed. The data field is the semaphore status
- Cause / Action:
Cause: Releasing of a semaphore in the memory
PDC code failed. Action: Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4541
- Severity: WARNING
- Event Summary: A memory selftest error is unable to be interpreted
correctly.
- Event Class: System
- Problem Description:
Firmware was unable to interpret a memory
error in the memory selftest correctly. The data field is the error type that
was incorrect.
- Cause / Action: Cause: Firmware was unable to interpret a memory
error in the memory selftest correctly. Action: Contact HP Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4542
- Severity: WARNING
- Event Summary: Many corrected memory errors were detected in the
memory subsystem
- Event Class: System
- Problem Description:
Significant numbers of corrected memory
errors have been detected on the memory subsystem
- Cause / Action:
Cause: You will receive this message if the
system is correcting a lot of ECC single bit errors. It may mean that
the module is about to fail, or environmental conditions in the server are
causing more errors than usual Action: If you receive this message, contact
your support provider to determine if a predictive repair should be made
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4542
- Severity: WARNING
- Event Summary: Many corrected memory errors were detected in the
memory subsystem
- Event Class: System
- Problem Description: Significant numbers of corrected memory errors
have been detected on the memory subsystem
- Cause / Action:
Cause: You will receive this message if the
system is correcting a lot of ECC single bit errors. It may mean that
the module is about to fail, or environmental conditions in the server are
causing more errors than usual
Action: If you receive this message, contact
your support provider to determine if a predictive repair should be made
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4543
- Severity: WARNING
- Event Summary: Over-temperature condition detected on a processor
- Event Class: System
- Problem Description: Over-temperature condition detected on a
processor
- Cause / Action:
Cause: The processor temperature is above the
operating range. The processor's performance is throttled by the firmware to
reduce the temperature
Action: The firmware has already taken necessary
action. Once the temperature comes down, the firmware will restore the
processor's performance to normal
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4544
- Severity: WARNING
- Event Summary: Cache errors detected on a processor
- Event Class: System
- Problem Description: Cache errors detected on a processor
- Cause / Action:
Cause: Threshold parity errors have been
detected in the Instruction or Data Cache Memory (I-Cache or D-Cache). The
operating system has recovered from the errors, but this is an abnormally high
failure rate
Action: Contact your HP support representative to check the
processor
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4545
- Severity: WARNING
- Event Summary: Corrected errors detected in the memory cache for a
processor module
- Event Class: System
- Problem Description: Corrected errors detected in the cache portion
of the memory for a processor module
- Cause / Action:
Cause: Threshold corrected platform errors have
been detected in the cache portion of the memory for the processor module. The
operating system has recovered from the errors, but this is an abnormally high
failure rate
Action: Contact your HP support representative to check the
processor
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4546
- Severity: WARNING
- Event Summary: Corrected errors detected on the system bus for a
processor module
- Event Class: System
- Problem Description: Corrected errors detected on the system bus
for a processor module
- Cause / Action:
Cause: Threshold corrected platform errors have
been detected on the system bus for the processor module. The operating system
has recovered from the errors, but this is an abnormally high failure
rate
Action: Contact your HP support representative to check the processor
module or the system bus for the processor if an excessive number of these
errors are generated
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4547
- Severity: WARNING
- Event Summary: Corrected errors detected on the processor bus for a
processor module
- Event Class: System
- Problem Description: Corrected errors detected on the processor bus
for a processor module
- Cause / Action:
Cause: Threshold corrected platform errors have
been detected on the processor bus for the processor module. The operating
system has recovered from the errors, but this is an abnormally high failure
rate
Action: Contact your HP support representative to check the processor
module or the system bus for the processor if an excessive number of these
errors are generated
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4548
- Severity: WARNING
- Event Summary: Corrected errors detected in the memory tag for a
processor module
- Event Class: System
- Problem Description: Corrected errors detected in the tag portion
of the memory for a processor module
- Cause / Action:
Cause: Threshold corrected platform errors have
been detected in the tag portion of the memory for the processor module. The
operating system has recovered from the errors, but this is an abnormally high
failure rate.
Action: Contact your HP support representative to check the
processor module if an excessive number of these errors are generated
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4594
- Severity: MAJOR
- Event Summary: An Alternate Memory Config has been loaded into the
system
- Event Class: System
- Problem Description:
The control bit to load an alternate memory
configuration is set and an alternate memory configuration has been loaded.
This bit should only be set in the factory and not in the field.
- Cause / Action: Cause: Control bit to use an alternate memory
config are set. Action: Clear NVM Action: Update PDC Action: Contact HP
support to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4598
- Severity: MAJOR
- Event Summary: Unspecified memory interleave error
- Event Class: System
- Problem Description:
Indicates that FW encountered a Fatal
interleaving error. The data field contains the return status from the
interleaving procedure call.
- Cause / Action: Cause: FW encountered a fatal interleaving error.
Action: Update SFW Action: Contact HP support to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4626
- Severity: MAJOR
- Event Summary: LBA has unexpected number of I/O Slots.
- Event Class: System
- Problem Description:
Firmware detected a PCI-to-PCI bridge that
exceeds the maximum supported bridge depth. Firmware will not configure I/O
devices below the maximum bridge depth. Such I/O devices will not be usable as
console nor boot devices but might be usable by the O/S. Data Field: PCI
function address of the bridge that exceeded the maximum depth limit. Bits
24..31: segment number Bits 16..23: bus number Bits 11..15: device number Bits
8..10: function number Bits 0..7: reserved (0)
- Cause / Action:
Cause: Unsupported I/O configuration. Action:
Remove the I/O cards below the specified PCI-to-PCI bridge.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4629
- Severity: MAJOR
- Event Summary: PCI parity error detected.
- Event Class: System
- Problem Description:
An I/O device (or host bridge) detected a
bus parity error. An I/O device (or host bridge) mastered a bus transaction
and received a parity error response from the target. Data Field: Physical
location of the I/O device (or host bridge).
- Cause / Action: Cause: I/O bus parity error. Action: Consult the
error logs for additional information. Determine and replace the failed I/O
device. Cause: I/O host bridge failure. Action: Contact your HP representative
to check the I/O host bridge.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4630
- Severity: MAJOR
- Event Summary: PCI system error detected
- Event Class: System
- Problem Description:
An I/O device (or host bridge) detected an
internal error. An I/O device (or host bridge) detected a bus error. Data
Field: Physical location of the I/O device (or host bridge).
- Cause / Action:
Cause: I/O device failure. Action: Consult the
error logs for additional information. Determine and replace the failed I/O
device. Cause: I/O host bridge failure. Action: Contact your HP representative
to check the I/O host bridge.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4631
- Severity: MAJOR
- Event Summary: DIMM thermal loading order warning
- Event Class: System
- Problem Description:
DIMMs are not loaded on the extender in a
thermally optimal way. Boot is still possible, but the DIMM arrangement should
be changed to the loading order recommended in the users manual. The data
field indicates the number of the extender with incorrectly loaded DIMMs.
- Cause / Action: Cause: The current DIMM loading order does not
follow the guidelines in the user manual Action: Rearrange the DIMMs to follow
the loading order specified in the Maintenance and Operational Manual
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4636
- Severity: FATAL
- Event Summary: memory extender loading order error
- Event Class: System
- Problem Description:
The Memory extenders have not been loaded
in the correct order.
- Cause / Action: Cause: The memory extenders have not been loaded in
the correct order. Action: Load the Memory extenders according to the users
manual.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4637
- Severity: MAJOR
- Event Summary: chipspare not supported on quad
- Event Class: System
- Problem Description:
This code will be sent when FW detects a
rank installed in the system that doesn't support chipspare. The data field is
used to indicate the rank that the x8 DIMMs are installed. It is in the format
0x00000000XDXCXBXA or 0x00000000YBYAXBXA where X and Y are the number of the
rank.
- Cause / Action: Cause: User installed a x8 DIMM in a system
configured for chipspare. Action: If user requires Chipspare, replace the DIMM
with a x4 DIMM. If Chipspare is not required, then no action is required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4638
- Severity: MAJOR
- Event Summary: memory DIMM pair mismatch
- Event Class: System
- Problem Description:
A pair of DIMMs installed in the system are
mismatched, and that pair of DIMMs will not be used. The data field indicates
which pair of DIMMs are mismatched in the format 0x000000000000XBXA where X is
the number of the rank that is mismatched.
- Cause / Action: Cause: The user installed a mismatched pair of
DIMMs in the same rank (ie the DIMMs are different size or width). Action:
Install memory ranks in pairs of DIMMs that are the same size and width.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4639
- Severity: MAJOR
- Event Summary: memory DIMM quad mismatch
- Event Class: System
- Problem Description:
A quad of DIMMs installed in the system are
mismatched, and that quad of DIMMs will not be used. The data field indicates
which quad of DIMMs are mismatched in the format 0x00000000XDXCXBXA where X is
the number of the rank that is mismatched.
- Cause / Action: Cause: The user installed a mismatched quad of
DIMMs in the same rank (ie the DIMMs are different size or width). Action:
Install memory ranks in quads of DIMMs that are the same size and width.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4644
- Severity: MAJOR
- Event Summary: (HWE) IO backplane type unknown
- Event Class: System
- Problem Description:
See Summary.
- Cause / Action:
Cause: This is an alien IO backplane. IO
discovery will fail. I/O for the cell will not be initialized. Action: Replace
I/O backplane or chassis or update firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4646
- Severity: MAJOR
- Event Summary: A rope went fatal from the SBA
- Event Class: System
- Problem Description:
A rope went fatal from the SBA to the LBA.
If all the ropes go fatal the IO subsystem is dead. Any I/O below the rope
will not be accessible. The data field gives the physical location of the rope
that went fatal. Some ROPE_FATAL conditions will not cause the system to go
down. For Example, a rope that has gone fatal during an OLAR operation will
only cause warnings to the user, not bring the partition down.
- Cause / Action: Cause: Mainly a hardware problem causes this
problem. Action: Replace I/O chassis.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4647
- Severity: MAJOR
- Event Summary: A PCI bus on the system went fatal.
- Event Class: System
- Problem Description:
A PCI bus on the system is fatal. Any
device on this bus is unavailable. Firmware was unable to complete card
initialization due to an error. Data field contains physical location of
failed bus.
- Cause / Action: Cause: A bad card, a bad device, or a system
hardware problem can cause this to occur. Card will be powered off (if
possible) and the attention light on the slot will be lit indicating an error.
Action: Reseat Card. Replace Card. If error persists, contact HP support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4648
- Severity: MAJOR
- Event Summary: One of the rope units in the SBA is dead.
- Event Class: System
- Problem Description:
One of the rope units in the SBA failed. If
all of the rope units fail, then IO will not be available on this cell.
- Cause / Action: Cause: Usually hardware. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4650
- Severity: FATAL
- Event Summary: PCI slot exceeds power limit
- Event Class: System
- Problem Description:
A non-hot plug I/O slot's power consumption
increases the total I/O power consumption beyond the supported limit. Firmware
will display the following EFI error message, "I/O configuration exceeds power
limit" and disallow O/S boot. Data Field: Physical location of the I/O slot.
- Cause / Action: Cause: The I/O configuration's power
consumption exceeds the supported limit. Action: Remove the I/O card from each
slot indicated by an IO_PCI_POWER_OVERLOAD_ERR event.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4667
- Severity: MAJOR
- Event Summary: A failure to create an NVRAM-backed fPars existence
variable for the server occurred. The event detail identifies the fPar number.
- Event Class: System
- Problem Description:
The fPar identified by the event detail
will be unusable. Other events may precede this event which will help to
diagnose the failure. A possible cause is that the system exhausted unused
NVRAM.
- Cause / Action: Cause: Firmware ran out of NVRAM before the server completed cold boot initialization.
Action: Initialize
the NVRAM to the factory defaults and reset the server. Soft partitions will
have to be recreated from scratch before they are available.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4668
- Severity: MAJOR
- Event Summary: When firmware attempted to handle a failed vPar
launch, there was no handler registered by the HP OpenVMS vPars monitor. The
CPU will enter a spin-loop which may only be exited by a reset. This event can
only occur in vPars mode.
- Event Class: System
- Problem Description:
This is a vPars monitor failure and can
only occur when booted in vPars mode. Booted in vPars mode and the vPars
monitor did not perform a required handshake with firmware. Likely due to a
vPars monitor defect or use of an unreleased vPars monitor.
- Cause / Action:
Cause: Incorrect vPar monitor behavior. Not a
firmware error. Action: Contact HP Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4669
- Severity: MAJOR
- Event Summary: During handling of a failed vPar launch, firmware
called the monitor launch failure handler, which is NOT supposed to return,
but it did return.
- Event Class: System
- Problem Description:
This is a vPars monitor failure and can
only occur when booted in vPars mode.
- Cause / Action: Cause: Incorrect vPar monitor behavior. Not a
firmware error. Action: Contact HP Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4670
- Severity: FATAL
- Event Summary: Firmware was unable to perform cold-boot
initialization of the fPars functionality. Can occur even when booting in
nPars mode. This error will cause a sequencer fatal error as it is
non-recoverable.
- Event Class: System
- Problem Description:
A FATAL sequencing event that is
Interesting only to firmware developers, but will prevent the server from
booting because it causes a sequencer fatal error.
- Cause / Action:
Cause: This should not show up and Firmware
should have resolved this. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4671
- Severity: FATAL
- Event Summary: The server is out of NVRAM for the fPars Device Map
function and will be unable to provide EFI ownership control variables for
fPars devices.
- Event Class: System
- Problem Description:
Firmware is out of NVRAM and cannot built
the ILM memory slice device map
- Cause / Action: Cause: Firmware ran out of NVRAM before the server completed cold boot initialization. Action: Initialize the
NVRAM to the
factory defaults and reset the server. Soft partitions will have to be
recreated from scratch before they are available.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4672
- Severity: FATAL
- Event Summary: An internal firmware inconsistency prevents the
fPars functionality from being initialized.
- Event Class: System
- Problem Description:
The NVRAM for the CPU map is corrupt and
cannot be trusted. It appears that there are more CPUs in the map than the
size of the map can hold.
- Cause / Action: Cause: NVRAM corruption of fPars specific database.
Action: Restore NVRAM to factory defaults. Reset server and reconfigure soft
partitions. Cause: This should not show up and Firmware should have resolved
this. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4673
- Severity: FATAL
- Event Summary: An internal firmware inconsistency prevents the
fPars functionality from being initialized.
- Event Class: System
- Problem Description:
The NVRAM for the IO device ownership map
is corrupt and cannot be trusted. It appears that there are more IO devices in
the map than the size of the map can hold.
- Cause / Action: Cause: NVRAM corruption of fPars specific database.
Action: Restore NVRAM to factory defaults. Reset server and reconfigure soft
partitions.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4674
- Severity: FATAL
- Event Summary: An internal firmware inconsistency prevents the
fPars functionality from being initialized.
- Event Class: System
- Problem Description:
The NVRAM for the cell local memory map is
corrupt and cannot be trusted. It appears that there are more memory slices in
the map than the size of the map can hold.
- Cause / Action:
Cause: NVRAM corruption of fPars specific
database. Action: Restore NVRAM to factory defaults. Reset server and
reconfigure soft partitions.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4675
- Severity: FATAL
- Event Summary: An internal firmware inconsistency prevents the
fPars functionality from being initialized.
- Event Class: System
- Problem Description:
An internal firmware inconsistency exists
or the server exhausted its internal resources and is unable to create an
fPars EFI ownership variable for a CPU device in the partition. Either of
these conditions is fatal.
- Cause / Action: Cause: NVRAM corruption of fPars specific database.
Action: Restore NVRAM to factory defaults. Reset server and reconfigure soft
partitions.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4676
- Severity: FATAL
- Event Summary: An internal firmware inconsistency prevents the
fPars functionality from being initialized.
- Event Class: System
- Problem Description:
An internal firmware inconsistency exists
or the server exhausted its internal resources and is unable to create an
fPars EFI ownership variable for an IO device in the partition. Either of
these conditions is fatal.
- Cause / Action: Cause: NVRAM corruption of fPars specific database.
Action: Restore NVRAM to factory defaults. Reset server and reconfigure soft
partitions.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4677
- Severity: FATAL
- Event Summary: An internal firmware inconsistency prevents the
fPars functionality from being initialized.
- Event Class: System
- Problem Description:
An internal firmware inconsistency exists
or the server exhausted its internal resources and is unable to create an
fPars EFI ownership variable for a CLM device in the partition. Either of
these conditions is fatal.
- Cause / Action: Cause: NVRAM corruption of fPars specific database.
Action: Restore NVRAM to factory defaults. Reset server and reconfigure soft
partitions.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4678
- Severity: FATAL
- Event Summary: An internal firmware inconsistency prevents the
fPars functionality from being initialized.
- Event Class: System
- Problem Description:
FATAL (limited) NVRAM storage is
unavailable during early boot when ILM ownership map storage is allocated.
- Cause / Action: Cause: This should not show up and Firmware should
have resolved this. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4679
- Severity: FATAL
- Event Summary: The internal hierarchical firmware database is
corrupt and/or an operation on this database failed in an unexpected way. Such
a failure is fatal.
- Event Class: System
- Problem Description:
A FATAL sequencing event that is
Interesting only to firmware developers. The event detail holds the
instruction pointer of the location where the event was emitted to help the
developer identify which of many such failures this one event is indicating.
- Cause / Action: Cause: This should not show up and Firmware should
have resolved this. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4680
- Severity: FATAL
- Event Summary: Physical memory allocation failure. This is always
fatal to firmware.
- Event Class: System
- Problem Description:
Firmware failed a physical memory
allocation. The event detail contains the physical instruction address at
which the event was emitted. This is always a fatal error and should never
occur in released firmware. It can occur due to corrupted code or data or
hardware failure, and of course an undetected firmware coding defect. Cause /
Action:
Cause: This should not show up and Firmware should have resolved
this. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4681
- Severity: FATAL
- Event Summary: The internal hierarchical firmware database is
corrupt and/or an operation on this database failed in an unexpected way. Such
a failure is fatal.
- Event Class: System
- Problem Description:
The internal hierarchical firmware database
is corrupt and the Read (Get Property) operation on this database failed in an
unexpected way. Such a failure is fatal. This error should never occur with
released firmware and is usually a memory corruption but may be an undetected
coding defect.
- Cause / Action: Cause: This should not show up and Firmware should
have resolved this. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4682
- Severity: FATAL
- Event Summary: The internal hierarchical firmware database is
corrupt and/or an operation on this database failed in an unexpected way. Such
a failure is fatal.
- Event Class: System
- Problem Description:
Reading the length of a property
(GetProplen) is the way firmware determines if a datum exists. If the data
must exist but does not, the failure is fatal to correct firmware function and
probably indicates corruption in the database. It may also indicate an
undetected firmware defect. But the error is fatal.
- Cause / Action:
Cause: This should not show up and Firmware
should have resolved this. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4683
- Severity: FATAL
- Event Summary: The internal hierarchical firmware database is
corrupt and/or an operation on this database failed in an unexpected way. Such
a failure is fatal.
- Event Class: System
- Problem Description:
An attempt to delete a database record
failed because the record is missing. This error should never occur in
released firmware and is fatal.
- Cause / Action: Cause: This should not show up and Firmware should
have resolved this. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4684
- Severity: FATAL
- Event Summary: The internal hierarchical firmware database is
corrupt and/or an operation on this database failed in an unexpected way. Such
a failure is fatal.
- Event Class: System
- Problem Description:
This error should never occur in released
firmware and indicates a corruption in the firmware system. It is always
fatal. The event detail identifies the code location (instruction pointer)
where the event was emitted.
- Cause / Action: Cause: This should not show up and Firmware should
have resolved this. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4685
- Severity: FATAL
- Event Summary: General failure in the fPars configuration
subsystem.
- Event Class: System
- Problem Description:
A CRITICAL internal firmware error that is
unrecoverable. Other events from the same CPU immediately preceding this event
may help identify the cause and indicate remedies. The event detail contains
the instruction pointer of the code that emitted the event.
- Cause / Action:
Cause: This should not show up and Firmware
should have resolved this. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4686
- Severity: FATAL
- Event Summary: General failure in the fPars configuration
subsystem.
- Event Class: System
- Problem Description:
A CRITICAL internal firmware error that is
unrecoverable. Other events from the same CPU immediately preceding this event
may help identify the cause and indicate remedies. The event detail contains
the instruction pointer of the code that emitted the event.
- Cause / Action:
Cause: This should not show up and Firmware
should have resolved this. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4687
- Severity: FATAL
- Event Summary: Memory allocation from SAL instance heap failed.
(I.e., heap exhaustion)
- Event Class: System
- Problem Description:
Call to physical memory allocation from SAL
instance heap failed. Interesting only to firmware developers.
- Cause / Action:
Cause: This should not show up and Firmware
should have resolved this. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4688
- Severity: FATAL
- Event Summary: The fPars component called external code to transfer
control and that code returned unexpectedly. This should never occur in
released software and indicates internal firmware corruption.
- Event Class: System
- Problem Description:
A transfer of control failed. There are two
uses of this event: one is to hand off BSP control to the vPars monitor when
DirectedfParResetAndMigrate() is performed. The monitor handler is invoked and
it returned, which should never happen. Firmware will halt in a spinloop in
this case, rather than reset the whole system. The second case is when an fPar
is launched into SequencerII to instantiate the per-fPar firmware interface.
The handoff should NOT return. So the CPU halts.
- Cause / Action:
Cause: This should not show up and Firmware
should have resolved this. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4689
- Severity: FATAL
- Event Summary: During cold boot, firmware could not allocate
physical memory to back the IO tlb tables of an IOC. This should never occur
in released firmware.
- Event Class: System
- Problem Description:
FATAL sequencing event interesting only to
firmware developers which should never occur in released firmware.
- Cause / Action: Cause: This should not show up and Firmware should
have resolved this. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4690
- Severity: FATAL
- Event Summary: Server contains one or more SBAs but firmware was
unable to initialize and construct the IOPDIR hardware and database for even a
single one of the IOCs so the system will be unbootable.
- Event Class: System
- Problem Description:
FATAL sequencing event interesting only to
firmware developers. Should never occur in released firmware. Could however,
be due to IOC hw failure but there will be other events emitted prior to this
event if that is the case.
- Cause / Action: Cause: This should not show up and Firmware should
have resolved this. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4691
- Severity: FATAL
- Event Summary: Inconsistent state during launch of an fPar prevents
any kind of normal processing by the CPU. Therefore the processor is halted so
as not to cause further corruption.
- Event Class: System
- Problem Description:
FATAL sequencing event interesting only to
firmware developers. The event detail contains the instruction pointer to help
identify which occurrence of the event. (There are multiple places in the code
(select_boot_role.c) where this event is emitted)
- Cause / Action:
Cause: This should not show up and Firmware
should have resolved this. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4692
- Severity: FATAL
- Event Summary: Firmware failed to set the value of an internal
database record (property) and this is a very CRITICAL error, which might be
fatal in nPars mode, but which we do not want to be fatal in fPars mode.
- Event Class: System
- Problem Description:
Very CRITICAL but do not want to make this
a fatal level.
- Cause / Action: Cause: Assignment to an fPars policy or
configuration variable failed. Action: Reconfigure the soft partitions and
reset the server.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4696
- Severity: FATAL
- Event Summary: Fpars is attempting to reset a cpu which is
currently holding its own cpu_safe_sm4
- Event Class: System
- Problem Description:
Invocation of the soft reset code from
an interrupt or similar vector, or improper locking/releasing of semaphores.
- Cause / Action: Cause: An error in the system firmware. Action:
Reset the nPar.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4697
- Severity: FATAL
- Event Summary: Fpars is resetting a cpu which holds FATAL
resources.
- Event Class: System
- Problem Description:
Data field is count of CPUs held by the sm4
which issues the event (and is trying to reset itself). Indicates invocation
of the SoftResetCpu code from an interrupt or similar vector where the
processor was modifying FATAL resources, or improper locking/releasing of
semaphores.
- Cause / Action: Cause: An error in the system firmware. Action:
Reset the nPar.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4698
- Severity: FATAL
- Event Summary: Fpar reset on cpu holding FATAL resources which were
not released.
- Event Class: System
- Problem Description:
A reset of an fPar discovered that an fPar
failed to release FATAL resources within a reasonable time.
- Cause / Action:
Cause: Fpar reset to an Fpar which failed to
release FATAL resources within a reasonable time. Action: No action is
required. The nPar will be rebooted.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4700
- Severity: MAJOR
- Event Summary: Server was configured to boot in fPars mode but no
fPars were enabled to boot. The server boots in hard partition mode to enable
configuration tools to be executed to repair the misconfiguration.
- Event Class: System
- Problem Description:
fPars-mode EFI variable is set to enable
fPars operation but no fPars are actually enabled to boot. So the server boots
in hard partition mode.
- Cause / Action: Cause: fPars-mode EFI variable is set to enable
fPars operation but no fPars are actually enabled to boot. So the server boots
in hard partition mode. This is a configuration usage problem, possibly a user
error such as might occur if a previously bootable config was made unbootable
by setting the enabled fPar existence variables, e.g., 'fPar0', 'fPar1', ...
'fParN' to 0x00 (disabled) and then rebooting Action: Use the configuration
tool released with the OS to disable fPars mode or to configure and enable
bootable fPars.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4701
- Severity: MAJOR
- Event Summary: System Software directed the migration reset of an
fPar towards an fPar that is not running OS software. The fPar will reboot in
fPars mode and will enter the vPar monitor failed launch error vector in vPars
mode
- Event Class: System
- Problem Description:
An fPar was directed to reset using the
ESIT DirectedfParResetAndMigrate() service but the target (recipient) fPar of
the resources is in a state that cannot receive ownership of these resources.
- Cause / Action: Cause: An fPar was directed to reset using the ESIT
DirectedfParResetAndMigrate() service but the target (recipient) fPar of the
resources is in a state that cannot receive ownership of these resources.
Action: Use the configuration tool released with the OS to correctly assign
these resources to an fPar and reset that fPar so they will be claimed.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4702
- Severity: MAJOR
- Event Summary: Firmware error setting the NVRAM value of
preferred-bsp for the vPars monitor. The partition will boot using the PD
monarch, but the value of preferred-bsp may not match this CPU.
- Event Class: System
- Problem Description:
Firmware error setting the NVRAM value of
preferred-bsp for the vPars monitor. The partition will boot using the PD
monarch, but the value of preferred-bsp may not match this CPU.
- Cause / Action:
Cause: Firmware could not store to the NVRAM
holding the EFI preferred-bsp variable Action: Reset the server, restore
factory default NVRAM and reconfigure soft partitions.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4703
- Severity: FATAL
- Event Summary: Internal firmware table indicates that the number of
CPUs in this config is unknown. Configuration and booting is therefore
impossible in this cell
- Event Class: System
- Problem Description:
The Platform parameters returned error for
the number of CPUs. Should not occur in released ROMS unless ROM corruption
has occurred.
- Cause / Action: Cause: This should not show up and Firmware should
have resolved this. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4704
- Severity: MAJOR
- Event Summary: Unable to write an XBC port ROUTE_TABLE_ENABLE_MASK
CSR.
- Event Class: System
- Problem Description:
Unable to write to an XBC port
ROUTE_TABLE_ENABLE_MASK CSR in Arches Fabric Phase3. Data field: (XBC port
<< 48) | (XBC ID < 32) | (route)
- Cause / Action: Cause: System firmware was unable to write a fabric
hardware register. Action: Capture IPMI event logs, look for additional
errors. Contact your HP Support Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4705
- Severity: FATAL
- Event Summary: An unrecoverable processor interrupt occurred.
- Event Class: System
- Problem Description:
An unrecoverable processor interrupt
occurred. IPF firmware owned the processor interrupt vector table (IVT) at the
time of the interrupt.
- Cause / Action: Cause: Probable bug in System Firmware, EFI driver,
EFI app or OS loader (prior to OS launch). The event data field contains the
IVT offset applicable to the interrupt. See table 5-7 (Interruption Vector
Table) in the Intel ASDM Volume 2. Action: Provide console log containing
register dump to HP Customer Engineer.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4708
- Severity: MAJOR
- Event Summary: During launch of an fPar (cold boot or fPar directed
reset), the BSP was unable to find memory necessary to instantiate the
firmware interface instance for its fPar. The fPar will be disabled and will
require reconfiguration and reset.
- Event Class: System
- Problem Description:
There is none (or inadequate) memory slice
resource assigned to the fPar so it cannot boot its instance. The fPar will be
disabled (enter a spinloop) because it cannot boot, or if in vPars mode will
return control to the monitor.
- Cause / Action: Cause: There is no Memory owned by the soft
partition (fPar) whose CPU emitted this event. The event detail contains
Action: Reconfigure the soft partition containing the CPU that emitted this
event and be sure to give memory resources to that soft partition before reset
of that partition. Until doing so, this soft partition will be unbootable and
emit this event on each attempt.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4709
- Severity: WARNING
- Event Summary: An alternate memory test size is being used.
- Event Class: System
- Problem Description:
An alternate memory test size is being used
for the memory selftest. The data field is the size of memory to be tested.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4710
- Severity: WARNING
- Event Summary: SCSI controller status has changed.
- Event Class: System
- Problem Description: SCSI controller status has changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the controller status of a SCSI Controller
Action: Check the SCSI
controller status. For additional information, verify System Windows Event Log
entry with Event ID as 1068 and Event Source as Storage Agent
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4711
- Severity: CRITICAL
- Event Summary: SCSI controller status has changed.
- Event Class: System
- Problem Description: SCSI controller status has changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the controller status of a SCSI Controller.
Action: Check the SCSI
controller status. For additional information, verify System Windows Event Log
entry with Event ID as 1068 and Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4712
- Severity: WARNING
- Event Summary: SCSI physical drive status has changed.
- Event Class: System
- Problem Description: SCSI physical drive status has changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the status of a SCSI physical drive.
Action: Check the SCSI physical drive
status. For additional information, verify System Windows Event Log entry with
Event ID as 1070 and Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4713
- Severity: CRITICAL
- Event Summary: SCSI physical drive status has changed.
- Event Class: System
- Problem Description: SCSI physical drive status has changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the status of a SCSI physical drive.
Action: Check the SCSI physical drive
status. For additional information, verify System Windows Event Log entry with
Event ID as 1070 and Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4714
- Severity: WARNING
- Event Summary: SCSI logical drive status has changed.
- Event Class: System
- Problem Description: SCSI logical drive status has changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the status of a SCSI logical drive.
Action: Check the SCSI logical drive
status. For additional information, verify System Windows Event Log entry with
Event ID as 1069 and Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4715
- Severity: CRITICAL
- Event Summary: SCSI logical drive status has changed.
- Event Class: System
- Problem Description: SCSI logical drive status has changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the status of a SCSI logical drive.
Action: Check the SCSI logical drive
status. For additional information, verify System Windows Event Log entry with
Event ID as 1069 and Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4716
- Severity: WARNING
- Event Summary: Drive Array backup controller became active.
- Event Class: System
- Problem Description: Drive Array backup controller became active.
- Cause / Action:
Cause: Storage Agent has detected that a backup
array controller in a duplexed pair has switched over to the active
role.
Action: Check the partner controller for problems. For additional
information, verify System Windows Event Log entry with Event ID as 1165 and
Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4717
- Severity: WARNING
- Event Summary: Drive Array controller status has changed.
- Event Class: System
- Problem Description: Drive Array controller status has changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the status of a drive array controller.
Action: Check the Drive Array
controller status. For additional information, verify System Windows Event Log
entry with Event ID as 1199 and Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4718
- Severity: CRITICAL
- Event Summary: Drive Array controller status has changed.
- Event Class: System
- Problem Description: Drive Array controller status has changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the status of a drive array controller.
Action: Check the Drive Array
controller status. For additional information, verify System Windows Event Log
entry with Event ID as 1199 and Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4719
- Severity: WARNING
- Event Summary: Drive Array logical drive status has changed.
- Event Class: System
- Problem Description: Drive Array logical drive status has changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the status of a drive array logical drive.
Action: Check Drive Array
logical drive status. For additional information, verify System Windows Event
Log entry with Event ID as 1200 and Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4720
- Severity: CRITICAL
- Event Summary: Drive Array logical drive status has changed.
- Event Class: System
- Problem Description: Drive Array logical drive status has changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the status of a drive array logical drive.
Action: Check Drive Array
logical drive status. For additional information, verify System Windows Event
Log entry with Event ID as 1200 and Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4721
- Severity: WARNING
- Event Summary: Drive Array spare drive status has changed.
- Event Class: System
- Problem Description: Drive Array spare drive status has changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the status of a Drive Array spare drive.
Action: Check Drive Array spare
drive status. For additional information, verify System Windows Event Log
entry with Event ID as 1201 and Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4722
- Severity: CRITICAL
- Event Summary: Drive Array spare drive status has changed.
- Event Class: System
- Problem Description: Drive Array spare drive status has changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the status of a Drive Array spare drive.
Action: Check Drive Array spare
drive status. For additional information, verify System Windows Event Log
entry with Event ID as 1201 and Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4723
- Severity: WARNING
- Event Summary: Drive Array physical drive status has changed.
- Event Class: System
- Problem Description: Drive Array physical drive status has changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the status of a Drive Array physical drive.
Action: Check Drive Array
physical drive status. For additional information, verify System Windows Event
Log entry with Event ID as 1202 and Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4724
- Severity: CRITICAL
- Event Summary: Drive Array physical drive status has changed.
- Event Class: System
- Problem Description: Drive Array physical drive status has changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the status of a Drive Array physical drive.
Action: Check Drive Array
physical drive status. For additional information, verify System Windows Event
Log entry with Event ID as 1202 and Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4725
- Severity: WARNING
- Event Summary: Drive Array physical drive threshold has exceeded.
- Event Class: System
- Problem Description: Drive Array physical drive threshold has
exceeded.
- Cause / Action:
Cause: Storage Agent has detected a factory
threshold associated with one of the physical drive objects on a Drive Array
has been exceeded.
Action: Check Drive Array physical drive threshold. For
additional information, verify System Windows Event Log entry with Event ID as
1203 and Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4726
- Severity: WARNING
- Event Summary: Drive Array accelerator status has changed.
- Event Class: System
- Problem Description: Drive Array accelerator status has changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the status of an array accelerator cache board.
Action: Check the Drive
Array accelerator board status. For additional information, verify System
Windows Event Log entry with Event ID as 1204 and Event Source as Storage
Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4727
- Severity: CRITICAL
- Event Summary: Drive Array accelerator status has changed.
- Event Class: System
- Problem Description: Drive Array accelerator status has changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the status of an array accelerator cache board.
Action: Check the Drive
Array accelerator board status. For additional information, verify System
Windows Event Log entry with Event ID as 1204 and Event Source as Storage
Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4728
- Severity: WARNING
- Event Summary: Drive Array accelerator reported bad data.
- Event Class: System
- Problem Description: Drive Array accelerator reported bad data.
- Cause / Action:
Cause: Storage Agent has detected an array
accelerator cache board that has lost battery power. If data was being stored
in the accelerator cache memory when the server lost power, that data has been
lost.
Action: Verify that no data has been lost. For additional
information, verify System Windows Event Log entry with Event ID as 1205 and
Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4729
- Severity: WARNING
- Event Summary: Drive Array accelerator battery failed.
- Event Class: System
- Problem Description: Drive Array accelerator battery failed.
- Cause / Action:
Cause: Storage Agent has detected a battery
failure associated with the array accelerator cache board.
Action: Replace
the Drive Array accelerator cache board. For additional information, verify
System Windows Event Log entry with Event ID as 1206 and Event Source as
Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4730
- Severity: WARNING
- Event Summary: External Array spare drive status has changed.
- Event Class: System
- Problem Description: External Array spare drive status has changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the status of a External Channel Array spare drive.
Action: If the spare
drive status is failed, replace the drive. For additional information, verify
System Windows Event Log entry with Event ID as 1147 and Event Source as
Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4731
- Severity: CRITICAL
- Event Summary: External Array spare drive status has changed.
- Event Class: System
- Problem Description: External Array spare drive status has changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the status of a External Channel Array spare drive.
Action: If the spare
drive status is failed, replace the drive. For additional information, verify
System Windows Event Log entry with Event ID as 1147 and Event Source as
Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4732
- Severity: WARNING
- Event Summary: External Array controller became active.
- Event Class: System
- Problem Description: External Array controller became active.
- Cause / Action:
Cause: External Array controller became
active.
Action: Check the partner controller for problems. For additional
information, verify System Windows Event Log entry with Event ID as 1179 and
Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4733
- Severity: WARNING
- Event Summary: External Array physical drive status has changed.
- Event Class: System
- Problem Description: External Array physical drive status has
changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the status of a physical drive.
Action: Check External Array physical drive
status. For additional information, verify System Windows Event Log entry with
Event ID as 1146 and Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4734
- Severity: CRITICAL
- Event Summary: External Array physical drive status has changed.
- Event Class: System
- Problem Description: External Array physical drive status has
changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the status of a physical drive.
Action: Check External Array physical drive
status. For additional information, verify System Windows Event Log entry with
Event ID as 1146 and Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4735
- Severity: WARNING
- Event Summary: External Array accelerator status has changed.
- Event Class: System
- Problem Description: External Array accelerator status has changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the status of External Array accelerator.
Action: Check the External Array
accelerator status. For additional information, verify System Windows Event
Log entry with Event ID as 1148 and Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4736
- Severity: CRITICAL
- Event Summary: External Array accelerator status has changed.
- Event Class: System
- Problem Description: External Array accelerator status has changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the status of External Array accelerator.
Action: Check the External Array
accelerator status. For additional information, verify System Windows Event
Log entry with Event ID as 1148 and Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4737
- Severity: CRITICAL
- Event Summary: External Array accelerator reported bad data.
- Event Class: System
- Problem Description: External Array accelerator reported bad data.
- Cause / Action:
Cause: Storage Agent has detected an Array
Accelerator Cache Board that has lost battery power. If data was being stored
in the accelerator memory when the system lost power, that data has been
lost.
Action: Verify that no data has been lost. For additional
information, verify System Windows Event Log entry with Event ID as 1149 and
Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4738
- Severity: CRITICAL
- Event Summary: External Array accelerator battery has failed.
- Event Class: System
- Problem Description: External Array accelerator battery has failed.
- Cause / Action:
Cause: Storage Agent has detected a battery
failure associated with the Array accelerator cache board.
Action: Replace
the Accelerator Cache Board. For additional information, verify System Windows
Event Log entry with Event ID as 1150 and Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4739
- Severity: WARNING
- Event Summary: External Array controller status has changed.
- Event Class: System
- Problem Description: External Array controller status has changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the status of an External Array Controller.
Action: Check the External
Array controller status. For additional information, verify System Windows
Event Log entry with Event ID as 1151 and Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4740
- Severity: CRITICAL
- Event Summary: External Array controller status has changed.
- Event Class: System
- Problem Description: External Array controller status has changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the status of an External Array Controller.
Action: Check the External
Array controller status. For additional information, verify System Windows
Event Log entry with Event ID as 1151 and Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4741
- Severity: WARNING
- Event Summary: External Array logical drive status has changed.
- Event Class: System
- Problem Description: External Array logical drive status has
changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the status of an External Array logical drive.
Action: Check the External
Array logical drive status. For additional information, verify System Windows
Event Log entry with Event ID as 1145 and Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4742
- Severity: CRITICAL
- Event Summary: External Array logical drive status has changed.
- Event Class: System
- Problem Description: External Array logical drive status has
changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the status of an External Array logical drive.
Action: Check the External
Array logical drive status. For additional information, verify System Windows
Event Log entry with Event ID as 1145 and Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4743
- Severity: WARNING
- Event Summary: Fiber Channel controller status has changed.
- Event Class: System
- Problem Description: Fiber Channel controller status has changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the status of a Fiber Channel Host Controller.
Action: Check the Fiber
Channel controller status. For additional information, verify System Windows
Event Log entry with Event ID as 1215 and Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4744
- Severity: CRITICAL
- Event Summary: Fiber Channel controller status has changed.
- Event Class: System
- Problem Description: Fiber Channel controller status has changed.
- Cause / Action:
Cause: Storage Agent has detected a change in
the status of a Fiber Channel Host Controller.
Action: Check the Fiber
Channel controller status. For additional information, verify System Windows
Event Log entry with Event ID as 1215 and Event Source as Storage Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4745
- Severity: WARNING
- Event Summary: The cluster service has degraded.
- Event Class: System
- Problem Description: The cluster service has degraded
- Cause / Action:
Cause: Cluster Agent has detected the cluster
node degraded status.
Action: Check the cluster node for the cause of
degraded status. For additional information, verify System Windows Event Log
entry with Event ID as 1171 and Event Source as Cluster Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4746
- Severity: CRITICAL
- Event Summary: The cluster service has failed.
- Event Class: System
- Problem Description: The cluster service has failed.
- Cause / Action:
Cause: Cluster Agent has detected the cluster
node failed status.
Action: Check the cluster node for the cause of failed
status. For additional information, verify System Windows Event Log entry with
Event ID as 1172 and Event Source as Cluster Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4747
- Severity: WARNING
- Event Summary: The cluster resource has degraded.
- Event Class: System
- Problem Description: The cluster resource has degraded.
- Cause / Action:
Cause: Cluster Agent has detected the cluster
resource degraded status.
Action: Check the cluster resource for the cause
of degraded status. For additional information, verify System Windows Event
Log entry with Event ID as 1167 and Event Source as Cluster Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4748
- Severity: CRITICAL
- Event Summary: The cluster resource has failed.
- Event Class: System
- Problem Description: The cluster resource has failed.
- Cause / Action:
Cause: Cluster Agent has detected the cluster
resource failed status.
Action: Check the cluster resource for the cause of
failed status. For additional information, verify System Windows Event Log
entry with Event ID as 1168 and Event Source as Cluster Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4749
- Severity: WARNING
- Event Summary: The cluster network has degraded.
- Event Class: System
- Problem Description: The cluster network has degraded.
- Cause / Action:
Cause: Cluster Agent has detected the cluster
network degraded status.
Action: Check the cluster network for the cause of
degraded status. For additional information, verify System Windows Event Log
entry with Event ID as 1169 and Event Source as Cluster Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4750
- Severity: CRITICAL
- Event Summary: The cluster network has failed.
- Event Class: System
- Problem Description: The cluster network has failed.
- Cause / Action:
Cause: Cluster Agent has detected the cluster
network failed status.
Action: Check the cluster network for the cause of
failed status. For additional information, verify System Windows Event Log
entry with Event ID as 1170 and Event Source as Cluster Agent.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4751
- Severity: WARNING
- Event Summary: An error was encountered when enabling or disabling
threads.
- Event Class: System
- Problem Description:
Unable to change the multi-thread enabled
status on all CPUs.
- Cause / Action:
Cause: Unable to change the multi-thread
enabled status on all CPUs. This occurs if the PAL call which enables or
disable multi-thread capabilities fails or if there is a broken CPU in the
system and the call cannot be attempted. Action: Verify that the CPUs support
multithreading PAL calls and replace any broken CPUs.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4752
- Severity: CRITICAL
- Event Summary: generic FATAL alert level error event for system
firmware
- Event Class: System
- Problem Description:
Generic event whose data field is usually
the firmware instruction pointer at the location the event was emitted
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4753
- Severity: MAJOR
- Event Summary: Generic WARNING level event for Banyan firmware
- Event Class: System
- Problem Description:
Generic warning event with a data field
that is usually the instruction pointer of the emitting location in code
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4754
- Severity: FATAL
- Event Summary: core firmware physical memory heap failure
- Event Class: System
- Problem Description:
core physical memory heap failure
- Cause / Action:
Cause: FW should fix this bug if seen. Action:
-
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4755
- Severity: FATAL
- Event Summary: Unexpected internal fPars configuration database
error
- Event Class: System
- Problem Description:
This generic event is emitted in many
places in the firmware when the code discovers the internal configuration
database to be inconsistent. fPars configuration probably will not function
without a reboot. Could be a firmware bug or it could be that system software
corrupted (wrote over) firmware data.
- Cause / Action:
Cause: Nothing. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4756
- Severity: FATAL
- Event Summary: When firmware attempts to create a new (empty) fPar,
it must create a number of private NVRAM variables. This event indicates the
system is out of NVRAM and cannot do so. The fPar will not be created.
- Event Class: System
- Problem Description:
Firmware has run out of NVRAM and cannot
create the needed variable(s)
- Cause / Action:
Cause: NVRAM is too full for operation. Action:
Delete some EFI variables in existing fPars or boot in nPars mode and delete
some variables. (E.g., too many boot paths in the boot manager can cause
this). Clear NVRAM and reconfigure system.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4757
- Severity: FATAL
- Event Summary: During reset of a soft partition, the LBA was
re-initialized and this operation failed. The soft partition may continue to
boot, but the IO device is probably unusable. This is probably a hardware
failure.
- Event Class: System
- Problem Description:
The call to the IOC firmware component
method "LbaReconfig" failed, probably a low-level hardware error. The fPar BSP
is identified in bits 63..32 of the event detail, the fPar owning this CPU and
LBA is identified in bits 31..16 and the number of the Rope is identified in
bits 15..0.
- Cause / Action:
Cause: Low level IO hw initialization of an LBA
by the firmware failed during an fPar reset. Action: Try booting nPars mode.
Power cycle the hardware. replace the LBA
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4758
- Severity: FATAL
- Event Summary: an error calling GetProp() that should not happen
but don't want to raise to alert level 7
- Event Class: System
- Problem Description:
Intended to be inserted by firmware
developer during debug/development to indicate location in code where
retrieval
of a firmware database datum (property) failed
- Cause / Action:
Cause: FW should fix this bug if seen. Action:
-
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4759
- Severity: MAJOR
- Event Summary: fPar Instance IO reset experienced CRITICAL error,
but firmware will continue to try to boot to Shell anyway.
- Event Class: System
- Problem Description:
If ClaimfParIoResources() returned an
error, this event is emitted by the BSP. The event detail contains the CPU id
of the calling CPU (BSP) in bits 63..31 and the fPar id of the fPar being
reset in bits 15..0
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4760
- Severity: CRITICAL
- Event Summary: SBA Init Node method property error
- Event Class: System
- Problem Description:
SBA InitNode get property error
- Cause / Action:
Cause: GetProperty failure in SBA InitNode
method Action: SBA InitNode failure
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4761
- Severity: CRITICAL
- Event Summary: No root node found in SBA InitNode
- Event Class: System
- Problem Description:
SBA InitNode root node error
- Cause / Action:
Cause: SBA InitNode cannot find root node
Action: SBA InitNode failure
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4762
- Severity: CRITICAL
- Event Summary: SBA InitNode method find node error
- Event Class: System
- Problem Description:
SBA InitNode find node error
- Cause / Action:
Cause: SBA InitNode method cannot find a
required device tree node Action: error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4763
- Severity: CRITICAL
- Event Summary: SBA LBA Configure method call failure
- Event Class: System
- Problem Description:
SBA component LBA Configure method call
failure
- Cause / Action:
Cause: SBA component call to LBA Configure
method failed Action: Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4764
- Severity: CRITICAL
- Event Summary: SBA call to LBA Configure method fails in MCA path
- Event Class: System
- Problem Description:
SBA call to LBA Configure method fails in
MCA path.
- Cause / Action: Cause: SBA call to LBA Configure method fails in
MCA path. Action: Error exit.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4765
- Severity: CRITICAL
- Event Summary: SBA call to LBA PciScan method fails.
- Event Class: System
- Problem Description:
SBA call to LBA PciScan method fails.
- Cause / Action:
Cause: SBA call to LBA PciScan method fails.
Action: Error exit.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4766
- Severity: CRITICAL
- Event Summary: SBA call to LBA SetDeviceMask method fails.
- Event Class: System
- Problem Description:
SBA call to LBA SetDeviceMask method fails.
- Cause / Action:
Cause: SBA call to LBA SetDeviceMask method
fails. Action: Disable rope in non-mca path, error exit in MCA path.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4767
- Severity: CRITICAL
- Event Summary: Attempt to determine I/O backplane type fails.
- Event Class: System
- Problem Description:
SBA attempt to determine I/O backplane type
fails.
- Cause / Action: Cause: SBA attempt to determine I/O backplane type
fails. Action: Error exit.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4768
- Severity: CRITICAL
- Event Summary: SBA unsupported function ID.
- Event Class: System
- Problem Description:
SBA unsupported function ID
- Cause / Action:
Cause: SBA component unsupported function ID
Action: Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4769
- Severity: CRITICAL
- Event Summary: SBA component unsupported class code
- Event Class: System
- Problem Description:
SBA component unsupported class code.
- Cause / Action:
Cause: SBA component unsupported class code
Action: Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4770
- Severity: CRITICAL
- Event Summary: SBA component unsupported revision
- Event Class: System
- Problem Description:
SBA component unsupported revision error
- Cause / Action:
Cause: SBA component unsupported revision
Action: Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4771
- Severity: CRITICAL
- Event Summary: SBA component unsupported module ID
- Event Class: System
- Problem Description:
SBA component unsupported module ID
- Cause / Action:
Cause: SBA component unsupported module ID
Action: Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4772
- Severity: CRITICAL
- Event Summary: SBA link configuration error
- Event Class: System
- Problem Description:
SBA link configuration error
- Cause / Action:
Cause: SBA link configuration error Action:
Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4773
- Severity: CRITICAL
- Event Summary: SBA error configuration error
- Event Class: System
- Problem Description:
SBA error configuration error
- Cause / Action:
Cause: SBA error configuration error Action:
Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4774
- Severity: CRITICAL
- Event Summary: SBA map sba config register space fails
- Event Class: System
- Problem Description:
SBA map in sba config register space fails
- Cause / Action:
Cause: SBA map in sba config register space
fails Action: Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4775
- Severity: CRITICAL
- Event Summary: SBA map in lba config register space fails
- Event Class: System
- Problem Description:
SBA map in lba config register space fails
- Cause / Action:
Cause: SBA map in lba config register space
fails Action: Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4776
- Severity: CRITICAL
- Event Summary: SBA initialize cacheline size error
- Event Class: System
- Problem Description:
SBA initialize cacheline size error
- Cause / Action:
Cause: SBA initialize cacheline size error
Action: Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4777
- Severity: CRITICAL
- Event Summary: SBA link error clear error
- Event Class: System
- Problem Description:
SBA link error clear error
- Cause / Action:
Cause: SBA link error clear error Action: Error
exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4778
- Severity: CRITICAL
- Event Summary: SBA cache initialization error
- Event Class: System
- Problem Description:
SBA cache initialization error
- Cause / Action:
Cause: SBA cache initialization error Action:
Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4779
- Severity: CRITICAL
- Event Summary: SBA initialize hardfail mode error
- Event Class: System
- Problem Description:
SBA initialize hardfail mode error
- Cause / Action:
Cause: SBA initialize hardfail mode error
Action: Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4780
- Severity: CRITICAL
- Event Summary: SBA initialize timeout error
- Event Class: System
- Problem Description:
SBA initialize timeout counters error
- Cause / Action:
Cause: SBA initialize timeout counters error
Action: Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4781
- Severity: CRITICAL
- Event Summary: SBA rope unit error config error
- Event Class: System
- Problem Description:
SBA rope unit error configuration error
- Cause / Action:
Cause: SBA rope unit error configuration error
Action: Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4782
- Severity: CRITICAL
- Event Summary: SBA rope unit error clear error
- Event Class: System
- Problem Description:
SBA rope unit error clear error
- Cause / Action:
Cause: SBA rope unit error clear error Action:
Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4783
- Severity: CRITICAL
- Event Summary: SBA rope bundling error
- Event Class: System
- Problem Description:
SBA rope bundling error
- Cause / Action:
Cause: SBA rope bundling error Action: Error
exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4784
- Severity: CRITICAL
- Event Summary: SBA found no valid LBAs during configuration
- Event Class: System
- Problem Description:
SBA found no valid LBAs during
configuration
- Cause / Action:
Cause: SBA found no valid LBAs during
configuration Action: Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4785
- Severity: CRITICAL
- Event Summary: SBA rope width configuration error
- Event Class: System
- Problem Description:
SBA rope width configuration error
- Cause / Action:
Cause: SBA rope width configuration error
Action: Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4786
- Severity: CRITICAL
- Event Summary: SBA rope link error configuration error
- Event Class: System
- Problem Description:
SBA rope link error configuration error
- Cause / Action:
Cause: SBA rope link error configuration error
Action: Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4787
- Severity: CRITICAL
- Event Summary: SBA rope error clear error
- Event Class: System
- Problem Description:
SBA rope error clear error
- Cause / Action:
Cause: SBA rope error clear error Action: Error
exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4788
- Severity: CRITICAL
- Event Summary: SBA disable directed range error
- Event Class: System
- Problem Description:
SBA disable directed range error
- Cause / Action:
Cause: SBA disable directed range error Action:
Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4789
- Severity: CRITICAL
- Event Summary: SBA MMIO config error
- Event Class: System
- Problem Description:
SBA MMIO configuration error
- Cause / Action:
Cause: SBA MMIO configuration error Action:
Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4790
- Severity: CRITICAL
- Event Summary: SBA IO configuration error
- Event Class: System
- Problem Description:
SBA IO configuration error
- Cause / Action:
Cause: SBA IO configuration error Action: Error
exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4791
- Severity: CRITICAL
- Event Summary: SBA initialize CSR error
- Event Class: System
- Problem Description:
SBA initialize CSR error
- Cause / Action:
Cause: SBA initialize CSR error Action: Error
exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4792
- Severity: CRITICAL
- Event Summary: SBA PCI bus scan operation ran out of PCI bus range
- Event Class: System
- Problem Description:
SBA PCI bus range error
- Cause / Action:
Cause: SBA PCI bus range error Action: Error
exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4793
- Severity: CRITICAL
- Event Summary: SBA call to LBA DisableArb fails
- Event Class: System
- Problem Description:
SBA call to LBA DisableArb fails
- Cause / Action:
Cause: SBA call to LBA DisableArb fails Action:
Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4794
- Severity: CRITICAL
- Event Summary: SBA rope phase set error
- Event Class: System
- Problem Description:
SBA rope phase set error
- Cause / Action:
Cause: SBA rope phase set error Action: Error
exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4795
- Severity: CRITICAL
- Event Summary: SBA rope align error
- Event Class: System
- Problem Description:
SBA rope align error
- Cause / Action:
Cause: SBA rope align error Action: Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4796
- Severity: CRITICAL
- Event Summary: SBA rope phase compare error
- Event Class: System
- Problem Description:
SBA rope phase compare error
- Cause / Action:
Cause: SBA rope phase compare error Action:
Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4797
- Severity: CRITICAL
- Event Summary: LBA platform data error
- Event Class: System
- Problem Description:
LBA platform data extraction error
- Cause / Action:
Cause: LBA platform data extraction error
Action: Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4798
- Severity: CRITICAL
- Event Summary: LBA get property error failure
- Event Class: System
- Problem Description:
LBA get property error
- Cause / Action:
Cause: LBA get property error Action: Error
exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4799
- Severity: CRITICAL
- Event Summary: LBA attempt to access unavailable error
- Event Class: System
- Problem Description:
LBA attempt to access unavailable lba
- Cause / Action:
Cause: LBA attempt to access unavailable lba
Action: Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4800
- Severity: CRITICAL
- Event Summary: LBA set request depth error
- Event Class: System
- Problem Description:
LBA set request depth error
- Cause / Action:
Cause: LBA set request depth error Action:
Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4801
- Severity: CRITICAL
- Event Summary: LBA set RPCE error
- Event Class: System
- Problem Description:
LBA set RPCE error
- Cause / Action:
Cause: LBA set RPCE error Action: Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4802
- Severity: CRITICAL
- Event Summary: LBA rope width parameter error
- Event Class: System
- Problem Description:
LBA rope with parameter error
- Cause / Action:
Cause: LBA rope width parameter error Action:
Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4803
- Severity: CRITICAL
- Event Summary: LBA invalid CLASS error
- Event Class: System
- Problem Description:
LBA invalid CLASS error
- Cause / Action:
Cause: LBA invalid CLASS error Action: Error
exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4804
- Severity: CRITICAL
- Event Summary: LBA set bus frequency error
- Event Class: System
- Problem Description:
LBA set bus frequency error
- Cause / Action:
Cause: LBA set bus frequency error Action:
Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4805
- Severity: CRITICAL
- Event Summary: LBA slot power on failure
- Event Class: System
- Problem Description:
LBA slot power on failure
- Cause / Action:
Cause: LBA slot power on failure Action: Error
exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4806
- Severity: CRITICAL
- Event Summary: LBA verify DLL lock failure
- Event Class: System
- Problem Description:
LBA verify DLL lock error
- Cause / Action:
Cause: LBA verify DLL lock error Action: Error
exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4807
- Severity: CRITICAL
- Event Summary: LBA check bus capability error
- Event Class: System
- Problem Description:
LBA check bus capability error
- Cause / Action:
Cause: LBA check bus capability error Action:
Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4808
- Severity: CRITICAL
- Event Summary: LBA set SLT error
- Event Class: System
- Problem Description:
LBA set SLT error
- Cause / Action:
Cause: LBA set SLT error Action: Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4809
- Severity: CRITICAL
- Event Summary: LBA hint configuration error
- Event Class: System
- Problem Description:
LBA hint configuration error
- Cause / Action:
Cause: LBA hint configuration error Action:
Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4810
- Severity: CRITICAL
- Event Summary: LBA configure PCI error failed
- Event Class: System
- Problem Description:
LBA PCI error configuration fails
- Cause / Action:
Cause: LBA PCI error configuration fails
Action: Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4811
- Severity: CRITICAL
- Event Summary: LBA PCIX error configuration fails
- Event Class: System
- Problem Description:
LBA PCIX error configuration fails
- Cause / Action:
Cause: LBA PCIX error configuration fails Action: Error
exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4812
- Severity: CRITICAL
- Event Summary: LBA general error configuration fails
- Event Class: System
- Problem Description:
LBA general error configuration failure
- Cause / Action:
Cause: LBA general error configuration failure
Action: Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4813
- Severity: CRITICAL
- Event Summary: LBA release reset error
- Event Class: System
- Problem Description:
LBA release reset error
- Cause / Action:
Cause: LBA release reset error Action: Error
exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4814
- Severity: CRITICAL
- Event Summary: LBA enable arbitration error
- Event Class: System
- Problem Description:
LBA enable arbitration error
- Cause / Action:
Cause: LBA enable arbitration error Action:
Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4815
- Severity: CRITICAL
- Event Summary: LBA create node error
- Event Class: System
- Problem Description:
LBA create node error
- Cause / Action:
Cause: LBA create node error Action: Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4816
- Severity: CRITICAL
- Event Summary: LBA attach service error
- Event Class: System
- Problem Description:
LBA attach service error
- Cause / Action:
Cause: LBA attach service error Action: Error
exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4817
- Severity: CRITICAL
- Event Summary: LBA set property error
- Event Class: System
- Problem Description:
LBA set property error
- Cause / Action:
Cause: LBA set property error Action: Error
exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4818
- Severity: CRITICAL
- Event Summary: LBA InitNode method call failed
- Event Class: System
- Problem Description:
LBA InitNode call failed
- Cause / Action:
Cause: LBA InitNode call failed Action: Error
exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4819
- Severity: CRITICAL
- Event Summary: LBA set hardfail error
- Event Class: System
- Problem Description:
LBA set hardfail error
- Cause / Action:
Cause: LBA set hardfail error Action: Error
exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4820
- Severity: CRITICAL
- Event Summary: LBA slot device e scan error
- Event Class: System
- Problem Description:
LBA slot device scan error
- Cause / Action:
Cause: LBA slot drawing power but no device
found during scan Action: Power off slot
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4821
- Severity: CRITICAL
- Event Summary: LBA PCI bus map failure
- Event Class: System
- Problem Description:
LBA PCI bus map call failure
- Cause / Action:
Cause: LBA PCI bus map call failure Action:
Ignore device
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4822
- Severity: CRITICAL
- Event Summary: LBA set bus number failure
- Event Class: System
- Problem Description:
LBA set bus number failure
- Cause / Action:
Cause: LBA set bus number failure Action: Error
exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4823
- Severity: CRITICAL
- Event Summary: LBA set bus error mode failure
- Event Class: System
- Problem Description:
LBA set bus error mode failure
- Cause / Action:
Cause: LBA set bus error mode failure Action:
Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4824
- Severity: CRITICAL
- Event Summary: LBA acquire semaphore error
- Event Class: System
- Problem Description:
LBA acquire semaphore error
- Cause / Action:
Cause: LBA acquire semaphore error Action:
Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4825
- Severity: INFORMATION
- Event Summary: LBA acquire AML semaphore error
- Event Class: System
- Problem Description:
LBA acquire AML semaphore error
- Cause / Action:
Cause: LBA acquire AML semaphore error Action:
Error exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4826
- Severity: CRITICAL
- Event Summary: SBA get property error
- Event Class: System
- Problem Description:
SBA get property error
- Cause / Action:
Cause: SBA get property error Action: Error
exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4827
- Severity: CRITICAL
- Event Summary: SBA set property error
- Event Class: System
- Problem Description:
SBA set property error
- Cause / Action:
Cause: SBA set property error Action: Error
exit
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4828
- Severity: CRITICAL
- Event Summary: SCSI host adapter failed
- Event Class: System
- Problem Description: SCSI host adapter failed
- Cause / Action:
Cause: One of the host adapters which were
installed in the server can no longer be detected by the Symbios SCSI agent.
This may be the cause of host adapter failure or by a change in the
configuration of the server.
Action: Make sure adapter card is seated
correctly. If there is a configuration or a sever hardware failure, call the
system administrator.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4829
- Severity: CRITICAL
- Event Summary: SCSI device failed
- Event Class: System
- Problem Description: SCSI device failed
- Cause / Action:
Cause: One of the devices which was installed
in the server can no longer be detected by the Symbios SCSI agent. This
warning may be caused by a failure of the device, by a power or cable problem,
or by a change in the configuration of the server.
Action: Run diagnostics
on all SCSI devices. Call the system administrator for additional assistance
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4830
- Severity: CRITICAL
- Event Summary: SCSI SMART predictive failure detected
- Event Class: System
- Problem Description: SCSI SMART predictive failure detected
- Cause / Action:
Cause: This alarm is generated when Symbios
SCSI agent detects a predictive failure condition for a device. This message
indicates that the device should be replaced before it actually
fails.
Action: The user must copy down the sense info description for
warranty use. Be sure to back-up the information on a drive before replacing
it. For additional help call the system administrator
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4835
- Severity: CRITICAL
- Event Summary: The cell info area failed the checksum test prior to
updating the fabric data
- Event Class: System
- Problem Description:
The cell info area failed the checksum test
prior to updating the fabric data
- Cause / Action: Cause: An error occurred while updating the fabric
portion of cell info. The partition cannot boot. Action: Reboot partition,
Gather IPMI Event Logs, Contact your HP Support Representative
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4836
- Severity: CRITICAL
- Event Summary: An error occurred while updating the cell info
checksum
- Event Class: System
- Problem Description:
An error occurred while updating the cell
info checksum
- Cause / Action: Cause: An error occurred while updating the fabric
portion of cell info. The partition cannot boot. Action: Reboot partition,
Gather IPMI Event Logs, Contact your HP Support Representative
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4837
- Severity: CRITICAL
- Event Summary: System firmware could not obtain the micro semaphore
- Event Class: System
- Problem Description:
System firmware could not obtain the micro
semaphore
- Cause / Action: Cause: An error occurred while updating the fabric
portion of cell info. The partition cannot boot. Action: Reboot partition,
Gather IPMI Event Logs, Contact your HP Support Representative
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4838
- Severity: MAJOR
- Event Summary: The ARF Vertex Module returned an unexpected
failure.
- Event Class: System
- Problem Description:
VM_CollectVertices failed unexpectedly.
Could possibly be a data corruption problem. Data field: (cell number <<
56) | (return value from VM_CollectVertices)
- Cause / Action:
Cause: An error occurred while routing the
fabric, possible data corruption or firmware error. Action: Contact your HP
support representative to inspect the fabric and system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4839
- Severity: MAJOR
- Event Summary: Unable to generate an ALB address in Arches Fabric
ArfSetDefaultCSRs() function.
- Event Class: System
- Problem Description:
Unable to generate an ALB address in Arches
Fabric ArfSetDefaultCSRs() function. Data field:(cell number << 56) |
(port number << 48) | (xbc ID < 32) | (return value)
- Cause / Action:
Cause: An error occurred while routing the
fabric, possible data corruption or firmware error. Action: Contact your HP
support representative to inspect the fabric and system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4840
- Severity: MAJOR
- Event Summary: SetDefaultCSRs failed in call from Arches Fabric
Phase 4
- Event Class: System
- Problem Description:
SetDefaultCSRs function failed
unexpectedly. Arches Fabric was unable to set XBC CSRs to default values. Data
field: return value from SetDefaultCSRs
- Cause / Action:
Cause: An error occurred while routing the
fabric Action: Collect IPMI event logs, and contact your HP support
representative to inspect the fabric and system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4841
- Severity: MAJOR
- Event Summary: Windows IML: Disk storage system battery low
- Event Class: System
- Problem Description: Disk storage system battery power is low.
- Cause / Action:
Cause: Battery power is low.
Action: Check
batteries.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4842
- Severity: MAJOR
- Event Summary: Windows IML: Disk storage system AC Voltage problem
power supply
- Event Class: System
- Problem Description: Disk storage system as problems with A/C line
- Cause / Action:
Cause: The A/C line voltage may be under or
over specification. The A/C power may have gone off.
Action: Check A/C
power line
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4848
- Severity: MAJOR
- Event Summary: failed to delete a bad port while routing the fabric
- Event Class: System
- Problem Description:
Failed to delete a bad port while routing
the fabric. Data field indicates return status
- Cause / Action:
Cause: Experienced an error while routing the
fabric. Action: Contact your HP Support Representative to investigate the
fabric and firmware vertex modules.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4849
- Severity: MAJOR
- Event Summary: failed to delete an edge that was found to be bad
during routing
- Event Class: System
- Problem Description:
failed to delete an edge that was found to
be bad during routing
- Cause / Action:
Cause: Experienced an error while routing the
fabric. Action: Contact your HP Support Representative to investigate the
fabric and firmware vertex modules.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4850
- Severity: MAJOR
- Event Summary: An error occurred while getting the XBC Semaphore
address
- Event Class: System
- Problem Description:
An error occurred while forming the XBC
semaphore address
- Cause / Action:
Cause: Couldn't determine the proper address of
the XBC semaphore Action: Save IPMI event codes, contact HP Support
Representative to analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4851
- Severity: MAJOR
- Event Summary: Error reading the XBC Global Semaphore
- Event Class: System
- Problem Description:
Error reading the XBC Global Semaphore
- Cause / Action: Cause: Couldn't read the XBC semaphore Action: Look
for additional IPMI event logs, contact your HP support representative to
analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4852
- Severity: MAJOR
- Event Summary: Failure to get the XBC Global Semaphore address
- Event Class: System
- Problem Description:
Failure to get the XBC Global Semaphore
address
- Cause / Action:
Cause: Failed to get a fabric address Action:
Look for additional IPMI event logs. Contact your HP Support Representative to
analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4853
- Severity: MAJOR
- Event Summary: Failed to write the XBC Global Semaphore
- Event Class: System
- Problem Description:
Failed to write the XBC Global Semaphore
- Cause / Action:
Cause: write to XBC failed Action: Contact HP
Support personnel to check the XBC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4854
- Severity: MAJOR
- Event Summary: Failed to read the XBC Global Semaphore
- Event Class: System
- Problem Description:
Failed to read the XBC Global Semaphore
- Cause / Action:
Cause: System Firmware was unable to release a
XBC semaphore Action: Collect IPMI Event Logs. Contact HP Support
Representative to investigate the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4855
- Severity: MAJOR
- Event Summary: Failed to get the address of the XBC Global
Semaphore
- Event Class: System
- Problem Description:
Failed to get the address of the XBC Global
Semaphore
- Cause / Action:
Cause: Failed to get a fabric address Action:
Look for additional IPMI event logs. Contact your HP Support Representative to
analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4856
- Severity: MAJOR
- Event Summary: Failed to write the XBC Global Semaphore
- Event Class: System
- Problem Description:
Failed to write the XBC Global Semaphore
- Cause / Action:
Cause: System Firmware was unable to release a
XBC semaphore Action: Collect IPMI Event Logs. Contact HP Support
Representative to investigate the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4857
- Severity: MAJOR
- Event Summary: Failed to release the XBC Global Semaphore
- Event Class: System
- Problem Description:
Failed to release the XBC Global Semaphore
- Cause / Action:
Cause: System Firmware was unable to release a
XBC semaphore Action: Collect IPMI Event Logs. Contact HP Support
Representative to investigate the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4858
- Severity: MAJOR
- Event Summary: Fabric phases have been executed in an invalid order
- Event Class: System
- Problem Description:
Fabric phases have been executed in an
invalid order
- Cause / Action:
Cause: System firmware failed a validation
check on its routing progress Action: Collect all IPMI event logs, preferably
live logs. Contact you HP Support Representative
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4859
- Severity: MAJOR
- Event Summary: Fabric phases have been executed in an invalid order
- Event Class: System
- Problem Description:
Fabric phases have been executed in an
invalid order. Data field indicates the expected phase.
- Cause / Action:
Cause: System firmware failed a validation
check on its routing progress Action: Collect all IPMI event logs, preferably
live logs. Contact you HP Support Representative
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4860
- Severity: MAJOR
- Event Summary: Couldn't collect vertex information during Fabric
Info call
- Event Class: System
- Problem Description:
Couldn't collect vertex information during
Fabric Info call. Data Field: return status
- Cause / Action:
Cause: There was a problem while collecting a
list of fabric link info. Action: Look for additional IPMI event logs. Contact
your HP Support Representative to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4861
- Severity: MAJOR
- Event Summary: Couldn't collect the vertices during the Fabric Info
call
- Event Class: System
- Problem Description:
Couldn't collect the vertices during the
Fabric Info call. Data Field: number of vertices returned
- Cause / Action:
Cause: An invalid number of fabric vertices was
found while collecting fabric information. Action: Fabric information cannot
be reported properly. Collect IPMI Event Logs and contact your HP Support
Representative.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4862
- Severity: MAJOR
- Event Summary: Couldn't find all the edges during a procedure call.
- Event Class: System
- Problem Description:
Couldn't find all the edges during a
procedure call. Data Field: return status
- Cause / Action:
Cause: There was a problem while collecting a
list of fabric link info. Action: Look for additional IPMI event logs. Contact
your HP Support Representative to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4863
- Severity: MAJOR
- Event Summary: Couldn't find an appropriate number of edges during
a procedure call.
- Event Class: System
- Problem Description:
Couldn't find an appropriate number of
edges during a procedure call. Data Field: the number of edges found
- Cause / Action:
Cause: There was a problem while collecting a
list of fabric link info. Action: Look for additional IPMI event logs. Contact
your HP Support Representative to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4864
- Severity: MAJOR
- Event Summary: Too many edges were encountered during a procedure
call.
- Event Class: System
- Problem Description:
Too many edges were encountered during a
procedure call. Data Field: link count
- Cause / Action:
Cause: There was a problem while collecting a
list of fabric link info. Action: Look for additional IPMI event logs. Contact
your HP Support Representative to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4865
- Severity: MAJOR
- Event Summary: Failed to get an address during a procedure call
- Event Class: System
- Problem Description:
Failed to get an address during a procedure
call. Data Field: return status
- Cause / Action:
Cause: Failed to get a fabric address Action:
Look for additional IPMI event logs. Contact your HP Support Representative to
analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4866
- Severity: MAJOR
- Event Summary: An unexpected state was encountered during a
procedure call
- Event Class: System
- Problem Description:
An unexpected state was encountered during
a procedure call. Data Field: state returned
- Cause / Action:
Cause: System Firmware encountered an unknown
fabric link state Action: Capture IPMI event logs. Contact your HP Support
Representative to analyze the fabric link.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4867
- Severity: MAJOR
- Event Summary: Unable to get the link health state during a
procedure call.
- Event Class: System
- Problem Description:
Unable to get the link health state during
a procedure call. Data Field: return status
- Cause / Action:
Cause: System Firmware was unable to read the
fabric port's Fatal Error state. The link may or may not be in error. Action:
Collect IPMI event logs for further details. Contact your HP Support
Representative to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4868
- Severity: CRITICAL
- Event Summary: The fabric data failed a CRC check
- Event Class: System
- Problem Description:
The fabric data failed a CRC check. Data
Field: (xbc port << 44 ) | (xbc << 32) | return status)
- Cause / Action:
Cause: The fabric data structure may have been
corrupted. Action: Collect IPMI event logs. Contact your HP support
representative to investigate the health of the system.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4870
- Severity: WARNING
- Event Summary: Windows IML: Disk Array Controller device failure
- Event Class: System
- Problem Description: This event is logged when Drive Array
Subsystem Drivers detect a device failure.
- Cause / Action:
Cause: A device connect to the Disk Array
Controller has failed
Action: Check and replace the device.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4871
- Severity: MAJOR
- Event Summary: A problem during routing prevented a route from
being chosen
- Event Class: System
- Problem Description:
A problem was encountered while routing
which prevented an appropriate route from being chosen. Data Field:
Destination Cell << 56 | XBC Port << 44 | XBC # << 32 |
return status
- Cause / Action:
Cause: An error occurred while routing the
fabric Action: Contact your HP support representative to inspect the fabric
and system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4872
- Severity: MAJOR
- Event Summary: A fabric routing error prevents an appropriate route
from being selected
- Event Class: System
- Problem Description:
An error was encountered while routing the
fabric. This error prevents an appropriate route from being selected. Data
Field: Destination Cell << 56 | XBC Port << 44 | XBC # << 32
| return status
- Cause / Action:
Cause: An error occurred while routing the
fabric Action: Contact your HP support representative to inspect the fabric
and system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4873
- Severity: MAJOR
- Event Summary: A fabric routing error prevents an appropriate route
from being selected
- Event Class: System
- Problem Description:
An error was encountered while routing the
fabric. This error prevents an appropriate route from being selected. Data
Field: Destination Cell << 56 | XBC Port << 44 | XBC # << 32
| return status
- Cause / Action:
Cause: An error occurred while routing the
fabric Action: Contact your HP support representative to inspect the fabric
and system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4875
- Severity: MAJOR
- Event Summary: A fabric routing error prevents an appropriate route
from being selected
- Event Class: System
- Problem Description:
An error was encountered while routing the
fabric. This error prevents an appropriate route from being selected. Data
Field: Destination Cell << 56 | XBC Port << 44 | XBC # << 32
| switch fabric
- Cause / Action:
Cause: An error occurred while routing the
fabric Action: Contact your HP support representative to inspect the fabric
and system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4876
- Severity: CRITICAL
- Event Summary: Windows IML:FATAL event got logged
- Event Class: System
- Problem Description: A FATAL IML event was logged by the health
driver.
- Cause / Action:
Cause: A windows IML agent/driver logged a
FATAL event
Action: No action is required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4877
- Severity: WARNING
- Event Summary: Windows IML:Caution event was logged.
- Event Class: System
- Problem Description: A caution level IML event got logged to the
health driver
- Cause / Action:
Cause: A windows agent or driver logged a
caution level IML event.
Action: No action is required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4885
- Severity: MAJOR
- Event Summary: Failed to get an address when opening XBC to XBC
links
- Event Class: System
- Problem Description:
Failed to get an address when opening XBC
to XBC links
- Cause / Action:
Cause: Failed to get a fabric address Action:
Look for additional IPMI event logs. Contact your HP Support Representative to
analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4886
- Severity: MAJOR
- Event Summary: Error getting address while routing the remote XBC
- Event Class: System
- Problem Description:
Error getting address while routing the
remote XBC
- Cause / Action:
Cause: Failed to get a fabric address Action:
Look for additional IPMI event logs. Contact your HP Support Representative to
analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4887
- Severity: MAJOR
- Event Summary: Error getting the neighbor info
- Event Class: System
- Problem Description:
Error getting the neighbor info
- Cause / Action:
Cause: XBC register read failure Action:
Contact HP Support personnel to analyze the crossbar chip.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4888
- Severity: MAJOR
- Event Summary: Error finding the shortest route
- Event Class: System
- Problem Description:
Error finding the shortest route
- Cause / Action:
Cause: System Firmware encountered a problem
while processing the crossbar fabric graphs Action: Collect IPMI event logs.
Contact your HP Support Representative to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4889
- Severity: MAJOR
- Event Summary: error writing the remote XBC routing register
- Event Class: System
- Problem Description:
error writing the remote XBC routing
register
- Cause / Action:
Cause: write to XBC failed Action: Contact HP
Support personnel to check the XBC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4890
- Severity: MAJOR
- Event Summary: error enabling the routes on the remote XBC
- Event Class: System
- Problem Description:
error enabling the routes on the remote XBC
- Cause / Action:
Cause: write to XBC failed Action: Contact HP
Support personnel to check the XBC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4891
- Severity: MAJOR
- Event Summary: Error writing the routing register on the local XBC
- Event Class: System
- Problem Description:
Error writing the routing register on the
local XBC
- Cause / Action: Cause: write to XBC failed Action: Contact HP
Support personnel to check the XBC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4892
- Severity: MAJOR
- Event Summary: Error writing the local XBC routing registers to
reach a remote cell
- Event Class: System
- Problem Description:
Error writing the local XBC routing
registers to reach a remote cell
- Cause / Action: Cause: write to XBC failed Action: Contact HP
Support personnel to check the XBC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4893
- Severity: MAJOR
- Event Summary: Error enabling the local XBC routes
- Event Class: System
- Problem Description:
Error enabling the local XBC routes
- Cause / Action:
Cause: write to XBC failed Action: Contact HP
Support personnel to check the XBC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4894
- Severity: MAJOR
- Event Summary: failed to disperse the routes across links
- Event Class: System
- Problem Description:
failed to disperse the routes across links
- Cause / Action:
Cause: System Firmware was unable to route the
crossbar fabric. Action: Collect IPMI Event Logs. Contact your HP Support
Representative to troubleshoot the fabric and/or backplane subsystem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4895
- Severity: MAJOR
- Event Summary: Failed to open a fabric link
- Event Class: System
- Problem Description: Failed to open a fabric link
- Cause / Action:
Cause: System Firmware was unable to open the
crossbar link Action: Collect IPMI Event Logs. Contact your HP Support
Representative to analyze the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4896
- Severity: MAJOR
- Event Summary: Error enabling the XBC return route
- Event Class: System
- Problem Description:
Error enabling the XBC return route
- Cause / Action:
Cause: write to XBC failed Action: Contact HP
Support personnel to check the XBC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4897
- Severity: MAJOR
- Event Summary: Error when writing the XBC return route
- Event Class: System
- Problem Description:
Error when writing the XBC return route
- Cause / Action:
Cause: write to XBC failed Action: Contact HP
Support personnel to check the XBC
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4898
- Severity: MAJOR
- Event Summary: Error while routing the remote side of a route
- Event Class: System
- Problem Description:
Error while routing the remote side of a
route
- Cause / Action:
Cause: System Firmware was unable to route the
crossbar fabric. Action: Collect IPMI Event Logs. Contact your HP Support
Representative to troubleshoot the fabric and/or backplane subsystem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4899
- Severity: MAJOR
- Event Summary: Error setting up XBC-XBC link for routing across it
- Event Class: System
- Problem Description:
Error setting up XBC-XBC link for routing
across it
- Cause / Action:
Cause: System Firmware was unable to route the
crossbar fabric. Action: Collect IPMI Event Logs. Contact your HP Support
Representative to troubleshoot the fabric and/or backplane subsystem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4900
- Severity: WARNING
- Event Summary: Event corresponding to PAT encoded chassis codes'
data field
- Event Class: System
- Problem Description:
This event is used for translated PAT
encoded chassis codes' data field from IODC to E0 format. Data field contains
the legacy chassis code's data.
- Cause / Action:
Cause: IODC warning Action: Contact you HP
support representative Cause: FATAL IODC error Action: Contact your HP support
representative
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4901
- Severity: CRITICAL
- Event Summary: Event corresponding to PAT encoded chassis codes'
data field
- Event Class: System
- Problem Description:
This event is used for translated PAT
encoded chassis codes' data field from IODC to E0 format. Data field contains
the legacy chassis code's data.
- Cause / Action:
Cause: FATAL IODC error Action: Contact your HP
support representative
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4902
- Severity: FATAL
- Event Summary: Event corresponding to PAT encoded chassis codes'
data field
- Event Class: System
- Problem Description:
This event is used for translated PAT
encoded chassis codes' data field from IODC to E0 format. Data field contains
the legacy chassis code's data.
- Cause / Action:
Cause: Fatal IODC error Action: Contact your HP
support representative
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4903
- Severity: FATAL
- Event Summary: Event corresponding to PAT encoded chassis codes
- Event Class: System
- Problem Description:
This event is used for translated PAT
encoded chassis codes originated from IODC to E0 format. Data field contains
the legacy chassis code.
- Cause / Action:
Cause: Fatal IODC error Action: Contact your HP
support representative
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4904
- Severity: CRITICAL
- Event Summary: Event corresponding to PAT encoded chassis codes
- Event Class: System
- Problem Description:
This event is used for translated PAT
encoded chassis codes originated from IODC to E0 format. Data field contains
the legacy chassis code.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4905
- Severity: WARNING
- Event Summary: Event corresponding to PAT encoded chassis codes
- Event Class: System
- Problem Description:
This event is used for translated PAT
encoded chassis codes originated from IODC to E0 format. Data field contains
the legacy chassis code.
- Cause / Action: Cause: IODC warning Action: Contact your HP support
representative
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4906
- Severity: CRITICAL
- Event Summary: System backplane power 1.2v LDO fault.
- Event Class: System
- Problem Description:
System backplane power has reported a 1.2v
LDO fault. The data field contains the physical location of the fault.
- Cause / Action:
Cause: System backplane power board has
reported a 1.2v LDO fault. Action: Check system backplane.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4907
- Severity: CRITICAL
- Event Summary: System backplane power 2.5v LDO fault.
- Event Class: System
- Problem Description:
System backplane power has reported a 2.5v
LDO fault. The data field contains the physical location of the fault.
- Cause / Action: Cause: System backplane power board has reported a
2.5v LDO fault. Action: Check system backplane.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4908
- Severity: CRITICAL
- Event Summary: System backplane power board 3.3v house keeping
fault.
- Event Class: System
- Problem Description:
System backplane power has reported a 3.3v
house keeping power fault. The data field contains the physical location of
the fault.
- Cause / Action:
Cause: System backplane power board has
reported a 3.3v house keeping fault. Action: Check system backplane power
board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4909
- Severity: CRITICAL
- Event Summary: System backplane power board 12v fault.
- Event Class: System
- Problem Description:
System backplane power has reported a 12v
power fault. The data field contains the physical location of the fault.
- Cause / Action:
Cause: System backplane power board has
reported a 12v fault. Action: Check system backplane power board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4910
- Severity: CRITICAL
- Event Summary: System backplane power board 3.3v fault.
- Event Class: System
- Problem Description:
System backplane power has reported a 3.3v
power fault. The data field contains the physical location of the fault.
- Cause / Action:
Cause: System backplane power board has
reported a 3.3v fault. Action: Check system backplane power board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4911
- Severity: CRITICAL
- Event Summary: System backplane power board 1.5v fault.
- Event Class: System
- Problem Description:
System backplane power has reported a 1.5v
power fault. The data field contains the physical location of the fault.
- Cause / Action:
Cause: System backplane power board has
reported a 1.5v fault. Action: Check system backplane power board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4912
- Severity: CRITICAL
- Event Summary: System backplane power board 2.5v fault.
- Event Class: System
- Problem Description:
System backplane power has reported a 2.5v
power fault. The data field contains the physical location of the fault.
- Cause / Action:
Cause: System backplane power board has
reported a 2.5v fault. Action: Check system backplane power board.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4913
- Severity: FATAL
- Event Summary: System backplane power is insufficient.
- Event Class: System
- Problem Description:
One or more power rails is providing
insufficient power to the backplane.
- Cause / Action:
Cause: There has been multiple faults on the
same power rail. Action: look for additional events describing action to take.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4914
- Severity: CRITICAL
- Event Summary: RCS board to system backplane clock fault
- Event Class: System
- Problem Description:
Clock fault for clocks supplied from the
Redundant Clock Source (RCS) board to the system backplane. RCS board is no
longer providing clocks to the backplane. The data field contains the physical
location of the RCS.
- Cause / Action:
Cause: RCS assembly has reported a fault.
Action: Check RCS assembly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4915
- Severity: CRITICAL
- Event Summary: HSO board to the system backplane clock fault
- Event Class: System
- Problem Description:
Clock fault for clocks supplied from the
Hot Swap Oscillator (HSO) board to the system backplane. HSO has reported a
fault or has been removed. The data field contains the physical location of
the HSO.
- Cause / Action:
Cause: HSO assembly has reported a fault or has
been removed. Action: Check HSO assembly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4916
- Severity: WARNING
- Event Summary: The operational clock frequencies do not match
between the RCS and HSO.
- Event Class: System
- Problem Description:
The operational clock frequencies do not
match between the Redundant Clock Source (RCS) and the Hot Swap Oscillator
(HSO). The data field contains the physical location of the HSO.
- Cause / Action:
Cause: Operating frequency of the HSO does not
match that of the RCS. Action: Check HSO assembly.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4917
- Severity: CRITICAL
- Event Summary: The clock margin of the system backplane failed.
- Event Class: System
- Problem Description:
The clock margin of the system backplane
failed.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4918
- Severity: FATAL
- Event Summary: System backplane Hot Swap Oscillator (HSO) boards
are insufficient.
- Event Class: System
- Problem Description:
System backplane Hot Swap Oscillator (HSO)
boards are insufficient.
- Cause / Action:
Cause: There are no HSO present or all clock
sources have faulted. Action: Look for additional events describing action to
take.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4919
- Severity: WARNING
- Event Summary: Failure reading RCS or HSO boards EEPROM.
- Event Class: System
- Problem Description:
Failure reading Redundant Clock Source
(RCS) or Hot Swap Oscillator HSO boards EEPROM. Data field contains the
physical location of the device.
- Cause / Action:
Cause: device is not accessible over i2c
Action: check HSO & RCS boards
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4920
- Severity: CRITICAL
- Event Summary: Failure writing Redundant Clock Source RCS or HSO
boards EEPROM.
- Event Class: System
- Problem Description:
Failure writing Redundant Clock Source
(RCS) or Hot Swap Oscillator HSO boards EEPROM. Data field contains the
physical location of the device
- Cause / Action:
Cause: device is not accessible over i2c.
Action: check HSO and RCS boards
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4921
- Severity: CRITICAL
- Event Summary: Failure reading the Reset and Power Monitors (RPM)
EEPROM.
- Event Class: System
- Problem Description:
Failure reading the Reset and Power
Monitors (RPM) EEPROM. Data field contains the physical location of the
device.
- Cause / Action:
Cause: device is not accessible over i2c.
Action: check backplane power.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4922
- Severity: CRITICAL
- Event Summary: Failure writing the Reset and Power Monitors (RPM)
EEPROM.
- Event Class: System
- Problem Description:
Failure writing the Reset and Power
Monitors (RPM) EEPROM. Data field contains the physical location of the device
- Cause / Action:
Cause: device is not accessible over i2c.
Action: check backplane power
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4923
- Severity: CRITICAL
- Event Summary: Failure reading the Onboard System Programmers (OSP)
EEPROM.
- Event Class: System
- Problem Description:
Failure reading the Onboard System
Programmers (OSP) EEPROM. Data field contains the physical location of the
device
- Cause / Action:
Cause: device is not accessible over i2c.
Action: check backplane power.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4924
- Severity: CRITICAL
- Event Summary: Failure writing the Onboard System Programmers (OSP)
EEPROM.
- Event Class: System
- Problem Description:
Failure writing the Onboard System
Programmers (OSP) EEPROM. Data field contains physical location of the device
- Cause / Action:
Cause: device is not accessible over i2c.
Action: check backplane power
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4925
- Severity: FATAL
- Event Summary: SBS fault on startup.
- Event Class: System
- Problem Description:
SBS fault on startup.
- Cause / Action:
Cause: SBS block for one of the MOAB's has
reported an error on startup. Action: check backplane power. Level 2 & 3
detail provides the Moab SBS reporting the fault: 0x00 - Moab 0x00 0x01 - Moab
0x01 0x02 - Moab 0x02 0x10 - Moab 0x40 0x11 - Moab 0x41 0x12 - Moab 0x42
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4926
- Severity: WARNING
- Event Summary: Failure reading IO backplane LPM.
- Event Class: System
- Problem Description:
Failure reading IO backplane LPM. Data
field
contains the physical location of the device.
- Cause / Action:
Cause: IO backplane LPM is not accessible over
i2c. Action: Check power status of IO backplane.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4927
- Severity: WARNING
- Event Summary: Failure writing IO backplane LPM.
- Event Class: System
- Problem Description:
Failure writing IO backplane LPM. Data
field
contains physical location of device.
- Cause / Action:
Cause: IO Backplane LPM is not accessible over
i2c. Action: Check power status of IO backplane.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4928
- Severity: MAJOR
- Event Summary: A CPU owned by one fPar tried to call a procedure in
a firmware instance of a sibling fPar. This is an access violation. The call
is rejected with the error status appropriate to that service (EFI or SAL)
- Event Class: System
- Problem Description:
The system software violated one of the
'well-behavedness' rules of soft partitions and tried to call a firmware
procedure within the firmware instance of a sibling. This is an access
violation and firmware rejects the call
- Cause / Action:
Cause: System Software Defect - using the wrong
entry address for the owner of the CPU. Action: Update the system software
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4929
- Severity: MAJOR
- Event Summary: An unexpected error occurred while calling
AlbInitPrep() for a link.
- Event Class: System
- Problem Description:
AlbInitPrep was unable to read the
ALREC_CONFIG CSR to determine whether the link is up or not. Data field:
unused
- Cause / Action:
Cause: An error occurred while routing the
fabric Action: Collect IPMI event logs, and contact your HP support
representative to inspect the fabric and system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4930
- Severity: WARNING
- Event Summary: non-FATAL event announces an fPar that is enabled to
boot has no healthy cpu configured. The configuration is modified to disable
but not delete this fPar.
- Event Class: System
- Problem Description:
The fPar whose ID is identified in the
event detail is disabled from booting because it has no bootable, healthy CPU.
It may not own any CPU or the CPUs it owns are Unhealthy (or deconfigured)
- Cause / Action:
Cause: The fPar whose ID is identified in the
event detail is disabled from booting because it has no bootable, healthy CPU.
It may not own any CPU or the CPUs it owns are Unhealthy (or deconfigured)
Action: Use the soft partitioning configuration tool released with the OS
product to reconfigure the fPar and reboot the partition. A cold reset of the
hard partition may be required
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4931
- Severity: WARNING
- Event Summary: Non-FATAL event announces that an fPar is enabled to
boot but has inadequate memory resources to instantiate the firmware for the
fPar. The configuration is modified to disable but not delete this fPar.
- Event Class: System
- Problem Description:
Non-FATAL event announces that an fPar is
enabled to boot but has inadequate memory resources to instantiate the
firmware for the fPar. The configuration is modified to disable but not delete
this fPar.
- Cause / Action:
Cause: The user has configured an fPar to be
booted with inadequate memory resources to instantiate the firmware image for
this fPar. The fPar is software deconfigured (disabled) from booting. Action:
Reconfigure the fPar, giving it at least instance-size (an fPars EFI variable)
amount of physically contiguous RAM. Then re-enable and reset the fPar.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4932
- Severity: WARNING
- Event Summary: Non-FATAL event announces that an fPar is enabled to
boot but has no IO device and will not be able too boot beyond EFI shell
interaction without reconfiguring and rebooting the fPar. The fPar is not
disabled.
- Event Class: System
- Problem Description:
The fPar has been enabled to boot, and
contains at least one CPU and adequate memory but has not been given any IO
resources. The soft partition (fPar) may therefore only boot as far as the EFI
shell. It must be reconfigured and reset before it may boot an operating
system.
- Cause / Action:
Cause: The fPar has been enabled to boot, and
contains at least one CPU and adequate memory but has not been given any IO
resources. The soft partition (fPar) may therefore only boot as far as the EFI
shell. It must be reconfigured and reset before it may boot an operating
system. Action: Reconfigure the soft partition using an
Operating-system-specific utility then reset the soft partition.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4933
- Severity: WARNING
- Event Summary: CRC error detected when executing the PDC Check Run
Time Structure.
- Event Class: System
- Problem Description:
NVM crc problem was detected in mfg mode
structure. The calculated crc value did not match the value stored in the
structure. An error status will be returned to the calling routine.
- Cause / Action:
Cause: firmware/software error Action: Check
for firmware/OS updates Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4934
- Severity: WARNING
- Event Summary: CRC mismatch found in control nvm structure while
performing boot cell init.
- Event Class: System
- Problem Description:
NVM crc problem was detected in the control
structure while performing boot cell init processing. The calculated crc value
did not match the value stored in the structure. The cell will be reset.
- Cause / Action:
Cause: firmware/software error Action: Check
for firmware/OS updates Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4935
- Severity: WARNING
- Event Summary: CRC mismatch found in control structure while
detecting cpu nvm clear status.
- Event Class: System
- Problem Description:
NVM crc problem was detected in the nvm
control structure while checking to see if the cpu nvm structure needs to be
cleared. The global nvm flag will be cleared and the cell will be reset.
- Cause / Action:
Cause: firmware/software error Action: Check
for firmware/OS updates Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4936
- Severity: WARNING
- Event Summary: CRC mismatch seen in nvm control struct while
viewing nvm flag environment menu
- Event Class: System
- Problem Description:
NVM CRC problem was detected in processing
the NVM Flag Environment menu item. The calculated crc value did not match the
value stored in the structure. An error message will be displayed to the user,
and the pd will be reset.
- Cause / Action:
Cause: firmware/software error Action: Check
for firmware/OS updates Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4937
- Severity: WARNING
- Event Summary: CRC mismatch in control structure while checking
number of CPUs left in cell.
- Event Class: System
- Problem Description:
NVM crc problem was detected in nvm control
structure. The calculated crc value did not match the value stored in the
structure. An error status will be returned to the caller.
- Cause / Action:
Cause: firmware/software error Action: Check
for firmware/OS update Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4938
- Severity: WARNING
- Event Summary: CRC mismatch in nvm control structure while trying
to deconfigure the CPUs.
- Event Class: System
- Problem Description:
NVM crc problem was detected in the nvm
control structure while trying to deconfigure some CPUs. The calculated crc
value did not match the value stored in the structure. The cell will be halted
- Cause / Action:
Cause: firmware/software error Action: Check
for firmware/OS updates Contact HP support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4939
- Severity: WARNING
- Event Summary: CRC mismatch detected in control struct while
marking a cpu for re-allocation.
- Event Class: System
- Problem Description:
NVM CRC problem was detected in the nvm
control structure while attempting to re-allocate a cpu. The calculated crc
value of the structure did not match the value stored in the structure. An
error status will be returned.
- Cause / Action:
Cause: Firmware/software error Action: Check
for firmware/OS update Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4940
- Severity: WARNING
- Event Summary: CRC mismatch in control struct while checking last
cpu in cell for next boot.
- Event Class: System
- Problem Description:
NVM crc problem was detected in the nvm
control structure while checking for last cpu in cell for next boot. The
calculated crc value in nvm control structure did not match the stored value
in the structure. An error status will be returned to the caller.
- Cause / Action:
Cause: firmware/software error Action: Check
for firmware/OS updates Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4941
- Severity: WARNING
- Event Summary: CRC failure detected in nvm control structure while
interleaving memory.
- Event Class: System
- Problem Description:
Firmware detected an error in the nvm
control structure while perform interleaving between cells. The calculated crc
value of the structure did not match the stored value. The pd will be reset.
- Cause / Action:
Cause: Firmware/software problem Action: Check
for firmware/OS updates Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4942
- Severity: WARNING
- Event Summary: CRC error detected in nvm control structure while
doing memory testing.
- Event Class: System
- Problem Description:
Firmware detected an error in the nvm
control structure while performing memory testing. The calculated crc value of
the structure did not match the stored value. The cell will be reset.
- Cause / Action: Cause: Firmware/software error Action: Check for
updated firmware/OS Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4943
- Severity: WARNING
- Event Summary: Checksum problem detected with the nvm control
structure while performing DRR.
- Event Class: System
- Problem Description:
Firmware detected an error in the nvm
control structure while performing clearing and logging of DRR errors. The
calculated crc value of the structure did not match the stored value. The cell
will be reset.
- Cause / Action:
Cause: Firmware/software failure Action: Check
for updated firmware/OS Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4944
- Severity: MAJOR
- Event Summary: Phase3BackToBackPrep() failed during OLA
- Event Class: System
- Problem Description:
The local cell was unable to clear the
link_off bit on at least one CC-CC link. Data field: return value of
Phase3BackToBackPrep()
- Cause / Action:
Cause: An error occurred while routing the
fabric. Action: Collect IPMI event logs, and contact your HP support
representative to inspect the fabric and system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4945
- Severity: MAJOR
- Event Summary: ArfOLAPreRendez() was unable to reach the cell being
added, the OLA will fail.
- Event Class: System
- Problem Description:
ArfOLAPreRendez() was unable to reach the
added cell. Data field: bitmap of unreachable cells
- Cause / Action:
Cause: Firmware was unable to find a valid
route to a cell being added by OLA. Action: Collect IPMI event logs, and
contact your HP support representative to inspect the fabric and system
firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4946
- Severity: MAJOR
- Event Summary: FW was unable to setup the NC table entries during
an OLA operation
- Event Class: System
- Problem Description:
An unexpected error occurred in
SetupNCTable(). Data field: return value from SetupNCTable()
- Cause / Action:
Cause: An error occurred while routing the
fabric. Action: Collect IPMI event logs, and contact your HP support
representative to inspect the fabric and system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4947
- Severity: MAJOR
- Event Summary: ArfOLAPostRendez() was unable to manage Mittelhorn
links during an OLA operation
- Event Class: System
- Problem Description:
Phase4BackToBackPrep() unexpectedly failed.
Data field: return value of Phase4BackToBackPrep()
- Cause / Action:
Cause: An error occurred while routing the
fabric. Action: Collect IPMI event logs, and contact your HP support
representative to inspect the fabric and system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4948
- Severity: MAJOR
- Event Summary: FW had an unexpected error while synchronizing
graphs during an OLA.
- Event Class: System
- Problem Description:
Data field: (master cell number <<
56) | (bitmap of cells to synch)
- Cause / Action:
Cause: An error occurred while routing the
fabric Action: Collect IPMI event logs, and contact your HP support
representative to inspect the fabric and system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4949
- Severity: MAJOR
- Event Summary: FW was unable to setup Non-Coherent or Coherent
tables after an OLA.
- Event Class: System
- Problem Description:
Data field: bitmap of unreachable cells.
- Cause / Action:
Cause: An error occurred while routing the
fabric. Action: Collect IPMI event logs, and contact your HP support
representative to inspect the fabric and system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4950
- Severity: MAJOR
- Event Summary: FW had an unexpected error setting up the NC or Coh
table after an OLA
- Event Class: System
- Problem Description:
Data field: return value from
SetupNCTable() or SetupCohTable()
- Cause / Action:
Cause: An error occurred while routing the
fabric. Action: Collect IPMI event logs, and contact your HP support
representative to inspect the fabric and system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4951
- Severity: MAJOR
- Event Summary: FW had an unexpected error while enabling routes in
an OLA operation.
- Event Class: System
- Problem Description:
Data field: (local cell number << 56)
| (bitmap of routes to enable)
- Cause / Action:
Cause: An error occurred while routing the
fabric. Action: Collect IPMI event logs, and contact your HP support
representative to inspect the fabric and system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4952
- Severity: MAJOR
- Event Summary: An unexpected error occurred while disabling routes
during an OLA operation.
- Event Class: System
- Problem Description:
Data field: (bitmap of cells << 32) |
(bitmap of routes to disable)
- Cause / Action:
Cause: An error occurred while routing the
fabric. Action: Collect IPMI event logs, and contact your HP support
representative to inspect the fabric and system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4953
- Severity: MAJOR
- Event Summary: An unexpected error occurred while disabling routes
during ArfPhase4.
- Event Class: System
- Problem Description:
Data field: (bitmap of source cells
<< 32) | (bitmap of routes to disable)
- Cause / Action:
Cause: An error occurred while routing the
fabric. Action: Collect IPMI event logs, and contact your HP support
representative to inspect the fabric and system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4954
- Severity: CRITICAL
- Event Summary: System FW and the PDHC have incompatible shared
memory interface revisions
- Event Class: System
- Problem Description:
System FW and the PDHC have incompatible
shared memory interface revisions. Data byte 0 is the PDHC's supported ICM
revision number. Data byte 1 is System FW's supported ICM revision number.
- Cause / Action:
Cause: (Probable) Either System firmware or
PDHC Firmware on the named cell was upgraded to a new major revision number
and not the other. The major numbers must match when checked by the PDHC or
else the cell will not be allowed to boot. Action: Use FWUU (Firmware Update
Utility) to check the revision numbers of PDHC and PDC Firmware on the cell(s)
called out in the event log. Compare to other cells in the complex, or, if
single cell, check firmware revs. available to FWUU.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4955
- Severity: MAJOR
- Event Summary: Indicated the Dillon IPR is not cleared after a
number of PDC attempts.
- Event Class: System
- Problem Description:
Indicated the Dillon IPR is not cleared
after a number of PDC attempts. The Data Field contains the
MAX_RD_CLR_IPR_TRIES. Please look at the previous Chassis Code to see the Cell
number.
- Cause / Action:
Cause: Dillon IPR is not cleared after a number
of PDC attempts. Action: Contact HP Support personnel to troubleshoot the
problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4956
- Severity: WARNING
- Event Summary: Describes a FATAL event when the I/O Link has gone
down for the specified I/O Chassis.
- Event Class: System
- Problem Description:
Data field is a physical location of the
I/O Chassis.
- Cause / Action:
Cause: HW Failure for I/O Link between system
backplane and I/O chassis Action: Re-seat I/O chassis, Replace I/O Chassis,
replace system backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4957
- Severity: FATAL
- Event Summary: Memory allocation has failed
- Event Class: System
- Problem Description:
Memory allocation failed. The data is the
size of the memory block which could not be allocated.
- Cause / Action:
Cause: Memory could not be allocated for the
guest. Action: Reduce memory usage by other programs, add physical memory,
verify sufficient swap space, or configure the guest to use less memory.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4958
- Severity: FATAL
- Event Summary: Memory could not be locked.
- Event Class: System
- Problem Description:
Memory locking failed. The data is the size
of the memory block which could not be locked.
- Cause / Action:
Cause: This can be caused by insufficient
physical memory. Action: Reduce memory usage by other programs, add physical
memory, or configure the guest to use less memory.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4959
- Severity: FATAL
- Event Summary: Insufficient physical processors
- Event Class: System
- Problem Description:
The minimum number of processors required
is more than the number available. The data is the number of available
processors.
- Cause / Action: Cause: The number of physical processors is too few
to run the guest as configured. Action: Add processors or reconfigure the
guest to use fewer processors.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4960
- Severity: CRITICAL
- Event Summary: CC chip fan failed
- Event Class: System
- Problem Description: Cache Controller Turbo Cooler Failure
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4961
- Severity: WARNING
- Event Summary: CC chip fan failing
- Event Class: System
- Problem Description: Cache Controller Turbo Cooler Failing
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4962
- Severity: CRITICAL
- Event Summary: CPU Smartfan Controller Failed
- Event Class: System
- Problem Description: CPU Smart Fan Controller Failure
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4963
- Severity: CRITICAL
- Event Summary: CC Smartfan Controller Failed
- Event Class: System
- Problem Description: Cache Controller Smart Fan Controller Failure
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4964
- Severity: WARNING
- Event Summary: The Serial Presence Detect (SPD) has been skipped.
- Event Class: System
- Problem Description:
The Serial Presence Detect (SPD) has been
skipped due to BYPASS_SPD_BIT being set. In this case, the Alternate Memory
Config will be loaded. This bit should only be set in the factory and not in
the field.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4971
- Severity: MAJOR
- Event Summary: PDCE is unable to get one of the SM4s within its PD
due to unexpected failure
- Event Class: System
- Problem Description:
Failure to obtain a global semaphore
- Cause / Action:
Cause: Attempt to obtain a global semaphore has
failed Action: No action is required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4972
- Severity: MAJOR
- Event Summary: PDCE is unable to get SM4 because of unexpected
failure.
- Event Class: System
- Problem Description:
Request to access a local semaphore has
timed out.
- Cause / Action:
Cause: Unable to access a local semaphore
within the allotted time. Action: No action is required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4973
- Severity: MAJOR
- Event Summary: PDCE is unable to release one of its SM4 from its PD
due to unexpected failure.
- Event Class: System
- Problem Description:
Failure to release the global semaphore.
- Cause / Action:
Cause: Unable to release the global semaphore
Action: No action is required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4974
- Severity: MAJOR
- Event Summary: PDCE is unable to release a local SM4 due to
unexpected failure.
- Event Class: System
- Problem Description:
Failure to release a local semaphore.
- Cause / Action:
Cause: Unable to release a local semaphore.
Action: No action is required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4975
- Severity: CRITICAL
- Event Summary: Fabric phase 1 of routing failed such that the cell
cannot continue
- Event Class: System
- Problem Description:
Fabric phase 1 of routing failed such that
the cell cannot continue
- Cause / Action:
Cause: A fabric phase of routing failed such
that the cell cannot continue Action: Check for additional IPMI Events.
Contact your HP Support representative to diagnose the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4976
- Severity: CRITICAL
- Event Summary: Fabric phase 3 of routing failed such that the cell
cannot continue
- Event Class: System
- Problem Description:
Fabric phase 3 of routing failed such that
the cell cannot continue
- Cause / Action:
Cause: A fabric phase of routing failed such
that the cell cannot continue Action: Check for additional IPMI Events.
Contact your HP Support representative to diagnose the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4977
- Severity: CRITICAL
- Event Summary: Fabric phase 4 of routing failed such that the cell
cannot continue
- Event Class: System
- Problem Description:
Fabric phase 4 of routing failed such that
the cell cannot continue
- Cause / Action:
Cause: A fabric phase of routing failed such
that the cell cannot continue Action: Check for additional IPMI Events.
Contact your HP Support representative to diagnose the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 4978
- Severity: MAJOR
- Event Summary: Firmware was unable to initialize COHx tables after
cell rendezvous.
- Event Class: System
- Problem Description:
Data field: (cell set << 32) |
(return value from SetupInitialCohTables())
- Cause / Action:
Cause: An error occurred while routing the
fabric. Action: Collect IPMI event logs, and contact your HP support
representative to inspect the fabric and system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5133
- Severity: FATAL
- Event Summary: The virtual machine has been stopped.
- Event Class: System
- Problem Description:
The guest operating system performed an
operation which could not be handled by the virtual machine. The virtual
machine could not continue running the guest so it stopped.
- Cause / Action:
Cause: The guest detected an error and could
not continue. Action: See the HPVM guest operation log for additional
information.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5134
- Severity: FATAL
- Event Summary: Kernel memory lock failed
- Event Class: System
- Problem Description:
Kernel driver failed to lock memory
- Cause / Action:
Cause: Kernel driver was unable to lock memory
pages Action: Possibly insufficient physical memory. Add memory or reduce size
of guest.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5135
- Severity: FATAL
- Event Summary: The MMIO/IOP map is corrupt or has been initialized
twice.
- Event Class: System
- Problem Description:
The code which initializes the MMIO/IOP map
found that the table already contained information.
- Cause / Action:
Cause: An incorrect calling sequence has
occurred or an image is corrupt. Action: If this continues to occur, reinstall
the software. If this does not fix the problem contact your HP support
representative.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5136
- Severity: FATAL
- Event Summary: An incorrect bus number was added
- Event Class: System
- Problem Description:
An attempt was made to add a PCI bus with a
number greater than the maximum allowed.
- Cause / Action:
Cause: Improper configuration Action: Check the
configuration and set all bus numbers to supported values.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5137
- Severity: FATAL
- Event Summary: An unsupported CPU generation was configured
- Event Class: System
- Problem Description:
The configured CPU generation is not
supported
- Cause / Action:
Cause: The configuration contains an
unsupported CPU generation Action: Modify the configuration to use a supported
CPU generation
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5138
- Severity: FATAL
- Event Summary: UART initialization error
- Event Class: System
- Problem Description:
An ISA UART was created without a data
structure. A PCI UART structure exists
- Cause / Action:
Cause: The ISA UART code was called when a PCI
UART structure and no ISA UART structure were created. Action: Verify that the
configuration contains the correct UART type.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5139
- Severity: FATAL
- Event Summary: A PMAN IOCTL to set the UART characteristics has
failed
- Event Class: System
- Problem Description:
A TCGETS or TCSETS IOCTL failed. The errno
status will be in a subsequent event.
- Cause / Action:
Cause: The device is not configured or is not
the proper type. Action: Check the configuration
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5140
- Severity: FATAL
- Event Summary: A stat call on the UART device failed
- Event Class: System
- Problem Description:
A stat call on the PMAN failed. The next
event contains the failure error.
- Cause / Action:
Cause: The device does not exist or is not
available Action: Check the configuration and verify the device
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5141
- Severity: FATAL
- Event Summary: The host virtual machine driver could not be opened
- Event Class: System
- Problem Description:
The virtual machine driver could not be
opened
- Cause / Action:
Cause: The virtual machine software is not
properly installed or is not running Action: Restart the virtual machine. If
this fails verify that it is properly installed.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5142
- Severity: FATAL
- Event Summary: The vm driver could not create a virtual machine
- Event Class: System
- Problem Description:
The virtual machine driver could not create
a virtual machine
- Cause / Action:
Cause: The PMAN kernel has insufficient memory
available to create a guest Action: Add memory or reduce memory use
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5143
- Severity: FATAL
- Event Summary: Could not create a file system node for the virtual
machine
- Event Class: System
- Problem Description:
Could not create a file system node for
communication between the vmm driver and the virtual machine. The next event
contains the error from the Unix system call.
- Cause / Action:
Cause: Insufficient resources or file system
problem Action: Verify that the root file system has no errors and is not
full.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5144
- Severity: FATAL
- Event Summary: Cannot open the guest-specific vm device
- Event Class: System
- Problem Description:
The vm device was created but cannot be
opened
- Cause / Action: Cause: File system error or PMAN has insufficient
resources Action: Verify that the root file system has no errors and that the
PMAN has sufficient memory.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5145
- Severity: FATAL
- Event Summary: The vm driver has not been loaded
- Event Class: System
- Problem Description:
The virtual machine driver has not been
loaded
- Cause / Action: Cause: HPVM has not been started Action: Start HPVM
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5146
- Severity: FATAL
- Event Summary: Creation of a thread has failed
- Event Class: System
- Problem Description:
An attempt to create a thread on the PMAN
has failed. The next event contains the Unix error number.
- Cause / Action:
Cause: Insufficient resources Action: Reduce
guest resources or add memory
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5147
- Severity: FATAL
- Event Summary: Driver could not enter virtual machine
- Event Class: System
- Problem Description:
VM Driver was unable to communicate with
virtual machine
- Cause / Action: Cause: This is an internal error. Action: If this
continues to occur, reinstall the software. If this does not fix the problem
contact your HP support representative.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5148
- Severity: FATAL
- Event Summary: The configuration file could not be read
- Event Class: System
- Problem Description:
The configuration file could not be read
- Cause / Action:
Cause: The configuration file is missing or
damaged. Action: Restore the configuration from a backup or recreate the
guest.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5149
- Severity: FATAL
- Event Summary: Memory allocation for firmware has failed
- Event Class: System
- Problem Description:
Memory allocation for a firmware table has
failed. The data contains the name of the firmware table.
- Cause / Action:
Cause: Insufficient memory available Action:
Increase available memory or swap file space
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5150
- Severity: FATAL
- Event Summary: Memory map tables could not be created
- Event Class: System
- Problem Description:
Driver was unable to build map tables
- Cause / Action:
Cause: Insufficient resources or PMAN kernel
memory too fragmented Action: Add memory or reboot
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5151
- Severity: FATAL
- Event Summary: Reboot failed
- Event Class: System
- Problem Description:
Reboot failed. The virtual machine will
exit.
- Cause / Action:
Cause: The virtual machine was unable to
restart. The next event contains the Unix error. Action: Restart if possible
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5152
- Severity: WARNING
- Event Summary: Error reading stable store flash
- Event Class: System
- Problem Description:
Stable store read or write to flash failed.
- Cause / Action:
Cause: Soft error Flash part Failure Action:
Reboot, if problem persists, contact HP support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5153
- Severity: WARNING
- Event Summary: Stable Store NVM cannot be read, written to, or is
not initialized
- Event Class: System
- Problem Description:
Error reading or writing the stable store
NVM area
- Cause / Action:
Cause: A read/write to the NVM stable storage
area failed Action: Reboot, if problem persists, contact HP support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5154
- Severity: FATAL
- Event Summary: FW detected an illegal memory configuration using
4GB DIMMs
- Event Class: System
- Problem Description:
SFW has detected an illegal memory config
using 4GB DIMMs. The system will be halted
- Cause / Action:
Cause: An unsupported memory configuration with
4GB DIMMs is installed in the system Action: Refer to the user documentation
to determine the supported memory configurations with 4GB DIMMs
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5155
- Severity: CRITICAL
- Event Summary: CPU Power Module Fault
- Event Class: System
- Problem Description: CPU Power Module Faults
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5156
- Severity: CRITICAL
- Event Summary: CPU Hotswap Controller Fault
- Event Class: System
- Problem Description: CPU hotswap ControllerFault
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5157
- Severity: MAJOR
- Event Summary: Firmware had an unexpected error while initializing
LINK_SEL_COHx CSRs.
- Event Class: System
- Problem Description:
Data field: cell set [63:32], return value
[31:0]
- Cause / Action:
Cause: An error occurred while routing the
fabric. Action: Collect IPMI event logs, and contact your HP support
representative to inspect the fabric and system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5158
- Severity: MAJOR
- Event Summary: System Firmware heap space is corrupt.
- Event Class: System
- Problem Description:
One of the System Firmware heap areas was
found to be corrupt.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5159
- Severity: MAJOR
- Event Summary: A portion of System Firmware heap space is
unaccounted for.
- Event Class: System
- Problem Description:
A portion of System Firmware heap space is
unaccounted for.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5160
- Severity: MAJOR
- Event Summary: System Firmware heap space is full.
- Event Class: System
- Problem Description:
One of the System Firmware heap space areas
is full. No free space is available.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5161
- Severity: MAJOR
- Event Summary: Firmware was unable to set ALREC and ALTRAN error
masks
- Event Class: System
- Problem Description:
Data field: cell ID [63:56], port num
[55:44], XBC ID [43:32]
- Cause / Action: Cause: An error occurred while routing the fabric.
Action: Collect IPMI event logs, and contact your HP support representative to
inspect the fabric and system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5162
- Severity: MAJOR
- Event Summary: Firmware had an unexpected internal error from the
vertex module.
- Event Class: System
- Problem Description:
Data field: vType1 [63:56] | vPort1 [55:48]
| vId1 [47:32] | vType2 [31:24] | vPort2 [23:16] | vId2 [15:0] Cause /
Action:
Cause: An error occurred while routing the fabric, possible data
corruption of firmware error. Action: Collect IPMI event logs, and contact
your HP support representative to inspect the fabric and system firmware.
Cause: An error occurred while routing the fabric, possible data corruption or
firmware error. Action: Collect IPMI event logs, and contact your HP support
representative to inspect the fabric and system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5163
- Severity: CRITICAL
- Event Summary: The OS being booted is not supported in a mixed cpu
revision configuration.
- Event Class: System
- Problem Description:
The OS being booted is not supported in a
partition containing a mix of processor stepping revisions.
- Cause / Action:
Cause: The OS being booted is not supported in
a partition containing a mix of processor stepping revisions. Action: Failed
processors should be replaced with like processors. Reconfigure the hardware
to avoid configurations with mixed steppings.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5164
- Severity: MAJOR
- Event Summary: Firmware had an unexpected error while setting up
Mittelhorn links.
- Event Class: System
- Problem Description:
Data field: vType1 [63:56] | vPort1 [55:48]
| vId1 [47:32] | vType2 [31:24] | vPort2 [23:16] | vId2 [15:0] Cause /
Action:
Cause: An error occurred while routing the fabric, possible data
corruption or firmware error. Action: Collect IPMI event logs, and contact
your HP support representative to inspect the fabric and system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5165
- Severity: MAJOR
- Event Summary: Firmware was unable to write a Skyline LINK_SEL_COHx
CSR
- Event Class: System
- Problem Description:
Data field: cell num [63:56] | link sel coh
CSR (0 or 1) [0:0]
- Cause / Action: Cause: An error occurred while routing the fabric.
Action: Collect IPMI event logs, and contact your HP support representative to
inspect the fabric and system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5166
- Severity: MAJOR
- Event Summary: The requested OS does not support booting with mixed
CPU revisions.
- Event Class: System
- Problem Description:
The OS suitable for booting in this mode
does not support booting in the current configuration due to the existence of
CPUs of differing revisions.
- Cause / Action: Cause: Mixed CPU revisions exist in a mixed-cpu
stepping configuration. Action: Replace cells to match cpu stepping revisions
or deconfigure the unmatched processors.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5167
- Severity: MAJOR
- Event Summary: System firmware experienced an error while updating
the link sel value
- Event Class: System
- Problem Description:
System firmware experienced an error while
updating the link sel value. Data field indicates return status Cause /
Action:
Cause: Error updating the global link select fabric value Action:
Contact your HP Support Representative to investigate the fabric subsystem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5168
- Severity: FATAL
- Event Summary: The LPM has reported to the Pdhc that the CPU
modules are loaded incorrectly
- Event Class: System
- Problem Description: The Cell LPM has reported to the Pdhc that the
CPU modules are not loaded correctly
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5171
- Severity: MAJOR
- Event Summary: Firmware was unable to generate a Skyline CSR
address
- Event Class: System
- Problem Description:
Fabric was unable to generate a Skyline CSR
address Data field: cell number [63:56] and return value from address function
[31:0]
- Cause / Action: Cause: An error occurred while routing the fabric,
possible data corruption or firmware error. Action: Collect IPMI event logs,
and contact your HP support representative to inspect the fabric and system
firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5172
- Severity: MAJOR
- Event Summary: Firmware was unable to generate a Skyline CSR
address
- Event Class: System
- Problem Description:
Firmware was unable to generate a Skyline
CSR address. Data field: cell number [63:56] | return value [31:0]
- Cause / Action:
Cause: An error occurred while routing the
fabric, possible data corruption or firmware error. Action: Collect IPMI event
logs, and contact your HP support representative to inspect the fabric and
system firmware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5173
- Severity: CRITICAL
- Event Summary: Firmware was unable to get the address for the PDT.
- Event Class: System
- Problem Description:
Firmware was unable to get the address for
the PDT. The data type contains the Cell physical location for the cell that
the PDT was being acquired for.
- Cause / Action: Cause: A firmware error occurred that caused the
PDT to be inaccessible. Action: Contact HP support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5174
- Severity: CRITICAL
- Event Summary: Invalid or no OS boot rendezvous entry point
- Event Class: System
- Problem Description:
Target CPU receives wakeup interrupt to
join the OS but the OS_BOOT_RENDEZ entry point is invalid or none-existent
- Cause / Action: Cause: OS or firmware bugs Action: Call HP support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5175
- Severity: CRITICAL
- Event Summary: Checksum verification failed for OS_BOOT_RENDEZ
entry point
- Event Class: System
- Problem Description:
The entry point for OS_BOOT_RENDEZ exists,
but fails the checksum test performed by firmware
- Cause / Action:
Cause: Software data corruption Action: Call HP
support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5176
- Severity: MAJOR
- Event Summary: The System Firmware call to PAL_COPY_INFO failed.
- Event Class: System
- Problem Description:
The System Firmware call to PAL_COPY_INFO
failed.
- Cause / Action: Cause: Contact Engineering, This is a bug. Action:
-
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5177
- Severity: MAJOR
- Event Summary: The System Firmware call to PAL_COPY_PAL failed.
- Event Class: System
- Problem Description:
The System Firmware call to PAL_COPY_PAL
failed.
- Cause / Action: Cause: Contact Engineering, This is a bug. Action:
-
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5178
- Severity: MAJOR
- Event Summary: The System Firmware call to PAL_CACHE_FLUSH failed.
- Event Class: System
- Problem Description:
The System Firmware call to PAL_CACHE_FLUSH
failed.
- Cause / Action: Cause: Contact Engineering, This is a bug. Action:
-
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5179
- Severity: MAJOR
- Event Summary: Firmware had an unexpected error during cell OLA
- Event Class: System
- Problem Description:
Unable to setup Skyline LINK_SEL_COHx CSRs
on the running partition to the newly added cell. Data field: cell set [63:32]
| return val [31:0]
- Cause / Action: Cause: New cell cannot communicate with existing
partition. Cell OLA failed. Action: Existing partition can continue to run,
new cell cannot be added to the partition.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5180
- Severity: WARNING
- Event Summary: An MCA occurred before a previous MCA event has been
completely handled.
- Event Class: System
- Problem Description:
This indicates an MCA happening before a
previous MCA was not completely handled.
- Cause / Action: Cause: An MCA happened before a previous MCA event
did not complete. Action: For Fpars/Vpars, system FW will reset the hard
partition. For npars, system FW will still attempt to hand off to the OS_MCA
handler, if OS_MCA handoff fails, the hard partition will get reset.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5181
- Severity: WARNING
- Event Summary: MCA occurred while an INIT event is getting
processed.
- Event Class: System
- Problem Description:
This indicates an MCA event happening
before a previous INIT event was not completely processed.
- Cause / Action:
Cause: An MCA event happening before a previous
INIT event was not completely processed. Action: For Fpars/Vpars, system FW
will reset the hard partition. In Npars mode, MCA handler will attempt to hand
off to OS_MCA handler. If handoff fails, the hard partition will be reset.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5182
- Severity: MAJOR
- Event Summary: Firmware was unable to find a bad edge in an
untraversable route
- Event Class: System
- Problem Description:
Firmware was unable to find a bad edge in
an untraversable route. Data Field: (destCell << 56 | port num < 44 |
xbc num << 32 | return status)
- Cause / Action: Cause: The bad edge could not be found. Action:
Check for intermittent link errors. Capture IPMI event logs and look for
additional error events. Contact your HP Support Representative to investigate
the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5183
- Severity: WARNING
- Event Summary: Some entity that is not known to the firmware, is
drawing power from the bus bar
- Event Class: System
- Problem Description:
Some entity that is not known to the
firmware, is drawing power from the bus bars.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5184
- Severity: CRITICAL
- Event Summary: Soft Partition cannot boot rebooting to nPars
- Event Class: System
- Problem Description:
Rebooting the partition to nPars because a
soft partition could not be booted.
- Cause / Action: Cause: A soft partition could not be booted. Please
refer to previous events for the complete details. Action: The partition is
rebooted to nPars to allow the soft partition to be re-configured.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5187
- Severity: MAJOR
- Event Summary: Firmware was unable to find a replacement route for
the broken link
- Event Class: System
- Problem Description:
Firmware was unable to find a replacement
route for the broken link. Data field contains: port << 44, chip id
<< 32, and number of attempts to find an alternate route
- Cause / Action:
Cause: A traversable route between the local
cell and the specified destination crossbar could not be found Action: Capture
IPMI event logs. Contact your HP Support Representative to investigate the
fabric subsystem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5188
- Severity: MAJOR
- Event Summary: Firmware was unable to set the APER "fabric
lockdown" bit
- Event Class: System
- Problem Description:
Data field details the cell that failed to
set its bit
- Cause / Action: Cause: Internal FW error. Action: Reset partition
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5189
- Severity: MAJOR
- Event Summary: Error occurred while getting the expected neighbor
for distributed routing
- Event Class: System
- Problem Description:
System firmware experienced an error while
retrieving the expected neighbor fabric chip for distributing traffic over
crossbar links. Data Field: (xbc port # << 44) | (xbc # << 32) |
return status
- Cause / Action: Cause: Unexpected error retrieving information from
the fabric graph. Action: Collect IPMI event logs and contact your HP Support
Representative to investigate the fabric system firmware subsystem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5190
- Severity: MAJOR
- Event Summary: Could not read crossbar neighbor information during
routing
- Event Class: System
- Problem Description:
Could not read crossbar neighbor
information during routing. Data Field: (xbc port << 44) | (xbc <<
32) | error status
- Cause / Action: Cause: System Firmware encountered an error while
reading a fabric crossbar chip. Action: Collect IPMI Event Logs and contact
your HP Service Representative to investigate the fabric subsystem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5191
- Severity: MAJOR
- Event Summary: Could not establish route for local crossbar during
route around routing
- Event Class: System
- Problem Description:
Could not establish route for local
crossbar during route around routing. Data Field: (xbc port << 44) |
(xbc << 32) | error status
- Cause / Action: Cause: System Firmware could not establish routing
from the local crossbar that would be needed to complete the route around
routing. Action: Collect IPMI Event Logs and contact your HP Support
Representative to investigate the fabric subsystem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5192
- Severity: FATAL
- Event Summary: The PDHC has inhibited a processor due to a power or
thermal fault.
- Event Class: System
- Problem Description:
A CPU module's temperature has exceed the
high temperature threshold or a CPU power module fault has occurred. As a
result of this event, the CPU has been inhibited. The Cell must be powered off
then on using the MP's PE command before the CPU module will be powered again.
- Cause / Action: Cause: A CPU module's temperature has exceed the
high temperature threshold or a CPU power module fault has occurred. Action:
Contact HP support personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5193
- Severity: CRITICAL
- Event Summary: Invalid or unsupported TPM in use.
- Event Class: System
- Problem Description:
An invalid TPM has been detected in the
system and is being use. Security might be compromised. Data field unused.
- Cause / Action: Cause: An invalid, non supported TPM is being used
in the system Action: Contact HP for support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5194
- Severity: WARNING
- Event Summary: Invalid TPM detected in the system. TPM is disabled.
- Event Class: System
- Problem Description:
Invalid or unsupported TPM detected in the
system. TPM is disabled. Data filed unused
- Cause / Action: Cause: An invalid or unsupported TPM has been
detected in the system. TPM is disabled. Action: Contact HP support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5195
- Severity: WARNING
- Event Summary: TPM initialization failed.
- Event Class: System
- Problem Description:
Firmware cannot initialize the TPM. TPM
related security feature are disabled Data field unused
- Cause / Action:
Cause: Bad TPM chip Action: contact HP support
Cause: Communication failure between the I/O board and the TPM Action: replace
the I/O board. Contact HP support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5230
- Severity: WARNING
- Event Summary: PCI interlock has been opened with slot power on.
- Event Class: System
- Problem Description:
PCI interlock has been opened with PCI slot
power on. The data field can be decoded as follows (where byte 0 is the
rightmost byte): byte offset 2:PCI slot number byte offset 5: IO Chassis
number byte offset 6: IO Bay number byte offset 7: Cabinet number
- Cause / Action:
Cause: PCI interlock has been opened with PCI
slot power on. Action: power down slot attempting to remove PCI card.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5231
- Severity: INFORMATION
- Event Summary: System backplane Hot Swap Oscillator (HSO) boards
are NOT redundant.
- Event Class: System
- Problem Description:
System backplane Hot Swap Oscillator (HSO)
boards are NOT redundant.
- Cause / Action: Cause: HSO assembly has been removed or has a fault
condition. Action: look for additional events describing action to take.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5232
- Severity: CRITICAL
- Event Summary: Windows IML: Fan failure on serial attached SCSI
device, FATAL.
- Event Class: System
- Problem Description:
A fan failure on a serial attached SCSI
device, this is a FATAL error.
- Cause / Action: Cause: Power to the fan has been disconnected or
the fan is not working. Action: Restore power or replace the fan.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5233
- Severity: WARNING
- Event Summary: Windows IML: Fan failure on serial attached SCSI
device, warning
- Event Class: System
- Problem Description:
A fan failure on serial attached SCSI
device, this is a warning.
- Cause / Action: Cause: Power to the fan has been disconnected or
the fan is not working. Action: Restore power or replace the fan.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5234
- Severity: WARNING
- Event Summary: Windows IML: Overheat condition on serial attached
SCSI device, warning
- Event Class: System
- Problem Description:
A overheated condition on the serial
attached SCSI device, this is a warning.
- Cause / Action: Cause: The temperature of the serial attached SCSI
device is high. Action: Check fans to make sure they are working, check room
temperature.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5235
- Severity: WARNING
- Event Summary: Windows IML: Power supply failure on serial attached
device enclosure, warning
- Event Class: System
- Problem Description:
A power supply failure on serial attached
device enclosure occurred.
- Cause / Action: Cause: Power has been removed a powers supply on
the SAS enclosure or the power supply is bad. Action: Check to see if the
power cable is connected and working.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5236
- Severity: CRITICAL
- Event Summary: Windows IML: A device in the serial attached SCSI
enclosure has failed.
- Event Class: System
- Problem Description:
A device in the serial attached SCSI
enclosure has failed.
- Cause / Action: Cause: A SCSI drive in the SAS enclosure has
failed. Action: Replace the failed SCSI drive.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5237
- Severity: MAJOR
- Event Summary: The specified IO backplane low dropout regulator is
reporting a fault.
- Event Class: System
- Problem Description:
The specified low dropout regulator on the
IO backplane is reporting a fault. The specified power rail is reporting a
fault. The number in the slot field of the physical location is the rail
number that caused the fault. Rails are: 0 = PCI brick 12v, 1 = PCI brick
-12v, 2 = PCI brick 5v, 3 = PCI brick 3.3v, 4 = LDO 2.5v for Esca 0, 5 = LDO
2.5v for Esca 1, 6 = VRMs 1-4 1.5v, 7 = Other backplane 2.5v, 8 = Other
backplane 1.2v
- Cause / Action: Cause: A defective component could cause this
fault. Action: Check the backplane.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5238
- Severity: CRITICAL
- Event Summary: The specified low dropout regulator on the main
backplane is reporting a fault.
- Event Class: System
- Problem Description:
The specified low dropout regulator on the
main backplane is reporting a fault. The number in the slot field of the
physical location is the rail number that caused the fault. Rails are: 0 =
SysBP xb0 1.5v, 1 = SysBp xb1 1.5v, 2 = SysBp switched 3.3v, 3 = SysBp LDO xb0
2.5v, 4 = SysBp xb1 LDO 2.5v, 5 = SysBp LDO 1.5v br0, 6 = SysBp LDO 1.5v br1
- Cause / Action: Cause: The specified low dropout regulator on the
main backplane is reporting a fault. Action: Check the main backplane.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5239
- Severity: CRITICAL
- Event Summary: The specified IO backplane vrm is reporting a
voltage fault.
- Event Class: System
- Problem Description:
The specified vrm is reporting a voltage
fault.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5240
- Severity: CRITICAL
- Event Summary: The specified IO backplane power brick is reporting
a voltage fault.
- Event Class: System
- Problem Description:
The specified vrm is reporting a fault.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5241
- Severity: CRITICAL
- Event Summary: The specified IO backplane vrm is reporting a
temperature fault.
- Event Class: System
- Problem Description:
The specified vrm is reporting a fault.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5242
- Severity: CRITICAL
- Event Summary: The specified IO backplane power brick is reporting
a temperature fault.
- Event Class: System
- Problem Description:
The specified vrm is reporting a fault.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5243
- Severity: CRITICAL
- Event Summary: The specified IO backplane vrm rail is reporting a
fault.
- Event Class: System
- Problem Description:
The specified vrm is reporting a fault.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5244
- Severity: CRITICAL
- Event Summary: The specified IO backplane power brick is reporting
a rail fault.
- Event Class: System
- Problem Description:
The specified vrm is reporting a fault.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5245
- Severity: CRITICAL
- Event Summary: The specified system backplane vrm is reporting a
module voltage fault.
- Event Class: System
- Problem Description:
The specified vrm is reporting a fault.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5246
- Severity: CRITICAL
- Event Summary: The specified system backplane vrm is reporting a
temperature fault.
- Event Class: System
- Problem Description:
The specified vrm is reporting a fault.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5247
- Severity: CRITICAL
- Event Summary: The specified system backplane rail is reporting a
fault
- Event Class: System
- Problem Description:
The specified vrm is reporting a fault.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5248
- Severity: WARNING
- Event Summary: Windows crash dump file has not been enabled
- Event Class: System
- Problem Description:
The Windows crash dump file has not been
enabled for this system.
- Cause / Action: Cause: The system has not been configured to save
debugging information in the event of a system crash. Action: Configure
writing debugging information using Windows Control
Panel>System>Advance>Startup and Recovery
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5249
- Severity: WARNING
- Event Summary: Windows paging file size or the memory dump target
volume is too small.
- Event Class: System
- Problem Description:
The paging file or the target volume of the
memory dump file is not large enough to store the crash dump information in
the event of a system crash.
- Cause / Action: Cause: The paging file or the target volume of the
memory dump file is not large enough to store the crash dump information in
the event of a system crash. Action: Check the crash dump settings for the
recommended Paging File size using the HP System Management Home Page or use
Startup and Recovery under Windows System Properties. Also execute the disk
cleanup to free some space on the target volume of the memory dump file.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5250
- Severity: CRITICAL
- Event Summary: The hot swap controller for the specified core IO is
reporting fault.
- Event Class: System
- Problem Description:
The hot swap controller for the specified
core IO is reporting fault.
- Cause / Action: Cause: The hot swap controller for the specified
core IO is reporting fault. Action: Check the specified core IO.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5251
- Severity: MAJOR
- Event Summary: An error occurred while retrieving the crossbar chip
number
- Event Class: System
- Problem Description:
An error occurred while retrieving the
crossbar chip number. Data Field: cell number << 56 | port number
<< 44 | return status
- Cause / Action: Cause: System firmware had difficulty determining
the crossbar number from the graph data structure Action: Capture IPMI event
logs, look for additional errors. Contact your HP Support Representative to
analyze the fabric
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5252
- Severity: MAJOR
- Event Summary: Error occurred attempting to write to the
PJ_ERR_FE_WIRE_MASK register.
- Event Class: System
- Problem Description:
Attempt to write to PJ_ERR_FE_WIRE_MASK for
the purpose of disabling ALB error signaling before cell reset has failed. The
most significant byte of the data field contains the target cell for the
register write.
- Cause / Action: Cause: Intercell connectivity or other hardware
failure. Action: No action necessary. Cell reset will be attempted regardless.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5253
- Severity: MAJOR
- Event Summary: Error occurred attempting to write to the
PJ_ERR_DR_WIRE_MASK register.
- Event Class: System
- Problem Description:
Attempt to write to PJ_ERR_DR_WIRE_MASK for
the purpose of disabling ALB error signaling before cell reset has failed. The
most significant byte of the data field contains the target cell for the
register write.
- Cause / Action: Cause: Intercell connectivity or other hardware
failure. Action: No action necessary. Cell reset will be attempted regardless.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5254
- Severity: MAJOR
- Event Summary: Error occurred attempting to read the
PJ_ERR_FE_WIRE_MASK register.
- Event Class: System
- Problem Description:
Attempt to read PJ_ERR_FE_WIRE_MASK for the
purpose of disabling ALB error signaling before cell reset has failed. The
most significant byte of the data field contains the target cell number. The
remainder is failure status.
- Cause / Action: Cause: Intercell connectivity or other hardware
failure. Action: No action necessary. Cell reset will be attempted regardless.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5255
- Severity: MAJOR
- Event Summary: Error occurred attempting to read the
PJ_ERR_DR_WIRE_MASK register.
- Event Class: System
- Problem Description:
Attempt to read PJ_ERR_DR_WIRE_MASK for the
purpose of disabling ALB error signaling before cell reset has failed. The
most significant byte of the data field contains the target cell number. The
remainder is the failure status.
- Cause / Action: Cause: Intercell connectivity or other hardware
problem. Action: No action necessary. Cell reset will be attempted regardless.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5256
- Severity: MAJOR
- Event Summary: Error occurred attempting to read back the
PJ_ERR_FE_WIRE_MASK register.
- Event Class: System
- Problem Description:
Attempt to read back PJ_ERR_FE_WIRE_MASK
after write for the purpose of disabling ALB error signaling before cell reset
has failed. The most significant byte of the data field contains the target
cell for the register write. The remainder is the failure status.
- Cause / Action: Cause: Intercell connectivity or other hardware
failure. Action: No action necessary. Cell reset will be attempted regardless.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5257
- Severity: MAJOR
- Event Summary: Error occurred attempting to read back the
PJ_ERR_FE_WIRE mask register.
- Event Class: System
- Problem Description:
Attempt to read back PJ_ERR_DR_WIRE_MASK
after write for the purpose of disabling ALB error signaling before cell reset
has failed. The most significant byte of the data field contains the target
cell number for the register write. The remainder is the failure status.
- Cause / Action: Cause: Intercell connectivity or other hardware
failure. Action: No action necessary. Cell reset will be attempted regardless.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5259
- Severity: WARNING
- Event Summary: One or more of the cells are not configured for
Cell-Local Memory (CLM) mode.
- Event Class: System
- Problem Description:
For most workloads, the system performance
may be degraded if CLM mode is not enabled for all of the cells in a
partition. Windows Server 2003 contains additional support to optimize for CLM
systems, increasing the performance compared to an interleaved configuration.
- Cause / Action: Cause: The initial partition configuration may have
not specified 100% Cell-Local Memory or may have been reconfigured
incorrectly. Action: Reconfigure the partition or reset the CLM setting on one
or more cells in your partition using the Partition Manager or ParCLI tools.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5260
- Severity: CRITICAL
- Event Summary: Multiple double-chip sparing has been invoked
- Event Class: System
- Problem Description:
Multiple double-chip spares have been
invoked
- Cause / Action: Cause: System firmware has detected and corrected
memory errors. Multiple double-chip spares have been invoked to help mitigate
this condition. Although the memory errors are corrected and multiple
double-chip spares have been invoked, this condition may indicate a potential
problem. Action: Monitor the situation and contact your HP support
representative to check the affected hardware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5261
- Severity: WARNING
- Event Summary: A platform error was detected by the
firmware/hardware, and corrected by using a spare channel.
- Event Class: System
- Problem Description:
A error occurred in the crossbar controller
and was corrected by the hardware.
- Cause / Action:
Cause: Hardware has detected many link retries
on one of its channels and has switched to a spare one. Action: Monitor the
situation and contact your HP support representative to check the affected
hardware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5262
- Severity: WARNING
- Event Summary: CPUs have automatically been reconfigured. This
could be due to all CPUs being deconfigured, or a new type of CPU installed or
NVM clearing due to a SFW version change.
- Event Class: System
- Problem Description:
All CPUs have been auto reconfigured
- Cause / Action: Cause: Either all CPUs were slated for
deconfiguration, or NVM has been cleared, or a new CPU type has been
installed. Action: No action is required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5263
- Severity: WARNING
- Event Summary: A platform error was detected by the
firmware/hardware, and corrected by using a spare channel.
- Event Class: System
- Problem Description:
A platform error was detected by the
firmware/hardware, and corrected by using a spare channel. The error occurred
between the crossbar chips on the backplane(s).
- Cause / Action:
Cause: The error occurred between the crossbar
chips on the backplane(s). Action: Contact your support representative to have
the backplane and / or connections between backplanes checked
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5264
- Severity: WARNING
- Event Summary: A platform error was detected by the
firmware/hardware, and corrected by using a spare channel.
- Event Class: System
- Problem Description:
A platform error was detected by the
firmware/hardware, and corrected by using a spare channel. The error occurred
between the cell controller and the IO controller.
- Cause / Action:
Cause: The error occurred between the cell
controller and the IO controller. Action: Contact your support representative
to have the Cell Controller to IO Interface checked. Check these FRUs: Cell-IO
cable, IO chassis, IO backplane, cell, or system backplane
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5265
- Severity: WARNING
- Event Summary: Multiple platform errors were detected and corrected
by the firmware/hardware.
- Event Class: System
- Problem Description:
Multiple platform errors were detected and
corrected by the firmware/hardware. The errors occurred between the cell
controller and the backplane.
- Cause / Action: Cause: The errors occurred between the cell
controller and the backplane. Action: Contact your support representative to
have the Cell Controller to backplane Interface checked.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5266
- Severity: WARNING
- Event Summary: Multiple platform errors were detected and corrected
by the firmware/hardware.
- Event Class: System
- Problem Description:
Multiple platform errors were detected and
corrected by the firmware/hardware. The errors occurred between the crossbar
chips on the backplane(s).
- Cause / Action: Cause: The errors occurred between the crossbar
chips on the backplane(s). Action: Contact your support representative to have
the backplane and / or connections between backplanes checked.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5267
- Severity: WARNING
- Event Summary: Multiple platform errors were detected and corrected
by the firmware/hardware.
- Event Class: System
- Problem Description:
Multiple platform errors were detected and
corrected by the firmware/hardware. The errors occurred between the cell
controller and the IO controller.
- Cause / Action: Cause: The errors occurred between the cell
controller and the IO controller. Action: Contact your support representative
to have the Cell Controller to IO Interface checked. Check these FRUs: Cell-IO
cable, IO chassis, IO backplane, cell, or system backplane.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5269
- Severity: FATAL
- Event Summary: The server identification information does not match
the cabinet type.
- Event Class: System
- Problem Description:
The server identification information does
not match the cabinet type. The server will not power on until this condition
is resolved.
- Cause / Action: Cause: The cabinet type is not in agreement with
the server identification information. The server identification information
is viewable using the ID command from the MP's command menu. The cabinet type
is displayed in the top line of the cabinet status shown when the PS command
from the MP's command menu is used to target a compute cabinet. Action:
Contact your support representative to have the server identification
information and cabinet type checked.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 5270
- Severity: WARNING
- Event Summary: All CPUs are slated for deconfiguration
- Event Class: System
- Problem Description:
All CPUs were scheduled to be deconfigured
- Cause / Action: Cause: All CPUs were scheduled to be deconfigured
through a combination of user deconfigures and/or failures Action: No action
is required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6517
- Severity: MAJOR
- Event Summary: Found an unexpected or unknown backplane type during
fabric phase preparation
- Event Class: System
- Problem Description:
Fabric was provided with a backplane type
that is unrecognized
- Cause / Action: Cause: Unknown system type Action: Contact HP
Support personnel to analyze the backplanes and activity logs
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6518
- Severity: WARNING
- Event Summary: Getting of a semaphore in the memory PDC code
failed.
- Event Class: System
- Problem Description:
Getting of a semaphore in the memory PDC
code failed. The data field is the sm4 status.
- Cause / Action:
Cause: Getting of a semaphore in the memory PDC
code failed. Action: Contact HP Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6519
- Severity: MAJOR
- Event Summary: Failed getting fabric information during procedure
call
- Event Class: System
- Problem Description:
Failed getting fabric information during
procedure call. Data Field: return status
- Cause / Action: Cause: Firmware failed retrieving fabric portion of
the cell information structure. Action: Collect IPMI Event Logs. Look for
additional error logs providing original cause. Contact your HP Support
Representative.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6520
- Severity: CRITICAL
- Event Summary: Failed to get a valid pointer to the fabric section
of the cell information structure
- Event Class: System
- Problem Description:
Failed to get a valid pointer to the fabric
section of the cell information structure
- Cause / Action:
Cause: Could not get a valid address for the
cell information Action: Collect all IPMI event logs. Contact your HP Support
Representative.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6521
- Severity: CRITICAL
- Event Summary: System firmware was unable to release the micro
semaphore
- Event Class: System
- Problem Description:
System firmware was unable to release the
micro semaphore
- Cause / Action: Cause: An error occurred while updating the fabric
portion of cell info. The partition cannot boot. Action: Reboot partition,
Gather IPMI Event Logs, Contact your HP Support Representative
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6522
- Severity: CRITICAL
- Event Summary: Fabric phase 0 of routing failed such that the cell
cannot continue
- Event Class: System
- Problem Description:
Fabric phase 0 of routing failed such that
the cell cannot continue
- Cause / Action: Cause: Fabric phase 0 of routing failed such that
the cell cannot continue Action: Check for additional IPMI Events. Contact
your HP Support representative to diagnose the fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6523
- Severity: FATAL
- Event Summary: The attempt to distribute the new alive set failed.
- Event Class: System
- Problem Description:
The attempt to distribute the new alive set
failed.
- Cause / Action: Cause: Possible hardware issues on cell board or
system backplane. Action: Contact HP Support for investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6524
- Severity: FATAL
- Event Summary: Could not clear the OLA FATAL flag.
- Event Class: System
- Problem Description:
Attempt to clear OLA FATAL flag failed.
- Cause / Action: Cause: Hardware issue may have caused a fabric
error. Action: Contact HP Support for investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6525
- Severity: FATAL
- Event Summary: Failed to copy core data to OLA cell
- Event Class: System
- Problem Description:
Failed to copy core data to OLA cell.
- Cause / Action: Cause: Problem in PDC may have caused bad CRC in
control structure. Action: Check for PDC updates that may have fixed the
problem. Contact HP Support for investigation. Cause: Hardware problem on cell
board or system backplane may have caused semaphore or fabric error. Action:
Contact HP Support for investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6526
- Severity: FATAL
- Event Summary: OLA cell could not update MCAST registers on PD it
is going to join.
- Event Class: System
- Problem Description:
OLA cell could not update MCAST registers
on PD it is going to join.
- Cause / Action: Cause: Unable to write registers on the CC.
Probable hardware issue. Action: Contact HP Support for investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6527
- Severity: MAJOR
- Event Summary: OL* cell unable to get update MCAST address.
- Event Class: System
- Problem Description:
OL* cell unable to get update MCAST
address.
- Cause / Action: Cause: Could be problem in PDC. Action: Check for
PDC updates that may fix this problem. Contact HP Support for investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6528
- Severity: FATAL
- Event Summary: Failed to update needed CC registers for OLA.
- Event Class: System
- Problem Description:
Failed to update needed CC registers for
OLA.
- Cause / Action: Cause: Unable to write registers on the CC.
Probable hardware issue. Action: Contact HP Support for investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6529
- Severity: FATAL
- Event Summary: PD failed to update MCAST registers on OLA cell.
- Event Class: System
- Problem Description:
PD failed to update MCAST registers on OLA
cell.
- Cause / Action: Cause: Unable to write registers on the CC.
Probable hardware issue. Action: Contact HP Support for investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6530
- Severity: MAJOR
- Event Summary: OLA cell could not read wakeup vector.
- Event Class: System
- Problem Description:
OLA cell could not read wakeup vector.
- Cause / Action: Cause: Could be problem in PDC. Action: Check for
PDC updates that may fix this problem. Contact HP Support for investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6531
- Severity: MAJOR
- Event Summary: Firmware unable to set ready to OLA bit in ICM.
- Event Class: System
- Problem Description:
Firmware unable to set ready to OLA bit in
ICM.
- Cause / Action: Cause: Possible error in PDC. Action: Check for
updates to PDC that may fix this issue. Contact HP Support if no PDC update is
available.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6532
- Severity: CRITICAL
- Event Summary: Can not read config set, SM4 unlocked.
- Event Class: System
- Problem Description:
Can not read config set, SM4 unlocked.
- Cause / Action: Cause: Could be problem with cell hardware. Action:
Contact HP Support for investigation. Cause: Could be problem in PDC. Action:
Check for PDC updates that may fix this problem. Contact HP Support for
investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6533
- Severity: CRITICAL
- Event Summary: Can not read config set, error accessing profile.
- Event Class: System
- Problem Description:
Can not read config set, error accessing
profile.
- Cause / Action: Cause: Could be problem in PDC. Action: Check for
PDC updates that may fix this problem. Contact HP Support for investigation.
Cause: May be problem with cell hardware or MP. Action: Contact HP Support for
investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6534
- Severity: CRITICAL
- Event Summary: Error reading config set.
- Event Class: System
- Problem Description:
Error reading config set.
- Cause / Action:
Cause: Possible error in PDC. Action: Check for
updates to PDC that may fix this issue. Contact HP Support if no PDC update is
available.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6535
- Severity: CRITICAL
- Event Summary: Failed to read poll flag value from core cell.
- Event Class: System
- Problem Description:
Failed to read poll flag value from core
cell.
- Cause / Action: Cause: Possible error in PDC. Action: Check for
updates to PDC that may fix this issue. Contact HP Support if no PDC update is
available.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6536
- Severity: CRITICAL
- Event Summary: Unable to read core cell number.
- Event Class: System
- Problem Description:
Unable to read core cell number.
- Cause / Action:
Cause: Valid bit not set in PDH register. Could
be a hardware or firmware issue. Action: Contact HP Support for investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6537
- Severity: CRITICAL
- Event Summary: Attempted to OLD root cell in PD.
- Event Class: System
- Problem Description:
Attempted to OLD root cell in PD.
- Cause / Action:
Cause: Can not OLD root cell in the PD. Action:
Try OLD again, but select a different cell.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6538
- Severity: CRITICAL
- Event Summary: Target cell has interleaved memory, can not OLD.
- Event Class: System
- Problem Description:
Target cell has interleaved memory, can not
OLD.
- Cause / Action: Cause: Can not OLD a cell with interleaved memory.
Action: Select another cell for OLD, one that has no interleaved memory.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6539
- Severity: CRITICAL
- Event Summary: Unable to determine if all CPUs on target are
stopped.
- Event Class: System
- Problem Description:
Unable to determine if all CPUs on target
are stopped.
- Cause / Action: Cause: Possible error in PDC. Action: Check for
updates to PDC that may fix this issue. Contact HP Support if no PDC update is
available.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6540
- Severity: CRITICAL
- Event Summary: At least one CPU on target cell was not stopped.
- Event Class: System
- Problem Description:
At least one CPU on target cell was not
stopped.
- Cause / Action: Cause: OS may have failed to stop all CPUs on the
OLD cell. Action: Try OLD again, contact HP Support if problem persists.
Cause: Hardware problems may have prevented CPUs on target cell from stopping.
Action: Contact HP Support for investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6541
- Severity: CRITICAL
- Event Summary: System firmware was unable to flush the SBA queues
for OLD.
- Event Class: System
- Problem Description:
System firmware was unable to flush the
SBA queues for OLD.
- Cause / Action: Cause: Could not flush SBA queues, probably due to
hardware problem with CC or SBA. Action: Contact HP Support for investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6542
- Severity: CRITICAL
- Event Summary: Failed to set OLD FATAL flag.
- Event Class: System
- Problem Description:
Failed to set OLD FATAL flag.
- Cause / Action:
Cause: Could not write to PDH. May be hardware
issue with cell board, or fabric problems. Action: Contact HP Support for
investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6543
- Severity: CRITICAL
- Event Summary: Failed to update PD structures for OLD.
- Event Class: System
- Problem Description:
Failed to update PD structures for OLD.
- Cause / Action: Cause: Possible hardware issues on cell board or
system backplane. Action: Contact HP Support for investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6544
- Severity: CRITICAL
- Event Summary: Failed to stop messages to OLD cell.
- Event Class: System
- Problem Description:
Failed to stop messages to OLD cell.
- Cause / Action: Cause: Unable to write registers on the CC.
Probable hardware issue. Action: Contact HP Support for investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6545
- Severity: CRITICAL
- Event Summary: PD failed to update MCAST registers on non core
cells.
- Event Class: System
- Problem Description:
PD failed to update MCAST registers on non
core cells.
- Cause / Action: Cause: Unable to write registers on the CC.
Probable hardware issue. Action: Contact HP Support for investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6546
- Severity: CRITICAL
- Event Summary: OLD cell failed to update MCAST registers on
executing cell.
- Event Class: System
- Problem Description:
OLD cell failed to update MCAST registers
on executing cell.
- Cause / Action: Cause: Unable to write registers on the CC.
Probable hardware issue. Action: Contact HP Support for investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6547
- Severity: CRITICAL
- Event Summary: Failed to stop messages generated from OLD cell.
- Event Class: System
- Problem Description:
Failed to stop messages generated from OLD
cell.
- Cause / Action: Cause: Unable to write registers on the CC.
Probable hardware issue. Action: Contact HP Support for investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6548
- Severity: CRITICAL
- Event Summary: Failed to update coherency set on OLD cell.
- Event Class: System
- Problem Description:
Failed to update coherency set on OLD cell.
- Cause / Action: Cause: Unable to write registers on the CC.
Probable hardware issue. Action: Contact HP Support for investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6549
- Severity: CRITICAL
- Event Summary: Failed to update cell map on OLD cell.
- Event Class: System
- Problem Description:
Failed to update cell map on OLD cell.
- Cause / Action:
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6550
- Severity: CRITICAL
- Event Summary: Failed to set BIB on OLD cell.
- Event Class: System
- Problem Description:
Failed to set BIB on OLD cell.
- Cause / Action:
Cause: Write to PDH failed. Could be due to
fabric or hardware issues. Action: Contact HP Support for investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6551
- Severity: CRITICAL
- Event Summary: Failed to disable XIN link.
- Event Class: System
- Problem Description:
Failed to disable XIN link.
- Cause / Action:
Cause: Unable to write registers on the CC.
Probable hardware issue. Action: Contact HP Support for investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6552
- Severity: CRITICAL
- Event Summary: Failed to stop processing transactions from OLD
cell.
- Event Class: System
- Problem Description:
Failed to stop processing transactions from
OLD cell.
- Cause / Action: Cause: Unable to write registers on the CC.
Probable hardware issue. Action: Contact HP Support for investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6553
- Severity: CRITICAL
- Event Summary: Failed to clear OLD FATAL section flag.
- Event Class: System
- Problem Description:
Failed to clear OLD FATAL section flag.
- Cause / Action: Cause: Hardware issue may have caused a fabric
error. Action: Contact HP Support for investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6554
- Severity: CRITICAL
- Event Summary: Failed to update OLD poll flag.
- Event Class: System
- Problem Description:
Failed to update OLD poll flag.
- Cause / Action:
Cause: Probably something wrong with cell
hardware. Action: Contact HP Support Personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6555
- Severity: CRITICAL
- Event Summary: Unable to update cell info cell state on OLA cell.
- Event Class: System
- Problem Description:
Unable to update cell info cell state on
OLA cell.
- Cause / Action: Cause: Unable to access cell information in PDH.
Probably due to bad cell hardware. Action: Contact HP Support for
investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6556
- Severity: CRITICAL
- Event Summary: Failed to change OLA cell state to OLA
rendezvousing.
- Event Class: System
- Problem Description:
Failed to change OLA cell state to OLA
rendezvousing.
- Cause / Action: Cause: Could not write to CC register. Probably due
to bad cell hardware. Action: Contact HP support for investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6557
- Severity: CRITICAL
- Event Summary: Attempt to clear the OLA steering bit failed.
- Event Class: System
- Problem Description:
Attempt to clear the OLA steering bit
failed.
- Cause / Action: Cause: Could not clear register in PDH. Probably
due to problem with cell hardware. Action: Contact HP Support for
investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6558
- Severity: CRITICAL
- Event Summary: Failed to change OLA cell state to OLA slave cell.
- Event Class: System
- Problem Description:
Failed to change OLA cell state to OLA
slave cell.
- Cause / Action: Cause: Could not write to CC register. Probably due
to bad cell hardware. Action: Contact HP support for investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6559
- Severity: CRITICAL
- Event Summary: Unable to read cell assignment control bits from
complex profile.
- Event Class: System
- Problem Description:
Unable to read cell assignment control bits
from complex profile.
- Cause / Action: Cause: Semaphore was unlocked. Could be an MP,
PDHC, or PDC related problem. Action: Contact HP Support for investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6560
- Severity: CRITICAL
- Event Summary: Failed to set OLD error value.
- Event Class: System
- Problem Description:
Failed to set OLD error value.
- Cause / Action:
Cause: Probably something wrong with cell
hardware. Action: Contact HP Support Personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6561
- Severity: CRITICAL
- Event Summary: Failed to read OLD error value.
- Event Class: System
- Problem Description:
Failed to read OLD error value.
- Cause / Action:
Cause: Probably something wrong with cell
hardware. Action: Contact HP Support Personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6562
- Severity: CRITICAL
- Event Summary: Failed to set OLD start timer value.
- Event Class: System
- Problem Description:
Failed to set OLD start timer value.
- Cause / Action: Cause: Probably something wrong with cell hardware.
Action: Contact HP Support Personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6563
- Severity: CRITICAL
- Event Summary: A memory subsystem parity error has occurred that is
not DIMM related.
- Event Class: System
- Problem Description:
A memory subsystem parity error has
occurred that is not DIMM related.
- Cause / Action: Cause: A non DIMM related memory subsystem parity
error has been detected. Action: The cell board could be faulty causing this
error. Cause: A non DIMM related memory subsystem parity error has been
detected. The system clocks could be causing these errors on multiple cells.
Action: Replace the system clock cable or related hardware.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6564
- Severity: WARNING
- Event Summary: Duplicate DIMM serial numbers have been detected.
- Event Class: System
- Problem Description:
Duplicate DIMM serial numbers have been
detected. Data field gives physical location of DIMM. Check for other events
specifying other DIMMs with same serial number.
- Cause / Action:
Cause: Multiple DIMMs have the same serial
number. Action: Replace DIMMs with duplicate serial numbers. Ensure new DIMMs
have unique serial numbers.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6565
- Severity: CRITICAL
- Event Summary: Failed to read OLD error value.
- Event Class: System
- Problem Description:
Failed to read OLD error value.
- Cause / Action:
Cause: Probably something wrong with cell
hardware. Action: Contact HP Support Personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6566
- Severity: CRITICAL
- Event Summary: Failed to set OLD error value.
- Event Class: System
- Problem Description:
Failed to set OLD error value.
- Cause / Action:
Cause: Probably something wrong with cell
hardware. Action: Contact HP Support Personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6567
- Severity: CRITICAL
- Event Summary: Failed to set OLD start timer value.
- Event Class: System
- Problem Description:
Failed to set OLD start timer value.
- Cause / Action: Cause: Probably something wrong with cell hardware.
Action: Contact HP Support Personnel to troubleshoot the problem.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6568
- Severity: CRITICAL
- Event Summary: Executing cell failed to update MCAST registers on
OLD cell.
- Event Class: System
- Problem Description:
Executing cell failed to update MCAST
registers on OLD cell.
- Cause / Action: Cause: Unable to write registers on the CC.
Probable hardware issue. Action: Contact HP Support for investigation.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6569
- Severity: FATAL
- Event Summary: An unrecoverable processor interrupt occurred.
- Event Class: System
- Problem Description:
An unrecoverable processor interrupt
occurred while in IPF firmware. The data value is the processor IFA.
- Cause / Action:
Cause: Probable bug in System Firmware, EFI
driver, EFI app or OS loader (prior to OS launch). The event data field
contains the IVT offset applicable to the interrupt. See table 5-7
(Interruption Vector Table) in the Intel ASDM Volume 2. Action: Provide
console log containing register dump to HP Customer Engineer.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6570
- Severity: FATAL
- Event Summary: An unrecoverable processor interrupt occurred.
- Event Class: System
- Problem Description:
An unrecoverable processor interrupt
occurred while in IPF firmware. The event data is the processor ISR.
- Cause / Action:
Cause: Probable bug in System Firmware, EFI
driver, EFI app or OS loader (prior to OS launch). The event data field
contains the IVT offset applicable to the interrupt. See table 5-7
(Interruption Vector Table) in the Intel ASDM Volume 2. Action: Provide
console log containing register dump to HP Customer Engineer.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6571
- Severity: WARNING
- Event Summary: Virus-like activity detected
- Event Class: System
- Problem Description:
The Virus Throttle Filter Driver detects
virus like activity.
- Cause / Action: Cause: The Virus Throttle Filter Driver detects
virus like activity. Action: Immediate action required.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6572
- Severity: WARNING
- Event Summary: Double chip sparing has been invoked.
- Event Class: System
- Problem Description:
System firmware has detected and corrected
memory errors. Double chip sparing has been invoked to help mitigate this
condition.
- Cause / Action: Cause: System firmware has detected and corrected
memory errors. Double chip sparing has been invoked to help mitigate this
condition. Action: It is advisable to monitor the situation and contact your
HP support representative to check the specified memory board(s) and/or the
cell board containing these memory components. If system firmware detects a
large number of errors during a future reboot, the affected echelons may be
deallocated.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6573
- Severity: UNKNOWN
- Event Summary: PCI card inaccessible (error status register)
- Event Class: System
- Problem Description: A PCI card is inaccessible due to an error.
The data word reports the error status register.
- Cause / Action:
Cause: No action required.
Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6574
- Severity: UNKNOWN
- Event Summary: PCI card inaccessible (function ID)
- Event Class: System
- Problem Description: A PCI card is inaccessible due to an error.
The data word contains the function ID.
- Cause / Action:
Cause: No action required.
Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6575
- Severity: FATAL
- Event Summary: External clock cable has been removed from the CPU
cabinet.
- Event Class: System
- Problem Description:
External clock cable has been removed from
the CPU cabinet. The data field contains the physical location of the cabinet.
- Cause / Action: Cause: External clock cable has been removed from
the CPU cabinet. Action: Check external clock cable.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6576
- Severity: MAJOR
- Event Summary: System Firmware found problems while opening a
fabric link
- Event Class: System
- Problem Description:
System Fabric encountered a link error
after opening up the fabric link
- Cause / Action: Cause: Fabric link errors were encountered Action:
Collect IPMI event logs. Contact your HP Support Representative to check the
health of the system fabric.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6577
- Severity: MAJOR
- Event Summary: Fatal errors are present on a cell's link to the
fabric
- Event Class: System
- Problem Description:
Fatal errors are present on a cell's link
to the fabric. Data Field: (crossbar port << 44) | (crossbar <<
32) | 1
- Cause / Action: Cause: Fatal errors are present on the cell to
crossbar links Action: Collect IPMI Event Logs. Contact your HP Support
Representative to check the fabric subsystem health.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6578
- Severity: MAJOR
- Event Summary: System Firmware experienced an error while updating
internal data
- Event Class: System
- Problem Description:
System Firmware experienced an error while
updating internal data. Data Field: (crossbar port << 44) | (crossbar
<< 32)
- Cause / Action: Cause: Firmware was unable to update its fabric
graph Action: Collect IPMI Event Logs. Contact your HP Support Representative
to check the fabric subsystem health.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6579
- Severity: MAJOR
- Event Summary: System firmware encountered fabric problems
- Event Class: System
- Problem Description:
System firmware encountered fabric
problems. Data Field: (crossbar port << 44) | (crossbar << 32)
- Cause / Action: Cause: Errors occurred while collecting error data
from a fabric link Action: Collect IPMI Event Logs. Contact your HP Support
Representative to check the fabric subsystem health.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6580
- Severity: MAJOR
- Event Summary: Fabric errors are present on a cell's link to the
fabric.
- Event Class: System
- Problem Description:
Fatal errors are present on a cell's link
to the fabric. Data Field: (crossbar port << 44) | (crossbar <<
32)
- Cause / Action: Cause: Fatal errors are present on a cell to
crossbar link Action: Collect IPMI Event Logs. Contact your HP Support
Representative to check the fabric subsystem health.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6581
- Severity: FATAL
- Event Summary: The request to power on was denied by the Enclosure
Manager
- Event Class: System
- Problem Description:
The request to power on, either via MP user
interface, power button, or other, was denied by the Enclosure Manager.
- Cause / Action: Cause: The enclosure may not have sufficient power
for the new blade Action: Make sure enclosure has the appropriate number and
configuration of power supplies. Reduce power required by the blade (for
instance, remove some memory or a CPU.) Or, if there are powered-on blades
that are not in use, power them off.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6582
- Severity: WARNING
- Event Summary: Power was forced to ON without permission from the
Enclosure Manager
- Event Class: System
- Problem Description:
The blade has been forced to power-on,
either via the MP user interface or the power button. The blade did not
negotiate with the enclosure to insure that there was sufficient power for
this action.
- Cause / Action: Cause: A forced power on occurred. Usually this is
performed when a normal power on request is denied. The enclosure may now be
running in a non-redundant or over-budget power configuration. Action: Make
sure enclosure has the appropriate number and configuration of power supplies.
Reduce power required by the blade (for instance, remove some memory or a
CPU.) Or, if there are powered-on blades that are not in use, power them off.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6583
- Severity: WARNING
- Event Summary: MP still unable to communicate with EM after
multiple retries.
- Event Class: System
- Problem Description:
The MP has repeatedly tried to communicate
with the enclosure manager and has not received any response
- Cause / Action: Cause: The MP has repeatedly requested
communication with the enclosure manager and has not received a response. This
may be because the EM has a fault condition, or is busy, being reset, or
removed. Action: The communication loss may have been temporary and nothing
needs to be done if the communication is now working. Check the Fault LED on
the enclosure manager. Reset the enclosure manager. Reset the management
processor. Replace the enclosure manager
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6584
- Severity: UNKNOWN
- Event Summary: the fabric API failed to complete phase 0
- Event Class: System
- Problem Description:
fabric failed to complete phase 0 of the
API
- Cause / Action: Cause: A problem occurred while setting up fabric.
Action: Collect IPMI Event Logs. Contact your HP Support Representative.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6585
- Severity: UNKNOWN
- Event Summary: there was a problem performing phase one of the
fabric API
- Event Class: System
- Problem Description:
there was a problem completing phase 1 of
the fabric API
- Cause / Action: Cause: error setting up the fabric Action: Refer to
additional events to indicate more detail on the failure
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6586
- Severity: UNKNOWN
- Event Summary: failure while running phase 1 of the fabric API
- Event Class: System
- Problem Description:
failure while running phase 1 of the fabric
API
- Cause / Action: Cause: error setting up the fabric Action: Refer to
additional events to indicate more detail on the failure
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6588
- Severity: FATAL
- Event Summary: DNA 2.0 version Unsupported - NOT USED
- Event Class: System
- Problem Description:
This chassis code is currently unused. This
chassis code was supposed to be used for DNA version 3.0 support where DNA
version 2.0 was not supported
- Cause / Action: Cause: PDC error Action: Upgrade PDC and report to
PDC team
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6589
- Severity: FATAL
- Event Summary: PDC problem retrieving DNA version number
- Event Class: System
- Problem Description:
PDC has a problem trying to retrieve the
DNA version number of the current cell. This is a fatal error and will result
in the halting of the cell.
- Cause / Action: Cause: DNA version unsupported Action: Contact HP
Support personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6590
- Severity: FATAL
- Event Summary: PDC received invalid DNA version parameter
- Event Class: System
- Problem Description:
PDC is attempting to set the memory
controller configuration value based upon the given DNA version parameter. The
DNA version passed in is not supported. This is related to the
MEM_INVALID_DNA_SETTING_VALUE chassis code which reports the invalid DNA
version number.
- Cause / Action: Cause: PDC problem - PDC validates parameters
passed into routine setting memory configuration value Action: Upgrade PDC and
report problem to PDC team
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6591
- Severity: FATAL
- Event Summary: The MID bus parameter to be checked for a memory
parity error is invalid
- Event Class: System
- Problem Description:
The MID bus number that identifies the MID
bus to be checked for a memory parity error is not MID bus 0 or 1. This is a
fatal error and will result in the halting of the cell.
- Cause / Action:
Cause: PDC error Action: Update PDC and report
problem to PDC team
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6592
- Severity: FATAL
- Event Summary: PDC read of cell board version number failed
- Event Class: System
- Problem Description:
PDC attempted to read PDH register
containing cell board version. The return status indicated failure in attempt
to read register. This is a fatal error and will result in the halting of the
cell.
- Cause / Action: Cause: Cell board error Action: Contact HP Support
personnel to troubleshoot the problem
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6593
- Severity: FATAL
- Event Summary: Error trying to read MDP primary error mode register
- Event Class: System
- Problem Description:
PDC detected an error reading the MDP
primary error mode register on one of the MID buses. This should not happen
because the CcRead routine currently only returns SUCCESS status. This chassis
code is related to the MEM_READ_MDP_DATA_MDP_ADDR chassis code which indicates
the address of the MDP primary error mode register trying to be read.
- Cause / Action: Cause: PDC error Action: Upgrade PDC and report
error to PDC team
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6594
- Severity: FATAL
- Event Summary: Syndrome information could not be used to pin point
SBE DIMM
- Event Class: System
- Problem Description:
The syndrome information logged upon the
occurrence of an SBE memory error could not correctly pin point the DIMM
having the memory error. This was due to the syndrome information not matching
any of the syndrome values within the syndrome table within PDC firmware used
to correlate the syndrome value with a specific DIMM number.
- Cause / Action: Cause: This could be caused for a couple of
reasons. The first reason is that this DIMM was also detected to be the cause
of an MBE error whose DIMM cannot be determined based upon the syndrome
information. This could also be caused by a hardware or concorde problem where
the correct syndrome information was not logged by the cell controller.
Finally, this could be caused by a PDC error in which the syndrome information
was not correctly placed within the syndrome table that correlates the
syndrome value with the DIMM number. Upon occurrence of this chassis code, the
chassis code log containing this chassis code plus the associated
MEM_SYNDROME_DATA should be reported to the PDC team. Action: -
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6595
- Severity: WARNING
- Event Summary: NVM Crc checksum failure detected in the nvm control
structure while reading control data byte
- Event Class: System
- Problem Description:
Error detected while trying to perform a
read of the control structure. The crc stored within the structure did not
match the calculated crc.
- Cause / Action: Cause: firmware/software error Action: Check for
firmware/OS updates Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6596
- Severity: WARNING
- Event Summary: Crc error detected in the nvm control structure
while copying speedy boot value
- Event Class: System
- Problem Description:
Error was detected in the nvm control
structure while copying the speedy boot data. The crc value stored in the
control structure did not match the calculated crc data used to validate the
structure. The cell will be reset.
- Cause / Action:
Cause: software/firmware error Action: Check
for firmware/os updates Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6597
- Severity: WARNING
- Event Summary: NVM crc error detected while getting marked speedy
value from control structure
- Event Class: System
- Problem Description:
Firmware error was detected in getting the
marked speedy value from the nvm control structure. The crc value stored in
the structure did not match the calculated value. This indicates that a write
to the structure happened without updating the stored crc value. Cause /
Action:
Cause: Firmware/software error Action: Check for firmware/os
updates Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6598
- Severity: WARNING
- Event Summary: NVM crc failure detected in copying core cell
control structure
- Event Class: System
- Problem Description:
Firmware error was detected in trying to
copy the core cells control structure.. The crc value stored in the core
cell's structure did not match the calculated value. This indicates that a
write to the structure happened without updating the stored crc value. Cause /
Action:
Cause: Firmware/software problem Action: Check for
firmware/software updates Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6599
- Severity: WARNING
- Event Summary: NVM crc checksum failure detected during the
processing of the scroll flag.
- Event Class: System
- Problem Description:
Firmware detected a problem during the
processing of the display scroll flag. The crc value stored in the structure
did not match the calculated value. This indicates that a write to the
structure happened without updating the stored crc value. The partition will
be reset.
- Cause / Action: Cause: Firmware/software problem Action: Check for
firmware/os updates Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6600
- Severity: WARNING
- Event Summary: NVM crc error detected in processing of the display
of the boot id information.
- Event Class: System
- Problem Description:
Firmware detected an error in processing of
displaying the boot id information. The crc value stored in the cell status
structure did not match the calculated value. This indicates that a write to
the structure happened without updating the stored crc value. The partition
will be reset.
- Cause / Action: Cause: Firmware/software problem Action: Check for
firmware/os update Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6601
- Severity: WARNING
- Event Summary: NVM crc failure detected while trying to set the
boot id for the cell
- Event Class: System
- Problem Description:
Firmware detected an error while trying to
set the boot id for the cell. The crc value stored in the cell status
structure did not match the calculated value. The partition will be reset.
- Cause / Action: Cause: Firmware/software problem Action: Check for
firmware/os update Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6602
- Severity: WARNING
- Event Summary: NVM crc error was detected during the processing of
the boot console handler
- Event Class: System
- Problem Description:
NVM memory problem was detected during the
processing of the boot console handler. The computed crc value for the control
structure did not match the stored value. The partition will be reset.
- Cause / Action: Cause: Firmware/software problem Action: check for
updated firmware/os version Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6603
- Severity: WARNING
- Event Summary: NVM crc error was detected during the retrieval of
the pdh tests area.
- Event Class: System
- Problem Description:
Firmware detected an error during the
retrieval of the path tests area. The calculated crc value used to protect the
nvm control structure did not match the stored value. The cell will be reset.
- Cause / Action: Cause: Firmware/software error Action: Check for
new firmware/os version Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6604
- Severity: WARNING
- Event Summary: NVM corruption was detected during the display of
the fast boot settings
- Event Class: System
- Problem Description:
Firmware detected an error in the nvm
control structure during the display of the fast boot settings. The calculated
crc value of the structure did not match the stored value. The partition will
be reset.
- Cause / Action: Cause: Firmware/software problem Action: Check for
new version of firmware/os Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6605
- Severity: WARNING
- Event Summary: Nvm corruption was detected during reading of the
rendezvous timeout value for the monarch processor
- Event Class: System
- Problem Description:
Firmware detected an error during the read
of the rendezvous timeout structure for the monarch processor. The crc
calculated for the rendezvous timeout structure did not match the stored
value. The cell will be reset.
- Cause / Action: Cause: Firmware/software problem Action: Check for
new version of firmware/os Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6606
- Severity: WARNING
- Event Summary: Firmware detected an nvm crc error during the read
of the rendezvous timeout value
- Event Class: System
- Problem Description:
Firmware detected a crc error during the
read of the rendezvous timeout structure. The calculated crc value of the
rendezvous timeout structure did not match the stored value. The cell will be
reset.
- Cause / Action: Cause: Firmware/software error Action: Check for
new version of firmware/os Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6607
- Severity: WARNING
- Event Summary: Firmware detected during early self tests that the
timeout structure has been corrupted.
- Event Class: System
- Problem Description:
Firmware detected during early self tests
that the rendezvous timeout structure has been corrupted. The calculated crc
value of the rendezvous timeout structure does not match the stored value. The
cell will be reset.
- Cause / Action: Cause: Firmware/software problem Action: Check for
new version of firmware/os Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6608
- Severity: WARNING
- Event Summary: Firmware detected during late selftest that the
rendezvous timeout structure has been corrupted.
- Event Class: System
- Problem Description:
Firmware detected an error during late
self tests that the rendezvous timeout structure has been corrupted. The
calculated crc value of the structure did not match the stored value. The cell
will be reset.
- Cause / Action: Cause: Firmware/software problem Action: Check for
new version of firmware/os Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6609
- Severity: WARNING
- Event Summary: Firmware detected during the load of the csr launch
values an error in the launch value nvm structure.
- Event Class: System
- Problem Description:
Firmware detected an error during the load
of the nvm launch values from a remote cell. The calculated crc value did not
match the stored value.
- Cause / Action: Cause: Firmware/software problem Action: Check for
new version of firmware/os Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6610
- Severity: WARNING
- Event Summary: Firmware detected during retrieval of the
manufacturing mode an error in the cached nvm structure
- Event Class: System
- Problem Description:
Firmware detected during the retrieval of
the manufacturing mode an error with the nvm cached value structure. The
calculated crc value of the structure did not match the stored value. Cause /
Action:
Cause: Firmware/software error Action: Check for new version of
firmware/os Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6611
- Severity: WARNING
- Event Summary: Firmware detected during update of the manufacturing
mode that the nvm cached value structure has been corrupted
- Event Class: System
- Problem Description:
Firmware detected during the update of the
manufacturing mode that the nvm cached value structure has been corrupted. The
calculated crc value of the structure did not match the stored value.
- Cause / Action: Cause: Firmware/software error Action: Check for
new version of firmware/os Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6612
- Severity: WARNING
- Event Summary: Firmware detected a crc failure while trying to
retrieve the boot id.
- Event Class: System
- Problem Description:
Firmware detected an error while trying to
retrieve the boot id value within the cell status structure. The calculated crc
value of the cell status structure does not match the stored value within the
structure.
- Cause / Action: Cause: Firmware/software error Action: Check for
new version of firmware/os Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6613
- Severity: WARNING
- Event Summary: Firmware detected an error while trying to retrieve
the local boot id
- Event Class: System
- Problem Description:
Firmware detected an error while trying to
retrieve the local boot id value within the cell status structure. The
calculated crc value of the structure did not match the stored value. Cause /
Action:
Cause: Firmware/software error Action: Check for new version of
firmware/os Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6614
- Severity: WARNING
- Event Summary: Firmware detected an error while setting the local
boot id
- Event Class: System
- Problem Description:
Firmware detected an error while trying to
set the cpu boot id value within the nvm cell status structure. The calculated
crc value of the structure does not match the stored value. Cause /
Action:
Cause: Firmware/software error Action: Check for new version of
firmware/os Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6615
- Severity: WARNING
- Event Summary: Firmware detected that the cell status nvm structure
has been corrupted.
- Event Class: System
- Problem Description:
Firmware detected that the nvm cell status
structure has been corrupted. The calculated crc value of the structure does
not match the stored value.
- Cause / Action:
Cause: Firmware/software error Action: Check
for new version of firmware/os Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6616
- Severity: WARNING
- Event Summary: Firmware detected during verification that CPUs were
deconfigured that the nvm control structure has been corrupted
- Event Class: System
- Problem Description:
Firmware has detected during cpu
deconfiguration verification that the control structure has been corrupted.
The calculated crc value of the structure does not match the stored value. The
cell will be reset.
- Cause / Action: Cause: Firmware/software error Action: Check for
new version of firmware/os Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6617
- Severity: WARNING
- Event Summary: Firmware detected an error in testing the sting 4Kb
scratch ram error. Data indicates the start of this area.
- Event Class: System
- Problem Description:
Firmware detected an error in testing the
sting 4 Kb area. The data value is the starting address of this 4 Kb area. The
following chassis code indicates the ending address of this 4 Kb area. The
cell will halt.
- Cause / Action: Cause: Scratch ram hardware failure Action: Check
hardware for problems Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6618
- Severity: WARNING
- Event Summary: Firmware detected an error while testing the 4 Kb
sting area. Data value contains the end of this area.
- Event Class: System
- Problem Description:
Firmware detected an error while testing
the 4 Kb sting area. The associated data value contains the end of this
structure. The previous event code indicates the start of this area. The cell
will halt.
- Cause / Action: Cause: Hardware failure Action: Check hardware
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6619
- Severity: WARNING
- Event Summary: Firmware has detected an error in testing the cell
status area. Data contains the starting address of this area
- Event Class: System
- Problem Description:
Firmware detected an error in testing the
cell status nvm area. The associated data field contains the start of this
area. The following event id data contains the end of this area. The cell will
halt.
- Cause / Action: Cause: Hardware failure Action: Check hardware
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6620
- Severity: WARNING
- Event Summary: Firmware detected an error in testing the nvm cell
status area. Data field contains the ending address of this area.
- Event Class: System
- Problem Description:
Firmware detected an error in testing the
nvm cell status area. The data field contains the ending address of this area.
The previous event code data contains the starting address of this area. The
cell will halt.
- Cause / Action: Cause: Hardware failure Action: Check Hardware
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6621
- Severity: WARNING
- Event Summary: Firmware detected an error in testing the cell state
nvm area. Data field contains the starting address of this area.
- Event Class: System
- Problem Description:
Firmware detected an error in testing the
cell state nvm area. The data field contains the starting address of this
area. The following event id data contains the ending address of this area.
The cell will halt.
- Cause / Action: Cause: Hardware Failure Action: Check Hardware
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6622
- Severity: WARNING
- Event Summary: Firmware detected an error in testing the cell state
nvm area. Data field contains the ending address of this area.
- Event Class: System
- Problem Description:
Firmware detected an error in testing the
cell state nvm area. The data field contains the ending address of this area.
The previous event id code contains the starting address of this area. The
cell will halt.
- Cause / Action: Cause: Hardware failure Action: Check Hardware
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6623
- Severity: WARNING
- Event Summary: Firmware detected an error in testing the cached nvm
area. Data field contains the starting address of this area.
- Event Class: System
- Problem Description:
Firmware detected an error in testing the
cached nvm area. The data field contains the starting address of this area.
The following event id data contains the ending address of this area. The cell
will halt.
- Cause / Action: Cause: Hardware failure Action: Check Hardware
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6624
- Severity: WARNING
- Event Summary: Firmware detected an error testing the cache nvm
area. Data field contains the ending address of this area.
- Event Class: System
- Problem Description:
Firmware detected an error in testing the
cached nvm area. The data field contains the ending address of this area. The
previous event id data contains the starting address of this area. The cell
will halt.
- Cause / Action: Cause: Hardware failure Action: Check Hardware
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6625
- Severity: WARNING
- Event Summary: Firmware detected an error testing the control
structure nvm area. Data field contains the starting address of the area.
- Event Class: System
- Problem Description:
Firmware detected an error in testing the
nvm control structure. The data field contains the starting address of the
area. The following event id code contains the ending address of the area. The
cell will halt.
- Cause / Action: Cause: Hardware failure Action: Check Hardware
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6626
- Severity: WARNING
- Event Summary: Firmware detected an error in testing the nvm
control structure area. Data field contains the ending address of this area.
- Event Class: System
- Problem Description:
Firmware detected an error in testing the
nvm control structure area. The data field contains the ending address of this
area. The previous event id data contains the starting address of this area.
The cell will halt.
- Cause / Action: Cause: Hardware Failure Action: Check Hardware
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6627
- Severity: WARNING
- Event Summary: Firmware detected an error in testing the nvm
rendezvous timeout structure. Data field contains starting address of area.
- Event Class: System
- Problem Description:
Firmware detected an error in testing the
nvm rendezvous timeout area. The data field contains the starting address of
this area. The following event id data contains the ending address of this
area. The cell will halt.
- Cause / Action: Cause: Hardware Failure Action: Check Hardware
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6628
- Severity: WARNING
- Event Summary: Firmware detected an error testing the rendezvous
timeout structure. Data field contains the ending address of this area.
- Event Class: System
- Problem Description:
Firmware detected an error in testing the
nvm rendezvous timeout structure. The data field indicates the ending address
of this area. The previous event id data indicates the starting address of
this area. The cell will halt.
- Cause / Action: Cause: Hardware Failure Action: Check Hardware
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6629
- Severity: WARNING
- Event Summary: Firmware has detected an error in testing the nvm
pdce run time structure. Data field contains starting address of area.
- Event Class: System
- Problem Description:
Firmware detected an error in testing the
nvm pdce run time area. The data field contains the starting address of the
area. The following event id data contains the end of the data area. The cell
will halt.
- Cause / Action: Cause: Hardware failure Action: Check Hardware
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6630
- Severity: WARNING
- Event Summary: Firmware detected an error in testing the nvm pdce
run time area. Data field contains the ending address of this area.
- Event Class: System
- Problem Description:
Firmware detected an error in testing the
nvm pdce run time area. The data field indicates the starting address of this
area. The previous event id data indicates the starting address of this area.
The cell will halt.
- Cause / Action: Cause: Hardware failure Action: Check hardware
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6631
- Severity: WARNING
- Event Summary: Firmware detected an error in testing the nvm csr
structure. Data field contains the starting address of this area.
- Event Class: System
- Problem Description:
Firmware detected an error in testing the
nvm csr area. The data field contains the start of this area. The following
event id data contains the ending address of this area. the cell will halt.
- Cause / Action: Cause: Hardware Failure Action: Check hardware
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6632
- Severity: WARNING
- Event Summary: Firmware detected an error in testing the nvm csr
area. Data field contains the ending address of this area.
- Event Class: System
- Problem Description:
Firmware detected an error in testing the
nvm csr area. The data field contains the ending address of this area. The
previous event id data contains the starting address of this area. The cell
will halt.
- Cause / Action: Cause: Hardware Failure Action: Check Hardware
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6633
- Severity: WARNING
- Event Summary: Firmware detected an error in testing the nvm fabric
area. Data field contains the starting address of this area
- Event Class: System
- Problem Description:
Firmware detected an error in testing the
nvm fabric area. The data field contains the starting address of this area.
The following event id data contains the ending address of this area. The cell
will halt.
- Cause / Action: Cause: Hardware Failure Action: Check the hardware
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6634
- Severity: WARNING
- Event Summary: Firmware detected an error in testing the nvm fabric
area. Data field contains the ending address of this area.
- Event Class: System
- Problem Description:
Firmware detected an error in testing the
nvm fabric area. The data field contains the ending address of this area. The
previous event id data contains the ending starting address of this area. The
cell will halt.
- Cause / Action: Cause: Hardware Failure Action: Check Hardware
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6635
- Severity: WARNING
- Event Summary: Error was detected during initialization/checking of
nvm structures
- Event Class: System
- Problem Description:
Major error was detected during
initialization/checking of nvm structures. Other event codes indicate type of
failure. Cell will halt.
- Cause / Action: Cause: Firmware errors Action: Check for new
version of firmware/os Contact HP Support Cause: Hardware Failure Action:
Check Hardware Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6636
- Severity: WARNING
- Event Summary: Firmware detected an error during checking of the
nvm structures.
- Event Class: System
- Problem Description:
Firmware detected an error during
validation of nvm structures. Previous event ids specify the failing
structures. The cell will be reset.
- Cause / Action:
Cause: Firmware/software failure Action: Check
for new versions of firmware/os Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6637
- Severity: WARNING
- Event Summary: NVM crc failure was detected when trying to retrieve
the nvm control structure.
- Event Class: System
- Problem Description:
Firmware detected an error in the nvm
control structure. The calculated crc value of the structure did not match the
stored value. The cell will be reset.
- Cause / Action:
Cause: Firmware/software problem Action: Check
for new version of firmware/os Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6638
- Severity: WARNING
- Event Summary: Firmware detected an nvm crc failure when trying to
retrieve the manufacturing mode in nvm.
- Event Class: System
- Problem Description:
Firmware detected an error in trying to
retrieve the manufacturing mode. The calculated crc value of the structure did
not match the stored value. The cell will be reset.
- Cause / Action:
Cause: Firmware/software problem Action: Check
for a newer version of the firmware/os Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6639
- Severity: WARNING
- Event Summary: Firmware detected an nvm error when trying to
retrieve the nvm flags.
- Event Class: System
- Problem Description:
Firmware detected an error trying to
retrieve the nvm flags. The calculated crc value for the control structure did
not match the stored value. The cell will be reset.
- Cause / Action:
Cause: Firmware/software problem Action: Check
for newer version of the firmware/os Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6640
- Severity: WARNING
- Event Summary: Firmware detected an error when trying to retrieve
the pdce run time data.
- Event Class: System
- Problem Description:
Firmware detected an error when trying to
retrieve the nvm pdce run time data. The calculated crc value of the pdce run
time structure did not match the stored value.
- Cause / Action:
Cause: Firmware/software problem Action: Check
for a newer version of the firmware/os Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6641
- Severity: WARNING
- Event Summary: CRC mismatch problem was detected in the mfg mode
structure while executing the PDC RDR proc.
- Event Class: System
- Problem Description:
NVM crc problem was detected in the mfg
mode structure while executing the PDC RDR proc. The calculated crc value did
not match the value stored in the structure. An error return status will be
returned to the calling function.
- Cause / Action:
Cause: firmware/software error Action: Check
for firmware/OS updates Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6642
- Severity: WARNING
- Event Summary: CRC mismatch in nvm control structure while trying
to mark cpu for deallocation.
- Event Class: System
- Problem Description:
NVM crc problem was detected with the nvm
control structure while trying to mark a cpu for deallocation. An error status
will be returned to the caller.
- Cause / Action:
Cause: firmware/software problem Action: Check
for firmware/OS updates Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6643
- Severity: WARNING
- Event Summary: Firmware detected a crc error in the nvm control
structure while performing memory.
- Event Class: System
- Problem Description:
Firmware detected an error in the nvm
control structure while performing memory discovery. The calculated crc value
of the structure did not match the stored value. The cell will be result.
- Cause / Action: Cause: Firmware/software problem Action: Check for
updated firmware/OS Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6644
- Severity: WARNING
- Event Summary: Firmware detected a crc failure while enabling
output of event ids.
- Event Class: System
- Problem Description:
Firmware detected an error in the nvm check
control structure while enabling output of event ids. The calculated crc value
of the structure did not match the stored crc value. The cell will be reset.
- Cause / Action: Cause: Firmware/software problem Action: Check for
updated firmware/OS Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6645
- Severity: MAJOR
- Event Summary: Cell OLD failed during SBA flush
- Event Class: System
- Problem Description:
During the sba flush operation portion of a
cell ola, an error was reported indicating that the operation failed. The data
field will contain the hb Id of the sba that tried to be flushed. A call
complete error will be returned to the calling program.
- Cause / Action:
Cause: I/O failure Action: Call HP support
indicating failing I/O
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6646
- Severity: MAJOR
- Event Summary: Memory flush failure detected during cell old
- Event Class: System
- Problem Description:
Failure detected while performing cell old
operation. The failure was during the flush of memory. The data field will
indicate the cell performing the flush operation. Call complete error will be
returned to the calling program.
- Cause / Action: Cause: PDC or hardware failure Action: Call HP
Support indicating failure
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6647
- Severity: MINOR
- Event Summary: Error happened when trying to update cellmap
external data
- Event Class: System
- Problem Description:
Error happened during cell ola operation in
trying to update the cell map external data. The data field contains the
status return information. A failure should be returned causing the pd to
crash.
- Cause / Action: Cause: Hardware Problem Action: Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6648
- Severity: MINOR
- Event Summary: Error happened trying to update the cell map
external data
- Event Class: System
- Problem Description:
A failure happened in trying to update the
cell map external data during a cell old operation. The data field contains
the returned status of making a call to update this data. A failure status
should be returned to the calling module of the cell old operation.
- Cause / Action: Cause: Hardware problem Action: Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6649
- Severity: MINOR
- Event Summary: Error was detected with cell state structure
- Event Class: System
- Problem Description:
An nvm checksum failure was detected in the
cell state structure while attempting to a cell add command. An error will be
returned to the calling program.
- Cause / Action:
Cause: Hardware/Pdc problem caused the nvm
structure to be corrupted Action: Contact PDC support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6650
- Severity: MINOR
- Event Summary: Checksum failure detected in cell state structure
- Event Class: System
- Problem Description:
Unable to access cell state structure in
nvm. The data field contains the returned status. A failure return code will
be returned.
- Cause / Action: Cause: Hardware Problem Action: Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6651
- Severity: MINOR
- Event Summary: Problem accessing cell state structure of a remote
cell
- Event Class: System
- Problem Description:
Error happened trying to access the cell
state structure of a cell. The data field is the physical location for which
the access was attempted. An error returned code will be returned to the
calling procedure.
- Cause / Action: Cause: Hardware Problem Action: Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6652
- Severity: MINOR
- Event Summary: Checksum error was detected in control structure
- Event Class: System
- Problem Description:
An error was detected during cpu
deconfiguration. The checksum of the control structure was checked and
reported to be invalid. The data field contains the physical location of the
cell trying to be accessed.
- Cause / Action: Cause: Hardware/Firmware problem Action: Contact HP
Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6653
- Severity: MINOR
- Event Summary: Corruption detected in cell statue structure
- Event Class: System
- Problem Description:
Error was detected in the cell state
structure of a remote cell. The computed checksum did not match the checksum
store in the structure. An error return value will be returned to the calling
module
- Cause / Action: Cause: Hardware/firmware problem Action: Contact HP
Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6654
- Severity: MINOR
- Event Summary: Problem accessing control structure of remote cell
- Event Class: System
- Problem Description:
Problem occurred trying to access the
control structure of a remote cell. The data field contains the physical
location of the cell that tried to be accessed. An error status will be
returned to the calling module.
- Cause / Action: Cause: Hardware/Firmware problem Action: Contact HP
Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6655
- Severity: WARNING
- Event Summary: During monitor of flush instruction, cpu state
returned invalid state value
- Event Class: System
- Problem Description:
During memory flush for the ol'd of a cell,
the check of the cpu state returned an unexpected state value. The data field
contains the returned state. The routine will return a error return value.
- Cause / Action: Cause: Firmware problem Action: Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6656
- Severity: WARNING
- Event Summary: Problem discovered in setting the reo flush state
during cell ol'd
- Event Class: System
- Problem Description:
Failure detected in setting the initial
state of the reo flush processing during the cell ol'd operation. The data
field contains the returned state. The procedure will return a error status.
- Cause / Action: Cause: System firmware/hardware problem Action:
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6657
- Severity: WARNING
- Event Summary: Problem detected in initializing the memory flush
processing during cell ol'd
- Event Class: System
- Problem Description:
Problem detected in initializing the memory
flush processing during cell ol'd. The data field contains the returned error
status. The procedure will return an error code to the calling procedure.
- Cause / Action: Cause: System firmware/Hardware Problem Action:
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6658
- Severity: WARNING
- Event Summary: Failure detected in waking up CPUs for memory flush
operation
- Event Class: System
- Problem Description:
Failure detected in attempting to wake up
cells to perform memory flush operation during cell ol'd. The data field
contains the physical location of the cpu that was attempted to be awakened.
An error status will be returned to the call module.
- Cause / Action:
Cause: System firmware/hardware problem Action:
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6659
- Severity: WARNING
- Event Summary: No active CPUs were found to perform memory flush on
ol'd cell
- Event Class: System
- Problem Description:
During the cell ol'd operation, no CPUs
were found on the cell being ol'd to perform the memory flush operation. The
data field contains the physical location of the cell being ol'd. An error
return status will be returned to the calling module.
- Cause / Action:
Cause: System firmware/hardware problem Action:
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6660
- Severity: FATAL
- Event Summary: Error trying to set cell reset blocking bit
- Event Class: System
- Problem Description:
Error was detected in trying to set the
reset blocking bit in ICM. The data field contains the set of cells whose
reset blocking bits were trying to be reset. The pd will be reset.
- Cause / Action:
Cause: Hardware/System firmware error Action:
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6661
- Severity: FATAL
- Event Summary: Error happened trying to set the reset blocking bit
during cell ola
- Event Class: System
- Problem Description:
An error was detected during the operation
of setting the reset blocking bit in ICM for a cell that is being added during
a cell ola operation. The data field contains the status being returned. The
ola cell will be reset for reconfiguration. An error status will be returned
to the calling procedure.
- Cause / Action: Cause: Hardware/System firmware failure. Action:
Contact HP Support.
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6662
- Severity: MINOR
- Event Summary: Fabric Error Discovered in post processing while
performing cell add
- Event Class: System
- Problem Description:
Error was found in fabric post processing
code when adding a cell to the pd. An error status return will be returned to
the caller. The event data specifies the physical location of the cell being
added.
- Cause / Action: Cause: Fabric problem with cell Action: Check for
other failure chassis codes Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6663
- Severity: MINOR
- Event Summary: Cell Add failed due to fabric error in
pre-processing
- Event Class: System
- Problem Description:
Error was detected during the processing of
adding a cell to the pd. An error status will be returned to the calling
module. The event data indicates the physical location of the cell being added
- Cause / Action: Cause: Fabric Error Action: Check for additional
error events Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6664
- Severity: WARNING
- Event Summary: Failure in initialization of Fabric Code
- Event Class: System
- Problem Description:
Failure was detected in initialization of
the fabric during cell add. An error status will be returned to the calling
module. The data field contains the physical location of the cell being added
- Cause / Action: Cause: Fabric error Action: Check for other error
events Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6665
- Severity: CRITICAL
- Event Summary: Error was detected in getting the current state of
the ola process
- Event Class: System
- Problem Description:
Error was detected in getting the current
state of the cell add process. An error will be returned to the calling module
indicating a call complete error. The data field contains the physical
location of the cell being added
- Cause / Action: Cause: System Firmware Problem Action: Contact HP
Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6666
- Severity: CRITICAL
- Event Summary: Error was detected in trying to set the state of the
add cell operation
- Event Class: System
- Problem Description:
Error was detected in trying to set the
current state of the cell add processing. This event is preceded by the
BOOT_OLA_ERR_SET_POLL_FLAG indicating the status of getting the address of the
state flag. The data field contains the value trying to be set.
- Cause / Action:
Cause: System firmware bug Action: Contact HP
Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6667
- Severity: FATAL
- Event Summary: Invalid state of the cell ola processing was
detected
- Event Class: System
- Problem Description:
An invalid state was detected in the
process of adding a cell to the pd. The data field indicates the cell being
added. The cell being added will be reset for reconfiguration. The call
complete error status will be returned to the calling module.
- Cause / Action:
Cause: System Firmware error Action: Contact HP
Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6668
- Severity: FATAL
- Event Summary: Error was detected in attempting to set the next
state of the cell add process
- Event Class: System
- Problem Description:
Error was detected in setting the next
state of the cell add process. The cell being added will be reset for
reconfiguration. The call complete error status will be returned to the
calling module. The data field indicates the physical location of the cell
being added
- Cause / Action: Cause: System firmware error Action: Contact HP
Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6669
- Severity: FATAL
- Event Summary: Error was detected in the determining the state of
the cell add process
- Event Class: System
- Problem Description:
Error was detected in the process of adding
a cell to the pd. The state indicated was not a valid state. The preceding
event indicates the cell being added. A call complete error status will be
returned to the calling module
- Cause / Action:
Cause: System firmware error Action: Contact HP
Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6670
- Severity: MINOR
- Event Summary: Slave State Invalid
- Event Class: System
- Problem Description:
System Firmware has detected that the slave
state of a cpu is invalid. The cell will be halted. The data field contains
the state of the cpu.
- Cause / Action: Cause: System Firmware problem Action: Contact HP
Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6671
- Severity: WARNING
- Event Summary: Invalid State returned from checking slave state
- Event Class: System
- Problem Description:
During the check of the slave states for
the CPUs, the slave state indicated an invalid state. The cell will be halted.
The data field contains the invalid state value. The following event indicates
the cpu being checked.
- Cause / Action: Cause: System Firmware Error Action: Contact HP
Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6672
- Severity: WARNING
- Event Summary: An invalid cpu state for a cpu was detected
- Event Class: System
- Problem Description:
An invalid slave state was detected for the
cpu indicated in the event. The previous event indicates the invalid state.
The cell will be halted.
- Cause / Action: Cause: System Firmware error Action: Contact HP
Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6673
- Severity: WARNING
- Event Summary: Invalid Vector address detected in wakeup structure
- Event Class: System
- Problem Description:
Error was detected in the vector address
passed into the address used for waking up a cpu. The cell will be halted
- Cause / Action: Cause: System Firmware Error Action: Contact HP
Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6674
- Severity: WARNING
- Event Summary: Sleep Wakeup Counters Invalid
- Event Class: System
- Problem Description:
System Firmware detected a mismatch between
the sleep counter and the wakeup counter. This event indicates the sleep
counter value and the following event indicates the wakeup counter value. The
cell will be halted.
- Cause / Action: Cause: System Firmware Error Action: Contact HP
Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6675
- Severity: WARNING
- Event Summary: Wakeup Counter Problem Detected
- Event Class: System
- Problem Description:
Error was detected during wakeup of a cpu.
The wakeup counter did not match the sleep counter. This event indicates the
wakeup counter. The previous event indicated the sleep counter value. The cell
will be halted.
- Cause / Action: Cause: System Firmware error Action: Contact HP
Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6676
- Severity: WARNING
- Event Summary: Invalid wakeup vector address
- Event Class: System
- Problem Description:
An invalid vector address was detected in
Waking up Monarch Processor. This event indicates the invalid vector address.
The cell will be halted
- Cause / Action: Cause: System Firmware Error Action: Contact HP
Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6677
- Severity: WARNING
- Event Summary: Invalid Counters during Monarch Wakeup
- Event Class: System
- Problem Description:
An error was detected during waking up the
monarch processor. The sleep and wakeup counters did not match. This event
indicates the sleep counter value and the next event indicates the wakeup
counter value. The cell will be halted.
- Cause / Action:
Cause: System Firmware error Action: Contact HP
Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6678
- Severity: WARNING
- Event Summary: Invalid counters were detected during Monarch Wakeup
- Event Class: System
- Problem Description:
Error was detected during Monarch Wakeup.
The sleep and wakeup counters did not match. This event indicates the wakeup
counter value, and the previous event indicates the sleep counter value. The
cell will be halted.
- Cause / Action: Cause: System Firmware Error Action: Contact HP
Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6679
- Severity: WARNING
- Event Summary: Invalid cpu state detected for monarch cpu
- Event Class: System
- Problem Description:
Invalid cpu state detected for monarch cpu
during wakeup. This event indicates the invalid state. The next event
indicates the monarch cpu with the invalid state. This cell will be halted.
- Cause / Action: Cause: System Firmware error Action: Contact HP
Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6680
- Severity: WARNING
- Event Summary: Wakeup problem indicated for Monarch Cpu
- Event Class: System
- Problem Description:
Error was detected in monarch cpu state.
This event indicates the monarch cpu number. The previous event indicates the
invalid state. This cell will be halted.
- Cause / Action: Cause: System Firmware Error Action: Contact HP
Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6681
- Severity: WARNING
- Event Summary: Error detected in checking slave sleep vectors
- Event Class: System
- Problem Description:
Error was detected in waiting for the
slaves to sleep. The detected cpu state was invalid. The event indicates the
invalid state. The next event indicates the cpu. The cell will be halted.
- Cause / Action: Cause: System Firmware Error Action: Contact HP
Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6682
- Severity: WARNING
- Event Summary: Error was detected in waiting for slaves to sleep
- Event Class: System
- Problem Description:
Error was detected in waiting for the
slaves to sleep. This event indicates the cpu with the invalid state. The
previous event indicates the invalid state. The cell will halt.
- Cause / Action:
Cause: System Firmware Error Action: Contact HP
Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6683
- Severity: WARNING
- Event Summary: Error getting semaphore during read of control data
- Event Class: System
- Problem Description:
Error was detected in trying to get a
semaphore during read of control information. The data field contains the
status return from the call to the get semaphore routine.
- Cause / Action:
Cause: Firmware Problem Action: Contact HP
Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6684
- Severity: WARNING
- Event Summary: Error was detected releasing a semaphore during read
of control data
- Event Class: System
- Problem Description:
Error was detected releasing a semaphore
during the read of control data information. The data field contains the
return status from trying to release the semaphore
- Cause / Action:
Cause: Firmware Problem Action: Contact HP
Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6685
- Severity: WARNING
- Event Summary: Error was detected locking semaphore during write of
control data
- Event Class: System
- Problem Description:
Error was detected locking a semaphore
during the writing of control data information. The data field contains the
status information of trying to lock the semaphore
- Cause / Action:
Cause: Firmware Error Action: Contact HP
Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6686
- Severity: WARNING
- Event Summary: Error detecting releasing semaphore during writing
of control information
- Event Class: System
- Problem Description:
Error was detecting in releasing a
semaphore during the write of control information. The data field contains the
status return from trying to release the semaphore.
- Cause / Action:
Cause: Firmware Error Action: Contact HP
Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6687
- Severity: WARNING
- Event Summary: Error was detected locking a semaphore during write
of remote control data
- Event Class: System
- Problem Description:
Error was detected locking a semaphore
during writing of control data information to a remote cell. The data field
contains the return status from trying to lock the semaphore. The following
event id contains the physical location of the remote cell.
- Cause / Action:
Cause: Firmware/Hardware Error Action: Contact
HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6688
- Severity: WARNING
- Event Summary: Error was detected locking a semaphore during remote
write of control data.
- Event Class: System
- Problem Description:
Error was detected trying to lock a
semaphore during write of the control data on a remote cell. The data field
contains the cell physical location of the remote cell. The previous event
contains the return status from trying to lock the semaphore.
- Cause / Action:
Cause: Firmware/Hardware Failure Action:
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6689
- Severity: WARNING
- Event Summary: Error detected releasing a semaphore during write of
control data to remote cell
- Event Class: System
- Problem Description:
Error was detected in unlocking a semaphore
during the write of control data ot a remote cell. The data field contains the
status returned from trying to release the semaphore. The following event
contains the cell physical location of the cell being accessed.
- Cause / Action: Cause: Firmware/Hardware Error Action: Contact HP
Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6690
- Severity: WARNING
- Event Summary: Problem releasing Hardware SM4 held by deconfigured
cpu
- Event Class: System
- Problem Description:
System firmware detected an error while
trying to unlock a sm4 held by a cpu that it being deconfigured. The data
field contains the return status. The cell will be halted.
- Cause / Action:
Cause: Hardware/System Firmware Problem Action:
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6691
- Severity: WARNING
- Event Summary: Failure in OLD during Fabric Cell OLD Processing
- Event Class: System
- Problem Description:
Error was discovered while performing cell
old fabric processing. The data value will indicate the return status of the
cell old fabric processing. An error return status will be returned to the OS.
- Cause / Action: Cause: Hardware Problem/Firmware Error Action:
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6692
- Severity: WARNING
- Event Summary: Error detected during cell OLA while trying to get
cpu hversion
- Event Class: System
- Problem Description:
Error was detected while trying to get the
cpu hversion. The data field contains the returned status from trying to get
the hversion. The call to the ola processing will return an error indication
- Cause / Action: Cause: Hardware Problem/Firmware problem Action:
Contact HP Support
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6693
- Severity: CRITICAL
- Event Summary: A hardware semaphore dead lock condition has been
detected
- Event Class: System
- Problem Description:
This is an internal SW error
- Cause / Action:
Cause: A CPU has tried to acquired the same
semaphore twice which resulted in a deadlock. This is an internal SFW error
and is not recoverable. Action: Collect the entire FPL, SEL, console logs and
forward them to the SFW team. Also note all activities around the time of
this event. The entire logs must analyzed by the SFW team before further
actions can be taken. The system must be cold reset to resume normal operation
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6694
- Severity: CRITICAL
- Event Summary: A semaphore dead lock condition has been detected
- Event Class: System
- Problem Description:
This is an internal SFW error
- Cause / Action:
Cause: A CPU has tried to acquired the same
semaphore twice which resulted in a deadlock. This is an internal SFW error
and is not recoverable. Action: Collect the entire FPL, SEL, console logs and
forward them to the SFW team. Also note all activities around the time of
this event. The entire logs must analyzed by the SFW team before further
actions can be taken. The system must be cold reset to resume normal operation
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6695
- Severity: CRITICAL
- Event Summary: A semaphore dead lock condition has been detected
- Event Class: System
- Problem Description:
This is an internal SFW error
- Cause / Action:
Cause: A CPU has tried to acquired the same
semaphore twice which resulted in a deadlock. This is an internal SFW error
and is not recoverable. Action: Collect the entire FPL, SEL, console logs and
forward them to the SFW team. Also note all activities around the time of
this event. The entire logs must analyzed by the SFW team before further
actions can be taken. The system must be cold reset to resume normal operation
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 6696
- Severity: CRITICAL
- Event Summary: A semaphore dead lock condition has been detected
- Event Class: System
- Problem Description:
This is an internal SFW error
- Cause / Action:
Cause: A CPU has tried to acquired the same
semaphore twice which resulted in a deadlock. This is an internal SFW error
and is not recoverable. Action: Collect the entire FPL, SEL, console logs and
forward them to the SFW team. Also note all activities around the time of
this event. The entire logs must analyzed by the SFW team before further
actions can be taken. The system must be cold reset to resume normal operation
- Automated Recovery: None
- Event Generation Threshold: 1 occurrence
Event 8039
- WBEM Severity: Critical
- Event Summary: SMI training has failed
- Event Description: SMI training has failed.
Data field contains the physical location of the memory controller.
- Probable Cause: Transmit Failure
- Event Category: System Hardware
- Event Sub-Category: Unknown
- Cause 1: SMI (Scalable Memory Interconnect)
link training has failed.
- Recommended Action
1: If seating is not an issue, replace the EMB (Extended Memory
Buffer), followed by the CPU. If not resolved, call support.
Event 8040
- WBEM Severity: Critical
- Event Summary: DIMM at field location has
been previously deallocated and remains
deallocated
- Event Description: DIMM at field location
has been previously deallocated and remains
deallocated.
- Probable Cause: Transmit Failure
- Event Category: Memory
- Event Sub-Category: DIMM
- Cause 1: A DIMM has been
deallocated due to failure during memory
init sequence. The most probable cause is a faulty DIMM. Other possible
causes could be due to SMI failure, DDR3 training failure, or bad SMB
(Scalable Memory Buffer) initialization.
- Recommended Action
1: The first action will be to replace the DIMM. If that does not
solve the issue, replace CPU. If problem still continues to exist,
replace the system board.
Event 8044
- WBEM Severity: Degraded/Warning
- Event Summary: Multi-blade conjoined
rendezvous has failed
- Event Description: Multi-blade conjoined
rendezvous has failed
- Probable Cause: SBL (Scalable Blade Link)
or CPU
- Event Category: System Hardware
- Event Sub-Category: Unknown
- Cause 1: Event occurs when
rendezvous among all known logical processors
does not occur. All processors need to
checkin with the monarch processor.
- Recommended Action
1: Check if the SBL (Scalable Blade Link) connector is damaged.
Replace if damaged. If not, check, then
replace processor. If problem persists, contact Support.
Event 8045
- WBEM Severity: Information
- Event Summary: Unable to fully allocate
memory to optimize interleaving ranges
- Event Description: Unable to fully allocate
memory to optimize interleaving ranges.
- Probable Cause: Unknown
- Event Category: Memory
- Event Sub-Category: DIMM
- Cause 1: Memory configuration is likely to
be incorrect. Mixed DIMMs or possible
deallocated DIMMs
can cause this issue.
- Recommended Action
1: Check memory configuration. If not optimal loading, replace the
affected DIMMs. If problem persists, refer
to the Service Guide for proper DIMM load order.
Event 8046
- WBEM Severity: Degraded/Warning
- Event Summary: Expected blades failed to
rendezvous at memory checkpoint
- Event Description: Expected processor
failed to rendezvous at memory checkpoint. Data field displays which
processors checked in.
- Probable Cause: SBL (Scalable Blade Link)
or CPU
- Event Category: System Hardware
- Event Sub-Category: Unknown
- Cause 1: Processors failed to rendezvous at
memory checkpoint
- Recommended Action
1: Reboot. If problem persists, replace SBL (Scalable Blade Link),
if applicable. If this does not resolve, then replace processor, then
contact Support.
Event 8047
- WBEM Severity: Degraded/Warning
- Event Summary: Expected blades failed to
rendezvous at memory test checkpoint
- Event Description: Expected blades failed
to rendezvous at memory test checkpoint. The data represents the
processor thread(s) that has rendezvous with the system. If bit0 is set
to 0, it means thread 0 has is missing. If bit is set to 0, it means
thread is missing. In the 9300 series processors, each set of 8 threads
belongs to one processor in the system.
- Probable Cause: Transmit Failure
- Event Category: Processor
- Event Sub-Category: Checkpoint
- Cause 1: Expected processors failed to
rendezvous at memory test checkpoint
- Recommended Action
1: Reseat the SBL and Reboot. If problem still exists, then replace
the processor belonging to the missing thread (see event description for
data mask). If problem persists, then contact Support.
Event 8066
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: System topology (CPUs and
IOHs) discovered is not a supported
topology.
- Event Description: System topology (CPUs
and IOHs) discovered is not a supported
topology.
- Probable Cause: Transmit Failure
- Event Category: System Hardware
- Event Sub-Category: Unknown
- Cause 1: A QPI hardware error may have
occurred.
- Recommended Action
1: Try rebooting the server. If problem persists contact support.
Event 8067
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: Could not find default QPI
data.
- Event Description: Could not find default
QPI data.
- Probable Cause: Transmit Failure
- Event Category: System Hardware
- Event Sub-Category: Unknown
- Cause 1: A hardware error may have
occurred.
- Recommended Action
1: Try rebooting the server. If problem persists contact support.
Event 8068
- WBEM Severity: Degraded/Warning
- Event Summary: Detected a CPU QPI link's
operation is degraded.
- Event Description: A QPI link on the
processor is degraded. An additional QPI_CPU_LINK_DEGRADED event or
QPI_IOH_LINK_DEGRADED event will be logged to indicate the physical
location information for the processor or IOH on the other side of the
degraded QPI link.
- Probable Cause: QPI link error
- Event Category: System Interconnect
- Event Sub-Category: QPI link
- Cause 1: QPI link hardware has been
detected operating with degraded performance.
- Recommended Action
1: Reboot. If problem persists contact support.
Event 8080
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: The CPUs across the system
or SBL domain must all be the same type.
- Event Description: The CPUs across the
system/SBL domain must all be the same part number.
- Probable Cause: mismatched hardware
- Event Category: Processor
- Event Sub-Category: part number
- Cause 1: The part
number of CPUs in the system/SBL domain are not the same.
- Recommended Action
1: The CPUs in the system or SBL domain must be the same part
number. Check the FRU data using the DF command at the CM> menu. The
S/SPEC should match and the core stepping should have a delta no greater
than 1 across all CPUs.
Event 8083
- WBEM Severity: Critical
- Event Summary: One or more of the blades in
the domain could not communicate.
- Event Description: One or more of the
blades in the domain could not communicate. This affects power control.
- Probable Cause: Peer to Peer communication
failure
- Event Category: blade to blade (internal to
enclosure) network
- Event Sub-Category: Unknown
- Cause 1: Connection problem between blade
and the enclosure
- Recommended Action
1: iLO will
attempt to correct communication issues. If this fails verify FW is at a
supported revision and matched across the
blades. Consider cycling stby power to the
affected blades.
Event 8084
- WBEM Severity: Degraded/Warning
- Event Summary: The Mezzanine IO Cards must
be the same in all blades.
- Event Description: The Mezzanine IO Cards
must be the same in all blades.
- Probable Cause: loading of
mezz should be consistent across blades
- Event Category: iLO
- Event Sub-Category: Domain
- Cause 1: The Mezzanine IO Cards in all
blades of the Domain are different
- Recommended Action
1: The Mezzanine IO Cards must be the same in all blades of the
Domain.
Event 8090
- WBEM Severity: Degraded/Warning
- Event Summary: This event was
originated as a Framework Assert Status Code.
- Event Description: Results from Framework
code assert. Framework Code is some of the code HP integrates into the
SFW ROM.
- Probable Cause: System Firmware Defect
- Event Category: System Firmware
- Event Sub-Category: Unknown
- Cause 1: System firmware has detected an
internal error.
- Recommended Action
1: Look at previous events and ASCII information in this event for
more information. If problem persists, contact support.
Event 8103
- WBEM Severity: Critical
- Event Summary: SFW ROM versions across the
partition are mismatched.
- Event Description: A CPU Socket's SFW ROM
has a different version and date stamp than the System Monarch's SFW
ROM.
- Probable Cause: Firmware mismatch
- Event Category: System Firmware
- Event Sub-Category: Unknown
- Cause 1: One or more processor System
Firmware ROM version is mismatched with the Monarch CPU.
- Recommended Action
1: Update SFW. If the failure persists on the same CPU, replace the
system board.
Event 8261
- WBEM Severity: Critical
- Event Summary: IOH persistent error
condition detected.
- Event Description: A persistent error could
not be cleared in the IOH.
- Probable Cause:
I/O Device Error
- Event Category: Failed to clear error
- Event Sub-Category:
PCIe Link
- Cause 1: A persistent error condition could
not be cleared on an IOH Root Port, may be caused by a faulty PCI
express card.
- Recommended Action
1: Replace the PCI express card, verify
the latest supported System Firmware bundle is installed.
Event 8262
- WBEM Severity: Critical
- Event Summary: Generic FRU has header
checksum error
- Event Description: System Firmware finds
that FRU data read from shared memory which MFW writes to is corrupt.
- Probable Cause: Underlying Resource
Unavailable
- Event Category: System Firmware
- Event Sub-Category: Data checksum error
- Cause 1: 1. FRU not properly programmed 2.
Manageability FW may not be working correctly
- Recommended Action
1: Review the event log for related MFW events to determine if there
is a FRU read problem. Try a MFW reset via web or text interface. If
this fails, verify all the firmware versions are at a supported revision
level and do a full power cycle of the
system. If this still fails, it is likely the FRU requires replacement.
Event 8263
- WBEM Severity: Degraded/Warning
- Event Summary: MFW is posting a Data Object
Table which is one revision below what SFW expects
- Event Description: MFW posts a Data Object
Table in Shared memory when it boots. SFW uses this table to get all the
system data it needs. SFW is designed to not fail with a table which is
1 revision old, but not with older ones. However there is no reason for
customers to ever run with old tables, so this should be fixed.
- Probable Cause: Manageability FW is old
revision
- Event Category: System Firmware
- Event Sub-Category: Shared
Memory Compatibility
- Cause 1: Manageability FW is not at the
current supported revision for this SFW.
- Recommended Action
1: Update Manageability FW.
Event 8264
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: System Firmware will not
work correctly with the shared memory table loaded
- Event Description: The Manageability FW
Shared memory table has a date code which is not compatible with this
SFW
- Probable Cause: Incorrect Manageability
Firmware Version
- Event Category: System Firmware
- Event Sub-Category: Shared memory
compatibility
- Cause 1: The System Firmware has detected
that Manageability FW has loaded an incompatible shared memory table.
- Recommended Action
1: Ensure the system FW versions are at a supported revision level.
Reset MFW via text or Web Interface .
Event 8321
- WBEM Severity: Degraded/Warning
- Event Summary: CPU Late Test failed due to
PAL Test Info call failed
- Event Description: CPU Late Test failed due
to PAL Test Info call failed. Data field contains physical location of
the processor.
- Probable Cause: Transmit Failure
- Event Category: System Hardware
- Event Sub-Category: Processor
- Cause 1: A SFW or CPU error may have
occurred.
- Recommended Action
1: If the problem persists replace the processor.
Event 8322
- WBEM Severity: Degraded/Warning
- Event Summary: The TPM DOT information has
a bad checksum.
- Event Description: The TPM DOT information
is corrupted. The checksum does not match with the data in the table.
- Probable Cause: Transmit Failure
- Event Category: System Firmware
- Event Sub-Category: TPM
- Cause 1: The System Firmware has detected a
bad checksum in the TPM shared memory table.
- Recommended Action
1: Review the event log for related MFW events to determine if there
is an NVM (Non-Volatile Memory) problem. Try a MFW reset via web or text
interface. If this fails, verify all the firmware versions are at a
supported revision level and do a full power
cycle of the system.
Event 8323
- WBEM Severity: Degraded/Warning
- Event Summary: The TPM mode at next boot
command could not be cleared in the TPM DOT table.
- Event Description: The TPM mode at next
boot command could not be cleared in the TPM DOT table.
- Probable Cause: Unknown
- Event Category: System Firmware
- Event Sub-Category: Unknown
- Cause 1: The system could not clear the
command to change the TPM state at next boot in the NVM (Non-Volatile
Memory) possibly due to an error on manageability firmware/hardware
- Recommended Action
1: Review the event log for related MFW events to determine if there
is an NVM (Non-Volatile Memory) problem. Try a MFW reset via web or text
interface. If this fails, verify all the firmware versions are at a
supported revision level and do a full power cycle of the system.
Event 8324
- WBEM Severity: Degraded/Warning
- Event Summary: The TPM DOT checksum could
not be recalculated.
- Event Description: The TPM DOT checksum
could not be recalculated.
- Probable Cause: Unknown
- Event Category: System Firmware
- Event Sub-Category: Unknown
- Cause 1: The System Firmware has detected a
bad checksum in the TPM shared memory table and failed to fix the
checksum in Non-Volatile Memory. Manageability may not be able to update
NVM.
- Recommended Action
1: Review the event log for related MFW events to determine if there
is an NVM (Non-Volatile Memory) problem. Try a MFW reset via web or text
interface. If this fails, verify all the firmware versions are at a
supported revision level and do a full power cycle of the system.
Event 8325
- WBEM Severity: Degraded/Warning
- Event Summary: The TPM DOT information
could not be updated with the TPM presence indication
- Event Description: The TPM DOT information
could not be updated with the TPM presence indication
- Probable Cause: Unknown
- Event Category: System Firmware
- Event Sub-Category: Unknown
- Cause 1: The system could not initialize
the value of the TPM presence flag in NVM (Non-Volatile Memory) possibly
due to an error on manageability
- Recommended Action
1: Review the event log for related MFW events to determine if there
is an NVM (Non-Volatile Memory) problem. Try a MFW reset via web or text
interface. If this fails, verify all the firmware versions are at a
supported revision level and do a full power
cycle of the system.
Event 8326
- WBEM Severity: Degraded/Warning
- Event Summary: The TPM DOT could not be
updated with the TPM absent flag.
- Event Description: The TPM DOT information
could not be updated to indicate the absence of the TPM
- Probable Cause: Unknown
- Event Category: System Firmware
- Event Sub-Category: Unknown
- Cause 1: The system could not initialize
the value of the TPM presence flag to Absent in NVM (Non-Volatile
Memory) possibly due to an error on Manageability
- Recommended Action
1: Review the event log for related MFW events to determine if there
is an NVM (Non-Volatile Memory) problem. Try a MFW reset via web or text
interface. If this fails, verify all the firmware versions are at a
supported revision level and do a full power
cycle of the system.
Event 8327
- WBEM Severity: Degraded/Warning
- Event Summary: The TPM DOT information
could not be initialized successfully.
- Event Description: The TPM DOT information
could not be initialized successfully.
- Probable Cause: Unknown
- Event Category: System Firmware
- Event Sub-Category: Unknown
- Cause 1: The System Firmware has detected
an uninitialized shared memory table for the
TPM and failed to initialize it. This can be caused by an error with
Manageability firmware/hardware.
- Recommended Action
1: Review the event log for related MFW events to determine if there
is an NVM (Non-Volatile Memory) problem. Try a MFW reset via web or text
interface. If this fails, verify all the firmware versions are at a
supported revision level and do a full power
cycle of the system.
Event 8328
- WBEM Severity: Degraded/Warning
- Event Summary: The TPM DOT information does
not appear to be initialized or is corrupted.
- Event Description: The TPM DOT information
does not appear to be initialized or is corrupted.
- Probable Cause: Unknown
- Event Category: System Firmware
- Event Sub-Category: Unknown
- Cause 1: The System Firmware detected that
Manageability FW has loaded an invalid shared memory table for the TPM
- Recommended Action
1: Review the event log for related MFW events to determine if there
is an NVM (Non-Volatile Memory) problem. Try a MFW reset via web or text
interface. If this fails, verify all the firmware versions are at a
supported revision level and do a full power
cycle of the system.
Event 8329
- WBEM Severity: Degraded/Warning
- Event Summary: The TPM DOT information is
not initialized.
- Event Description: The TPM DOT information
is not initialized.
- Probable Cause: Unknown
- Event Category: System Firmware
- Event Sub-Category: Unknown
- Cause 1: The System Firmware has detected
an uninitialized shared memory table for the
TPM. This can be caused by manageability firmware/hardware.
- Recommended Action
1: The system will try to initialize the memory table. If the error
persists, review the event log for related MFW events to determine if
there is an NVM (Non-Volatile Memory) problem. Try a MFW reset via web
or text interface. If this fails, verify all the firmware versions are
at a supported revision level and do a full power
cycle of the system. Replace the battery.
Event 8330
- WBEM Severity: Degraded/Warning
- Event Summary: The TPM related information
could not be written to the Non-Volatile Memory.
- Event Description: The TPM related
information could not be written to the
Non-Volatile Memory.
- Probable Cause: Unknown
- Event Category: System Firmware
- Event Sub-Category: Unknown
- Cause 1: The system could not save the TPM
configuration to NVM (Non-Volatile Memory) This
can be caused by manageability firmware/hardware.
- Recommended Action
1: Review the event log for related MFW events to determine if there
is an NVM (Non-Volatile Memory) problem. Try a MFW reset via web or text
interface. If this fails, verify all the firmware versions are at a
supported revision level and do a full power
cycle of the system.
Event 8331
- WBEM Severity: Degraded/Warning
- Event Summary: The TPM DOT table could not
initialize the TPM mode at next boot element
- Event Description: The TPM DOT table could
not be initialized with the TPM mode at next boot element. An error has
occurred.
- Probable Cause: Unknown
- Event Category: System Firmware
- Event Sub-Category: Unknown
- Cause 1: The system could not
initialized the value of TPM state at next
boot in NVM (Non-Volatile Memory). This can be caused by manageability
firmware/hardware.
- Recommended Action
1: Review the event log for related MFW events to determine if there
is an NVM (Non-Volatile Memory) problem. Try a MFW reset via web or text
interface. If this fails, verify all the firmware versions are at a
supported revision level and do a full power
cycle of the system.
Event 8332
- WBEM Severity: Degraded/Warning
- Event Summary: The command to change the
TPM state has failed.
- Event Description: The command to change
the TPM state has failed.
- Probable Cause: Unknown
- Event Category: System Firmware Security
- Event Sub-Category: Unknown
- Cause 1: The request to change the TPM
state has failed. An error occurred with the TPM
- Recommended Action
1: Review the event log for TPM related messages. Reboot the system.
Replace the TPM board.
Event 8333
- WBEM Severity: Degraded/Warning
- Event Summary: The system reset failed when
the TPM code attempted to change its status
- Event Description: The system reset request
failed when the TPM code attempted to change its status (enable,
disable, reset)
- Probable Cause: Unknown
- Event Category: System Firmware
- Event Sub-Category: Unknown
- Cause 1: The reset command failed to reset
the system.
- Recommended Action
1: Reboot the system. Verify that the system is at the latest
firmware revision.
Event 8449
- WBEM Severity: Major
- Event Summary: One or more processor
threads failed to start boot
- Event Description: One or more processor
threads failed to start boot. See the following
EVN_CPU_PHYSICAL_LOCATION for the CPU Physical Location.
- Probable Cause: Processor Problem (Internal
Machine Error)
- Event Category: Processor
- Event Sub-Category:
- Cause 1: A CPU error occurred.
- Recommended Action
1: Replace the failed CPU, as indicated by the physical location. If
the failure persists, contact support.
Event 8480
- WBEM Severity: Degraded/Warning
- Event Summary: Detected
a IOH QPI link's operation is degraded.
- Event Description: A QPI link on the IOH is
degraded. An additional QPI_CPU_LINK_DEGRADED event will be logged to
indicate the physical location information for the CPU on the other side
of the degraded QPI link.
- Probable Cause: QPI link error
- Event Category: System Interconnect
- Event Sub-Category: QPI link
- Cause 1: QPI link hardware has been
detected operating with degraded performance.
- Recommended Action
1: Reboot. If problem persists contact support.
Event 8495
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: The system has been issued
an INIT. All activity will be halted.
- Event Description: The system has been
issued an INIT, (similar to PA-RISC TOC). This means all system
processing and I/O activity will be irrecoverably halted and the
computer system will be restarted.
- Probable Cause: User requested INIT
- Event Category: iLO
- Event Sub-Category: INIT
- Cause 1: A user has issued an INIT
interrupt to the system.
- Recommended Action
1: None. The system will restart on its own.
Event 8502
- WBEM Severity: Degraded/Warning
- Event Summary: The EM has determined that
power must be shed and has asserted Power Alert.
- Event Description: The EM has determined
that power must be shed and has asserted the Power Alert signal.
Throttling is likely to occur in order for the blade to shed power.
- Probable Cause: Power Problem
- Event Category: Other Power
- Event Sub-Category: Enclosure PSU
- Cause 1: Typically caused by a power supply
failure reducing the power pool available to the EM to distribute to the
blades.
- Recommended Action
1: Identify any failed power supplies and replace them.
Event 8503
- WBEM Severity: Degraded/Warning
- Event Summary:
iLO entering special mode.
- Event Description:
iLO entering a special mode.
- Probable Cause: special running mode
- Event Category: iLO
- Event Sub-Category: running mode
- Cause 1:
iLO entering special mode. The system
is not operating in a normal mode. Some protection features may be
disabled in this mode.
- Recommended Action
1: The special mode must be exited to ensure proper operation of the
server.
Event 8509
- WBEM Severity: Degraded/Warning
- Event Summary: A firmware update has
failed.
- Event Description: A firmware update has
failed.
- Probable Cause: a requested firmware update
failed
- Event Category: iLO
- Event Sub-Category: firmware update
- Cause 1: An attempted firmware update has
failed with an error.
- Recommended Action
1: Check to make sure that the bits are accessible and system power
is off (if required). Try the update again.
Event 8510
- WBEM Severity: Critical
- Event Summary: A failure has occurred while
attempting to update the iLO
ROM.
- Event Description: A failure has occurred
while attempting to update the
iLO ROM. The ROM may be in a corrupt state. The error code
included in data field is for developer use only.
- Probable Cause: FLASH device or interface
(SPI bus) failure
- Event Category: Other Storage Device
- Event Sub-Category: error in read/write of
FLASH over SPI bus
- Cause 1: Something has interfered with
firmware update while it attempted to update the bits in the
iLO ROM.
- Recommended Action
1: DO NOT REMOVE POWER FROM THE SYSTEM. DO NOT RESET
iLO. Attempt the
update again or attempt update to original version. Refer to recipe
notes to verify no procedural issues.
Event 8511
- WBEM Severity: Critical
- Event Summary: A failure has occurred while
attempting to update the SFW ROM.
- Event Description: A failure has
occurred while attempting to update the
System Firmware (SFW) ROM. The ROM may be in a corrupt state. The error
code included in data field is for developer use only.
- Probable Cause: FLASH device or interface
(SPI bus) failure
- Event Category: Other Storage Device
- Event Sub-Category: error in read/write of
FLASH over SPI bus
- Cause 1: Something has interfered with
firmware update while it attempted to update the bits in the SFW ROM.
The Aux ROM and either of the direct attach SFW ROMs or any combination
there of may be affected.
- Recommended Action
1: Attempt the update again. If this fails you may want to reset
iLO or remove
stby power from the system and try the
update again.
Event 8512
- WBEM Severity: Critical
- Event Summary: A failure has occurred while
attempting to update the FPGA ROM.
- Event Description: A failure has
occurred while attempting to update the FPGA
ROM. The ROM may be in a corrupt state. The error code included in data
field is for developer use only.
- Probable Cause:
ROM device or interface
failure
- Event Category: Other Storage Device
- Event Sub-Category: error in read/write of
ROM over interface
- Cause 1: Something has interfered with
firmware update while it attempted to update the bits in the FPGA ROM.
- Recommended Action
1: Do not remove power from the system. Do not reset
iLO. Attempt the
update again.
Event 8513
- WBEM Severity: Critical
- Event Summary: A failure has occurred while
attempting to update the Power Monitor PIC.
- Event Description: A failure has
occurred while attempting to update the Power
Monitor PIC. The PIC may be in a corrupt state. The error code included
in data field is for developer use only.
- Probable Cause: PIC device or interface
(i2c) failure
- Event Category: Other Storage Device
- Event Sub-Category: Error during read/write
to PIC (ROM space)
- Cause 1: Something has interfered with
firmware update while it attempted to update the bits in the Power
Monitor PIC.
- Recommended Action
1: Attempt the update again. If this fails try resetting
iLO and attempt
the update again.
Event 8514
- WBEM Severity: Degraded/Warning
- Event Summary: Failed to save non-volatile
EFI variables into NVM
- Event Description: System firmware was
unable to save the active non-volatile EFI variables to persistent
storage. The event data field represents an internal return code.
- Probable Cause: MFW Performance Problem
- Event Category: Support Firmware
- Event Sub-Category: Unknown
- Cause 1: An error occurred in the interface
between system firmware and manageability firmware.
- Recommended Action
1: Reboot the server and verify that the system board (Monarch Blade
system board if a multi-blade server) is functioning properly. If not,
replace the system board. Check system logs to see if other NVM errors
are related to this same system board. If so, replace the system board.
Verify system and manageability firmware are at their supported
revisions, and if not, update firmware to the latest supported
revisions. If problems persist, contact Support.
Event 8515
- WBEM Severity: Critical
- Event Summary: The system has been placed
in security override mode.
- Event Description: Security override is
set. Security enforcement is disabled. This mode
will timeout in 15 minutes or less. To Manually exit
this mode go to the Access Settings web page.
- Probable Cause: User initiated
security override.
- Event Category: iLO
- Event Sub-Category: security
- Cause 1: A user has asserted the physical
presence button for a duration greater than 8
seconds and less than 12 seconds.
- Recommended Action
1: This mode will timeout in 15 minutes
or less. To manually exit this mode go to the
Access Settings web page. You can also press the physical presence
button for less than 4 seconds to reset
iLO and clear this mode.
Event 8516
- WBEM Severity: Critical
- Event Summary: The system is entering TPM
physical presence mode.
- Event Description: The system is entering
TPM physical presence mode.
- Probable Cause: User initiated via physical
presence button
- Event Category: iLO
- Event Sub-Category: TPM
- Cause 1: A user has asserted the TPM
physical presence button for a duration of greater than 4 seconds and
less than 8 seconds.
- Recommended Action
1: No action is required. This event indicates that the system was
placed in a mode where restricted commands can now be executed by the
TPM. Asserting the physical presence button for less than 4 seconds will
reset iLO and
clear the TPM physical presence mode.
Event 8517
- WBEM Severity: Critical
- Event Summary: The TPM did not accept the
start-up request
- Event Description: The TPM did not
accept the start-up request.
- Probable Cause: TPM is defective or
tampering has occurred
- Event Category: System HW Security
- Event Sub-Category: Unknown
- Cause 1: The TPM can be defective or has
detected a tampering attempt.
- Recommended Action
1: Verify for possible tampering. Replace the board hosting the TPM
chip.
Event 8519
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: The thread has received an
interruption during boot.
- Event Description: Thread has received an
unexpected interruption during SFW boot and at EFI environment.
- Probable Cause: Unknown
- Event Category: System Firmware
- Event Sub-Category: Unknown
- Cause 1: Firmware or EFI applications or
Hardware Errors.
- Recommended Action
1: Review the interruption log on the console. Reboot the system. If
problem persists, contact support.
Event 8523
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: ACPI parser error due to a
missing AML identifier
- Event Description: The ACPI subsystem
encountered a missing AML identifier while parsing the DSDT table. This
will likely prevent the Operating System from booting successfully.
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: ACPI Subsystem
- Cause 1: The ACPI subsystem could not find
the CPU AML identifier while parsing the DSDT table.
- Recommended Action
1: Ensure SFW component is at supported level.
Event 8524
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: Failed to acquire platform
specific data in ACPI
- Event Description: Could not acquire
platform data needed by the ACPI subsystem.
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: ACPI subsystem
- Cause 1: ACPI received an invalid pointer
to platform data. Possible causes for this are a corrupted System
Firmware image or an error in the memory subsystem.
- Recommended Action
1: Ensure SFW component is at supported level. Check for errors in
the memory subsystem if the problem persists.
Event 8525
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: ACPI parser error due to a
missing AML identifier
- Event Description: The ACPI subsystem
encountered a missing AML identifier while parsing the DSDT table. This
will likely prevent the Operating System from booting successfully.
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: ACPI Subsystem
- Cause 1: The ACPI subsystem could not find
the HPET AML identifier while parsing the DSDT table.
- Recommended Action
1: Ensure SFW component is at supported level.
Event 8526
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: ACPI parser error due to a
missing AML identifier
- Event Description: The ACPI subsystem
encountered a missing AML identifier while parsing the DSDT table. This
will likely prevent the Operating System from booting successfully.
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: ACPI Subsystem
- Cause 1: The ACPI subsystem could not find
the ICH AML identifier while parsing the DSDT table.
- Recommended Action
1: Ensure SFW component is at supported level.
Event 8527
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: Failed to acquire platform
specific data in ACPI
- Event Description: System failed to acquire
I/O data in the ACPI subsystem.
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: Unknown
- Cause 1: Corrupted SFW image.
- Recommended Action
1: Ensure SFW component is at supported level.
Event 8528
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: ACPI parser error due to a
missing AML identifier
- Event Description: The ACPI subsystem
encountered a missing AML identifier while parsing the DSDT table. This
will likely prevent the Operating System from booting successfully.
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: ACPI Subsystem
- Cause 1: The ACPI subsystem could not find
the IPMI AML identifier while parsing the DSDT table.
- Recommended Action
1: Ensure SFW component is at supported level.
Event 8529
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: The ACPI subsystem failed to
load the DSDT table from flash device.
- Event Description: The ACPI subsystem
failed to load the DSDT table from flash device.
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: ACPI Subsystem
- Cause 1: The ACPI subsystem failed to load
the DSDT table from flash device.
- Recommended Action
1: Ensure SFW component is at supported level.
Event 8530
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: System firmware protocol
failed while executing ACPI subsystem code
- Event Description: System firmware protocol
failed while executing ACPI subsystem code. This affects system
resources communicated to the Operating System.
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: ACPI
- Cause 1: The failure is possibly related to
using a corrupted System Firmware binary.
- Recommended Action
1: Ensure SFW component is at supported level.
Event 8531
- WBEM Severity: Major
- Event Summary: The ACPI subsystem failed to
load the SSDT table from the flash device.
- Event Description: The ACPI subsystem
failed to load the SSDT table from the flash device. This can cause the
Operating System to lose some system resources.
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: ACPI Subsystem
- Cause 1: The ACPI could not load the SSDT
table from the flash device.
- Recommended Action
1: Ensure SFW component is at supported level.
Event 8532
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: ACPI parser error due to a
missing AML identifier
- Event Description: The ACPI subsystem
encountered a missing AML identifier while parsing the DSDT table. This
will likely prevent the Operating System from booting successfully.
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: ACPI Subsystem
- Cause 1: The ACPI subsystem could not find
the memory AML identifier while parsing the DSDT table.
- Recommended Action
1: Ensure SFW component is at supported level.
Event 8533
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: Failed to acquire platform
specific data in ACPI
- Event Description: System failed to acquire
memory data in the ACPI subsystem.
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: memory range
- Cause 1: Corrupted SFW image.
- Recommended Action
1: Ensure SFW component is at supported level.
Event 8535
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: Failed to acquire platform
specific data in ACPI
- Event Description: System failed to acquire
partition information in the ACPI subsystem.
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: Unknown
- Cause 1: Invalid SFW image.
- Recommended Action
1: Ensure SFW component is at supported level.
Event 8536
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: ACPI parser error due to a
missing platform AML identifier
- Event Description: The ACPI subsystem
encountered a missing AML identifier while parsing the DSDT table. This
will likely prevent the Operating System from booting successfully.
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: ACPI subsystem
- Cause 1: The ACPI subsystem could not find
the Platform AML identifier while parsing
the DSDT table.
- Recommended Action
1: Ensure SFW component is at supported level.
Event 8537
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: ACPI parser error due to a
missing AML identifier
- Event Description: The ACPI subsystem
encountered a missing AML identifier while parsing the DSDT table. This
will likely prevent the Operating System from booting successfully.
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: ACPI Subsystem
- Cause 1: The ACPI subsystem could not find
the Power Management AML identifier while parsing the DSDT table.
- Recommended Action
1: Ensure SFW component is at supported level.
Event 8538
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: ACPI parser error due to a
missing AML identifier
- Event Description: The ACPI subsystem
encountered a missing AML identifier while parsing the DSDT table. This
will likely prevent the Operating System from booting successfully.
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: ACPI Subsystem
- Cause 1: The ACPI subsystem could not find
the root-complex AML identifier while parsing the DSDT table.
- Recommended Action
1: Ensure SFW component is at supported level.
Event 8539
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: ACPI parser error due to a
missing AML identifier
- Event Description: The ACPI subsystem
encountered a missing AML identifier while parsing the DSDT table. This
will likely prevent the Operating System from booting successfully.
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: ACPI Subsystem
- Cause 1: The ACPI subsystem could not find
the root-port AML identifier while parsing the DSDT table.
- Recommended Action
1: Ensure SFW component is at supported level.
Event 8540
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: ACPI parser error due to a
missing AML identifier
- Event Description: The ACPI subsystem
encountered a missing AML identifier while parsing the DSDT table. This
will likely prevent the Operating System from booting successfully.
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: ACPI Subsystem
- Cause 1: The ACPI subsystem could not find
the Serial Port AML identifier while parsing the DSDT table.
- Recommended Action
1: Ensure SFW component is at supported level.
Event 8541
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: ACPI parser error due to a
missing AML identifier
- Event Description: The ACPI subsystem
encountered a missing AML identifier while parsing the DSDT table. This
will likely prevent the Operating System from booting successfully.
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: ACPI Subsystem
- Cause 1: The ACPI subsystem could not find
the Server AML identifier while parsing the DSDT table.
- Recommended Action
1: Ensure SFW component is at supported level.
Event 8542
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: ACPI failed to set its
tables
- Event Description: The ACPI tables could
not be set. This prevents the Operating System from consuming ACPI
resources properly.
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: ACPI Subsystem
- Cause 1: The ACPI driver in firmware failed
to set the tables.
- Recommended Action
1: Ensure SFW component is at supported level.
Event 8543
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: ACPI parser error due to a
missing AML identifier
- Event Description: The ACPI subsystem
encountered a missing AML identifier while parsing the DSDT table. This
will likely prevent the Operating System from booting successfully.
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: ACPI Subsystem
- Cause 1: The ACPI subsystem could not find
the slot AML identifier while parsing the DSDT table.
- Recommended Action
1: Ensure SFW component is at supported level.
Event 8544
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: ACPI parser error due to a
missing AML identifier
- Event Description: The ACPI subsystem
encountered a missing AML identifier while parsing the DSDT table. This
will likely prevent the Operating System from booting successfully.
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: ACPI Subsystem
- Cause 1: The ACPI subsystem could not find
the Platform AML identifier while parsing
the DSDT table.
- Recommended Action
1: Ensure SFW component is at supported level.
Event 8546
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: Failed to acquire platform
specific data in ACPI
- Event Description: System failed to acquire
Trusted Platform Module data in the ACPI subsystem.
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: Unknown
- Cause 1: Invalid SFW image.
- Recommended Action
1: Ensure SFW component is at supported level.
Event 9071
- WBEM Severity: Degraded/Warning
- Event Summary: Fans are entering blowout
mode.
- Event Description: Fans are entering
blowout mode. See data field for more information.
- Probable Cause: Sensor Failure
- Event Category: Other Cooling
- Event Sub-Category: temp, fan, lid sensors
- Cause 1: A sensor may have been unreadable
meaning that it could not be read on the i2c bus. See the
data field for the sensor that caused the
error forcing fans into blowout mode.
- Recommended Action
1: Investigate the sensor implicated in the
data field to determine the cause of failure. If a temperature
sensor can't be read try AC cycling the complex; if a fan sensor can't
be read try reseating the fan and AC cycling the complex;
If none of this helps try replacing the FRU
associated with sensor.
Event 9799
- WBEM Severity: Major
- Event Summary: Virtual Connect failed to
write data into a FRU storage device
- Event Description: After changing FRU data
in memory, system firmware attempted to replicate the changes to the
physical FRU, but received an unexpected error. The event data field
encodes internal information, including the following: Second byte:
Blade number within server, 0-n. Fourth byte: FRU ID.
- Probable Cause: MFW Performance Problem
- Event Category: Support Firmware
- Event Sub-Category: Unknown
- Cause 1: An error occurred in the interface
between system firmware and manageability firmware.
- Recommended Action
1: Reboot the server and verify that the system board or
mezz card containing this FRU is functioning
properly. If not, replace the failing hardware. Check system logs to see
if other errors are related to this same FRU. If so, replace the system
board or mezz card containing this FRU.
Verify system and manageability firmware are at their supported
revisions, and if not, update firmware to the latest supported
revisions. If problems persist, contact Support.
Event 9801
- WBEM Severity: Major
- Event Summary: Virtual Connect failed to
save a backup of non-volatile EFI variables into NVM
- Event Description: System firmware was
unable to save a backup of the active non-volatile EFI variables to
persistent storage. If the server is removed from Virtual Connect
service later, it will not be possible to restore the original
variables. The event data field represents an internal return code.
- Probable Cause: MFW Performance Problem
- Event Category: Support Firmware
- Event Sub-Category: Unknown
- Cause 1: An error occurred in the interface
between system firmware and manageability firmware.
- Recommended Action
1: Reboot the server and verify that the system board (Monarch Blade
system board if a multi-blade server) is functioning properly. If not,
replace the system board. Check system logs to see if other NVM errors
are related to this same system board. If so, replace the system board.
Verify system and manageability firmware are at their supported
revisions, and if not, update firmware to the latest supported
revisions. If problems persist, contact Support.
Event 9802
- WBEM Severity: Major
- Event Summary: Virtual Connect failed to
save non-volatile EFI variables into NVM
- Event Description: System firmware was
unable to save the active non-volatile EFI variables to persistent
storage. The event data field represents an internal return code.
- Probable Cause: MFW Performance Problem
- Event Category: Support Firmware
- Event Sub-Category: Unknown
- Cause 1: An error occurred in the interface
between system firmware and manageability firmware.
- Recommended Action
1: Reboot the server and verify that the system board (Monarch Blade
system board if a multi-blade server) is functioning properly. If not,
replace the system board. Check system logs to see if other NVM errors
are related to this same system board. If so, replace the system board.
Verify system and manageability firmware are at their supported
revisions, and if not, update firmware to the latest supported
revisions. If problems persist, contact Support.
Event 9803
- WBEM Severity: Major
- Event Summary: Virtual Connect failed to
erase the EFI variables backup from NVM
- Event Description: System firmware was
unable to erase a backup of the EFI variables present in persistent
storage. The same backup will again be restored at the next reboot,
overwriting any new variables produced during the current boot cycle.
The event data field represents an internal return code.
- Probable Cause: MFW Performance Problem
- Event Category: Support Firmware
- Event Sub-Category: Unknown
- Cause 1: An error occurred in the interface
between system firmware and manageability firmware.
- Recommended Action
1: Reboot the server and verify that the system board (Monarch Blade
system board if a multi-blade server) is functioning properly. If not,
replace the system board. Check system logs to see if other NVM errors
are related to this same system board. If so, replace the system board.
Verify system and manageability firmware are at their supported
revisions, and if not, update firmware to the latest supported
revisions. If problems persist, contact Support.
Event 9838
- WBEM Severity: Major
- Event Summary: Virtual Connect failed to
save FRU data into NVM
- Event Description: After changing FRU data
in memory, system firmware attempted to replicate the changes to NVM,
but received an unexpected error. The event data field encodes internal
information, including the following: Second byte: Blade number within
server, 0-n. Fourth byte: FRU ID.
- Probable Cause: MFW Performance Problem
- Event Category: Support Firmware
- Event Sub-Category: Unknown
- Cause 1: An error occurred in the interface
between system firmware and manageability firmware.
- Recommended Action
1: Reboot the server and verify that the system board or
mezz card containing this FRU is functioning
properly. If not, replace the system board containing this FRU. Check
system logs to see if other NVM errors are related to this same system
board. If so, replace the system board containing this FRU. Verify
system and manageability firmware are at their supported revisions, and
if not, update firmware to the latest supported revisions. If problems
persist, contact Support.
Event 9862
- WBEM Severity: Critical
- Event Summary: The
iLO peer to peer LAN communication
between MMP and AMP was lost.
- Event Description: The
iLO peer to peer LAN communication
between a Monarch MP (MMP) and an Auxiliary
MP (AMP) has been lost.
- Probable Cause: iLO
Peer to Peer LAN communication
- Event Category: iLO
- Event Sub-Category: Conjoined
- Cause 1: The
iLO peer to peer LAN communication
between a Monarch MP (MMP) and an Auxiliary
MP (AMP) has been lost.
- Recommended Action
1: The management system will attempt to repair itself. If the
management system does not recover refer to the Service Guide to
determine how to reset iLO
without a user interface.
Event 9863
- WBEM Severity: Critical
- Event Summary: The
iLO peer to peer LAN communication
between MMP and AMP was not established.
- Event Description: The
iLO peer to peer LAN communication
between Monarch MP (MMP) and an Auxiliary MP
(AMP) could not be established.
- Probable Cause: iLO
Peer to Peer LAN communication
- Event Category: Enclosure internal network
- Event Sub-Category: Unknown
- Cause 1: The
iLO peer to peer LAN communication
between Monarch MP (MMP) and an Auxiliary MP
(AMP) could not be established.
- Recommended Action
1: The management system will attempt to repair itself. If the
management system does not recover refer to the Service Guide to
determine how to reset iLO
without a user interface.
Event 9864
- WBEM Severity: Critical
- Event Summary: The
iLO peer to peer LAN communication
between DMP and AMP was lost.
- Event Description: The
iLO peer to peer LAN communication
between a Domain MP (DMP) and an Auxiliary MP (AMP) has been lost.
- Probable Cause: iLO
Peer to Peer LAN communication
- Event Category: iLO
- Event Sub-Category: Conjoined
- Cause 1: The
iLO peer to peer LAN communication
between a Domain MP (DMP) and an Auxiliary MP (AMP) has been lost.
- Recommended Action
1: The management system will attempt to repair itself. If the
management system does not recover refer to the Service Guide to
determine how to reset iLO
without a user interface.
Event 9865
- WBEM Severity: Critical
- Event Summary: The
iLO peer to peer LAN communication
between DMP and AMP was not established.
- Event Description: The
iLO peer to peer LAN communication
between Domain MP (DMP) and an Auxiliary MP (AMP) could not be
established.
- Probable Cause: iLO
Peer to Peer LAN communication
- Event Category: iLO
- Event Sub-Category: conjoined
- Cause 1: The
iLO peer to peer LAN communication
between Domain MP (DMP) and an Auxiliary MP (AMP) could not be
established.
- Recommended Action
1: The management system will attempt to repair itself. If the
management system does not recover refer to the Service Guide to
determine how to reset iLO
without a user interface.
Event 9866
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: Insufficient power; more
power supplies may be needed.
- Event Description: Insufficient power; more
power supplies may be needed.
- Probable Cause: Power Problem
- Event Category: System Power
- Event Sub-Category: Unknown
- Cause 1: Power request to OA denied with
the reason insufficient power. Either not enough
BPS, or too much power draw from other blades installed in
enclosure. Consult OA to determine if there is enough room to add BPS.
- Recommended Action
1: Install more power supply units, or reduce the current draw on
the power system. Power Cap is one method that could be used to reduce
current draw. You may want to verify that none of your BPS
have failed.
Event 9867
- WBEM Severity: Critical
- Event Summary: A Socket Monarch Thread has
stopped making forward progress.
- Event Description: The socket monarch
thread has stopped making forward progress. It may be in an infinite
loop or the thread itself (hardware) has failed.
- Probable Cause: Timeout
- Event Category: Processor
- Event Sub-Category: Processor HW Failure or
Firmware Fault
- Cause 1: Unexpected Firmware or Hardware
Errors.
- Recommended Action
1: AC power cycle the system. If the problem still occurs, contact
support.
Event 9868
- WBEM Severity: Critical
- Event Summary: The Partition Monarch Thread
has stopped making forward progress.
- Event Description: The socket monarch
thread has stopped making forward progress. It may be in an infinite
loop or the thread itself (hardware) has failed.
- Probable Cause: Timeout
- Event Category: Processor
- Event Sub-Category: Processor HW failure or
Firmware Fault.
- Cause 1: Unexpected Firmware or Hardware
Errors.
- Recommended Action
1: AC power cycle the system. If the problem still occurs, contact
Support.
Event 9869
- WBEM Severity: Critical
- Event Summary: The processor module that
contains the Monarch Thread was deconfigured.
- Event Description: The processor module
that contains the Monarch Thread that is not making forward progress has
been deconfigured.
- Probable Cause: Timeout
- Event Category: Processor
- Event Sub-Category: Processor Hardware
Failure or Firmware Fault.
- Cause 1: Unexpected Firmware or Hardware
Error has been encountered.
- Recommended Action
1: AC power cycle the system. If the problem still occurs, contact
Support for help.
Event 9870
- WBEM Severity: Critical
- Event Summary: The ACPI subsystem
encountered an unknown platform type.
- Event Description: The ACPI
subsystem failed to recognize the platform
type of the server.
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: NVM
- Cause 1: ACPI failed to recognize the
platform type of the server. This is likely due to corrupted NVM data.
- Recommended Action
1: Verify that the System Firmware is at its supported revision, and
if not, update firmware to the latest supported version. If problem
persists contact support.
Event 9871
- WBEM Severity: Major
- Event Summary: Invalid partition size
encountered by the ACPI subsystem.
- Event Description: An incorrect partition
size was communicated to the ACPI module.
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: ACPI
- Cause 1: Possible data corruption in
non-volatile memory subsystem.
- Recommended Action
1: Reset the system. Verify that the System Firmware is at its
supported revision, and if not, update firmware to the latest supported
version. If problem persists contact support.
Event 9874
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: System firmware failed to
publish ACPI tables
- Event Description: System firmware failed
to publish ACPI resources to the Operating System
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: ACPI Module
- Cause 1: The ACPI subsystem could not
publish ACPI tables due to a System Firmware error.
- Recommended Action
1: Verify that the System Firmware is at its supported revision, and
if not, update firmware to the latest supported version. If problem
persists contact support.
Event 9875
- WBEM Severity: Major
- Event Summary: software memory test failed
- Event Description: Memory test failed. Data
field contains the physical address that failed.
- Probable Cause: Memory test failure
- Event Category: Memory
- Event Sub-Category: DIMM
- Cause 1: Unexpected data accessed from the
memory address during firmware memory test.
- Recommended Action
1: The first action will be check the
system event logs for de-allocated DIMM(s). Next reseat the DIMM(s). If
the problem persists, then replace the DIMM(s) with supported DIMM(s) -
See Service Guide. If that does not solve the issue, replace CPU. If
problem still continues to exist, replace the system board.
Event 9876
- WBEM Severity: Major
- Event Summary: Scalable memory buffer (SMB)
link initialization failed
- Event Description: SMB (Scalable Memory
Buffer) link initialization failed. Data field is the physical location
of the failed SMB.
- Probable Cause: CPU memory controller
- Event Category: Memory
- Event Sub-Category: memory controller
- Cause 1: SMB (Scalable Memory Buffer) link
initialization failed.
- Recommended Action
1: The first action will be to replace the CPU. If that does not
solve the issue, reseat and/or replace with supported DIMM(s) associated
with the SMB - See Service Guide. If problem still continues to exist,
replace the system board.
Event 9877
- WBEM Severity: Major
- Event Summary: Scalable memory buffer (SMB)
channel training failed
- Event Description: Scalable memory buffer
(SMB) channel training failed. Data field contains the physical location
of the failed SMB.
- Probable Cause: CPU memory controller
- Event Category: Memory
- Event Sub-Category: memory controller
- Cause 1: Scalable memory buffer (SMB)
channel training failed.
- Recommended Action
1: The first action will be to reset the CPU. If that does not solve
the issue, replace CPU. If problem still continues to exist, replace the
system board.
Event 9878
- WBEM Severity: Major
- Event Summary: Failed
outbound read access to scalable memory buffer (SMB) due to
timeout
- Event Description: Failed
outbound read access to scalable memory
buffer (SMB) due to timeout. Data field is the physical location of the
SMB that CPU attempted access to.
- Probable Cause: CPU memory controller
- Event Category: Memory
- Event Sub-Category: memory controller
- Cause 1: Failed
outland read access to scalable memory buffer (SMB) due to
timeout.
- Recommended Action
1: The first action will be to replace the CPU. If problem still
continues to exist, replace the system board.
Event 9879
- WBEM Severity: Major
- Event Summary: Failed
outbound write access to scalable memory buffer (SMB) due to
timeout.
- Event Description: Failed
outbound write access to scalable memory
buffer (SMB). Data field is the physical location of the SMB that CPU
attempted access to.
- Probable Cause: CPU memory controller
- Event Category: Memory
- Event Sub-Category: memory controller
- Cause 1: Failed
outbound write access to memory controller due to timeout
- Recommended Action
1: The first action will be to replace the CPU. If problem still
continues to exist, replace the system board.
Event 9880
- WBEM Severity: Major
- Event Summary: Memory build-in self test
(MBIST) failed
- Event Description: Memory Build-in self
test failed. Data field is the physical location of the Scalable memory
buffer (SMB) that failed.
- Probable Cause: System Hardware
- Event Category: Memory
- Event Sub-Category: DIMM
- Cause 1: Memory Build-in self test failed
- Recommended Action
1: The first action will be to check the SEL for the de-allocated
DIMM(s). Next, reseat the failing DIMM(s). If that does not solve the
issue, replace the DIMM(s) with supported DIMM(s) - See Service Guide.
If problem still continues to exist, replace the system board.
Event 9881
- WBEM Severity: Major
- Event Summary: Memory build-in self test
failed with timeout
- Event Description: Memory built-in self
test failed with timeout. Data field is the physical location of the
Scalable memory buffer (SMB) that failed.
- Probable Cause: Hardware
- Event Category: Memory
- Event Sub-Category: DIMM
- Cause 1: Memory built-in self test failed
with timeout
- Recommended Action
1: The first action will be to check the SEL for the de-allocated
DIMM(s). Next, reseat the failing DIMM(s). If that does not solve the
issue, replace the DIMM(s) with supported DIMM(s) - See Service Guide.
If problem still continues to exist, replace the system board.
Event 9882
- WBEM Severity: Major
- Event Summary: DIMM rank mismatch or rank
not found during memory initialization
- Event Description: DIMM rank mismatch or
invalid data. Data field is the physical location of the failing DIMM.
- Probable Cause: Hardware
- Event Category: Memory
- Event Sub-Category: DIMM
- Cause 1: The DIMM rank data is invalid or
does not match with the other DIMM pair.
- Recommended Action
1: The first action will be to reseat the DIMM. If problem persists,
replace DIMM with supported DIMM type (see Service Guide). If problem
still continues to exist, contact support.
Event 9883
- WBEM Severity: Major
- Event Summary: Unable to
acquire communications link to scalable
memory buffer (SMB)
- Event Description: Failed to acquire
communication link to scalable memory buffer (SMB). Data field value is
the physical location of the SMB.
- Probable Cause: Hardware
- Event Category: System Hardware
- Event Sub-Category: CPU
- Cause 1: Failed to acquire communication
link to scalable memory buffer (SMB).
- Recommended Action
1: The first action will be to replace the CPU. If problem still
continues to exist, replace the system board.
Event 9884
- WBEM Severity: Degraded/Warning
- Event Summary: Invalid count reported for
number of supported CPU p-states
- Event Description: An invalid count has
been reported for the number of supported CPU p-states.
- Probable Cause: Processor Problem (Internal
Machine Error)
- Event Category: Processor
- Event Sub-Category: PAL/Power-Management
- Cause 1: The processor is reporting invalid
data about its power management information.
- Recommended Action
1: Verify that the System Firmware is at its supported revision, and
if not, update firmware to the latest supported version. Replace the
processor module. If problem persists contact support.
Event 9890
- WBEM Severity: Degraded/Warning
- Event Summary: Inbound
access to memory controller timed out
- Event Description: Failed
inbound access to scalable memory buffer
(SMB) due to timeout. Data field is the physical location of the SMB
that CPU attempted access to.
- Probable Cause: Hardware
- Event Category: Memory
- Event Sub-Category: memory controller
- Cause 1: Failed inbound
access to scalable memory buffer (SMB) due to timeout.
- Recommended Action
1: The first action will be to replace the CPU. If problem still
continues to exist, replace the system board.
Event 9891
- WBEM Severity: Degraded/Warning
- Event Summary: Frequency ratio for DIMM is
incompatible
- Event Description: Memory frequency ratio
is not supported by system platform. Data field is the physical location
of DIMM that failed.
- Probable Cause: Hardware
- Event Category: Memory
- Event Sub-Category: DIMM
- Cause 1: Memory frequency ratio is not
supported by system platform.
- Recommended Action
1: The first action will be to reseat the DIMM. If problem persists,
replace DIMM with supported DIMM type (see Service Guide). If problem
still continues to exist, contact support.
Event 9892
- WBEM Severity: Degraded/Warning
- Event Summary: Reached unexpected code
paths in memory initialization code
- Event Description: Reached unexpected code
paths in memory initialization code.
- Probable Cause: Hardware
- Event Category: System Hardware
- Event Sub-Category: SFW flash
- Cause 1: Reached unexpected code paths in
memory initialization code.
- Recommended Action
1: Ensure SFW component is at supported level. If problem persists,
contact support.
Event 9893
- WBEM Severity: Degraded/Warning
- Event Summary: LAI expander detected
- Event Description: Unsupported memory LAI
expander detected
- Probable Cause: Hardware
- Event Category: System Hardware
- Event Sub-Category: System Board
- Cause 1: Unsupported memory LAI expander
detected
- Recommended Action
1: Contact support.
Event 9894
- WBEM Severity: Degraded/Warning
- Event Summary: Pre-initialization of memory
controller failed
- Event Description: Failed to initialize
internal communications engine on CPU. Field value is the physical
location of the CPU's scalable memory buffer (SMB) that failed.
- Probable Cause: Hardware
- Event Category: Processor
- Event Sub-Category: memory controller
- Cause 1: Failed to initialize internal
communications engine on CPU.
- Recommended Action
1: The first action will be to check SEL for de-allocated DIMM(s).
Reseat and/or replace the DIMM(s). If that does not solve the issue,
replace CPU. If problem still continues to exist, replace the system
board.
Event 9895
- WBEM Severity: Degraded/Warning
- Event Summary: DDR channel training failed
- Event Description: DDR training failure.
Data field is the physical location of the failed DDR channel
- Probable Cause: Hardware
- Event Category: Memory
- Event Sub-Category: DIMM
- Cause 1: DDR3 training failure.
- Recommended Action
1: The first action will be to check SEL for de-allocated DIMM(s).
Reseat and/or replace the DIMM(s). If that does not solve the issue,
replace CPU. If problem still continues to exist, replace the system
board.
Event 9896
- WBEM Severity: Degraded/Warning
- Event Summary: DDR calibration for commands
failed
- Event Description: DDR calibration of
command failed. Data field is the physical location of the scalable
memory buffer (SMB) that failed.
- Probable Cause: Hardware
- Event Category: Memory
- Event Sub-Category: DIMM
- Cause 1: DDR calibration of command failed
- Recommended Action
1: The first action will be to check SEL for de-allocated DIMM(s).
Reseat and/or replace the DIMM(s). If that does not solve the issue,
replace CPU. If problem still continues to exist, replace the system
board.
Event 9897
- WBEM Severity: Degraded/Warning
- Event Summary: Invalid DIMM detected
- Event Description: Invalid DIMM data found. Data field is the
physical location of the failed DIMM
- Probable Cause: Hardware
- Event Category: Memory
- Event Sub-Category: DIMM
- Cause 1: Invalid DIMM data found
- Recommended Action
1: The first action will be to reseat the DIMM. If that does not
solve the issue, replace DIMM with supported DIMM type (See Service
Guide). If problem still continues to exist, contact support.
Event 9898
- WBEM Severity: Degraded/Warning
- Event Summary: The primary system variables
are uninitialized.
- Event Description: The primary system
variables are uninitialized.
- Probable Cause: System Variables
uninitialized
- Event Category: Primary System Variables
EEPROM
- Event Sub-Category: Unknown
- Cause 1: The primary system variables are
uninitialized.
- Recommended Action
1: The system variables must be initialized. Please run SYSSET to
initialize them.
Event 9899
- WBEM Severity: Degraded/Warning
- Event Summary: integrated Lights Out
firmware battery failure or NVRAM state change.
- Event Description: Integrated Lights Out
detected improper data in NVRAM (bad checksums.) Either the NVRAM layout
changed, or the integrated Lights Out Battery
may not be maintaining the data through A/C power cycles.
- Probable Cause: Battery Failure
- Event Category: Support Firmware
- Event Sub-Category: Unknown
- Cause 1: Determine if the firmware was
recently upgraded. This is often the reason for the NVRAM to change. If
not, and the A/C power has been removed, than it's possible the battery
is indeed going bad and would need to be replaced.
- Recommended Action
1: Verify current supported version of firmware. Replace battery.
Event 9900
- WBEM Severity: Degraded/Warning
- Event Summary: integrated Light Out
firmware software error.
- Event Description: Integrated Lights Out
detected a software error and is logging an event. The data represents
data associated with the error seen.
- Probable Cause: Software Error
- Event Category: Support Firmware
- Event Sub-Category: Unknown
- Cause 1: A software error was detected and
is being logged. The internal data is connected to the location and
module where the error occurred. The Forward Progress Log may receive
additional (lower alert level) event entries with more data associated
with this event.
- Recommended Action
1: Verify that the most recent version of firmware is installed.
Please send this notification to
Superdome2_fw_problem_report@groups.hp.com
Event 9901
- WBEM Severity: Degraded/Warning
- Event Summary: Power was forced to ON
without permission from the Onboard Administrator.
- Event Description: Power was forced to ON
without permission from the Onboard Administrator.
- Probable Cause: Power Problem
- Event Category: Other Power
- Event Sub-Category: OA power request
skipped.
- Cause 1: A forced power on occurred.
Usually this is performed when a normal power on request is denied. The
Onboard Administrator may now be running in a non-redundant or
over-budget power configuration.
- Recommended Action
1: Make sure Onboard Administrator has the appropriate number and
configuration of power supplies. Reduce power required by the blade (for
instance, remove some memory or a CPU.) Or, if there are powered-on
blades that are not in use, power them off.
Event 9902
- WBEM Severity: Degraded/Warning
- Event Summary: Integrated Lights Out unable
to communicate with OA after multiple retries.
- Event Description: Integrated Lights Out
has repeatedly tried to communicate with the Onboard Administrator and
has not received any response.
- Probable Cause: OA fault, reset, removed or
busy
- Event Category: Support Hardware
- Event Sub-Category: OA
- Cause 1:
iLO has repeatedly requested
communication with the Onboard Administrator and has not received a
response. This may be because the EM has a fault condition, or is busy,
being reset, or removed.
- Recommended Action
1: The communication loss may have been temporary and nothing needs
to be done if the communication is now working. Check the Fault LED on
the enclosure manager. Reset the enclosure manager. Reset
iLO. Replace the
enclosure manager.
Event 9903
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: Power could not turn on
because CPUs in the system or SBL Domain are mismatched.
- Event Description: The system/SBL domain
has CPUs of multiple part numbers installed
and was unable to turn on as a result.
- Probable Cause: Consistency check
- Event Category: Processor
- Event Sub-Category: mismatch
- Cause 1: The system/SBL domain has CPUs of
different S/SPEC and/or a delta greater than 1 in core stepping.
- Recommended Action
1: Determine which CPU part number you want to keep, and remove
those that don't match. DF from the CM> menu will display S/SPEC and
stepping to help identify mismatch. When the CPUs all match try to power
on again.
Event 9904
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: The installed SBL is
incompatible for the enclosure type, or blade bay.
- Event Description: The installed SBL is
incompatible with the current enclosure type or blade bay.
- Probable Cause: Consistency check
- Event Category: SBL
- Event Sub-Category: mismatch
- Cause 1: Each SBL2/SBL4 is constructed for
the precise width of the enclosure bays. SBL2-7000, SBL2E-7000, and
SBL4-7000 are designed for the c7000 enclosure. SBL2-3000, SBL2E-3000,
and SBL4-3000 are designed for the c3000 enclosure. The SBL2-7000 and
SBL2-3000 must be installed with the monarch blade in bay 1, 3, 5, or 7.
The SBL2E-7000 and SBL2E-3000 must be installed with the monarch blade
in bay 2, 4, or 6. The SBL4 must be installed with the monarch blade in
bay 1 or 5. This event is being generated because the installed SBL does
not match the environment.
- Recommended Action
1: Change your installation to a supported configuration.
Event 9905
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: Power on failed due to an
SBL (Scalable Blade Link) mismatch
- Event Description: Power on failed
because the installed SBL (Scalable Blade
Link) is incompatible with the enclosure type or blade bay.
- Probable Cause: Consistency check
- Event Category: SBL
- Event Sub-Category: mismatch
- Cause 1: Each SBL2/SBL4 is constructed for
the precise width of the enclosure bays. SBL2-7000, SBL2E-7000, and
SBL4-7000 are designed for the c7000 enclosure. SBL2-3000, SBL2E-3000,
and SBL4-3000 are designed for the c3000 enclosure. The SBL2-7000 and
SBL2-3000 must be installed with the monarch blade in bay 1, 3, 5, or 7.
The SBL2E-7000 and SBL2E-3000 must be installed with the monarch blade
in bay 2, 4, or 6. The SBL4 must be installed with the monarch blade in
bay 1 or 5. This event is being generated because the installed SBL does
not match the environment.
- Recommended Action
1: Change your installation to a supported configuration. Then
attempt power on again.
Event 9906
- WBEM Severity: Critical
- Event Summary: Firmware on the system is
mismatched
- Event Description: The system has
mismatched firmware on it.
- Probable Cause: Firmware
- Event Category: revision
- Event Sub-Category: mismatched
- Cause 1: One of the FW types within the
system has a mismatch across blades.
- Recommended Action
1: Use the SR command from the CM> menu or look in the
iLO web GUI to
identify the mismatched FW and bring it up to date with the rest of the
system.
Event 9907
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: Power failed to turn on due
to mismatched firmware
- Event Description: Power failed to turn on
due to a firmware mismatch.
- Probable Cause: firmware
- Event Category: revision
- Event Sub-Category: mismatch
- Cause 1: The revision of one of the ROMs in
the system does not match the revision of its counterparts. One of the
FW types within the system has a mismatch.
- Recommended Action
1: Use the iLO
SR command from the CM> menu or look in the iLO
web GUI to identify the mismatched FW and bring it up to date with the
rest of the system.
Event 9908
- WBEM Severity: Critical
- Event Summary: The CPU power pod is no
longer powering the CPU.
- Event Description: The CPU power pod is no
longer powering the CPU.
- Probable Cause: Power Problem
- Event Category: Processor Power
- Event Sub-Category: Unknown
- Cause 1: The CPU power pod is no longer
providing power to the CPU. It may not be plugged in to the
system board, or it may have failed.
- Recommended Action
1: If not plugged into the system board, plug it in. Check that the
connector is not loose. If not then you should replace the FRU.
Event 9909
- WBEM Severity: Critical
- Event Summary: This blade has failed to
show signs of boot.
- Event Description: The blade logging the
event has failed to boot.
- Probable Cause: Processor Problem (Internal
Machine Error)
- Event Category: Processor
- Event Sub-Category: Unknown
- Cause 1: Something has caused the blade to
fail to boot. No BOOT_START event was detected from this blade. Could be
the result of a corrupted SFW ROM, a failing processor, or a board
failure of the QPI link.
- Recommended Action
1: At the next convenient opportunity power
cycle the system power to all blades that are configured as part
of a system with the failed blade. Look for other errors that may have
been logged. Reflash the SFW.
Event 9910
- WBEM Severity: Degraded/Warning
- Event Summary: ICH Mezzanine card is
missing
- Event Description: ICH Mezzanine card is
missing.
- Probable Cause: communications error
- Event Category: communication to ICH MEZZ
FRU
- Event Sub-Category: Unknown
- Cause 1: The communications to the ICH MEZZ
may have failed. Otherwise the ICH MEZZ is not present.
- Recommended Action
1: The system will fail to power on if there is no ICH MEZZ on the
monarch blade. Please validate proper loading of the ICH MEZZ card if a
pc on fails.
Event 9911
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: The system failed to power
on upon request.
- Event Description: The system failed to
power on.
- Probable Cause: Power Problem
- Event Category: System Power
- Event Sub-Category: Unknown
- Cause 1: Power on was denied because there
was a failure in the power on sequence. This probably points towards HW
or communication synchronization issues.
- Recommended Action
1: Reset iLO
and try again. If this doesn't work remove STBY power and try again.
Event 9912
- WBEM Severity: Degraded/Warning
- Event Summary: A blade failed to power on
upon request.
- Event Description: A blade failed to power
on.
- Probable Cause: Power Problem
- Event Category: System Power
- Event Sub-Category: Unknown
- Cause 1: A blade has failed to power on.
- Recommended Action
1: -
Event 9913
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: System failed to power on
because the monarch blade failed to power on.
- Event Description: System failed to power
on because the monarch blade failed to power on.
- Probable Cause: Power Problem
- Event Category: System Power
- Event Sub-Category: Monarch
- Cause 1: A failure on the Monarch has
resulted in a failure to power on. As a result the system will be
powered off.
- Recommended Action
1: Try power on again. Reset
iLO and try power on again. Remove
STBY power and then try power on again.
Event 9914
- WBEM Severity: Critical
- Event Summary: A previously powered on
blade has lost power.
- Event Description: A previously powered on
blade has lost power.
- Probable Cause: Power Problem
- Event Category: System Power
- Event Sub-Category: blade local
- Cause 1: The blade indicated by the event
has unexpectedly lost power.
- Recommended Action
1: At the next available opportunity power off the system and power
on again. Monitor the blade to make sure this event doesn't happen
again.
Event 9915
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: The system has been powered
off due to a power failure.
- Event Description: The system has been
powered off due to a power failure.
- Probable Cause: Power Problem
- Event Category: System Power
- Event Sub-Category: Unknown
- Cause 1: One or more blades had a power
failure. As a result the system control is powering off the entire
partition.
- Recommended Action
1: Try power on again. If it fails try with a reset to
iLO and a removal
of STBY power. If system powers back on properly monitor to make sure
this doesn't occur again.
Event 9916
- WBEM Severity: Degraded/Warning
- Event Summary: The system variables are
mismatched.
- Event Description: The system variables are
mismatched
- Probable Cause: mismatch in
SYSVARs EEPROM
- Event Category: EEPROM
- Event Sub-Category: SYSVARS
- Cause 1: The contents in the system
variables EEPROMs don't match. This
indicates that the system variables have been programmed, but they don't
match.
- Recommended Action
1: Run the SYSSET command to bring the system variables
EERPOMs into alignment.
Event 9917
- WBEM Severity: Critical
- Event Summary: The Power Monitor is in
programming mode
- Event Description: The Power Monitor is
left in programming mode. Power protection features are disabled while
Power Monitor remains in this state.
- Probable Cause: powered off in middle of
power monitor update
- Event Category: iLO
- Event Sub-Category: power
- Cause 1: The most likely cause is that the
standby power was removed (blade removed from enclosure) while in the
middle of updating the power monitory PIC. In this mode power protection
is not enabled.
- Recommended Action
1: Install current supported version of the Complex firmware (which
includes power monitor firmware).
Event 9918
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: Power failed to turn on
because there is no SBL (Scalable Blade Link) attached.
- Event Description: Power failed to turn on
because there is no SBL (Scalable Blade Link) attached. Power on is not
allowed without an SBL attached.
- Probable Cause: SBL or blade
- Event Category: System Power
- Event Sub-Category: SBL
- Cause 1: Power can not be turned on without
an attached SBL.
- Recommended Action
1: Attach an appropriate SBL if you want to power on. If an SBL is
already attached remove and inspect its pins. Also, inspect the blade
side connector. If there are no bent pins and the connector looks ok
reseat the SBL and try again.
Event 9919
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: Power on denied by the OA
with an UNKNOWN error code.
- Event Description: The OA has denied power
to the SVB (sever blade) with an unknown error code. Any data included
in the data field is a return code for
developer debug.
- Probable Cause: OA FW mismatch to
iLO FW
- Event Category: System Power
- Event Sub-Category: peer to peer
communication
- Cause 1:
iLO was denied power by the OA with an
error code that was unknown to
iLO.
- Recommended Action
1: Refer to the recipe documentation to make sure that your OA
version is compatible with the
iLO FW you are using. Try resetting OA.
Event 9920
- WBEM Severity: Degraded/Warning
- Event Summary: Indicates an error has
occurred while trying to reset SAS controllers.
- Event Description: Indicates which of
several errors has occurred while trying to reset PMC SAS controllers.
- Probable Cause: Adapter/Card Error
- Event Category: Device Firmware
- Event Sub-Category: Unknown
- Cause 1: An error has occurred while trying
to reset the SAS controller after a GMCA or during I/O discovery.
- Recommended Action
1: Replace drive. Update firmware on SAS controller. Verify that the
system firmware is at a supported revision. If problem persists, contact
support.
Event 9921
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: Power not turned on. The
iLO peer to peer
LAN communication is not functional.
- Event Description: The power will not be
turned on. The iLO
peer to peer LAN communication is not fully functioning.
- Probable Cause: probably the internal
network is having issues
- Event Category: System Power
- Event Sub-Category: SBL
- Cause 1: The
iLO peer to peer LAN communication is
not fully functioning. Could be caused by issues with the internal LAN
or iLO code.
- Recommended Action
1: Review the logs to determine details of failure. The management
system will attempt to repair itself. If the management system does not
recover refer to the Service Guide to determine how to reset
iLO without a
user interface.
Event 9922
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: The system will experience a
hard reset as a result of a watchdog timeout.
- Event Description: The system will
experience a hard reset as a result of a watchdog timeout.
- Probable Cause: watchdog timeout
- Event Category: Software Application
- Event Sub-Category: OS hang
- Cause 1: The OS Watchdog has timed out.
- Recommended Action
1: System will be reset. Determine why the OS did not reset the
watchdog. Consider changing the action to "no action" for debug. Look
for other unusual OS activity.
Event 9923
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: The system will power down
as a result of a watchdog timeout.
- Event Description: The system will power
down as a result of a watchdog timeout.
- Probable Cause: watchdog timeout
- Event Category: Software Application
- Event Sub-Category: OS hang
- Cause 1: The OS Watchdog has timed out.
- Recommended Action
1: Review logs and power system back on if no concerning events are
observed. Determine why OS failed to reset the watchdog. Consider
changing action to "no action" for debug. Look for other unusual OS
activity.
Event 9924
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: The system will power cycle
as a result of a watchdog timeout.
- Event Description: The system will power
cycle as a result of a watchdog timeout.
- Probable Cause: watchdog timeout
- Event Category: Software Application
- Event Sub-Category: OS hang
- Cause 1: The OS Watchdog has timed out.
- Recommended Action
1: None. System will power back on in a moment.
Event 9925
- WBEM Severity: Critical
- Event Summary: Unable to retrieve error
information after a fatal error.
- Event Description: Unable to retrieve error
information after a fatal error.
- Probable Cause: fatal error without valid
registers for HW setup
- Event Category: System Firmware
- Event Sub-Category: Unknown
- Cause 1: A fatal error has occurred. In
attempting to configure hardware for error information recovery the
valid signature wasn’t found in the fatal error registers.
- Recommended Action
1: Review logs for indicators of what might have happened, and
monitor system health.
Event 9926
- WBEM Severity: Critical
- Event Summary: Failed to initialize shared
memory.
- Event Description: Failed to initialize
shared memory.
- Probable Cause: RAM access or FW/RTOS
- Event Category: iLO
- Event Sub-Category: shared memory
- Cause 1: Either the hardware (RAM) did not
respond to the initialization requests, or there was a FW issue during
setup of the shared memory task.
- Recommended Action
1: Try resetting iLO
with an xd->R command from the
iLO CM> prompt.
Event 9927
- WBEM Severity: Critical
- Event Summary: The Display Board is not
responding and may be missing.
- Event Description: The Display Board is not
responding and may be missing.
- Probable Cause: missing
FRU
- Event Category: FRU
- Event Sub-Category: none
- Cause 1: The Display Board is not
responding. It is either missing or loose. This board includes sensors
such as the altimeter and a thermal sensor so operation may be altered.
- Recommended Action
1: Check if the board is present. If not replace it. If it is,
remove it and re-insert it to ensure proper connectivity.
Event 9928
- WBEM Severity: Critical
- Event Summary: Indicates an
inconsistency in bus numbering has been
discovered.
- Event Description: Indicates an
inconsistency in bus numbering has been discovered.
- Probable Cause: Hardware
- Event Category: System Hardware
- Event Sub-Category: Unknown
- Cause 1: The bus number
returned by
GetSystemSocketsInfo() does not
match what is expected.
- Recommended Action
1: Ensure all systems in the enclosure are conjoined as required and
functioning. Verify the SBL is installed and seated properly. If problem
persists, contact support.
Event 9929
- WBEM Severity: Critical
- Event Summary: A power supply has failed.
Physical Location is included in data field.
- Event Description: A power supply has
failed. A Physical Location is included in the
data field.
- Probable Cause: Power Supply Failure
- Event Category: System Power
- Event Sub-Category: Unknown
- Cause 1: A power supply has failed. The
connection could be loose or there could be a hardware failure.
- Recommended Action
1: Identify the failing supply from the physical location included
in the alert. Check connections. If no loose connections are found
replace the power supply.
Event 9930
- WBEM Severity: Critical
- Event Summary: A fan has failed.
- Event Description: A fan has failed. Check
the data field for Physical Location.
- Probable Cause: Fan Failure
- Event Category: System Hardware
- Event Sub-Category: Unknown
- Cause 1: A connection may be bad. Otherwise
a fan has probably failed.
- Recommended Action
1: Identify the Physical Location indicated
in the data field. Check connections and fan
health. If connections are ok replace failed fan.
Event 9985
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: System Firmware internal
error detected resulting in execution of ASSERT macro
- Event Description: This event is generated
when an unexpected assert condition is encountered in System Firmware.
The following events indicate which module generated the condition and
the line number where it occurred.
- Probable Cause: Unknown
- Event Category: System Firmware
- Event Sub-Category: ASSERT
- Cause 1: The cause is unknown.
- Recommended Action
1: Examine previous events and ASCII information following this
event for more information. If problem persists, contact support.
Event 9986
- WBEM Severity: Degraded/Warning
- Event Summary: System Firmware internal
error detected resulting in execution of ASSERT macro
- Event Description: This event is generated
when an unexpected assert condition is encountered in System Firmware.
The following events indicate which module generated the condition and
the line number where it occurred.
- Probable Cause: Unknown
- Event Category: System Firmware
- Event Sub-Category: ASSERT
- Cause 1: The cause is unknown.
- Recommended Action
1: Examine previous events and ASCII information following this
event for more information. If problem persists, contact support.
Event 9990
- WBEM Severity: Major
- Event Summary: Thread has failed Early
Self-test and is now operating with degraded performance
- Event Description: Thread has failed Early
Self-test and is now operating with degraded performance. See the
subsequent BOOT_CPU_PHYSICAL_LOCATION for physical location of
cpu with degraded
performance.
- Probable Cause: Processor Problem (Internal
Machine Error)
- Event Category: Processor
- Event Sub-Category: Unknown
- Cause 1: Thread has failed early self-test.
- Recommended Action
1: Reboot the system. If the problem persists, contact support for
help to replace the CPU.
Event 10065
- WBEM Severity: Minor
- Event Summary: Failed to notify
manageability of a change to the real-time-clock device.
- Event Description: The EFI subsystem could
not notify manageability of a change to the real-time-clock device.
- Probable Cause: Transmit Failure
- Event Category: System Firmware
- Event Sub-Category: SFW/MFW communication
channel
- Cause 1: The EFI subsystem could not notify
manageability of a change to the real-time-clock device.
- Recommended Action
1: Verify that the System Firmware is at its supported revision, and
if not, update firmware to the latest supported version. Restart the
system. If problem persists contact support.
Event 10067
- WBEM Severity: Critical
- Event Summary: ACPI parser error due to a
missing AML identifier
- Event Description: The ACPI subsystem
encountered a missing AML identifier while parsing the DSDT table. This
could also prevent the Operating System from booting successfully. This
could cause the Operating System to crash after it has booted
successfully.
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: ACPI Subsystem
- Cause 1: The ACPI subsystem could not find
the doorbell AML identifier while parsing the DSDT table.
- Recommended Action
1: Ensure SFW component is at supported level.
Event 10072
- WBEM Severity: Critical
- Event Summary: Memory Configuration
rendevous failed
- Event Description: Memory configuration
rendevous failed. Mark the start of the
Memory Configuration rendevous point Data
format: 0xAABB AA = Blade Number BB = Cpu
Number
- Probable Cause: System
- Event Category: System Hardware
- Event Sub-Category: NVRAM
- Cause 1: Read from non-volatile memory
failed.
- Recommended Action
1: Reset the system and attempt to reboot again. If problem
persists, contact support.
Event 10073
- WBEM Severity: Degraded/Warning
- Event Summary:
iLO was unable to initialize user
accounts.
- Event Description:
iLO was unable to initialize user
accounts.
- Probable Cause: bad data in EEPROM or
checksum failed
- Event Category: iLO
- Event Sub-Category: Users
- Cause 1:
iLO was unable to read the default
user account data from EEPROM. The default user data contained in the
EEPROM was likely bad, or the checksum failed.
- Recommended Action
1: There is no factory default password at this point- all passwords
will be refused. Use the physical presence button to force
iLO into security
override mode. This will allow you to log into
iLO without a user account. Use the UC
command or web GUI to setup iLO
user account(s) again.
Event 10166
- WBEM Severity: Degraded/Warning
- Event Summary: SFW discarded one or more in
progress SFW events before they got logged by
iLO.
- Event Description: SFW events were
discarded to speed up SFW startup. Event queues have been re-initialized
and restarted.
- Probable Cause: degraded
iLO or SFW,
perhaps an MCA loop.
- Event Category: iLO
- Event Sub-Category: Logs
- Cause 1: The SFW event queue overflowed and
remained that way for a duration of time.
This could result from poor iLO
queue draining (logging) performance, or could be related to the number
of event being issued by SFW in a short period of time.
- Recommended Action
1: None.
Event 10172
- WBEM Severity: Degraded/Warning
- Event Summary: The
iLO config
was unexpectedly defaulted. Settings may need to be re-entered.
- Event Description: The
iLO config
was unexpectedly reset to default values. Settings may need to be
re-entered.
- Probable Cause: Software Error
- Event Category: iLO
- Event Sub-Category:
config/NVRAM
- Cause 1: Some bad data in the
iLO
config area of NVRAM was detected and as a
result config was defaulted. It is not
believe that this was caused by a battery
failure or user action.
- Recommended Action
1: Re-enter any lost configuration data.
Event Number: 10175
- WBEM Severity: Degraded/Warning
- Event Summary: The local blade failed to
power off upon request.
- Event Description: The local blade failed
to power off upon request.
- Probable Cause: Power Problem
- Event Category: System Power
- Event Sub-Category: Unknown
- Cause 1: This blade has failed to power
off.
- Recommended Action
1: Try resetting iLO
and attempting power off again.
Event 10176
- WBEM Severity: Degraded/Warning
- Event Summary: The system has failed to
power off upon request.
- Event Description: The system has failed to
power off upon request.
- Probable Cause: Power Problem
- Event Category: System Power
- Event Sub-Category: Unknown
- Cause 1: the rack or one or more blades
have failed to power off. The overall power off action has failed as a
result.
- Recommended Action
1: reset iLO
and attempt power off again.
Event 10223
- WBEM Severity:
- Event Summary: A data table stored in
non-volatile RAM is invalid.
- Event Description: The DOT table saved in
NVRAM is out of date or corrupt. All firmware in the system should be
updated and NVRAM may need to be cleared.
- Probable Cause: Software Error
- Event Category: System Firmware
- Event Sub-Category: DOT table stored in NVRAM
- Cause 1: Manageability or System FW
revisions are out of date or corrupted
- Recommended Action
1: Ensure that all firmware in the system is up to date.
Event 10250
- WBEM Severity: Degraded/Warning
- Event Summary: Unable to retrieve
information for restoring the state of PCIe
after a GMCA
- Event Description: Indicates which of
several errors have occurred while trying to retrieve
PCIe state save information from the DOT.
Some attempts are made to recover from these errors, so they don't
necessarily indicate a critical failure. If
PCIe state save information could not be
restored after all recovery attempts exhausted, the OS may not be able
to dump to disk, and architected PCIe errors
will not be logged.
- Probable Cause: NVRAM corruption or SFW-MFW
communication error
- Event Category: System Firmware
- Event Sub-Category: Data Object Table
- Cause 1: An error has prevented system
firmware from accessing the I/O state save area of the DOT. Possible
causes are data corruption of NVRAM or communication error between SFW
and MFW.
- Recommended Action
1: Ensure system firmware recipe is accurate and up-to-date.
Event 10359
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: The system has lost AC
power.
- Event Description: The system has lost AC
power. This will result in a loss of system power and standby power
unless the AC power is restored.
- Probable Cause: Power Supply Failure
- Event Category: System Power
- Event Sub-Category: Unknown
- Cause 1: Most likely cause is removal of an
AC power cord, or a bulk power supply failure.
- Recommended Action
1: Make sure that the AC Power cord is properly installed, and the
bulk power supply is functional
Event 10366
- WBEM Severity: Degraded/Warning
- Event Summary: The Power Button was
asserted prior to changing the system power.
- Event Description: The Power Button was
asserted prior to changing the system power.
- Probable Cause: Power Problem
- Event Category: System Power
- Event Sub-Category: power button asserted
or stuck
- Cause 1: The power button must be released
before remotely controlling system power. The button may be stuck.
- Recommended Action
1: Inspect the power button and correct anything which may affect
its proper operation.
Event 10384
- WBEM Severity: Degraded/Warning
- Event Summary: ICH failed to respond to an
i2c query
- Event Description: The ICH failed
respond to an i2c request.
- Probable Cause: ICH device may have failed
- Event Category: ICH
mezz
- Event Sub-Category: Unknown
- Cause 1: The ICH device failed to respond
to an i2c request. It is probable that the ICH device is in a failed
state.
- Recommended Action
1: Investigate the ICH MEZZ card to determine if there has been a
failure.
Event 10405
- WBEM Severity: Critical
- Event Summary: The chassis has been open
for an extended period of time.
- Event Description: The chassis has been
open for an extended period of time.
- Probable Cause: Enclosure Door Open
- Event Category: System Cooling
- Event Sub-Category: Unknown
- Cause 1: The chassis
door or top cover is open. More
specifically it has been open for an extended period of time.
- Recommended Action
1: Close the door or top cover.
Event 10407
- WBEM Severity: Fatal/Nonrecoverable
- Event Summary: Indicates the API for
nPar reset during a Global Machine Check
Abort failed.
- Event Description: The
nPartition will remain in an MCA state and requires manual
intervention to reboot it back up using the ParCon
commands.
- Probable Cause: Software Error
- Event Category: Support Firmware
- Event Sub-Category: Unknown
- Cause 1: Firmware API call failed
- Recommended Action
1: Use ParCon commands to reboot the
partition.
Top
of Page
Last updated: Aug 3, 2010