Before running a system recovery procedure, it is important
to identify and fix the root cause of the hardware issues.
Identifying and fixing the root cause can help recover
a system, if these are the faults that are causing the system to fail.
The following are common issues which can be easily resolved:
- The node has been powered off or the power cords were unplugged.
- Check the node status
of every node canister that is part of this system. Resolve all hardware
errors except node error 578 or node error 550.
- All nodes must be reporting either a node error
578 or a node error 550. These error codes indicate that the system
has lost its configuration data. If any nodes report anything other
than these error codes, do not perform a recovery. You can encounter
situations where non-configuration nodes report other node errors,
such as a 550 node error. The 550 error can also indicate that a node
is not able to join a system.
- If any nodes show a node error 550, record
the error data that is associated with the 550 error from the service
assistant.
Note: If after resolving all these scenarios, half or greater
than half of the nodes are reporting node error 578, it is appropriate
to run the recovery procedure. Call the Lenovo Support Center for further assistance.
- For any nodes that are reporting a node error 550, ensure that
all the missing hardware that is identified by these errors is powered
on and connected without faults. If you
cannot contact the service assistant from any node, isolate the problems
by using the LED indicators.
- If you have not been able to restart the system, and if any node
other than the current node is reporting node error 550 or 578, you
must remove system data from those nodes. This action acknowledges
the data loss and puts the nodes into the required candidate state.