Troubleshooting

Troubleshooting procedures help you diagnose problems.

Best practices for troubleshooting
Taking advantage of certain configuration options, and ensuring vital system access information has been recorded, makes the process of troubleshooting easier.
Battery operation for control enclosures
Each node canister in the control enclosure caches critical data and holds state information in volatile memory.
Understanding the medium errors and bad blocks
A storage system returns a medium error response to a host when it is unable to successfully read a block. The system response to a host read follows this behavior.
Resolving insufficient memory with a distributed array
If the creation of a distributed array fails due to a lack of memory, complete the following procedure to resolve the insufficient memory.
RAID write response time
This function means that the RAID software layer, where redundancy exists to do so, can prevent drive bad behavior from having an unlimited impact on I/O performance. In addition, the system tries to avoid immediately committing to an array rebuild due to a brief offline event from a single drive, while there is full redundancy.
User interfaces for servicing your system
The system provides several user interfaces to troubleshoot, recover, or maintain your system. The interfaces provide various sets of facilities to help resolve situations that you might encounter.
Starting statistics collection
The system collects statistics over an interval and creates files that can be viewed.
Event reporting
Events that are detected are saved in an event log. As soon as an entry is made in this event log, the condition is analyzed. If any service activity is required, a notification is sent, if you set up notifications.
Debugging and performance-monitoring statistics for offloaded data transfer
Offloaded data transfer (ODX) infrastructure captures debug and monitoring information for specific ODX modules and makes it available in global and local views.
Resolving a problem
Described here are some procedures to help resolve fault conditions that might exist on your system. A basic understanding of the system concepts is required.
Recover system procedure
The recover system procedure recovers the entire storage system if the system state is lost from all control enclosure node canisters. The procedure re-creates the storage system by using saved configuration data. The recovery might not be able to restore all volume data. This procedure is also known as Tier 3 (T3) recovery.
Backing up and restoring the system configuration
You can back up and restore the configuration data for the system after preliminary tasks are completed.
Servicing storage systems
Storage systems that are supported for attachment to the system are designed with redundant components and access paths to enable concurrent maintenance. Hosts have continuous access to their data during component failure and replacement.
Removing and replacing parts
You can remove and replace customer-replaceable units (CRUs) in control enclosures or expansion enclosures.

Parent topic: Lenovo Storage V3700 V2/V5030 Series

Give feedback

Send a link to this topic