To use data reduction technologies on the system, you need to create a data reduction pool, create thin-provisioned or compressed volumes, and map these volumes to hosts that support SCSI unmap commands.
Data reduction can increase storage efficiency and performance and reduce storage costs, especially for flash storage. Data reduction reduces the amount of data that is stored on external storage systems and internal drives by reclaiming previously used storage resources that are no longer needed by host systems. To estimate potential capacity savings that data reduction technologies can provide on the system, use the Data Reduction Estimator Tool (DRET). This tool analyzes existing user workloads which are being migrated to a new system. The tool scans target workloads on all attached storage arrays, consolidates these results, and generates an estimate of potential data reduction savings for the entire system.
Go to https://www-945.ibm.com/support/fixcentral/ to find the tool and its readme. Data reduction is only supported Lenovo Storage V5030 and Lenovo Storage V5030F systems.
The system supports data reduction pools, which can contain thin-provisioned or compressed volumes. Data reduction pools also support additional capacity savings on thin-provisioned and compressed volumes by supporting data deduplication. When deduplication is specified for a thin-provisioned or compressed volume, duplicate versions of data are eliminated and not written to storage, thus saving additional capacity. Data reduction pools also contain specific volumes that track when space is freed from hosts and possible unused capacity that can be collected and reused within the storage pool. When space is freed from hosts, the process is called unmapping. Unmap is a set of SCSI commands that hosts use to indicate that allocated capacity is no longer required on a target volume. The freed space can be collected and reused on the system without the reallocation of capacity on the storage. The pool can also reclaim unused capacity in a data reduction pool and redistribute it to free extents. Reclaimable capacity is unused capacity that is created when data is overwritten, volumes are deleted, or when data is marked as unneeded by a host by using the SCSI unmap command. When a host no longer needs the data that is stored on a volume, the host system using SCSI unmap commands to release that storage from the volume. When these volumes are in data reduction pools, that space becomes reclaimable capacity and is monitored and collected and eventually redistributed back to the pool for use by the system. In the management GUI, reclaimable capacity is added to the available capacity for the data reduction pool. For standard pools, available capacity does not include any reclaimable capacity. In the command line interface, lsmdiskgrp command displays the different values that apply to data reduction and standard pools. For data reduction pools, the value for reclaimable_capacity indicates the amount of unused capacity that is available after data is reduced in the pool. Unlike with the management GUI, reclaimable_capacity is not included in the free_capacity value that is displayed in the lsmdiskgrp. Reclaimable capacity is collected as metadata and is also stored in the data reduction pool, thus using storage on the external storage system. The system periodically returns this capacity back to the pool, however, the system can use up 85% of the available logical capacity with reclaimable data, which can generate out-of-space warnings on the external storage system incorrectly. When creating data reduction pools, ensure that 15% of the total capacity that is allocated is reserved for these operations. Reclaimable capacity can be used for other volumes, which more efficiently uses existing storage resources. Monitor physical capacity of data reduction pools in the management GUI by selecting . In the command-line interface, use the lsmdiskgrp command to display the physical capacity of a data reduction pool.
chsystem -hostunmap on
mkmdiskgrp -name pool_name -ext extent_size -datareduction yesWhere pool_name is the name of the pool and extent_size is the extent size of the pool. Data reduction pools are created as parent pools only, not child pools.
mkvolume -name name -pool storage_pool_name -size disk_size -compressed
mkvolume -name name -pool storage_pool_name -size disk_size -thinWhere name is the name of the new volume, storage_pool_name is the name of the data reduction pool, and disk_size is the capacity of the volume.
mkvdiskhostmap -host host_namevdisk_nameWhere host_name is the name of the host and vdisk_name is the name of the volume.