Virtualization

Virtualization is a concept that applies to many areas of the information technology industry.

For data storage, virtualization includes the creation of a pool of storage that contains several disk systems. These systems can be supplied from various vendors. The pool can be split into volumes that are visible to the host systems that use them. Therefore, volumes can use mixed back-end storage and provide a common way to manage a storage area network (SAN).

Historically, the term virtual storage described the virtual memory techniques that were used in operating systems. However, the term storage virtualization describes the shift from managing physical volumes of data to logical volumes of data. This shift can be made on several levels of the components of storage networks. Virtualization separates the representation of storage between the operating system and its users from the actual physical storage components. Mainframe computers have used this technique for many years through methods such as system-managed storage and products like the IBM Data Facility Storage Management Subsystem (DFSMS). Virtualization can be applied at the following four main levels.
At the server level
Manages volumes on the operating systems servers. An increase in the amount of logical storage over physical storage is suitable for environments that do not have storage networks.
At the storage device level
Uses RAID to create disk systems. This type of virtualization can range from simple RAID controllers to advanced volume management such as that provided by the IBM DS8000. The Virtual Tape Server (VTS) is another example of virtualization at the device level.
At the fabric level
Enables storage pools to be independent of the servers and the physical components that make up the storage pools. One management interface can be used to manage different storage systems without affecting the servers. The system implements virtualization at the fabric level.
At the file system level
Provides the highest benefit because data is shared, allocated, and protected at the data level rather than the volume level.

Virtualization is a radical departure from traditional storage management. In traditional storage management, storage is attached directly to a host system, which controls storage management. SANs introduced the principle of networks of storage, but storage is still primarily created and maintained at the RAID system level. Multiple RAID controllers of different types require knowledge of, and software that is specific to, the specific hardware. Virtualization provides a central point of control for disk creation and maintenance.

One problem area that virtualization addresses is unused capacity. Before virtualization, individual host systems each had their own storage, which wasted unused storage capacity. Using virtualization, storage is pooled so that jobs from any attached system that need large amounts of storage capacity can use it as needed. Virtualization makes it easier to regulate the amount of available storage without having to use host system resources or to turn storage devices off and on to add or remove capacity. Virtualization also provides the capability to move storage between storage systems transparently to host systems.

Types of virtualization

Virtualization can be either asymmetrical or symmetrical. Levels of virtualization provides a diagram of the levels of virtualization. The system implements fabric-level virtualization. Within the context of the system, virtualization refers to symmetric fabric-level virtualization.
Figure 1. Levels of virtualization
Levels of virtualization
Asymmetric
A virtualization engine is outside the data path and generate a metadata style service. The system does not use asymmetric virtualization.
Symmetric
A virtualization engine sits in the data path and presents disks to the hosts, but hides the physical storage from the hosts. Therefore, advanced functions, such as cache and Copy Services, can be implemented in the engine itself.

Virtualization at any level provides benefits. When several levels are combined, the benefits of those levels can also be combined. For example, you can combine benefits by attaching a RAID controller to a virtualization engine that provides virtual volumes for a virtual file system.