Hierarchical storage management

Hierarchical storage management

Hierarchical storage management (HSM) is a data storage technique which automatically moves data between high-cost and low-cost storage media. HSM systems exist because high-speed storage devices, such as hard disk drive arrays, are more expensive (per byte stored) than slower devices, such as optical discs and magnetic tape drives. While it would be ideal to have all data available on high-speed devices all the time, this is prohibitively expensive for many organizations. Instead, HSM systems store the bulk of the enterprise's data on slower devices, and then copy data to faster disk drives when needed. In effect, HSM turns the fast disk drives into caches for the slower mass storage devices. The HSM system monitors the way data is used and makes best guesses as to which data can safely be moved to slower devices and which data should stay on the fast devices.

In a typical HSM scenario, data files which are frequently used are stored on disk drives, but are eventually migrated to tape if they are not used for a certain period of time, typically a few months. If a user does reuse a file which is on tape, it is automatically moved back to disk storage. The advantage is that the total amount of stored data can be much larger than the capacity of the disk storage available, but since only rarely-used files are on tape, most users will usually not notice any slowdown.

HSM is sometimes referred to as tiered storage.

HSM (originally DFHSM, now DFSMShsm) was first[citation needed] implemented by IBM on their mainframe computers to reduce the cost of data storage, and to simplify the retrieval of data from slower media. The user would not need to know where the data was stored and how to get it back; the computer would retrieve the data automatically. The only difference to the user was the speed at which data was returned.

Later, IBM ported HSM to its AIX operating system, and then to other Unix-like operating systems such as Solaris, HP-UX and Linux.

HSM was also implemented on the DEC VAX/VMS systems and the Alpha/VMS systems. The first implementation date should be readily determined from the VMS System Implementation Manuals or the VMS Product Description Brochures.

Recently, the development of Serial ATA (SATA) disks has created a significant market for three-stage HSM: files are migrated from high-performance Fibre Channel Storage Area Network devices to somewhat slower but much cheaper SATA disks arrays totalling several terabytes or more, and then eventually from the SATA disks to tape.

The newest development in HSM is with hard disk drives and flash memory, with flash memory being over 30 times faster than disks, but disks being considerably cheaper.

Conceptually, HSM is analogous to the cache found in most computer CPUs, where small amounts of expensive SRAM memory running at very high speeds is used to store frequently used data, but the least recently used data is evicted to the slower but much larger main DRAM memory when new data has to be loaded.

In practice, HSM is typically performed by dedicated software, such as IBM Tivoli Storage Manager, CommVault [1], VERITAS Enterprise Vault, Sun Microsystems SAMFS/QFS, Quantum StorNext, or EMC Legato OTG DiskXtender.

Contents

Use Cases

HSM is often used for deep archival storage of data to be held long term at low cost. Automated tape robots can silo large quantities of data efficiently with low power consumption.

Some HSM software products allow the user to place portions of data files on high-speed disk cache and the rest on tape. This is used in applications that stream video over the internet -- the initial portion of a video is delivered immediately from disk while a robot finds, mounts and streams the rest of the file to the end user. Such a system greatly reduces disk cost for large content provision systems.

Tiered storage

Tiered storage is a data storage environment consisting of two or more kinds of storage delineated by differences in at least one of these four attributes: Price, Performance, Capacity and Function.

Any significant difference in one or more of the four defining attributes can be sufficient to justify a separate storage tier.

Examples:

  • Disk and Tape: Two separate storage tiers identified by differences in all four defining attributes.
  • Old technology disk and new technology disk: Two separate storage tiers identified by differences in one or more of the attributes.
  • High performing disk storage and less expensive, slower disk of the same capacity and function: Two separate tiers.
  • Identical Enterprise class disk configured to utilize different functions such as RAID level or replication: A separate storage tier for each set of unique functions.

Note: Storage Tiers are NOT delineated by differences in vendor, architecture, or geometry except where those differences result in clear changes to Price, Performance, Capacity and Function.

See also

Implementations


Wikimedia Foundation. 2010.

Игры ⚽ Нужно решить контрольную?

Look at other dictionaries:

  • Hierarchical Storage Management — Pour les articles homonymes, voir HSM. Hiérarchie du Stockage Hierarchical Storage Management (HSM), désigne un système permettant le stockage de données de façon hiérarchique …   Wikipédia en Français

  • Hierarchical Storage Management —    Abbreviated HSM. A combination of several types of file storage systems, managed by intelligent software.    In HSM, data is moved from one type of storage to another depending on how frequently the data is accessed. Active data is held on… …   Dictionary of networking

  • Hierarchical Storage Management —    (HSM)    Managing files spanning several levels of storage devices to balance overall performance with cost …   IT glossary of terms, acronyms and abbreviations

  • Storage management — can refer to one of the following:* ADSTAR Distributed Storage Manager * Automatic Storage Management * Enterprise storage management * Hierarchical storage management * IBM Tivoli Storage Manager * OpenView Storage Area Manager * Storage… …   Wikipedia

  • Content storage management — (CSM) is a technique for the evolution of traditional media archive technology used by media companies and content owners to store and protect valuable file based media assets. CSM solutions focus on active management of content and media assets… …   Wikipedia

  • IBM Tivoli Storage Manager — ADSM redirects here. For the stock exchange, see Abu Dhabi Securities Market. IBM Tivoli Storage Manager Developer(s) IBM Stable release 6.3 / October 2011 Operating system cross platform …   Wikipedia

  • Digital asset management — (DAM) consists of management tasks and decisions surrounding the ingestion, annotation, cataloguing, storage, retrieval and distribution of digital assets. Digital photographs, animations, videos and music exemplify the target areas of media… …   Wikipedia

  • Computer data storage — 1 GB of SDRAM mounted in a personal computer. An example of primary storage …   Wikipedia

  • Information Lifecycle Management — NOTOC Information Lifecycle Management refers to a wide ranging set of strategies for administering storage systems on computing devices. Specifically, four categories of storage strategies may be considered under the auspices of ILM.PolicyILM… …   Wikipedia

  • IBM Tivoli Storage Manager (TSM) — IBM Tivoli Storage Manager IBM Tivoli Storage Manager …   Wikipédia en Français

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”