|
ABSTRACT:
Data archives have become a core element of the storage infrastructure. Archives today have two purposes: They hold a vast spectrum of data that does not require frequent access, and they ensure that more relevant data is retained (and then deleted) to meet regulatory compliance needs.
Whereas storage area network (SAN) storage emphasizes performance, archival storage relies on low-cost, high-capacity SATA drives and employs a combination of RAID and traditional backups to guard disks against failure. Some archives are little more than "dumb" disk arrays, but the more sophisticated archives provide data deduplication for single-instance storage, robust power conservation features and immutability for data that may be needed as evidence in litigation.
The hardware used for archival storage is only part of the challenge. Software plays a central role in a wide variety of archiving tasks, from optimizing and organizing access to email records to supporting data indexing and searching across millions of files to setting policies for file handling, which sets the stage for data migration and retention in an archive.
|
| |
 |
| |
AUTHOR:
Stephen J. Bigelow
|