RSS
Follow us on:
  • Recommend:
  • 0 Comments
  • Print

RAID Made Easy

Confused by the difference between RAID 0 and RAID 1+0? Our primer on the basics of data redundancy will help you sort out which type of RAID will best safeguard your data.

What is RAID, why do you need it, and what are all those mode numbers that are constantly bandied about? RAID stands for "redundant array of independent disks," and you may or may not need it depending on your data-storage requirements.

The biggest gain from using RAID is protection against drive failure--which, according to Google and other experts, happens a lot more often than hard-drive manufacturers like to admit. (Note that the word array is included in the acronym, so saying "RAID array," as a lot of people do, is redundant. Clearly, storage folks have a strange sense of humor.)

In the old days, when the fastest and largest hard drives carried a very heavy premium (faster drives still do, though not nearly to the same degree), RAID was created to combine multiple, less-expensive drives into a single, higher-capacity and/or faster volume. Redundancy, also known as fault tolerance or failover protection, was included so that the loss of one drive wouldn't render an entire array and its data useless.

As such, RAID has several levels, or methods by which the drives are ganged together, with data distributed across the drives. The RAID levels are commonly referred to by number. The three most common levels in the consumer and small-office markets are RAID 0, RAID 1, and RAID 5, which I'll cover first along with other common options such as JBOD ("just a bunch of disks"), Microsoft's RAID-like Drive Extender, and RAID-virtualization technologies such as those from Drobo, Netgear, Synology, and Seagate.

Most RAID modes don't require that you employ drives of equal size, but they'll use only the capacity on each drive that equals the capacity of the smallest drive in the array--that is, if you mix a 500GB drive with a 1TB drive, the setup will treat both as 500GB drives.

Keep in mind that RAID's data redundancy is a hedge only against data loss due to drive failure, and a way to keep you working until you can replace the bad drive. RAID offers no protection against data lost to malware, theft, or natural disaster, and it's certainly no substitute for proper backup practices.

Common RAID Modes

RAID 0

Picture the 0 in "RAID 0" as a race track, and you're on to its primary purpose--speed. RAID 0 distributes data across multiple drives (for example, block A goes to/from drive 1, block B goes to/from drive 2), which permits the increased write and read speeds. This approach is sometimes referred to as striping, and other modes (as you'll see later) employ the technique as well.

Always remember, however, that RAID 0 offers zero protection against drive failure, as no duplicate or parity information is written. Hence, when a drive fails, you end up with a puzzle that's missing pieces. In such a situation, your data is lost, unless of course you want to spend a very large sum of money trying to recover it.

RAID 1

RAID 1 creates a mirror of your data across two drives--that is, the array writes and reads the very same data to pairs of drives. The drives are equal partners; should either fail, you can continue working with the good one until you can replace the bad one. RAID 1 is the simplest, easiest method to create a failover disk storage subsystem. It costs you a whopping 50 percent of your total available drive capacity, however--for example, two 1TB drives in a mirrored array nets you 1TB of usable space, not 2TB.

You may have as many pairs of mirrored drives as your RAID controller allows. And in the unlikely event that said consumer-grade data traffic cop supports duplex reading, RAID 1 can provide an increase in read speeds by fetching blocks alternately from each drive.

RAID 5

This RAID mode offers both speed and data redundancy for light to medium use (home offices or small to medium-size businesses). RAID 5 writes data to and reads from multiple disks, and distributes parity data across all the disks in the array. Parity data is a smaller amount of data derived mathematically from a larger set that can accurately describe that larger amount of data, and thus can be used to restore it.

RAID 5 uses approximately one-third of the available space for parity information, and requires a minimum of three disks to implement. Its reading from multiple disks means that it's pretty fast compared with consumer setups that might process one or two reads simultaneously, though performance can suffer greatly when it's processing multiple reads in a server situation. Also, since parity information is distributed across all the drives, any drive can fail without causing the entire array to fail.

Next: RAID Alternatives and Other RAID Options

Would you recommend this story? YES NO

  • Recommend:
  • 0 Comments
  • Print
Comments

Subscribe to the Digital Gear Review Newsletter - weekly

See All Newsletters »

Subscribe to the Digital Gear Review Newsletter - weekly

See All Newsletters »
Today's Special Offers