NEVER Use A RAID As Your Backup System!

RAID1

Photographers acquire A LOT of images, and that translates to terabytes of data that need to be saved and protected.

Like their shooting styles, their archive/backup schemes can be just as diverse. From RAID systems, like a Drobo, to triple redundant drives, to no backup at all.

One of the first things to remember is that an archive and a backup are NOT the same.

Backing Up Your Data

To borrow a quote from a fellow photographer’s blog, “Repeat after me three times: RAID is not backup. Period.”

For those who never heard of it, RAID stands for “Redundant Array of Independent Disks” or “Redundant Array of Inexpensive Disks.” And for those who use the phrase “RAID array,” thats redundant.

The concept of a RAID is to combine multiple, less-expensive drives into a single, higher-capacity and/or faster volume. It is designed for redundancy so that the array and its data remain usable WHEN (NOT IF) a drive fails. The terms 1-disk or 2-disk redundancy refers to the number of drives that can fail with the array remaining usable.

There are many different types of RAID configurations:

RAID 0: Its primary purpose: faster performance. RAID 0 spreads the data across multiple drives. For example, block A is on drive 1, block B is on drive 2), and this permits increased write and read speeds. This is called striping.
RAID 0 offers no protection against drive failure, since this mode does not write any duplicate or parity information.

RAID 1: This mode writes and reads the same data to pairs of drives which is called mirroring. If either drive fails, you can continue working with the other until you can replace the bad one.

RAID 5: This mode is about both speed and redundancy. RAID 5 writes and reads from multiple disks, and it distributes parity data across all the disks in the array. Parity data is a smaller amount of data derived mathematically from a larger set that can accurately describe that larger amount of data, and thus serves to restore it. Since parity information is distributed across all the drives, any drive can fail without causing the entire array to fail.

RAID 5 needs a minimum of three disks to implement. Since data is read from multiple disks, performance can improve under RAID 5. This makes RAID very good for video editing systems.

backup

Other options include RAID 6 or RAID 10, but they aren’t often found in consumer-level RAID units. RAID levels 2, 3, and 4 are not commonly used anymore.

The problem with considering a RAID as your backup is that it doesn’t help you with file deletion, corruption by applications, operating system or viruses.

So if you accidentally delete a file, it will instantly be removed from both mirrored copies. If your disk is corrupted by a software bug or virus, the corruption will be done to both mirrored copies simultaneously.

Having all the drives in one box that is being served by one power supply and controller has its problems too. A bad enough power surge will probably fry all disks in the RAID. If your house burns down… well, you get the point.

A RAID is still a single device and because of that, also a single point of failure.

None of this means you should not use a RAID. Many photographers I know love the DROBO system. This is fine. JUST BACK IT UP! (I have never used a DROBO, but for another photographer’s opinion on DROBO see Scott Kelby’s post here: scottkelby.com/2012/im-done-with-drobo

A BACKUP needs to be a complete and recoverable copy of your data that resides on a separate hard drive possibly even a RAID. Just DO NOT USE SOFTWARE THAT MIRRORS THE PRIMARY DRIVE TO THE BACKUP or you will run into the same problems as above with at RAID 1. Proper backup software will perform a full backup and then hourly or daily backups of changed files.

My operating system and work disk (containing the current year’s photography) is backed up daily using Apple Time Machine software and a SEPARATE 3-terabyte drive. The drive is also plugged into its own surge protector. This software does not mirror the primary drives but backs up files and changed files. This gives you the opportunity to go back and recover something that may have been accidentally deleted.

The work disk contains ALL RAW files from the current year.

Images that are worked up for publication are exported from Adobe Lightroom and stored on my Photoshelter Archive. I trust Photoshelter and their geographically redundant archive to protect those images. If disaster were to strike, I could still export the images again from the backed up Lightroom archive.

My ARCHIVE of RAW images is stored on a separate drive that contains the last two year’s work. These images are also backed up on the primary backup drive.

Untitled

Every year I rotate the oldest year off to a small portable drive. For these backups of the archives, I use Western Digital My Passport 2-terabyte drives. They are small and easily portable for off-site storage.

Basically everything exists in two or three places.

Whatever method you use for backing up and archiving, make sure that your data is stored redundantly and housed in more than one place. It will be the only way to guarantee its safety.

If anyone has any questions, feel free to ask!

Be Sociable, Share!

7 thoughts on “NEVER Use A RAID As Your Backup System!

  1. Looking at your diagram, am I correct that you have four separate drives coming off your desktop: one for the system software, one for work files, a third for images for the last two years and then a large 3TB drive holding everything? If so, mind if I ask what type of drives, sizes and enclosures you’re using for each of those. It also looks like you’ve foregone RAID. Are you using Time Machine for backups or making bootable backups with something like SuperDuper?

    • Hi PJ

      There are actually 5 drives attached to the Desktop machine, 4 of them are internal to the tower. (The system disk 500MB, work disk 2TB and 2 scratch disks 1TB each). The 3TB backup is external. All drives are Western Digital.

      I have a bootable backup of the system disk which is simply another internal drive that is not installed. I can pop it into a docking station that is generally used to diagnose disks if I need to boot from it. Of course since the system disk has nothing on it but the system and the applications, I can also do a pretty fast recover from TIME MACHINE of that drive if needed.

      Yes I use TIME MACHINE.

      The problem with the RAID 1 as your main drive is this. If you delete a file, it is deleted on the mirrored copy as well. Same issue with corruption. The drives in the enclosure mirror themselves in real time. The only thing a RAID 1 protects against is a crashed drive.

      RAID 5 would offer better protection and the RAID 5 draw is speed. But truly, unless you are doing a lot of video or sound work, I don’t see the need. AND even though you can recover from a crash, you should talk to people that have had to do so, IT CAN BE PAINFULLY SLOW to rebuild the failed drive. It is doing so by using parity data. It is not a simple copy to reload the data. Of course this drive would also have to be backed up. I am unclear as to whether you can recover a single deleted file from a RAID 5. Its main purpose is speed and ability to rebuild a lost drive.

      My laptop is backed up with TIME MACHINE to a portable drive. Here I am mainly concerned about the SYSTEM files and other things like accounting that I have on the laptop. I transfer daily work off each night, so I am not too worried about that. Usually by that time, worked up images are on the way to the client and have been sent to Photoshelter.

      Yes the small WD My Passport Drives are great for long term archive storage. I simply buy 2 at the end of every year. If you look for deals at Staples or Office Depot you can usually get one for about $120.00.

  2. Thanks for this article Pete – PERFECT timing for me and I plan to adapt this myself. I’ve had a Drobo, it was a nightmare, finally turning into a brick. As I rethink my backup system, I’ve gotten myself totally confused with RAID 0, 1, 5, etc. I embrace the concept of backup being more than drive failure replacement.

    If I use Time Machine to for my main backup, what [software] do you use for your on-site and off-site backups? I thought that TM could only backup a working computer – might you elaborate a bit if you drag and drop, or maybe use another program front the archived backups?

    Thanks,

    Greg

    • Hi Greg. I do not have an off site backup for the general system. The image files from each year are exported from Lightroom as a catalog to separate externals for off site. And of course all of the “worked up” images are stored on PhotoShelter.

  3. Pete,

    I have just spent the best part of the day looking for a longer term solution to back up my photography. I have learnt a lot about the pros and cons of RAID and proprietary systems to then find your article, the best on the net for my purpose. It also turns out that it is pretty much what I am already doing, though I am doing it manually. What software do you us for your differential backups? (possibly time machine? I am PC so if that is the case I will need to explore some more).

  4. Use a Seagate 1 TB USB drive, built in software allows you to choose the time of day and daily schedule of the backups to the drive. I will be using a 6 drive Raid 10 solution for a t620 enterprise server. I will be using the Seagate for a backup of the sql database. The seagate will be left in a fireproof safe with a hole in the back for wiring.

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>