Data Storage - eWeek




Five 'Dirty Little Secrets' to Know When Buying a Data Archive





  Table of Contents:
  1. Five 'Dirty Little Secrets' to Know When Buying a Data Archive
  2. Dirty Little Secrets 3-5

It turns out there are several so-called dirty little secrets that not every vendor will tell you ahead of time about archiving products. There are five categories of these "secrets": scalability, data protection, performance, data migration and energy efficiency. If you're in the market, you need to read this first.

Five 'Dirty Little Secrets' to Know When Buying a Data Archive - Dirty Little Secrets 3-5
( Page 2 of 2 )

Dirty Little Secret No. 3: Data protection. The existence of the commodity hardware "back door."

Many archive companies allow a customer to use "commodity" hardware—standard storage arrays from companies such as Dell, Hewlett-Packard, EMC and IBM—along with their software as part of the "system solution."

"They then will claim that those archives are secure and that they maintain data protection and integrity," Woolery said. "But there is a huge back door, and that is this: Because it's commodity storage, it's anybody's storage, and you can't control that storage. The software is not tied directly to the hardware."

An example of this: An administrator could delete all data at the hardware level, effectively bypassing the software-based security. A good archiving solution should manage this at both the hardware and software layer.

"Someone can go in there and delete a complete RAID set, if they want to, without the application security being able to stop him," Woolery said. "The software doesn't know you deleted the RAID set, and because it's commodity storage, the software and hardware are not necessarily linked all the way down to the hardware level. That's a lot of work to do if you're using anybody's commodity storage."

Dirty Little Secret No. 4: Data migration: When an archive is moved, the files can become orphaned and the entire process could become exceedingly slow.

When most archive applications ingest a file, they create a CAS address at the application level, not the archive level. As a result, the application is the only thing that knows how to find that file.

"So, let's say down the line you want to change your application, or your archive system is growing old and you want to upgrade it and create a new one, then you must migrate that data off the old system," Woolery said.

"The problem is, you have to migrate it off, but you have to do it back through the old application—or you orphan the data. Let's say it's an e-mail database [such as Exchange]. So that means that while you're backing it out during that migration, that e-mail application will not be able to be used by the company."

And we're usually talking about a lot of objects, especially when it comes to e-mail.

"This will take longer than hours," Woolery said. "It could take days. Can you afford to have your e-mail app down for days? It's not possible. What if you're a hospital with lots of image files, which are big files? Due to HIPAA regulations, these files must be kept for seven years. That's a huge amount of data to migrate, and that's why it's so difficult. Vendors won't tell you this ahead of time."

Dirty Little Secret No. 5: Energy efficiency. Not the best in most archive systems.

Since archiving systems usually have one database running across all the system disk drives, it is very difficult to be energy-efficient.

"If you request one file in the archive and it's on one disk, you will have to spin up every single disk in that system to get that one file," Woolery said.

"If you have 100 disks and there's one small file you need to access, or a diagnostic you need to perform, they all spin up if you have a single database. Whether it's a large, robust DB or a small one, it still requires spinning up all 100 disks."

Seems like something the archive industry should be trying to remedy, don't you think? In fact, all five of these issues perhaps should be researched a little closer.


 




 
 
>>> More Data Storage Articles          >>> More By Chris Preimesberger
 

FEATURED SPONSOR MESSAGE

Microsoft Sponsored Resource Center

Increase Your Microsoft Office 365 Knowledge! Dig inside this suite of cloud-based collaboration tools.

Watch the video >>

Brought to you by





Advertisement
eWEEK Quick LInks

 
Close this advertisement