Virtualization Technology: 10 Things You Need to Know Now About Data Deduplication

 
 
By Chris Preimesberger  |  Posted 2009-10-09
 
 
 

10 Things You Need to Know Now About Data Deduplication

 

10 Things You Need to Know Now About Data Deduplication

Scalability

Avoid managing multiple deduplication repositories by deploying a deduplication solution that will scale a single repository as needed.

Scalability

Data availability

Make sure your deduplicated data is available: look for disk mirroring and appliance clustering to guarantee immediate availability for fast recovery from a failure.

Data availability

Integration with backup

Deduplication should be tightly integrated with backup/archiving processes; ideally you should be able to install the deduplication system on the same appliance as your backup application.

Integration with backup

Open architecture

The deduplication system should be open and able to run on the server/appliance of your choice; deduplication can be deployed as a storage appliance, gateway appliance or even a software appliance. Choose what fits into your environment and answers your needs.

Open architecture

Ease of implementation

The solution should be easy to implement and non-disruptive to your existing infrastructure.

Ease of implementation

Evaluate time to recovery

A good deduplication system will not degrade the restore time required for restructuring data.

Evaluate time to recovery

Total cost of ownership

Remember that the cost of a deduplication system is both material (hardware and software) and management cost; factor in both when evaluating your ROI.

Total cost of ownership

Virtual deployment

Don't forget that deduplication can be deployed on a virtual appliance, saving you hardware cost and allowing you to take full advantage of your virtual infrastructure.

Virtual deployment

Dedupe ratio

Don't focus on the theoretical dedupe ratio that vendors publish; the ratio is contingent on many factors unique to each deployment.

Dedupe ratio

Inline or post-process?

Choose the solution that gives you the best performance to support your backup window.

Inline or post-process?

Rocket Fuel