Close
  • Latest News
  • Artificial Intelligence
  • Big Data and Analytics
  • Cloud
  • Networking
  • Cybersecurity
  • Applications
  • IT Management
  • Storage
  • Sponsored
  • Mobile
  • Small Business
  • Development
  • Database
  • Servers
  • Android
  • Apple
  • Innovation
  • Blogs
  • PC Hardware
  • Reviews
  • Search Engines
  • Virtualization
Read Down
Sign in
Close
Welcome!Log into your account
Forgot your password?
Read Down
Password recovery
Recover your password
Close
Search
Logo
Logo
  • Latest News
  • Artificial Intelligence
  • Big Data and Analytics
  • Cloud
  • Networking
  • Cybersecurity
  • Applications
  • IT Management
  • Storage
  • Sponsored
  • Mobile
  • Small Business
  • Development
  • Database
  • Servers
  • Android
  • Apple
  • Innovation
  • Blogs
  • PC Hardware
  • Reviews
  • Search Engines
  • Virtualization
More
    Home Database
    • Database

    What is Data Deduplication?

    By
    eWEEK EDITORS
    -
    July 23, 2007
    Share
    Facebook
    Twitter
    Linkedin

      Q: How does data deduplication work?
      A: Data deduplication is based on the fact that in any enterprise where you are storing and backing up data there is a tremendous amount of content the occurs more than once. Its more efficient to eliminate or deduplicate those occurrences rather than store them in multiple places. Deduplication vendors use a variety of different algorithms. Some use hash algorithms like SHA-1, others do bit-by-bit comparison. But it boils down to examining the blocks of data in a backup stream and replacing duplicated instances with pointers to a unique instance.

      Q: What do data deduplication products look like?
      A: Typically its an appliance that can sit either in-band or out-of-band. If its in-band, then it analyzes and deduplicates the backup stream while its being sent to backup storage (for example, to a virtual tape library or VTL). If its out-of-band, it analyzes and rewrites the data after its been written to the backup device. In either case, the goal is to remove duplicate data while changing as little as possible in your existing infrastructure, all you do is deploy the appliance.

      Q: What kind of applications does deduplication work best with?
      A: It can work with either file-oriented or block-oriented applications. It really depends on which applications that particular vendors product is targeting. But you need to keep in mind that it isnt suited for data thats already been compressed or encrypted, because that will reduce the number of pattern matches the deduplication algorithm can detect. Typically you would do encryption after deduplication, not before.

      Q: What are the main benefits of deduplication?
      A: Well, contrary to what you might think, the most important benefit isnt really saving storage space, but the fact that you need to send less data to backup in the first place. That can save you a lot of time and bandwidth.

      Q: Just how much data redundancy can be eliminated with deduplication?
      A: It varies tremendously of course. In the best case, you can get a compression ratio of 20-to-1. In other words, a 20 terabyte backup would be reduced to just one terabyte. About 10% of the data deduplication users we talk to get this kind of ratio. But this is definitely something you need to test for yourself with your own data before you buy a deduplication appliance.

      Q: What are some of the vendors of data deduplication gear?
      A: Data Domain and Diligent Technologies are two of the leaving private independent vendors. EMC acquired a well-known company called Avamar. Network Appliance, Symantec and FalconStor also have solutions.

      eWEEK EDITORS
      eWeek editors publish top thought leaders and leading experts in emerging technology across a wide variety of Enterprise B2B sectors. Our focus is providing actionable information for today’s technology decision makers.
      Get the Free Newsletter!
      Subscribe to Daily Tech Insider for top news, trends & analysis
      This email address is invalid.
      Get the Free Newsletter!
      Subscribe to Daily Tech Insider for top news, trends & analysis
      This email address is invalid.

      MOST POPULAR ARTICLES

      Latest News

      Zeus Kerravala on Networking: Multicloud, 5G, and...

      James Maguire - December 16, 2022 0
      I spoke with Zeus Kerravala, industry analyst at ZK Research, about the rapid changes in enterprise networking, as tech advances and digital transformation prompt...
      Read more
      Applications

      Datadog President Amit Agarwal on Trends in...

      James Maguire - November 11, 2022 0
      I spoke with Amit Agarwal, President of Datadog, about infrastructure observability, from current trends to key challenges to the future of this rapidly growing...
      Read more
      IT Management

      Intuit’s Nhung Ho on AI for the...

      James Maguire - May 13, 2022 0
      I spoke with Nhung Ho, Vice President of AI at Intuit, about adoption of AI in the small and medium-sized business market, and how...
      Read more
      Applications

      Kyndryl’s Nicolas Sekkaki on Handling AI and...

      James Maguire - November 9, 2022 0
      I spoke with Nicolas Sekkaki, Group Practice Leader for Applications, Data and AI at Kyndryl, about how companies can boost both their AI and...
      Read more
      Cloud

      IGEL CEO Jed Ayres on Edge and...

      James Maguire - June 14, 2022 0
      I spoke with Jed Ayres, CEO of IGEL, about the endpoint sector, and an open source OS for the cloud; we also spoke about...
      Read more
      Logo

      eWeek has the latest technology news and analysis, buying guides, and product reviews for IT professionals and technology buyers. The site’s focus is on innovative solutions and covering in-depth technical content. eWeek stays on the cutting edge of technology news and IT trends through interviews and expert analysis. Gain insight from top innovators and thought leaders in the fields of IT, business, enterprise software, startups, and more.

      Facebook
      Linkedin
      RSS
      Twitter
      Youtube

      Advertisers

      Advertise with TechnologyAdvice on eWeek and our other IT-focused platforms.

      Advertise with Us

      Menu

      • About eWeek
      • Subscribe to our Newsletter
      • Latest News

      Our Brands

      • Privacy Policy
      • Terms
      • About
      • Contact
      • Advertise
      • Sitemap
      • California – Do Not Sell My Information

      Property of TechnologyAdvice.
      © 2022 TechnologyAdvice. All Rights Reserved

      Advertiser Disclosure: Some of the products that appear on this site are from companies from which TechnologyAdvice receives compensation. This compensation may impact how and where products appear on this site including, for example, the order in which they appear. TechnologyAdvice does not include all companies or all types of products available in the marketplace.

      ×