Skip to Content

What Is Data Reduction?

What Is Data Reduction?

Data reduction is a capacity optimisation technique in which data is reduced to its simplest possible form to free up capacity on a storage device. There are many ways to reduce data, but the idea is very simple—squeeze as much data into physical storage as possible to maximise capacity. 

In this article, we’ll dive into the basics of data reduction to help you better evaluate storage vendors.

Benefits of Data Reduction

The main benefit of data reduction is pretty straightforward: The more data you can fit into a terabyte of disk space, the less capacity you’ll need to purchase. Data reduction can:

  • Save energy
  • Reduce your physical storage costs
  • Decrease your data centre footprint

Data reduction greatly increases the efficiency of a storage system and directly impacts your total spend on capacity.

All-Flash Arrays: Bringing the Benefits of Flash Memory to the Data Centre

As you might guess, simply switching out your HDDs with SSDs is enough to increase the speed and performance of your NAS and SAN solutions. The benefits of an all-flash array are the same as the benefits of flash memory itself:

  • Speed: Faster memory read-write and access times lead to improved speed and performance. The best all-flash arrays leverage NVMe over Fabrics (NVMe-oF) to maximise data transfer speeds and latencies throughout a SAN.
  • Portability: SSDs are significantly smaller than HDDs. On a purely physical basis, flash memory has the advantage of space-per-capacity. On a cost-per-capacity basis, flash memory is quickly closing ground on HDD solutions.
  • Durability: The lack of physical moving parts makes SSDs inherently less vulnerable to drops and shocks than their spinning-disk counterparts.

Test Drive FlashBlade

Experience a self-service instance of Pure1® to manage Pure FlashBlade™, the industry's most advanced solution delivering native scale-out file and object storage.

Try Now

Data Compression vs. Data Deduplication

Data-reduction techniques can be broadly categorized into two main types:

  • Data compression: This bit-rate reduction technique involves encoding information using fewer bits of data. Compression algorithms can be lossy (some information is lost, reducing the resolution of the data) and lossless (information is fully preserved by removing statistical redundancy).
  • Data deduplication: Also known as dedupe, this process involves eliminating duplicate copies of data within a storage volume or across the entire storage system (cross-volume dedupe). It uses pattern recognition to identify redundant data and replace them with references to a single saved copy. 

In practice, you can employ a combination of techniques from both categories to reduce data in your system.

How Pure Storage Delivers on Data Reduction

Pure Storage® Purity Reduce uses five different data-reduction technologies to save space in its all-flash arrays:

  • Pattern removal: Purity Reduce identifies and removes repetitive binary patterns to reduce the volume of data to be processed by the dedupe scanner and compression engine. 
  • 512B aligned variable dedupe: A high-performance inline deduplication process with a variable block-size range of 4-32KB ensures only unique blocks of data are saved on flash.
  • Inline compression: Purity Reduce uses an append-only write layout and variable addressing to remove the wasted space fixed-block architectures introduce.
  • Deep reduction: Inline compression is followed by heavier-weight compression algorithms post-process to further increase space savings. 
  • Copy reduction: Copies made on FlashArray™ only use metadata—Purity provides instant pre-deduplicated copies of data for xCopy commands, snapshots, replication, and clones.

Purity Reduce delivers the most granular and complete data reduction ratios in the flash storage industry:

Data reduction works on a wide variety of applications and data types, but the only way to know how it functions on your data is to try it.

02/2024
Optimize Video Surveillance Data with Pure Storage
Enhance prevention, protection, investigation, and response to incidents and threats with smarter infrastructure for video surveillance and AI with Pure Storage.
Solution Brief
5 pages

Resources and Events

JUNE 3-5, 2024
Join Pure Storage at VeeamON 2024

At VeeamON 2024, we’ll show you why Pure Storage is the essential storage platform for Veeam data security and recovery. Visit us at Booth #P5.

Book a Meeting
JUNE 11-14, 2024
Join Pure Storage at Splunk .conf24

At Splunk .conf24, we’ll show you why Pure Storage is the superior storage platform for Splunk data management. Visit us at Booth #402.

Book a Meeting
SUSTAINABILITY
Green Your Data Centre with Pure Storage
11 min.

Learn more about how sustainability is built into our philosophy and our products.

Watch the Video
PURE //
Resources
Buyer's Guide
A Buyer's Guide to Modern Virtualisation
14 pages

Navigate VMware changes with a modern, scalable virtualisation strategy.

Get the Guide
CONTACT US
Meet with an Expert

Let’s talk. Book a 1:1 meeting with one of our experts to discuss your specific needs.

Questions, Comments?

Have a question or comment about Pure products or certifications?  We’re here to help.

Schedule a Demo

Schedule a live demo and see for yourself how Pure can help transform your data into powerful outcomes. 

Call Sales: +44 8002088116

Mediapr@purestorage.com

 

Pure Storage, Inc.

2555 Augustine Dr.

Santa Clara, CA 95054

800-379-7873 (general info)

info@purestorage.com

CLOSE
Your Browser Is No Longer Supported!

Older browsers often represent security risks. In order to deliver the best possible experience when using our site, please update to any of these latest browsers.