Evaluate Pure Storage

Data Reduction: essential for a cost-effective flash array.

5-10x reduction

graphic showing the strength of dedupe

  • CompressionCompression
  • DeduplicationDeduplication
  • Pattern RemovalPattern Removal
  • Thin ProvisioningThin Provisioning

The FlashArray employs high-performance inline data reduction techniques including deduplication, compression, and thin provisioning to dramatically reduce the data footprint, allowing the logical size of the FlashArray to be 5-10X larger than its actual raw physical size, thereby driving down the $/GB usable. But don't take our word for it: check out the dedupe ticker below with real-time customer data reduction results.


Average Data Reduction Rate

(Deduplication + Compression Only)

5.79:1

5.79 = 2.70 (comp.) x 2.15 (dup.)
Useful for comparing vs. other storage
arrays with thin provisioning enabled.

  • 5.67
  • 3.90
  • 4.76
  • 11.77
  • 6.78
  • 4.72
  • 4.99
  • 2.13
  • 3.28
  • 6.57
  • 1.69
  • 3.31
  • 4.21
  • 4.82
  • 3.94
  • 3.36
  • 5.77
  • 4.94
  • 3.12
  • 4.53
  • 3.27
  • 18.41
  • 3.10
  • 6.14
  • 4.86
  • 2.74
  • 6.49
  • 4.71
  • 8.57
  • 5.21
  • 9.39
  • 5.19
  • 2.84
  • 4.00
  • 18.82
  • 6.50
  • 4.70
  • 4.98
  • 4.19
  • 3.75
  • 6.36
  • 3.18
  • 5.11
  • 3.45
  • 6.12
  • 3.70
  • 3.90
  • 4.30
  • 6.82
  • 6.59

Dedupe Ticker

Real-time Data Reduction Results

  • >20.00
  • >20.00
  • >20.00
  • 8.16
  • >20.00
  • >20.00
  • >20.00
  • 9.84
  • 6.78
  • 8.58
  • >20.00
  • 6.16
  • 8.21
  • 17.32
  • 5.16
  • >20.00
  • 11.60
  • 4.47
  • >20.00
  • 4.83
  • 16.77
  • 13.17
  • 11.84
  • 13.72
  • 11.44
  • 6.43
  • 13.92
  • >20.00
  • 14.25
  • >20.00
  • >20.00
  • 9.55
  • 3.97
  • 19.03
  • >20.00
  • 4.33
  • 5.71
  • 11.57
  • 15.85
  • 7.32
  • 6.63
  • 5.66
  • >20.00
  • 8.05
  • 8.16
  • 10.61
  • 11.23
  • >20.00
  • 10.09
  • >20.00

Average Total Reduction

(Including Thin Provisioning)

12.72:1

Useful for comparing vs. other disk
arrays without thin provisioning,
including most other flash products.

Averaged across EVERY Pure Storage
FlashArray, updated live 24x7.

How is data reduction calculated?

Last Updated: 07:35:25AM PDT


Think everyone’s got data reduction?

Not so fast, Jack. Here’s the skinny:

* As of May 2013

  • Flash Appliance/Array
  • Deduplication
  • Compression
  • Thin
    Provisioning
  • Geometry
  • Inline
    (vs. post-process)
Pure Storage
FlashArray
Variable
512byte - 32K
Violin
Memory
N/A N/A
EMC
XtremIO
Fixed
4 Kilobyte
IBM
TMS+SVC
N/A

Data
Reduction

CTO Deep Dive: Data Reduction

How much will my data reduce?

Data reduction works on a wide variety of applications and data types, but the only way to know how it functions on your data is to try it. The averages to the right are what we find typical for our most common use cases. If you want to analyze your data, try our PureSize tool, or request a trial deployment.

Virtual server environments

VMware or Hyper-V, consolidated virtual server environments with mixed applications.

4-6 to 1

Database environments

OLTP or OLAP, even databases get surprising amounts of data reduction.

2-4 to 1

Virtual desktop (VDI) environments

Virtual desktops (both persistent and non persistent) are one of the most reducible workloads in the datacenter.

5-10 to 1


How is Pure Storage data reduction different?

It's so fundamental to the FlashArray you can't even turn it off.

High performance.

Deduplication and compression in a flash array have to be fast. The FlashArray's core architecture was designed to support data reduction. All of our performance benchmarks are taken with data reduction turned–on.

Global and inline.

Other data reduction solutions dedupe as a post-process, and operate only inside a single SSD, LUN, card, or volume. Partitioning your data set dramatically reduces the savings from deduplication. Pure Storage dedupe is inline and global across the entire array.

Accelerates flash writes.

With flash storage, reading is easy; writing is the hard part. Inline deduplication and compression allow the FlashArray to avoid 70-90% of the writes it would otherwise do to flash, dramatically increasing the array's write bandwidth and extending the life of the underlying flash.


  • 512-byte granularity.

    8x smaller than the other guys.

    512-byte granularity

    In the world of deduplication, size matters. The smaller "chunk size" you use to look for duplicates, the more effective you will be at reducing data. But chunk size is a trade-off, as smaller chunks require more processing and metadata. Pure Storage detects duplicates down to a 512-byte chunk size, which has two advantages: substantially higher deduplication (typically 3-5X better than the coarse-grained alternatives), and that fine-grained geometry also offers better alignment with application data layouts.

  • Only possible with 100% flash.

    Enables high-performance dedupe
    and compression.

    At Pure Storage we realized early on that we couldn't deliver our breakthrough data deduplication algorithms if there was any mechanical disk in our array. Hear Pure Storage CTO and co-founder, John Colgrove, talk about how the FlashArray's architecture enables high-performance data reduction.

Real customer results say a lot (and only store a little).

  • Logo Mattersight7.2 - to - 1
  • Logo Yodle6.2 - to - 1
  • Logo Siemens5.7 - to - 1
  • Logo Fenwick4.9 - to - 1
  • Logo City of Davenport6.5 - to - 1
  • Logo TripPak4.1 - to - 1