Big Data Explained

Big Data Explained

What Is Big Data?

Today’s businesses collect vast amounts of data from a variety of sources that must often be analysed in real time. Big data refers to data that is too big, too fast, or too complex to process using traditional techniques.

The Three V's of Big Data

While the concept of big data has been around for a long time, industry analyst Doug Laney was the first to coin the three Vs of big data in 2001, which are:

  • Volume: The quantity of data that must be processed (usually a lot— gigabytes, exabytes, or more)
  • Variety: The wide-ranging types of data, both structured and unstructured, streaming from many different sources
  • Velocity: The speed at which new data is streaming into your system

Some data experts extend the definition to four, five, or more Vs. The fourth and fifth V are:

  • Veracity: The quality of the data with respect to its accuracy, precision, and reliability
  • Value: The value the data provides—what is it worth to your business? 

While the list can go all the way up to 42 Vs, these five are the most commonly used to define big data.

The Benefits of Hosting Big Data on All-Flash Arrays

The benefits of using all-flash storage for big data include:

  • Higher velocities (55-180 IOPS for HDDs vs. 3K-40K IOPS with SSDs)
  • Massive parallelism with over 64K queues for I/O operations
  • NVMe performance and reliability

Test Drive FlashBlade

Experience a self-service instance of Pure1® to manage Pure FlashBlade™, the industry's most advanced solution delivering native scale-out file and object storage.

Why Choose Pure Storage for Your Big Data Needs?

The relative volume, variety, and velocity of big data is constantly changing. If you want your data to stay big and fast, you’ll want to make sure you’re consistently investing in the latest storage technologies. Advances in flash memory have made it possible to deliver custom all-flash storage solutions for all your data tiers. Here’s how Pure Storage® can help power your big data analytics pipeline: 

  • All the benefits of all-flash arrays 
  • Consolidation into a unified, performant data hub that can handle high-throughput data streaming from a variety of sources
  • Truly non-disruptive Evergreen™ upgrades with zero downtimes or data migrations
  • A simplified data management system that combines cloud economics with on-premises control and efficiency
  • Fast and efficient scale-out flash storage with FlashBlade®
800-379-7873 +44 20 3870 2633 +43 720882474 +32 (0) 7 84 80 560 +33 9 75 18 86 78 +49 89 12089 253 +353 1 485 4307 +39 02 9475 9422 +31 (0) 20 201 49 65 +46-101 38 93 22 +45 2856 6610 +47 2195 4481 +351 210 006 108 +966112118066 +27 87551 7857 +34 51 889 8963 +41 31 52 80 624 +90 850 390 21 64 +971 4 5513176 +7 916 716 7308 +65 3158 0960 +603 2298 7123 +66 (0) 2624 0641 +84 43267 3630 +62 21235 84628 +852 3750 7835 +82 2 6001-3330 +886 2 8729 2111 +61 1800 983 289 +64 21 536 736 +55 11 2655-7370 +52 55 9171-1375 +56 2 2368-4581 +57 1 383-2387