the 4 Vs of Big Data

VOLUMEN OF BIG DATA

r

Size of the data sets that need to be analyzed and processed

EXAMPLE:

r

All credit card transactions on a day within Europe.

VELOCITY OF BIG DATA

r

Speed with which data is generated with such a pace that requires distinct (distributed) processing techniques.

EXAMPLE:

r

Twitter messages or Facebook posts.

Because of these characteristics of the data, the knowledge
domain that deals with the storage, processing, and analysis of these data sets has been labeled Big Data.

VARIETY OF BIG DATA

r

Is as big the variety that generally is one out of three types: structured, semi structured and unstructured data wich frequently requires distinct processing capabilities and specialist algorithmsEXAMPLE:

EXAMPLE

r

CCTV audio and video files that are generated at various locations in a city.

VERACITY OF BIG DATA

r

Refers to the quality of the data that is being analyzed.

High one

r

Has many records that are valuable to analyze and that contribute in a meaningful way to the overall results.

Low one

r

Contains a high percentage of meaningless data.

non-valuable

r

Noise.

EXAMPLE

r

Data from a medical experiment or trial.