Is big data really that “big“ ?

Subinraj
2 min readSep 17, 2020

In the modern world the term “big data” is bigger than what you actually think, So what exactly is big data ? and why is it such a big deal ? well lets find out

Big data is a mixture of structured, semi structured and unstructured data collected by organizations that can be mined for information and used in machine learning projects, predictive modeling and other various applications. When it comes to big data google is arguably the undisputed champion, There is a wide talk about google being a spy or google knows everything about their users they does all this by the help of big data analysis the exact term for that process is called “Big data analytics”

So as we’ve seen in the definition of big data which basically states big data is collection of data according to my findings on average we produce a humongous amount of 2500000 Terabytes of data per day which is a probably increasing at very high rate per each considering the fact most of countries are affected by covid-19 and number of internet users increased terrifically.

The Six V’s Of Big Data

Basically there are three V’s of big data but since the world is growing at each second three more V’s has been added to make it six V’s for big data which are as follows

  • Volume : The amount of data from myriad sources
  • Variety : The types of data : structured , semi-structured and unstructured
  • Velocity : The speed at which big data is generated
  • Veracity : The degree to which big data can be trusted
  • Value : The business value of the data collected
  • Variability : The ways in which the big data can be used and formatted

These six V’s of big data literally shows us the six different factors that matters in big data

Currently there are 4.57 billion active internet users around the world which is more than 50% of the entire world population mind blowing isn’t it ?

If we split up the 2500000 Terabytes of data usage per day we can approximately get the following information from various websites

  • Google processes over 3.5 billion search queries every day
  • 350 million photos are uploaded to Facebook each day.
  • Every day, 306.4 billion emails are sent
  • Facebook generates 4 petabytes of data every day
  • 65 billion messages are sent on WhatsApp
  • 5 million Tweets are made

So in the above split up we’ve seen few top tier websites/apps which a normal netizen uses which consists of google,facebook,whatsapp,twitter etc other than these there are popular websites like youtube , instagram which uses much higher amount of data all of these companies uses big data for their uninterpreted services

--

--