Posts

Showing posts from January, 2020

Introduction to Big Data

Image
What is Big Data ? As the name suggests, Big Data is simply data sets which are much larger in volume as compared to conventional data sets and can't be processed using traditional data processing softwares. This data is collected by organizations and used for machine learning projects, predictive modeling and other such applications. The 3 V's associated with Big Data give us a better understanding of the concept. They are: Volume - In the case of Big Data the volume of data which is being processed is very high and this could range anywhere between tens of terabytes for some companies and hundreds of petabytes for others.  Velocity - The rate at which data is received is called velocity and this velocity is high in the  case of Big Data as it deals with data generated in real time or near real time scenarios which require real time evaluation. Variety - As opposed to Traditional data sets which is structured Big Data mostly consists of data which is mostl...