Wednesday, March 11, 2026

Big Data 1 - What is Big Data

What is Big Data?

Big Data is a term that is used to describe the extremely vast amounts of complex data that is unable to be processed by the traditional data management systems. This data can be categorised as either structured, unstructured or semi-structured data:

Structured data is data that can be organised neatly into tables and spreadsheets as part of  databases. It includes things like dates, emails, names, phone numbers, prices and much more. This type of data is easily processed by machine learning and other data management tools. 

Unstructured Data can be anything that cannot be easily categorised and put into a table or spreadsheet. This type of data is complex and common in big data comprising of 80 - 90% of all collected data. Examples of unstructured data can be anything from emails and social media posts to photos and smart phone activity. 

Semi-structured data cannot be as easily categorised as structured data however it is more flexible that structured data and uses metadata like tags and markers to help it to be part of structured data sets. 

The 5 Vs of Big Data

The 5 Vs of Big data refers to the five main characteristics that define big data.

Volume - The word 'big' in big data isn't there for no reason. These days data is generated from everything we do in our daily lives from browsing social media to going to the shops, this all contributes to unfathomably huge amounts of data that is unable that cannot be processed easily. 

Velocity - The speed at which data is generated and moves around is so fast that data often needs to be analysed in real time for organisations to make the most of it. 

Variety - Data can be almost anything and everything, it is important that data is sorted accordingly to the type of data like structured or unstructured. 

Veracity - How reliable and accurate the data is. Because of the volume and variety in big data, quality data that businesses need should be be filtered from any poor quality inaccurate  data. 

Value - The reason some much data collected is its value. Data should be valuable to the company or organisation that collects it. This allows them to analyse the data and use that information to improve their business platform or product. 

https://www.mongodb.com/resources/basics/big-data-explained

https://www.ibm.com/think/topics/structured-vs-unstructured-data

https://cloud.google.com/learn/what-is-big-data

https://www.ibm.com/think/topics/big-data

3 comments:

  1. Really nice blog. Very professional looking.

    ReplyDelete
  2. This is a clear and well-organized explanation of Big Data. The breakdown of structured, unstructured, and semi-structured data helps readers understand the differences, and the 5 Vs section effectively highlights the key characteristics that make Big Data unique and valuable.

    ReplyDelete
  3. Such a well-written article, really enjoyed reading it.

    ReplyDelete

Technological Requirements of Big Data

Technological Requirements of Big Data We've already discussed how big data has continues to grow and develop including what is likely t...