Big Data Analysis and Distributed Programming module (AC51048)

Explore how large-scale data is stored, processed, and analysed using modern distributed systems

Credits
20
Module code
AC51048
Level
5
Semester
Semester 2
School
School of Science and Engineering
Discipline
Computing

Big data drives the modern world, from social media and healthcare to finance and scientific research. As data volumes continue to grow, understanding how to manage, process, and extract value from massive datasets has become a key skill for computing professionals.

In this module, you’ll explore what makes data “big”, learn about the tools and technologies that make large-scale processing possible, and examine real-world applications of distributed systems in industry.

What you will learn

In this module, you will:

  • define and analyse the characteristics of big data
  • explore the requirements and challenges of managing large-scale datasets
  • study distributed computing frameworks used to process and analyse big data
  • investigate how programming languages can be applied to handle data-intensive tasks
  • evaluate industry use cases and best practices in big data analytics

By the end of this module, you will be able to:

  • compare and contrast different programming approaches for manipulating big data
  • implement algorithms and techniques to analyse complex datasets
  • discuss the use of big data in real-world, industry-driven contexts
  • apply critical thinking and problem-solving skills to large-scale data challenges

Assignments / assessment

  • coursework project (40%)
  • final written exam (60%)

Teaching methods / timetable

You will learn through a combination of workshops and hands-on practical sessions. These include demonstrations, coding activities, and opportunities to apply distributed programming techniques to real-world datasets.

Courses

This module is available on the following courses: