Big Data Analysis and Distributed Programming module (AC51048)
Explore how large-scale data is stored, processed, and analysed using modern distributed systems
Big data drives the modern world, from social media and healthcare to finance and scientific research. As data volumes continue to grow, understanding how to manage, process, and extract value from massive datasets has become a key skill for computing professionals.
In this module, you’ll explore what makes data “big”, learn about the tools and technologies that make large-scale processing possible, and examine real-world applications of distributed systems in industry.
What you will learn
In this module, you will:
- define and analyse the characteristics of big data
- explore the requirements and challenges of managing large-scale datasets
- study distributed computing frameworks used to process and analyse big data
- investigate how programming languages can be applied to handle data-intensive tasks
- evaluate industry use cases and best practices in big data analytics
By the end of this module, you will be able to:
- compare and contrast different programming approaches for manipulating big data
- implement algorithms and techniques to analyse complex datasets
- discuss the use of big data in real-world, industry-driven contexts
- apply critical thinking and problem-solving skills to large-scale data challenges
Assignments / assessment
- coursework project (40%)
- final written exam (60%)
Teaching methods / timetable
You will learn through a combination of workshops and hands-on practical sessions. These include demonstrations, coding activities, and opportunities to apply distributed programming techniques to real-world datasets.
Courses
This module is available on the following courses: