Big Data Analysis module (AC51011)
About the Module
This module looks at how we can store, manipulate and analyse big data. We define big data essentially as data that is high in volume, is captured at high velocity and contains high variety (in terms of structured and unstructured parts). We will look into the tools and techniques both for storage of this data, such as distributed databases and filesystems, and for processing this data, such as Apache Hadoop, Apache Spark, Apache Hive etc. A part of the module is also dedicated to programming in Scala, a functional language with built in concurrency that allows easy and rapid development of parallel data analytics.
There are 20 SCQF points available on this module.
|4||Introduction to Big Data|
|5||Infrastructure for Big Data Analytics|
|8||Big Data Processing Frameworks|
|9||Concurrent and Distributed Functional Programming|
|10||Advanced Data Processing Frameworks|
Assessment and Coursework
Coursework counts for 40% of the final module mark.
Two class tests count for 40% of the final module mark.
The final 20% of the final mark is covered by weekly quizzes.
Marking criteria are provided on My Dundee for all assignments so that you know what we are looking for when we are marking your coursework. Please ensure that you refer to these when completing assignments.
|Title||Week Given||Week Due||Effort Expected (hours)||Value (%)|
|Big Data Processing||8||12||20||20|
All course material is available on My Dundee. This includes copies of lecture materials, practical exercises, and assignments. The reading list for this module can be accessed from My Dundee and provides recommended materials for completing the module.