||The term ``big data'' is often used to describe vast collections of semi-structured data in the range of tera- or even petabytes.
Companies like Google and Amazon illustrate that mining and analyzing such collections yields the potential for completely new applications.
The lecture provides an overview of motivations to analyze big data and introduces techniques needed in the process.
This includes introductions to scripting languages, NOSQL databases and Map/Reduce systems which are accompanied by practical exercises.
Modul Informatik 1 (Konzepte der Informatik + Programmierkurs 1)
Students have to pass 50% of the weekly theoretical and practical assignments
a written exam at the end of the semester.
Storage Area Networks and Distributed File Systems
For implementations the students will learn and use the language Python.
The students know and understand the basic concepts for dealing with very large data sets and are able to aply them in small projects.