636-0018-00L  Data Mining I

SemesterAutumn Semester 2018
LecturersK. M. Borgwardt
Periodicityyearly recurring course
Language of instructionEnglish


AbstractData Mining, the search for statistical dependencies in large databases, is of utmost important in modern society, in particular in biological and medical research. This course provides an introduction to the key problems, concepts, and algorithms in data mining, and the applications of data mining in computational biology.
ObjectiveThe goal of this course is that the participants gain an understanding of data mining problems and algorithms to solve these problems, in particular in biological and medical applications.
ContentThe goal of the field of data mining is to find patterns and statistical dependencies in large databases, to gain an understanding of the underlying system from which the data were obtained. In computational biology, data mining contributes to the analysis of vast experimental data generated by high-throughput technologies, and thereby enables the generation of new hypotheses.

In this course, we will present the algorithmic foundations of data mining and its applications in computational biology. The course will feature an introduction to popular data mining problems and algorithms, reaching from classification via clustering to feature selection. This course is intended for both students who are interested in applying data mining algorithms and students who would like to gain an understanding of the key algorithmic concepts in data mining.

Tentative list of topics:

1. Distance functions
2. Classification
3. Clustering
4. Feature Selection
Lecture notesCourse material will be provided in form of slides.
LiteratureWill be provided during the course.
Prerequisites / NoticeBasic understanding of mathematics, as taught in basic mathematics courses at the Bachelor's level.