# Search result: Catalogue data in Spring Semester 2020

Statistics Master The following courses belong to the curriculum of the Master's Programme in Statistics. The corresponding credits do not count as external credits even for course units where an enrolment at ETH Zurich is not possible. | ||||||

Specialization Areas and Electives | ||||||

Statistical and Mathematical Courses | ||||||

Number | Title | Type | ECTS | Hours | Lecturers | |
---|---|---|---|---|---|---|

636-0702-00L | Statistical Models in Computational Biology | W | 6 credits | 2V + 1U + 2A | N. Beerenwinkel | |

Abstract | The course offers an introduction to graphical models and their application to complex biological systems. Graphical models combine a statistical methodology with efficient algorithms for inference in settings of high dimension and uncertainty. The unifying graphical model framework is developed and used to examine several classical and topical computational biology methods. | |||||

Objective | The goal of this course is to establish the common language of graphical models for applications in computational biology and to see this methodology at work for several real-world data sets. | |||||

Content | Graphical models are a marriage between probability theory and graph theory. They combine the notion of probabilities with efficient algorithms for inference among many random variables. Graphical models play an important role in computational biology, because they explicitly address two features that are inherent to biological systems: complexity and uncertainty. We will develop the basic theory and the common underlying formalism of graphical models and discuss several computational biology applications. Topics covered include conditional independence, Bayesian networks, Markov random fields, Gaussian graphical models, EM algorithm, junction tree algorithm, model selection, Dirichlet process mixture, causality, the pair hidden Markov model for sequence alignment, probabilistic phylogenetic models, phylo-HMMs, microarray experiments and gene regulatory networks, protein interaction networks, learning from perturbation experiments, time series data and dynamic Bayesian networks. Some of the biological applications will be explored in small data analysis problems as part of the exercises. | |||||

Lecture notes | no | |||||

Literature | - Airoldi EM (2007) Getting started in probabilistic graphical models. PLoS Comput Biol 3(12): e252. doi:10.1371/journal.pcbi.0030252 - Bishop CM. Pattern Recognition and Machine Learning. Springer, 2007. - Durbin R, Eddy S, Krogh A, Mitchinson G. Biological Sequence Analysis. Cambridge university Press, 2004 | |||||

701-0104-00L | Statistical Modelling of Spatial Data | W | 3 credits | 2G | A. J. Papritz | |

Abstract | In environmental sciences one often deals with spatial data. When analysing such data the focus is either on exploring their structure (dependence on explanatory variables, autocorrelation) and/or on spatial prediction. The course provides an introduction to geostatistical methods that are useful for such analyses. | |||||

Objective | The course will provide an overview of the basic concepts and stochastic models that are used to model spatial data. In addition, participants will learn a number of geostatistical techniques and acquire familiarity with R software that is useful for analyzing spatial data. | |||||

Content | After an introductory discussion of the types of problems and the kind of data that arise in environmental research, an introduction into linear geostatistics (models: stationary and intrinsic random processes, modelling large-scale spatial patterns by linear regression, modelling autocorrelation by variogram; kriging: mean square prediction of spatial data) will be taught. The lectures will be complemented by data analyses that the participants have to do themselves. | |||||

Lecture notes | Slides, descriptions of the problems for the data analyses and solutions to them will be provided. | |||||

Literature | P.J. Diggle & P.J. Ribeiro Jr. 2007. Model-based Geostatistics. Springer. Bivand, R. S., Pebesma, E. J. & Gómez-Rubio, V. 2013. Applied Spatial Data Analysis with R. Springer. | |||||

Prerequisites / Notice | Familiarity with linear regression analysis (e.g. equivalent to the first part of the course 401-0649-00L Applied Statistical Regression) and with the software R (e.g. 401-6215-00L Using R for Data Analysis and Graphics (Part I), 401-6217-00L Using R for Data Analysis and Graphics (Part II)) are required for attending the course. | |||||

401-6222-00L | Robust and Nonlinear Regression | W | 2 credits | 1V + 1U | A. F. Ruckstuhl | |

Abstract | In a first part, the basic ideas of robust fitting techniques are explained theoretically and practically using regression models and explorative multivariate analysis. The second part addresses the challenges of fitting nonlinear regression functions and finding reliable confidence intervals. | |||||

Objective | Participants are familiar with common robust fitting methods for the linear regression models as well as for exploratory multivariate analysis and are able to assess their suitability for the data at hand. They know the challenges that arise in fitting of nonlinear regression functions, and know the difference between classical and profile based methods to determine confidence intervals. They can apply the discussed methods in practise by using the statistics software R. | |||||

Content | Robust fitting: influence function, breakdown point, regression M-estimation, regression MM-estimation, robust inference, covariance estimation with high breakdown point, application in principal component analysis and linear discriminant analysis. Nonlinear regression: the nonlinear regression model, estimation methods, approximate tests and confidence intervals, estimation methods, profile t plot, profile traces, parameter transformation, prediction and calibration | |||||

Lecture notes | Lecture notes are available | |||||

Prerequisites / Notice | It is a block course on three Mondays in June | |||||

401-8618-00L | Statistical Methods in Epidemiology (University of Zurich)No enrolment to this course at ETH Zurich. Book the corresponding module directly at UZH. UZH Module Code: STA408 Mind the enrolment deadlines at UZH: https://www.uzh.ch/cmsssl/en/studies/application/mobilitaet.html | W | 5 credits | 3G | University lecturers | |

Abstract | Analysis of case-control and cohort studies. The most relevant measures of effect (odds and rate ratios) are introduced, and methods for adjusting for confounders (Mantel-Haenszel, regression) are thoroughly discussed. Advanced topics such as measurement error and propensity score adjustments are also covered. We will outline statistical methods for case-crossover and case series studies etc. | |||||

Objective | ||||||

401-4626-00L | Advanced Statistical Modelling: Mixed Models | W | 4 credits | 2V | M. Mächler | |

Abstract | Mixed Models = (*| generalized| non-) linear Mixed-effects Models, extend traditional regression models by adding "random effect" terms. In applications, such models are called "hierarchical models", "repeated measures" or "split plot designs". Mixed models are widely used and appropriate in an aera of complex data measured from living creatures from biology to human sciences. | |||||

Objective | - Becoming aware how mixed models are more realistic and more powerful in many cases than traditional ("fixed-effects only") regression models. - Learning to fit such models to data correctly, critically interpreting results for such model fits, and hence learning to work the creative cycle of responsible statistical data analysis: "fit -> interpret & diagnose -> modify the fit -> interpret & ...." - Becoming aware of computational and methodological limitations of these models, even when using state-of-the art software. | |||||

Content | The lecture will build on various examples, use R and notably the `lme4` package, to illustrate concepts. The relevant R scripts are made available online. Inference (significance of factors, confidence intervals) will focus on the more realistic *un*balanced situation where classical (ANOVA, sum of squares etc) methods are known to be deficient. Hence, Maximum Likelihood (ML) and its variant, "REML", will be used for estimation and inference. | |||||

Lecture notes | We will work with an unfinished book proposal from Prof Douglas Bates, Wisconsin, USA which itself is a mixture of theory and worked R code examples. These lecture notes and all R scripts are made available from https://github.com/mmaechler/MEMo | |||||

Literature | (see web page and lecture notes) | |||||

Prerequisites / Notice | - We assume a good working knowledge about multiple linear regression ("the general linear model') and an intermediate (not beginner's) knowledge about model based statistics (estimation, confidence intervals,..). Typically this means at least two classes of (math based) statistics, say 1. Intro to probability and statistics 2. (Applied) regression including Matrix-Vector notation Y = X b + E - Basic (1 semester) "Matrix calculus" / linear algebra is also assumed. - If familiarity with [R](https://www.r-project.org/) is not given, it should be acquired during the course (by the student on own initiative). | |||||

447-6236-00L | Statistics for Survival Data | W | 2 credits | 1V + 1U | A. Hauser | |

Abstract | The primary purpose of a survival analysis is to model and analyze time-to-event data; that is, data that have as a principal endpoint the length of time for an event to occur. This block course introduces the field of survival analysis without getting too embroiled in the theoretical technicalities. | |||||

Objective | Presented here are some frequently used parametric models and methods, including accelerated failure time models; and the newer nonparametric procedures which include the Kaplan-Meier estimate of survival and the Cox proportional hazards regression model. The statistical tools treated are applicable to data from medical clinical trials, public health, epidemiology, engineering, economics, psychology, and demography as well. | |||||

Content | The primary purpose of a survival analysis is to model and analyze time-to-event data; that is, data that have as a principal endpoint the length of time for an event to occur. Such events are generally referred to as "failures." Some examples are time until an electrical component fails, time to first recurrence of a tumor (i.e., length of remission) after initial treatment, time to death, time to the learning of a skill, and promotion times for employees. In these examples we can see that it is possible that a "failure" time will not be observed either by deliberate design or due to random censoring. This occurs, for example, if a patient is still alive at the end of a clinical trial period or has moved away. The necessity of obtaining methods of analysis that accommodate censoring is the primary reason for developing specialized models and procedures for failure time data. Survival analysis is the modern name given to the collection of statistical procedures which accommodate time-to-event censored data. Prior to these new procedures, incomplete data were treated as missing data and omitted from the analysis. This resulted in the loss of the partial information obtained and in introducing serious systematic error (bias) in estimated quantities. This, of course, lowers the efficacy of the study. The procedures discussed here avoid bias and are more powerful as they utilize the partial information available on a subject or item. This block course introduces the field of survival analysis without getting too embroiled in the theoretical technicalities. Models for failure times describe either the survivor function or hazard rate and their dependence on explanatory variables. Presented here are some frequently used parametric models and methods, including accelerated failure time models; and the newer nonparametric procedures which include the Kaplan-Meier estimate of survival and the Cox proportional hazards regression model. The statistical tools treated are applicable to data from medical clinical trials, public health, epidemiology, engineering, economics, psychology, and demography as well. | |||||

401-8628-00L | Survival Analysis (University of Zurich)No enrolment to this course at ETH Zurich. Book the corresponding module directly at UZH. UZH Module Code: STA425 Mind the enrolment deadlines at UZH: http://www.uzh.ch/studies/application/mobilitaet_en.html | W | 3 credits | 1.5G | University lecturers | |

Abstract | The analysis of survival times, or in more general terms, the analysis of time to event variables is concerned with models for censored observations. Because we cannot always wait until the event of interest actually happens, the methods discussed here are required for an appropriate handling of incomplete observations where we only know that the event of interest did not happen within ... | |||||

Objective | ||||||

Content | During the course, we will study the most important methods and models for censored data, including - general concepts of censoring, - simple summary statistics, - estimation of survival curves, - frequentist inference for two and more groups, and - regression models for censored observations | |||||

Application Areas Students select one area of application and look for suitable courses in which quantitative methods and modeling play a role. They need the consent by the Advisor (http://stat.ethz.ch/~kalisch/) that the chosen courses are eligible in the category "Application Areas". For the category assignment of eligible courses keep the choice "no category" and take contact with the Study Administration Office (www.math.ethz.ch/studiensekretariat/staff/ekuenti) after having received the credits. The Study Administration Office needs the Advisor's consent. | ||||||

Seminar or Semester Paper | ||||||

Number | Title | Type | ECTS | Hours | Lecturers | |

401-4620-00L | Statistics Lab Number of participants limited to 27. | W | 6 credits | 2S | M. Kalisch, M. H. Maathuis, M. Mächler, L. Meier, N. Meinshausen | |

Abstract | "Statistics Lab" is an Applied Statistics Workshop in Data Analysis. It provides a learning environment in a realistic setting. Students lead a regular consulting session at the Seminar für Statistik (SfS). After the session, the statistical data analysis is carried out and a written report and results are presented to the client. The project is also presented in the course's seminar. | |||||

Objective | - gain initial experience in the consultancy process - carry out a consultancy session and produce a report - apply theoretical knowledge to an applied problem After the course, students will have practical knowledge about statistical consulting. They will have determined the scientific problem and its context, enquired the design of the experiment or data collection, and selected the appropriate methods to tackle the problem. They will have deepened their statistical knowledge, and applied their theoretical knowledge to the problem. They will have gathered experience in explaining the relevant mathematical and software issues to a client. They will have performed a statistical analysis using R (or SPSS). They improve their skills in writing a report and presenting statistical issues in a talk. | |||||

Content | Students participate in consulting meetings at the SfS. Several consulting dates are available for student participation. These are arranged individually. -During the first meeting the student mainly observes and participates in the discussion. During the second meeting (with a different client), the student leads the meeting. The member of the consulting team is overseeing (and contributing to) the meeting. -After the meeting, the student performs the recommended analysis, produces a report and presents the results to the client. -Finally, the student presents the case in the weekly course seminar in a talk. All students are required to attend the seminar regularly. | |||||

Lecture notes | n/a | |||||

Literature | The required literature will depend on the specific statistical problem under investigation. Some introductory material can be found below. | |||||

Prerequisites / Notice | Prerequisites: Sound knowledge in basic statistical methods, especially regression and, if possible, analysis of variance. Basic experience in Data Analysis with R. | |||||

401-3630-04L | Semester Paper Successful participation in the course unit 401-2000-00L Scientific Works in Mathematics is required. For more information, see www.math.ethz.ch/intranet/students/study-administration/theses.html | W | 4 credits | 6A | Supervisors | |

Abstract | Semester papers serve to delve into a problem in statistics and to study it with the appropriate methods or to compile and clearly exhibit a case study of a statistical evaluation. | |||||

Objective | ||||||

401-3630-94L | Semester Paper Successful participation in the course unit 401-2000-00L Scientific Works in Mathematics is required. For more information, see www.math.ethz.ch/intranet/students/study-administration/theses.html | W | 4 credits | 6A | Supervisors | |

Abstract | Semester papers serve to delve into a problem in statistics and to study it with the appropriate methods or to compile and clearly exhibit a case study of a statistical evaluation. | |||||

Objective | ||||||

401-3630-06L | Semester Paper Successful participation in the course unit 401-2000-00L Scientific Works in Mathematics is required. For more information, see www.math.ethz.ch/intranet/students/study-administration/theses.html | W | 6 credits | 9A | Supervisors | |

Abstract | Semester papers serve to delve into a problem in statistics and to study it with the appropriate methods or to compile and clearly exhibit a case study of a statistical evaluation. | |||||

Objective | ||||||

401-3620-20L | Student Seminar in Statistics: Inference in Non-Classical Regression Models Number of participants limited to 24. Mainly for students from the Mathematics Bachelor and Master Programmes who, in addition to the introductory course unit 401-2604-00L Probability and Statistics, have heard at least one core or elective course in statistics. Also offered in the Master Programmes Statistics resp. Data Science. | W | 4 credits | 2S | F. Balabdaoui | |

Abstract | Review of some non-standard regression models and the statistical properties of estimation methods in such models. | |||||

Objective | The main goal is the students get to discover some less known regression models which either generalize the well-known linear model (for example monotone regression) or violate some of the most fundamental assumptions (as in shuffled or unlinked regression models). | |||||

Content | Linear regression is one of the most used models for prediction and hence one of the most understood in statistical literature. However, linearity might too simplistic to capture the actual relationship between some response and given covariates. Also, there are many real data problems where linearity is plausible but the actual pairing between the observed covariates and responses is completely lost or at partially. In this seminar, we review some of the non-classical regression models and the statistical properties of the estimation methods considered by well-known statisticians and machine learners. This will encompass: 1. Monotone regression 2. Single index model 3. Unlinked regression 4. Partially unlinked regression | |||||

Lecture notes | No script is necessary for this seminar | |||||

Literature | In the following is the material that will read and studied by each pair of students (all the items listed below are available through the ETH electronic library or arXiv): 1. Chapter 2 from the book "Nonparametric estimation under shape constraints" by P. Groeneboom and G. Jongbloed, 2014, Cambridge University Press 2. "Nonparametric shape-restricted regression" by A. Guntuoyina and B. Sen, 2018, Statistical Science, Volume 33, 568-594 3. "Asymptotic distributions for two estimators of the single index model" by Y. Xia, 2006, Econometric Theory, Volume 22, 1112-1137 4. "Least squares estimation in the monotone single index model" by F. Balabdaoui, C. Durot and H. K. Jankowski, Journal of Bernoulli, 2019, Volume 4B, 3276-3310 5. "Least angle regression" by B. Efron, T. Hastie, I. Johnstone, and R. Tibshirani, 2004, Annals of Statsitics, Volume 32, 407-499. 6. "Sharp thresholds for high dimensional and noisy sparsity recovery using l1-constrained quadratic programming (Lasso)" by M. Wainwright, 2009, IEEE transactions in Information Theory, Volume 55, 1-19 7."Denoising linear models with permuted data" by A. Pananjady, M. Wainwright and T. A. Courtade and , 2017, IEEE International Symposium on Information Theory, 446-450. 8. "Linear regression with shuffled data: statistical and computation limits of permutation recovery" by A. Pananjady, M. Wainwright and T. A. Courtade , 2018, IEEE transactions in Information Theory, Volume 64, 3286-3300 9. "Linear regression without correspondence" by D. Hsu, K. Shi and X. Sun, 2017, NIPS 10. "A pseudo-likelihood approach to linear regression with partially shuffled data" by M. Slawski, G. Diao, E. Ben-David, 2019, arXiv. 11. "Uncoupled isotonic regression via minimum Wasserstein deconvolution" by P. Rigollet and J. Weed, 2019, Information and Inference, Volume 00, 1-27 | |||||

401-3940-20L | Student Seminar in Mathematics and Data: Optimization of Random Functions Number of participants limited to 12. | W | 4 credits | 2S | A. Bandeira | |

Abstract | More information at course webpage: https://people.math.ethz.ch/~abandeira/Spring2020.StudentSeminar.html | |||||

Objective | ||||||

363-1100-00L | Risk Case Study Challenge Does not take place this semester. | W | 3 credits | 2S | A. Bommier, S. Feuerriegel | |

Abstract | This seminar provides master students at ETH with the challenging opportunity of working on a real risk modelling and risk management case in close collaboration with a Risk Center Partner Company. For the Spring 2019 Edition the Partner will be Zurich Insurance Group. | |||||

Objective | Students work on a real risk-related case of a business relevant topic provided by experts from Risk Center partners. While gaining substantial insights into the risk modeling and management of the industry, students explore the case or problem on their own, working in teams, and develop possible solutions. The cases allow students to use logical problem solving skills with emphasis on evidence and application and involve the integration of scientific knowledge. Typically, the risk-related cases can be complex, cover ambiguities, and may be addressed in more than one way. During the seminar students visit the partners’ headquarters, conduct interviews with members of the management team as well as internal and external experts, and present their results. | |||||

Content | Get a basic understanding of o The insurance and reinsurance business o Risk management and risk modelling o The role of operational risk management Get in contact with industry experts and conduct interviews on the topic. Conduct a small empirical study and present findings to the company | |||||

Prerequisites / Notice | Please apply for this course via the official website (www.riskcenter.ethz.ch/education/lectures/risk-case-study-challenge-.html). Apply no later than February 15, 2019. The number of participants is limited to 14. | |||||

GESS Science in Perspective Two credits are needed from the "Science in Perspective" programme with language courses excluded if three credits from language courses have already been recognised for the Bachelor's degree. see Link (Eight credits must be acquired in this category: normally six during the Bachelor’s degree programme, and two during the Master’s degree programme. A maximum of three credits from language courses from the range of the Language Center of the University of Zurich and ETH Zurich may be recognised. In addition, only advanced courses (level B2 upwards) in the European languages English, French, Italian and Spanish are recognised. German language courses are recognised from level C2 upwards.) | ||||||

» see Science in Perspective: Type A: Enhancement of Reflection Capability | ||||||

» Recommended Science in Perspective (Type B) for D-MATH | ||||||

» see Science in Perspective: Language Courses ETH/UZH | ||||||

Master's Thesis | ||||||

Number | Title | Type | ECTS | Hours | Lecturers | |

401-2000-00L | Scientific Works in MathematicsTarget audience: Third year Bachelor students; Master students who cannot document to have received an adequate training in working scientifically. | O | 0 credits | Ö. Imamoglu, E. Kowalski | ||

Abstract | Introduction to scientific writing for students with focus on publication standards and ethical issues, especially in the case of citations (references to works of others.) | |||||

Objective | Learn the basic standards of scientific works in mathematics. | |||||

Content | - Types of mathematical works - Publication standards in pure and applied mathematics - Data handling - Ethical issues - Citation guidelines | |||||

Lecture notes | Moodle of the Mathematics Library: https://moodle-app2.let.ethz.ch/course/view.php?id=519 | |||||

Prerequisites / Notice | Directive Link | |||||

401-2000-01L | Lunch Sessions – Thesis Basics for Mathematics StudentsDetails and registration for the optional MathBib training course: https://www.math.ethz.ch/mathbib-schulungen | Z | 0 credits | Speakers | ||

Abstract | Optional course "Recherchieren in der Mathematik" (held in German) by the Mathematics Library. | |||||

Objective | ||||||

401-4990-02L | Master's Thesis Only students who fulfil the following criteria are allowed to begin with their Master's thesis: a. successful completion of the Bachelor's programme; b. fulfilling of any additional requirements necessary to gain admission to the Master's programme; c. They have acquired at least 16 credits in the category ‘Core courses‘. Successful participation in the course unit 401-2000-00L Scientific Works in Mathematics is required. For more information, see www.math.ethz.ch/intranet/students/study-administration/theses.html | O | 30 credits | 57D | Supervisors | |

Abstract | The master's thesis concludes the study programme. Thesis work should prove the students' ability to independent, structured and scientific working. | |||||

Objective | Die Studierenden sollen mit der Master-Arbeit, die den Abschluss des Studiengangs bildet, ihre Fähigkeit zu selbständiger, strukturierter und wissenschaftlicher Tätigkeit unter Beweis stellen. |

- Page 2 of 3 All