Introduction to Bayesian Data Analysis

Bayesian data analysis is increasingly becoming the tool of choice for many data-analysis problems.

This free course on Bayesian data analysis will teach you basic ideas about random variables and probability distributions, Bayes' rule, and its application in simple data analysis problems. You will learn to use the R package brms (which is a front-end for the probabilistic programming language Stan). The focus will be on regression modeling, culminating in a brief introduction to hierarchical models (otherwise known as mixed or multilevel models).

This course is appropriate for anyone familiar with the programming language R and for anyone who has done some frequentist data analysis (e.g., linear modeling and/or linear mixed modeling) in the past.

自三月 13, 2023起开始自学
语言: English
Advanced, Beginner, Big Data and AI

课程信息

Introduction: Why are Bayesian methods important for data analysts?

Here are some of the advantages of Bayesian methods over the standard frequentist approach used in data analysis:

  • Prior knowledge/expertise can be incorporated into the data analysis
  • Models can be flexibly specified to reflect the assumed generative process
  • The results of the analysis – the posterior distributions of the parameters of interest – have an intuitive interpretation
  • Hypothesis testing can be carried out in a more meaningful manner than the standard used null hypothesis significance testing

Prerequisites: Who is this course for?

We assume the following in this course:

  • Basic familiarity with the programming language R, openHPI offers a free R course for Beginners (in German)
  • Experience with data analysis using linear models
  • It is helpful (but not necessary) to have had some exposure to linear mixed models using the R library lme4
  • High-school mathematics (pre-calculus)
  • Some basic concepts from probability theory (sum and product rule, conditional probability)

This course is not appropriate for participants who don't know R programming and who have no experience at all with data analysis.

Course outcomes: What will you learn from this course?

  • Some basic ideas relating to random variables
  • Some fundamental properties of probability distributions
  • Application of Bayes' rule in data analysis
  • The concept of likelihood and its role in Bayesian statistical modeling
  • Bayesian regression models using brms (a front-end for Stan)
  • How to visualize and interpret prior and posterior distributions
  • How to generate prior and posterior predictive distributions for evaluating models
  • How to interpret the results of simple regression models

After completing this course, you will be in a good position to learn how to use more advanced Bayesian methods, such as hierarchical models, finite mixture models, multinomial processing tree models, measurement error models, etc.

Course structure: How do you plan this course?

This four-week course consists of

  • A series of weekly video lectures
  • Self-tests, weekly homework, and programming tasks
  • Supplementary reading materials

We expect a weekly time commitment of 5-10 hours to complete the course, depending on your prior knowledge.

Recommended reading: Textbook

The course follows the structure of an online textbook, which will be published by CRC Press soon. You can view the textbook here.

你将学到什么

  • Bayesian statistics
  • Data analysis
  • Bayesian regression models using brms

本课程适用于谁

  • Students
  • Researchers
  • Data Analysts
  • Scientists
  • Anyone who wishes to do data analysis

课程内容

  • Week 0 - Initial Setup:

    Please install the latest versions of R and RStudio, rstan, brms, and other necessary packages in R. In order to get the most out of this course, please read the textbook chapters 1-4 (the textbook link is provided below) as the course progresses. Each chapter belongs to the corresponding week in this course.
  • Week 1 - Introduction:

    Learn the foundational ideas about random variables and probability distributions. Reading: Chapter 1 of the textbook.
  • Week 2 - Bayesian data analysis:

    Understand Bayes' rule, derive the posterior using Bayes' rule; visualize the prior, likelihood, and posterior; distinguish between the prior, likelihood, and posterior; incorporate prior knowledge into the analysis. Reading: Chapter 2.
  • Week 3 - Computational Bayesian data analysis:

    Derive the posterior through sampling; build a simple linear regression model using brms; visualize prior predictive distributions, perform sensitivity analysis and posterior predictive checks. Reading: Chapter 3.
  • Week 4 - Bayesian regression and hierarchical models:

    Perform simple linear regressions using the normal and binomial likelihoods to answer the following research questions: (i) Does attentional load affect pupil size? (ii) Does trial id affect response times? (iii) Does set size affect recall accuracy? Take a brief look-ahead at linear mixed models. Reading: Chapter 4 and up to section 5.3 of Chapter 5.
  • Final Exam:

    Final Exam
  • I like, I wish:

    Please provide your feedback on the course.

订阅本课程

该课程是免费的。 只需在openHPI上注册一个帐户并参加课程!
现在注册吧

Learners

Current
Today
4,413
Course End
3月 13 2023
3,598
Course Start
1月 25 2023
2,737

评分

本课程已由303位用户进行了五分制评分,平均得分为4.15

证书要求

  • 课程证书 授予者需要至少取得课程总分的百分之 50%
  • 参与证明 授予者需要至少学习了所有课程资料的百分之 50%

欲知详情,请访问证书指南.

该课程提供者

Prof. Dr. Shravan Vasishth

Shravan Vasishth is professor of linguistics at the University of Potsdam, Germany. His background is in Statistics, Computer Science, Linguistics, and Japanese. He is a chartered statistician with the Royal Statistical Society, UK. For more details about him and his research, see vasishth.github.io.

Dr. Anna Laurinavichyute

I am a cognitive scientist with a PhD in cognitive science from the University of Potsdam, Germany. I am interested in data science and computational cognitive modeling.