Workflow Diary Example

Author

Your Name

Published

April 22, 2023

Getting Started

How to use this template

This workflow diary template will be used for you to record your experiences, attempts, and results while you apply the Bayesian workflow steps during each week of the seminar. Each section of the template corresponds to a step in the workflow that you will be applying, and specifies a suggested minimum ‘goal’ that you should aim to complete. How you decide to implement the workflow steps and achieve these goals is completely up to you.

There are no ‘wrong’ answers here, and you are not being graded on whether you implement the workflow steps ‘correctly’. This diary should be a record of your learning and experience with each workflow step, and will evolve as you learn more about the Bayesian workflow. If there are any issues or questions that you have, please record them in the diary so that we can discuss them during the in-person seminar sessions.

Remember that you should be able to re-run, modify, or extend any of your code if requested during the interactive presentations.

Setting up your Environment

There are a number of packages that you will need throughout this course. If you have any issues completing the setup steps below, please let us know.

Stan: `cmdstanr`, `cmdstan`, and `brms`

For estimating Bayesian models in R, we recommend the cmdstanr package. You should be familiar with this if you have previously completed BDA. You can install cmdstanr, if it is not already installed, by running the following:

# The 'cmdstanr' package is not available on CRAN, so you will need to install it from GitHub:
if (!requireNamespace("remotes", quietly = TRUE)) {
  install.packages("remotes")
}
if (!requireNamespace("cmdstanr", quietly = TRUE)) {
  remotes::install_github("stan-dev/cmdstanr")
}

The cmdstanr package is simply a wrapper/interface for the CmdStan command-line interface to Stan, which also needs to be installed. You can run the following code to check whether CmdStan is installed and up-to-date, and install it if necessary:

curr_ver <- cmdstanr::cmdstan_version(error_on_NA = FALSE)
if (is.null(curr_ver) || curr_ver < cmdstanr:::latest_released_version()) {
  cmdstanr::check_cmdstan_toolchain(fix = TRUE)
  cmdstanr::install_cmdstan()
}
rm(curr_ver)

A more user-friendly interface to cmdstanr is the brms package, which provides a formula-based interface to specify Bayesian models. You can install brms by running:

if (!requireNamespace("brms", quietly = TRUE)) {
  install.packages("brms")
}

Post-Processing and Diagnostics

There are also some packages that you are likely to find useful for post-processing and diagnosing your models. These include bayesplot, posterior, loo, and priorsense. The first three would have been installed with brms above, and you can install priorsense by running the following:

# The 'priorsense' package is not available on CRAN
# so you will need to install it from GitHub:
if (!requireNamespace("priorsense", quietly = TRUE)) {
  remotes::install_github("n-kall/priorsense")
}

Bayesian Workflow

Loading Data and Preprocessing

Use this section of the diary for loading your dataset of choice and performing any necessary preprocessing. This could include cleaning the data, transforming variables, or creating new variables. Remember that you should be able to re-run or modify this code if needed during the interactive presentations.

# Your code here