Computer labs

The goal of these labs is to introduce you to, and build up your proficiency with, R and RStudio. You’ll be using these throughout the course, both to learn the statistical concepts discussed in the lectures and also to analyze real data and come to informed conclusions. To straighten out which is which:

The R language is the standard statistical tool used by most statisticians at universities. One reason data scientists and statisticians like to use R is that all known statistical techniques are available in R. Whenever someone develops a new statistical technique, one of the first things they do is produce an R package so that the technique becomes available in R. The reason they do this for R rather than for one of the commercial alternatives is that R is open source and freely available to all, and of course that the previous methods on which the new method builds are already available in R.

Feeling comfortable using R is not only important for this module and any further statistics modules you may take at the Department of Mathematics of the University of York, it can also be an important factor for your future career (see the article “R skills attract the highest salaries”. Even though R is specially designed for statistics, it is consistently in the list of the top ten most important programming languages compiled by the IEEE spectrum magazine.

As the labs progress, you are encouraged to explore beyond what the labs dictate; a willingness to experiment will make you a much better programmer.

Assessment

Important

The five main labs (imaginatively named “Lab 1” to “Lab 5”) count for credit: your best 4 out of 5 will marks will count for 20% of the module mark.

Each lab will have an accompanying quiz. As you work through each lab you will find places where you are asked to perform a calculation and then enter your mark in the appropriate quiz.

You can have two attempts at each quiz; the mark from your best attempt will become your final grade for that lab.

The Intro lab does not count for credit, but you should attempt this in the first week of the semester to make sure that:

  • you can successfully access R
  • you know how to enter answers in the accompanying quiz.