Biostatistics Student Association Computing Workshop
Statistical programming in R
These notes cover some aspects of:
- Downloading data in R
- Managing large desktop-sized files (a few GB in size)
- Assessing parsing errors from
readr
- Manipulating data quickly with
dplyr
anddata.table
(draft) - Implementing the split, apply, combine approach with
dplyr
, and reshaping and plotting withreshape2
andggplot2
- Plotting with
ggvis
I might also put together notes on visualizing missing data with pheatmap
and mi
.
To run tutorials 1-4, you’ll either need to replicate the directory structure on your computer or change dataPath
(at the top of each tutorial) to wherever you downloaded the data. One way to replicate the directory structure is to clone the repository. If you clone the repository, you can run the code directly from the markdown files.
Thanks, and happy computing!