Bioinformatics on Big Data: Cloud Computing on the Human Genome (2016)

Course Objectives

A poster announcing this workshop can be found here

Several big data genomics projects, including the ICGC, are deciding to host their data in the Cloud and to provide access to configurable virtual machines (VM) with which to compute on this data (thereby removing the need to purchase and maintain your own compute cluster). Similarly, many labs are moving to renting compute time from various cloud providers. Analysis of a single genome or a smaller selected subset differs from analysis of multiple genomes, particularly in the compute infrastructure required.

To navigate through working in this new compute space, the CBW has developed a 2-day course providing an introduction to security and privacy issues related to working on human genome data and the processes necessary to access such data. After reviewing cloud computing infrastructure, the workshop will also provide a hands-on introduction to launching and configuring your own virtual machine (VM), accessing cloud-based data sets, and how to scale up the number of VMs to meet your analysis needs. Customizing VMs with your own tools and cloud-computing best practices will also be discussed.

Participants will gain practical experience and skills to be able to:

  • Launch their own virtual machine (VM)
  • Configure a VM with prepackaged tools
  • Pull in data sets from Cloud repositories
  • Follow best practices in data and workflow management
  • Customize a VM with their own tools
  • Scale up their VM to meet their analysis needs

Course Material

Open Access LogoCanadian Bioinformatics Workshops promotes open access. Past workshop content is available under a Creative Commons License.

Module 1 - Introduction to Cloud Computing and Virtual Machines (Faculty: Francis Ouellette)

application/pdf iconPDF (1MB) | application/vnd.openxmlformats-officedocument.presentationml.presentation iconPPT (10MB) |  iconYouTube (0KB)

Module 2 - Ethics of data usage & security best practices (2016) (Faculty: Mark Phillips)

application/pdf iconPDF (5MB) | application/vnd.openxmlformats-officedocument.presentationml.presentation iconPPT (4MB) |  iconYouTube (0KB)

Module 3 - Working Reproducibly in the Cloud (Faculty: George Mihaiescu)

application/pdf iconPDF (10MB) | application/vnd.openxmlformats-officedocument.presentationml.presentation iconPPT (9MB) |  iconYouTube (0KB)

Module 4 - Sharing and Scaling a VM (2016) (Faculty: George Mihaiescu)

application/pdf iconPDF (1MB) | application/vnd.openxmlformats-officedocument.presentationml.presentation iconPPT (3MB) |  iconYouTube (0KB)

Module 5 - Big Data Analysis in the Cloud (2016) (Faculty: Christina Yung, Faculty: Solomon Shorser)

application/pdf iconPDF (3MB) | application/vnd.openxmlformats-officedocument.presentationml.presentation iconPPT (3MB) |  iconYouTube (0KB)