High-throughput sequencing of Chromatin-Immunoprecipitated libraries (ChIP-seq) and of bisulfite converted DNA (WGBS) have become increasingly common and have largely supplanted microarrays for chromatin and DNA methylation profiling. When processed appropriately, ChIP-seq data provides base-pair resolution representations of transcription factor DNA-binding events and nucleosome (histone) modifications genome-wide. Similarly, WGBS can provide a quantitative genome wide profile of cytosine methylation.
The CBW has developed a 4-day course providing an introduction to histone ChIP-seq and WGBS data analysis followed by integrated tutorials demonstrating the use of open source ChIP-Seq and WGBS analysis packages. The tutorials are designed as self-contained units that include example data and detailed instructions for installation of all required bioinformatics tools (FASTQC, BWA, MACS2, samtools, Picard, BisSNP). The course also includes an overview of integrative epigenomic tools that have been developed to explore ChIP-Seq and WGBS data together with other epigenomic datasets such as RNA-seq, DHS-seq and ATAC-seq.
This is a distributed workshop. It is held simultaneously at two locations, with faculty and TAs present at both venues. Lectures are broadcast back and forth between sites. When applying, please choose the location where you would like to attend.
Participants will gain practical experience and skills to be able to:
- Align ChIP-seq and WGBS sequence data to a reference genome (required)
- Identify narrow and broad peaks from ChIP-seq data
- Identify methylated levels from WGBS data
- Visualize and summarize the output of ChIP-Seq and WGBS analyses
- Explore integrative tools for epigenomic data sets
- Discover existing online resources to facilitate epigenomic data analysis
Graduates, postgraduates and PIs working with or about to embark on an analysis of epigenomic data and in particular of ChIP-Seq and Whole-Genome Bisulfite Sequencing (WGBS) experiments. Attendees may be familiar with some aspect of ChIP-Seq or WGBS data analysis or have no direct experience. We will be working on human data sets and only covering methods that require a reference genome.
Basic familiarity with Linux environment and S, R, or Matlab.
You will also require your own laptop computer. Minimum requirements: 1024×768 screen resolution, 1.5GHz CPU, 2GB RAM, 10GB free disk space, recent versions of Windows, Mac OS X or Linux (Most computers purchased in the past 3-4 years likely meet these requirements).
This workshop requires participants to complete pre-workshop tasks and readings.
Module 1: Introduction to ChIP Sequencing and Analysis (Martin Hirst)
- Basic introduction to biology of ChIP-seq
- Experimental design and analysis considerations
- Commonly asked questions
Lab Practical (Edmund Su):
- Configure, launch, and connect to a cloud instance
- Introduction to the test data
- Run FastQC
- Examine and understand the format of raw FastQ files
- Obtain reference genomes (fasta)
- Perform pre-alignment QC
Module 2: ChIP-Seq Alignment, Peak Calling, and Visualization (Martin Hirst)
- Use of BWA and reference genome formating
- Introduction to the BAM format
- Basic manipulation of BAMs with samtools, Picard, etc.
- Use of MACS3 to call narrow and broad peaks
- Visualization of ChIP-seq alignments and peaks with IGV
Lab Practical (Edmund Su):
- Run BWA with suitable parameters
- Use samtools to explore the features of the SAM/BAM format and perform basic manipulation of these alignment files (view, sort, index, manipulate headers, extract data, etc.)
- Use MACS3 to call narrow and broad peaks.
- Visualize results in IGV.
Module 3 : ChIP-Seq differential analysis (Edmund Su)
- Calling differential and shared peaks
- Utilizing triplicates for differential analysis
- Common pitfalls and false positives
Lab Practical (Edmund Su):
- Performing differential peak analysis with Bedtools
- Performing advance differential peak analysis using Bedtools/Deeptools/R
- Performing differential peak analysis with DiffBind
Module 4: Introduction to WGBS and Analysis (Guillaume Bourque)
- Basic introduction to biology of WGBS
- Experimental design and analysis considerations
- Perform in silico conversion of the reference genome
- Convert reads and map onto the genome
- Pileup mapped reads and call methylation profile
Lab Practical (Jose Hector Galvez):
- Prepare in silico converted reference genome and reads
- Map reads using Bismark
- Post-process reads using samtools
- Call methylation profiles
- Explore and visualize results in IGV
- Perform differential methylation analyses using the methylkit and DSS libraries
Module 5: Online Resources and Downstream Analysis (David Bujold)
- Explore existing epigenomic data resources such as from the International Human Epigenome Consortium, and ENCODE
- Cover existing data sharing standards
- GO terms enrichment using GREAT
- Motifs in ChIP-seq peaks using HOMER
Lab Practical (David Bujold)
- Explore available reference epigenomic data sets using the IHEC Data Portal and other resources
- Look for motifs (using HOMER)
- GO enrichment (using GREAT)
- Introduction to Galaxy
Duration: 4 days
Start: Aug 26, 2024
End: Aug 29, 2024
Status: Cancelled
Workshop Ended
Canadian Bioinformatics Workshops promotes open access. Past workshop content is available under a Creative Commons License.
Posted on: