Easier analysis of methylation array data

Methylation Array Data Analysis Tips

Since the release of the first Illumina Infinium Methylation BeadChips, the user community has been instrumental in their widespread adoption by developing software packages for advanced methylation data analysis. While Illumina software is used by core laboratories for basic quality control, third party Bioconductor packages offer the most functionality for downstream analysis.

For BeadChip processing laboratories

 

DRAGEN Array Methylation QC

The cloud-based DRAGEN Array Methylation QC software delivers high-throughput and quantitative reporting of control metrics for Infinium Methylation microarrays. Read more about the sample QC methods used to determine data quality.

 

GenomeStudio Methylation Module and BeadArray Controls Reporter

The GenomeStudio Methylation Module can be used for basic QC of methylation beadchips. The Controls Dashboard in GenomeStudio is used to visualize sample-independent and sample-dependent controls, whereas the BeadArray Controls reporter (BACR) provides a quantitative analysis of controls for fast results. 

  DRAGEN Array
Methylation QC 
GenomeStudio
Methylation Module
BeadArray Controls Reporter
Deployment Cloud–based
Graphical user interface
Installation on local hardware
Graphical user interface
Installation on local hardware
Graphical user interface
Key uses High-throughput, quality control analysis Visual quality check Quantitative quality check
QC capabilities Algorithm-based analysis of 21 quantitative control metrics with adjustable thresholds
Data summary plots
Proportion of CG probes passing with an adjustable p-value threshold
Control plots (requires manual visual evaluation)
Number of probes passing
Algorithm-based analysis of 21 quantitative control metrics with adjustable thresholds
Access Included with array. Accessed on BaseSpace Sequence Hub. Compute and storage iCredit charges apply. Refer to the user guide Included with array.
Download from support site
Included with array.
Download from support site
BeadChip compatibility All methylation arrays* All methylation arrays Infinium MethylationEPIC

* Availability of recommended thresholds and control probes varies

SeSAMe provides end-to-end data analysis of Infinium Methylation BeadChips including advanced QC, updated normalization techniques, differential methylation analysis, and visualization capabilities.

The following video tutorial series, led by SeSAMe developer Wanding Zhou, provides step-by-step tutorials to familiarize new users with data analysis on SeSAMe:

Installing SeSAMe

In this video you’ll learn how to install SeSAMe to perform data analysis for the Infinium DNA methylation beadchip. All the scripts and links can be found on this SeSAMe Installation Github page. If you haven't installed R on your computer yet, please do so before watching this video.

Pre-processing Infinium Methylation data

This video tutorial will show you how to process IDATs into DNA methylation level data, or the beta values. This tutorial uses two public datasets from Gene Expression Omnibus or GEO. You’ll learn how to read in the signal intensity data, perform quality control, assess results, and more.

Modeling differential methylation

In this video, we’ll go over some of the linear modeling-based frameworks for analyzing differential DNA methylation. You’ll learn how to load packages and data, what to consider and check for prior to modeling, perform linear modeling, and investigate biological questions following test results.

Inferring sample metadata

This video tutorial will demonstrate how to use the SeSAMe software to infer sample metadata. This metadata can be sex, age, DNA copy number or cell fraction, or other metadata. This tutorial demonstrates various inferences to provide a broad understanding of the process.

Additional information and full documentation can be found on the SeSAMe Bioconductor page.

Minfi is a comprehensive package for methylation data analysis developed by Kasper Hansen. Github packages may be available to support the use of minfi with newer Infinium Methylation BeadChips. Visit the minfi Bioconductor page for documentation including user guides and installation instructions. Archived tutorial videos using 450K data can be found here, and an introduction video by Kasper Hansen can be found here.

Bioconductor hosts a suite of publicly available software programs to analyze Infinium methylation array data.

The table below provides a few examples of analysis packages and their function:

Software package Function
ChAMP Comprehensive R package for Epigenome-Wide Association Study (EWAS), providing pre-processing, differential calling, GSEA and interactive visualization
Rnbeads End-to-end methylation array analysis: includes quality control, data preprocessing, data tracks & tables, exploratory analysis, and differential methylation
Conumee Performs copy number variation (CNV) analysis using Illumina 450K or EPIC Methylation Arrays
wateRmelon Provides a set of tools for importing, quality control, and normalizing Illumina DNA methylation array data
bumphunter Detects differentially methylated regions in EWAS based on ‘bump hunting’ statistical method

For additional information and methylation array data processing packages, visit Bioconductor.

The Columbia Epigenetics Boot Camp offers an intensive hands-on training on data analysis techniques for methylation arrays, and provides an overview of considerations when designing DNA methylation studies.

Interested in receiving newsletters, case studies, and information on genomic analysis techniques? Enter your email address.