Ludwig Geistlinger, Marcel Ramos, Sehyun Oh, and Levi Waldron
CUNY School of Public Health 55 W 125th St, New York, NY 10027
Ludwig_Geistlinger@hms.harvard.edu
Marcel.Ramos@sph.cuny.edu
Sehyun.Oh@sph.cuny.edu
Levi.Waldron@sph.cuny.edu
This workshop gives an overview of Bioconductor solutions for the analysis of copy number variation (CNV) data. The workshop introduces Bioconductor core data structures for efficient representation, access, and manipulation of CNV data, and how to use these containers for structured downstream analysis of CNVs and integration with gene expression and quantitative phenotypes. Participants will be provided with code and hands-on practice for a comprehensive set of typical analysis steps including exploratory analysis, summarizing individual CNV calls across a population, overlap analysis with functional genomic regions and regulatory elements, expression quantitative trait loci (eQTL) analysis, and genome-wide association analysis (GWAS) with quantitative phenotypes. As an advanced application example, the workshop also introduces allele-specific absolute copy number analysis and how it is incorporated in cancer genomic analysis for the estimation of tumor characteristics such as tumor purity and ploidy.
Basic knowledge of R syntax
Familiarity with the SummarizedExperiment class
Familiarity with the GenomicRanges class
Familiarity with high-throughput genomic assays such as microarrays and next-generation sequencing
Familiarity with the biological definition of single nucleotide polymorphism (SNP) and copy number variation (CNV)
Activity | Time |
---|---|
Overview | 5m |
Data representation and manipulation | 20m |
Integrative downstream analysis (eQTL, GWAS, …) | 20m |
Allele-specific CN analysis in cancer | 15m |
GRangesList
and RaggedExperiment
to represent, access, and manipulate CNV dataGenomicRanges
/ RaggedExperiment
/ CNVRanger
architecture