Center for Statistical Genomics

Overview

The genetic epidemiology and clinical course of cancer is increasingly being challenged by big data and high complexity. We aim to develop novel statistical methods to address some of the major problems facing cancer genetic epidemiologists in the “omic” era and to illustrate their use for novel discovery, characterization, and prediction in various cancer studies. These methods address a wide range of analysis challenges, including feature selection, mediation, interaction, and characterization, all in the context of integrating prior biological knowledge with epidemiological or clinical data. Our goal is to provide tools for evaluating the impact of potential preventive or therapeutic interventions based on modifiable risk factors.

The majority of this work was funded by the National Cancer Institute #P01CA196569.

View Schedule and Recordings

Project 1: Integration of Omic Data to Estimate Mediation or Latent Structures

This project will develop statistical approaches leveraging latent structures or mediating relationships for the integration of multiple omics data to better understand gene-to-phenotype relationships. The methods will be applicable to either individual-level data or summary statistics, and they will have a direct impact on applied investigations by facilitating a better understanding of potential mechanisms driving underlying cancer etiology.

Project 2: Integration of Omic Data in the Analysis of Gene × Environment Interaction

The availability of high-volume ‘omic’ data, including gene expression, metabolome, methylation, and microbiome, provides new opportunities to identify gene-environment (G×E) and omic × E interactions. This project will develop statistical methods to leverage omic data to improve power for identifying novel interactions as well as to inform the biological mechanism by which genes and exposures affect cancer outcomes.

Project 3: Statistical Methods for Genome Characterization

Understanding the role that genes play in life is a key issue in biomedical sciences, yet the overwhelming majority of sequences in public databases remain uncharacterized. Functional annotation is important for a variety of downstream analyses of genetic data, yet experimental characterization of function remains costly and slow. This project therefore proposes three Aims focused on improving our understanding of functional genomics, thereby allowing better translation of data to knowledge impacting human health.

Faculty

Core A: Administrative Core

The administrative core provides scientific oversight, enhances communications among investigators, and supports all of the activities of this program. It works proactively to assure complete synergy across the Research Projects and Cores around the theme of “integrative genomics” that is the focus of the program. Led by Jim Gauderman, PhD and Kimberly Siegmund, PhD

Funding by the National Cancer Institute P01 CA196569-07.

Center for Statistical Genomics

Overview

Happy Scientist Seminars

Projects

Project 1: Integration of Omic Data to Estimate Mediation or Latent Structures

Project 2: Integration of Omic Data in the Analysis of Gene × Environment Interaction

Project 3: Statistical Methods for Genome Characterization

View Faculty

Cores

Core A: Administrative Core

Core B: Annotation Core

Core C: Computation and Software Development Core

Core D: Data Analysis and Research Translation Core

Publications

News

Recent Seminars