17 Overview

AnVIL provides a secure, scalable computing platform for controlled access data. Hosting over 7 petabytes of data across 113 dbGaP accessions and 103 consent codes, AnVIL inverts the model of genomics data sharing by providing tools such as Jupyter, RStudio, Galaxy, and WDL Workflows. This AnVIL Demo introduces datasets from the MAGE and 1000 Genomes Project (1KGP) projects to showcase notable features of working in AnVIL including how to import data and export results between these various tools.

This demonstration will specifically explore workspaces, noting how cloned workspaces differ from the original; run an analysis in a Jupyter Notebook, run an analysis using RStudio, and submit a workflow in Galaxy. All analyses will showcase human genetic variation concepts and results from the MAGE and 1KGP datasets.

17.1 Learning Objectives

  1. Explore MAGE Workspace
  2. Analysis with Jupyter/Terminal
  3. Bioconductor with RStudio
  4. Workflows with Galaxy