Chapter 14 Using Galaxy on AnVIL

In the next few steps, you will walk through how to get set up to use Galaxy on the AnVIL platform. AnVIL is centered around different “Workspaces”. Each Workspace functions almost like a mini code laboratory - it is a place where data can be examined, stored, and analyzed. The first thing we want to do is to copy or “clone” a Workspace to create a space for you to experiment.

Use a web browser to go to the AnVIL website. In the browser type:

anvil.terra.bio

Tip At this point, it might make things easier to open up a new window in your browser and split your screen. That way, you can follow along with this guide on one side and execute the steps on the other.

Your instructor will give you information on which workspace you should clone. After logging in, click “View Workspaces”. Select the “Public” tab. In the top search bar type the activity workspace.

Clone the workspace by clicking the teardrop button (teardrop button). And selecting “Clone”.

Screenshot showing the teardrop button. The button has been clicked revealing the "clone" option. The Clone option and the teardrop button are highlighted.

In the first box, give your Workspace clone a name by adding an underscore (“_”) and your name. For example, “SARS-CoV-2-Genome_Ava_Hoffman”. Next, select the Billing project provided by your instructor. Leave the bottom two boxes as-is and click “CLONE WORKSPACE”.

Screenshot showing the "clone a workspace" popout. The Workspace name, Billing Project, and Clone Workspace button have been filled in and highlighted.

14.1 Video overview on using Galaxy

Here is a video tutorial that describes the basics of using Galaxy on AnVIL.

14.1.1 Objectives

  • Start compute for your Galaxy on AnVIL
  • Run tool to quality control sequencing reads
  • Stop compute to minimize expenses

14.1.2 Slides

The slides for this tutorial are are located here.

14.2 Starting Galaxy

Galaxy is a free, relatively easy to use bioinformatics implementation package. It changes command line programs into GUI based programs and is a great tool for performing bioinformatics analysis without having to update software or worry too much about coding. In order to use Galaxy, we need to create a cloud environment. This is like quickly renting a few computers from Google as the engine to power our Galaxy analysis.

Currently, you will need to use Chrome or Safari as your browser for Galaxy cloud environments to work.

In your new Workspace, click on the “ANALYSES” tab. Next, click on “START”. You should see a popup window on the right side of the screen. Click on the Galaxy logo to proceed.

Screenshot of the Workspace Notebooks tab. The notebook tab name and the plus button that starts a cloud environment for Galaxy have been highlighted,

Click on “NEXT” and “CREATE” to keep all settings as-is.

The CREATE button among cloud environments has been highlighted.

Click on the Galaxy icon.

The Galaxy icon appears if the environment has been successfully launched.

You will see that the environment is still being set up.

The status of the cloud computing environment shows that it is still being set up.

This will take 8-10 minutes. When it is done, click “Open”. You might need to refresh the page.

The Provisioning status text has changed to "Launch Galaxy" indicating the cloud environment is ready to use.

Remember that you can refresh your browser or navigate away at any time. This is because the connection to the environment is in the cloud, not on your personal computer.

You can also follow along with the first ~2 minutes of this video to start Galaxy on AnVIL.

14.4 Deleting Galaxy

Once you are done with your activity, you’ll need to shut down your Galaxy cloud environment. This frees up the cloud resources for others and minimizes computing cost. The following steps will delete your work, so make sure you are completely finished at this point. Otherwise, you will have to repeat your work from the previous steps.

Return to AnVIL, and find the Galaxy logo that shows your cloud environment is running. Click on this logo.

Screenshot of the Workspace menu. The currently running Galaxy cloud environment logo on the right sidebar is highlighted.

Next, click on “Settings”. Click on “Delete Environment”.

Screenshot of the cloud environment pop out menu. The "Delete Environment" button is highlighted.

Finally, select “Delete everything, including persistent disk”. Make sure you are done with the activity and then click “Delete”.

Screenshot of the cloud environment pop out menu. The “Delete everything, including persistent disk” radio button has been checked and is highlighted. The “Delete” button is highlighted.