Machine Learning with Galaxy

Open Galaxy

Please use this link to access the Galaxy platform specifically for our activity.

Note: this link will only be active for this workshop, but you can continue to use Galaxy resources for free by visiting usegalaxy.org.

Set up Galaxy’s history pane

If you have files in your history already, use the + button on the top right of the history pane to Create new history.

If needed (e.g., because there are already files in your history), create a new history

Click the pencil button next to “Unnamed history”.

Pencil button for renaming the history

Fill in the name with something descriptive/appropriate.

Replace 'Unnamed history' with a descriptive name for the history.

Click “Save”.

Click 'Save' to save the new history name

Now that our History pane is empty and named a descriptive title, we’ll need to load data.

Data upload

Copy these links:

https://zenodo.org/records/13885908/files/Chowell_train_Response.tsv
https://zenodo.org/records/13885908/files/Chowell_test_Response.tsv

In Galaxy, click the “Upload” button in the top left of the page.

'Upload' button for data upload highlighted in the upper left corner

This will open an interactive panel for data upload.

Interactive panel for data upload

Click the “Paste/Fetch Data” button in the middle of the bottom stretch of options.

The Paste/Fetch data button is highlighted within the bottom stretch of options

Paste the copied URLs into the middle box.

paste the copied URLs into the middle highlighted box

Click the blue “Start” button in the bottom stretch of options.

The 'Start' button in the middle of the bottom stretch of options is highlighted.

Click the “Close” button at the end of the bottom stretch of options.

The 'Close' button in the bottom stretch of options is highlighted.

Using PyCaret Model Comparison Tool

On the top left of the page, the tool pane has a search bar. Copy and paste PyCaret Model Comparison into the tool search.

Tool search bar where to copy and paste the name of the tool 'Pycaret Model Comparison'

Select PyCaret Model Comparison.

Select PyCaret Model Comparison from the options listed below the tool search bar

In the middle pane, if the Train Dataset (CSV or TSV) is not already filled in with “Chowell_train_Response.tsv”, click the down arrow and select it.

Select the training dataset for the Train Dataset option

For the Test Dataset (CSV or TSV) option, select “Chowell_test_Response.tsv”.

Select the test dataset for the Test Dataset option

For the Select the target column option, select “c22: Response”.

Select the 'Response' column as the target column

Under the Task section, verify that “classification” is selected.

Verify that 'classification' is selected for the Task option

For the Only Select Classification Models if you don't want to compare all models, select the following:

     Logistic Regression
     Decision Tree Classifier
     Random Forest Classifier

Select Logistic Regression, Decision Tree Classifier, and Random Forest Classifier from the list for the 'Only Select Classification Models if you do not want to compare all models option.'

Under the Customize Default Settings?section, switch the “No” to a “Yes”.

Switch the No to Yes under the Customize Default Settings? Option

For the Cross Validation Folds option, decrease the number of folds to use for cross-validation from 10 to 2.

Switch 10 to 2 under the Cross Validation Folds option

Click the blue >Run Tool button. This will add the job to the queue and add the output files to the top of the history pane.

Click the Run Tool button to submit the job so that pycaret will compare the classification models using the provided data

Click the eye icon () next to the output titled PyCaret Model Comparison Comparisons results on data 2 and data 1 to inspect the output (which will be opened in the middle pane).

Use the eye icon to inspect the output

The output will open in the middle pane where you can inspect it.

Machine Learning with Galaxy

Activity

Overview of activity and data

Activity steps

Open Galaxy

Set up Galaxy’s history pane

Data upload

Using PyCaret Model Comparison Tool