Register for DaSL Training: Season 3
The Fred Hutch Data Science Lab (DaSL) is excited to continue its training program in Spring 2024 in the joy and practice of data science! At DaSL, we believe that everyone, regardless of their educational background, can excel at data science.
All events will be held in-person with online synchronous options and recordings to observe. Registration is open to the Fred Hutch community - please sign up below!
Beginner | Intermediate | Advanced | |
---|---|---|---|
Programming | Intro to R | Intermediate R | |
Rigorous Science | Introduction to Git and GitHub | ||
Scalable Computing | Introduction to Command Line | Cluster 101 |
Course Descriptions and Details
Introduction to R
You will learn the fundamentals of R, a statistical programming language, and use it to wrangle data for analysis and visualization. The programming skills you will learn are transferable to learn more about R independently and other high-level languages such as Python. At the end of the class, you will be reproducing analysis from a scientific publication!
Targeted audience: Researchers who want to do more with their data analyses and visualizations. This course is appropriate for those who want to learn coding for the first time, or have explored programming and want to focus on fundamentals in R.
Commitment: 6 weekly 1.5 hour classes, with encouraged 1-2 hours of practice weekly.
Course dates: Noon - 1:30pm PT on April 17, 24, May 1, 8, 15, 22. Register here.
Intermediate R
You will continue to learn the fundamentals of R and programming, and work on a data science project from start to finish. You will learn how to load and clean messy data, use custom R packages and functions, and effectively scale up your analysis.
Targeted audience: This course is appropriate for those who understand the basics of R data analysis and want to expand their knowledge to tackle messy data and use custom tools.
Pre-requisites: Completion of Intro to R, or knowledge of subsetting vectors and dataframes, and performing simple analysis such as summarizing data.
Commitment: 6 weekly 1.5 hour classes, with encouraged 1-2 hours of practice weekly.
Course dates: Noon - 1:30pm PT on April 15, 22, 29, May 6, 13, 20. Register here.
Introduction to Command Line
Fluency in programming and data science requires using computer software from the Command Line, a text-based way of controlling the computer. You will go on a guided under-the-hood tour behind the graphical interface we typically use: you will learn how to interact and manipulate files, folders, and software via the Command Line.
Targeted audience: Researchers who want to use scientific software launched from the command line, want to use a high-performance cluster computing environment, or want to use a cloud computing environment.
Commitment: A 1.5 hour workshop.
Workshop date: Noon - 1:30pm PT on April 16. Register here.
Cluster 101
Many scientific computing tasks cannot be done locally on a personal computer due to constraints in computation, data, and memory. In this workshop, you will learn how to connect to the Fred Hutch SLURM high performance cluster to transfer files, load scientific software, compute interactively, and launch jobs!
Targeted audience: Researchers who want to use Fred Hutch’s SLURM high performance cluster to run software and analysis at scale.
Pre-requisites: Completion of Intro to Command Line workshop or demonstrating competency.
Commitment: A 1.5 hour workshop.
Workshop date: Noon - 1:30pm PT on April 30. Register here.
Introduction to Git and GitHub
You will learn how to use Git, a version control system that is the primary means of doing reproducible and collaborative research. You will use Git from the command line to document the history of your code, create different versions of your code, and collaborate with others on your code using GitHub!
Targeted audience: Researchers who want to keep track the history of their code at a professional standard, and share it with an audience.
Prerequisites: Completion of Intro to Command Line or demonstrating competency.
Commitment: A 1.5 hour workshop.
Workshop date: Noon - 1:30pm PT on May 14. Register here.