Resources for Building Your Data Science Skillset

This list of resources is designed to give you material to study from, or groups to study with. The list will be updated with any resources you send us. Current resources include:

HPC and Batch Computing

Scripting and Programming Languages

Python

R / RStudio

Bash / Shell Scripting / UNIX/Linux Command Line

Data Visualization

Matplotlib in-depth user guides: beginner, intermediate, and advanced sections, plus specific topics. https://matplotlib.org/tutorials/index.html

Study Groups and Special Interest Groups

General Tutorials and Overviews

Note that CCR’s Bioinformatics Training and Education Program (BTEP) has licenses for Biostars and Dataquest.io available for CCR staff. If you are interested in these, but are not in CCR, please contact us and we will arrange licenses for you: NCICBIITDataScienceTraining@mail.nih.gov

NIH Listservs

Note: You must register for an account before subscribing to these.

CBIIT Cancer Data Science Seminar Series https://datascience.cancer.gov/news-events/events/data-science-seminar

Resources for Intermediate or Advanced Learners

Machine Learning