Skip to main content
CourseintermediateFree

Harvard University's CS109 Data Science

This course provides a comprehensive introduction to data science, covering topics like data wrangling, exploration, visualization, prediction, and communication. It uses Python extensively, along with various data science libraries. Lecture videos, notes, and assignments from past iterations are frequently made available to the public.

Visit resource

More resources on Statistical Software (R, Python)

WebsiteFree

Scikit-learn Official Documentation

Scikit-learn is a widely used open-source machine learning library for Python. Its documentation provides extensive user guides, tutorials, and examples for various classification, regression, clustering, and dimensionality reduction algorithms.

WebsiteFree

NumPy Official Documentation

NumPy is the fundamental package for scientific computing with Python, providing support for large, multi-dimensional arrays and matrices, along with a collection of high-level mathematical functions. Its official documentation is crucial for understanding array operations and numerical computing in Python.

WebsiteFree

Pandas Official Documentation

The definitive and comprehensive documentation for the Pandas library, an indispensable tool for data manipulation and analysis in Python. It includes user guides, API references, and tutorials for working with DataFrames and Series.

WebsiteFree

RStudio Cheat Sheets

RStudio (now Posit) provides a collection of incredibly useful, visually appealing cheat sheets for various R packages and tasks, including `ggplot2`, `dplyr`, data import, R Markdown, and more. These are excellent quick-reference guides for R users.

WebsiteFree

The Comprehensive R Archive Network (CRAN)

The official repository for R, offering not only the R software itself but also extensive documentation, manuals, FAQs, and a vast collection of packages contributed by the R community. It's the authoritative source for R-related information.

YouTubeFree

MarinStatsLectures

A YouTube channel offering a vast library of video lectures on statistics and R programming. It covers a wide range of statistical topics from basic descriptive statistics to advanced inferential methods, often demonstrating their application in R.

See all Statistical Software (R, Python) resources β†’