Python Data Science Tutorials

You use Python to explore, analyze, and visualize data with pandas, NumPy, SciPy, and Jupyter. Create clear charts with Matplotlib and Seaborn, clean messy datasets, and write tests so analyses are repeatable. Work through practical tasks like feature engineering, time series, and text processing while using virtual environments to keep tooling reliable.

Join Now: Click here to join the Real Python Newsletter and you’ll never miss another Python tutorial, course, or news update.

When you are ready to model, apply scikit-learn for classification, regression, clustering, and pipelines. For deep learning, train with Keras, TensorFlow, or PyTorch and track results. Scale workloads with Dask, store data in SQLite, PostgreSQL, and deploy predictions with FastAPI and Docker.

Browse all resources below, or commit to a guided Learning Path with progress tracking:

Learning Path

Data Science With Python Core Skills

21 Resources ⋅ Skills: Pandas, NumPy, Data Cleaning, Data Visualization, Statistics

Learning Path

Math for Data Science

5 Resources ⋅ Skills: Statistics, Correlation, Linear Regression, Logistic Regression, NumPy, SciPy, pandas, Gradient Descent

Learning Path

pandas for Data Science

15 Resources ⋅ Skills: pandas, Data Science, Data Visualization, DataFrame, GroupBy, Data Cleaning

Install pandas with python -m pip install pandas. Read files using pd.read_csv() or pd.read_parquet(), inspect with df.info() and df.describe(), and summarize with groupby() and agg().

Use scikit-learn for classical ML tasks and pipelines. Choose TensorFlow or PyTorch for deep learning, and consider XGBoost for strong tabular baselines.

Start with Matplotlib for full control and Seaborn for quick, statistical plots. Set styles, labels, and legends, and export figures with plt.savefig() for reports and dashboards.

Use Dask for pandas-like processing on larger-than-memory data, or PySpark when you need a cluster. For single-machine workflows, stream with chunksize, downcast dtypes, and store data as Parquet.

Serialize the model with joblib.dump(), load it in a FastAPI app, and expose a POST /predict endpoint. Run with Uvicorn or behind Gunicorn, and containerize with Docker for consistent releases.

A worker kneels at a Python-branded validation machine labeled VALIDATING, sorting boxes from a conveyor belt into two stacks marked PASS and REVIEW alongside wooden crates.

Validating Data With Pointblank in Python

intermediate best-practices data-science

LangGraph: Build Stateful AI Agents in Python

LangGraph Tutorial: Build Stateful AI Agents in Python

Jul 13, 2026 intermediate ai data-science

Natural Language Processing With Python's NLTK Package

Natural Language Processing With Python's NLTK Package

Jul 02, 2026 basics data-science

Real Python Podcast E300 Title Image

The Real Python Podcast – Episode #300: Maintaining Your Python Developer Instincts While Using LLM Tools

Jun 26, 2026 ai community data-science data-viz django

Using Python for Data Analysis

Python for Data Analysis: A Practical Guide

Jun 22, 2026 intermediate best-practices data-science python

Using Python for Data Analysis

Using Python for Data Analysis

Jun 22, 2026 intermediate best-practices data-science python

Develop Data Visualization Interfaces in Python With Dash

Develop Data Visualization Interfaces in Python With Dash

Jun 16, 2026 intermediate data-science data-viz

Serialize Your Data With Python

Serialize Your Data With Python

Jun 11, 2026 intermediate data-science web-dev

Embeddings and Vector Databases With ChromaDB

Embeddings and Vector Databases With ChromaDB

Jun 09, 2026 advanced ai databases data-science machine-learning

Python Data Science Artwork

Data Science With Python Core Skills

May 26, 2026 basics data-science

A person sitting on a chair, talking Python to another person who is sitting at a desk with a laptop, with a server structure behind them

Data Collection & Storage

May 26, 2026 intermediate databases data-science

Visualizing Data in Python With Seaborn

Visualizing Data in Python With Seaborn

May 26, 2026 intermediate data-science data-viz

Real Python Podcast E296 Title Image

The Real Python Podcast – Episode #296: Managing Polars Schema Issues & Profiling GitHub Users

May 22, 2026 intermediate best-practices community data-science stdlib

How to Flatten a List of Lists in Python

How to Flatten a List of Lists in Python

May 11, 2026 intermediate algorithms data-science

How to Flatten a List of Lists in Python

How to Flatten a List of Lists in Python

May 11, 2026 intermediate algorithms data-science

Real Python Podcast E294 Title Image

The Real Python Podcast – Episode #294: Declarative Charts in Python & Discerning Iterators vs Iterables

May 08, 2026 intermediate ai data-science data-viz web-scraping

ChatterBot: Build a Chatbot With Python

ChatterBot: Build a Chatbot With Python

May 06, 2026 intermediate data-science projects

Real Python Podcast E293 Title Image

The Real Python Podcast – Episode #293: Agentic Data Science Pair Programming With marimo pair

May 01, 2026 intermediate ai data-science

ChatterBot: Build a Chatbot With Python

ChatterBot: Build a Chatbot With Python

Apr 29, 2026 intermediate data-science projects

Altair: Declarative Charts With Python

Apr 22, 2026 intermediate data-science data-viz

Embeddings and Vector Databases With ChromaDB

Vector Databases and Embeddings With ChromaDB

Apr 14, 2026 advanced ai databases data-science machine-learning

Using Pandas and Python to Explore Your Dataset

Explore Your Dataset With pandas

Apr 14, 2026 basics data-science

Altair: Declarative Charts With Python

Apr 14, 2026 intermediate data-science data-viz

Embeddings and Vector Databases With ChromaDB

Vector Databases and Embeddings With ChromaDB

Apr 14, 2026 advanced ai databases data-science machine-learning

Real Python Podcast E290 Title Image

The Real Python Podcast – Episode #290: Advice on Managing Projects & Making Python Classes Friendly

Apr 10, 2026 intermediate data-science data-structures projects python

Real Python Podcast E289 Title Image

The Real Python Podcast – Episode #289: Limitations in Human and Automated Code Review

Mar 27, 2026 intermediate ai data-science django

Real Python Podcast E288 Title Image

The Real Python Podcast – Episode #288: Automate Exploratory Data Analysis & Invent Python Comprehensions

Mar 20, 2026 intermediate career data-science django python

Spyder: Your IDE for Data Science Development in Python

Spyder: Your IDE for Data Science Development in Python

Mar 16, 2026 basics data-science tools

Spyder: Your IDE for Data Science Development in Python

Spyder: Your IDE for Data Science Development in Python

Mar 05, 2026 basics data-science tools

Automate Python Data Analysis With YData Profiling

Automate Python Data Analysis With YData Profiling

Mar 02, 2026 intermediate data-science data-viz

The pandas DataFrame: Make Working With Data Delightful

The pandas DataFrame: Make Working With Data Delightful

Mar 02, 2026 intermediate data-science

Automate Python Data Analysis With YData Profiling

Automate Python Data Analysis With YData Profiling

Mar 02, 2026 intermediate data-science data-viz

Real Python Podcast E282 Title Image

The Real Python Podcast – Episode #282: Testing Python Code for Scalability & What's New in pandas 3.0

Jan 30, 2026 intermediate career data-science data-structures testing

GeoPandas Basics: Maps, Projections, and Spatial Joins

GeoPandas Basics: Maps, Projections, and Spatial Joins

Jan 26, 2026 intermediate data-science

GeoPandas Basics: Maps, Projections, and Spatial Joins

GeoPandas Basics: Maps, Projections, and Spatial Joins

Jan 26, 2026 intermediate data-science

Writing DataFrame-Agnostic Python Code With Narwhals

Writing DataFrame-Agnostic Python Code With Narwhals

Dec 15, 2025 advanced data-science python

Writing DataFrame-Agnostic Python Code With Narwhals

Writing DataFrame-Agnostic Python Code With Narwhals

Dec 15, 2025 advanced data-science python

Real Python Podcast E276 Title Image

The Real Python Podcast – Episode #276: Exploring Quantum Computing & Python Frameworks

Dec 05, 2025 intermediate data-science tools

A person meditating on the left, thinking about Python, a panda sleeping on the right, dreaming of bamboo, with a structure and a Python-themed gong in the middle between them

Introduction to pandas

Dec 02, 2025 intermediate data-science

Quantum Computing Basics With Qiskit

Quantum Computing Basics With Qiskit

Dec 01, 2025 intermediate data-science

Real Python Podcast E274 Title Image

The Real Python Podcast – Episode #274: Preparing Data Science Projects for Production

Nov 14, 2025 intermediate data-science data-viz

Real Python Podcast E273 Title Image

The Real Python Podcast – Episode #273: Advice for Writing Maintainable Python Code

Nov 07, 2025 intermediate best-practices data-science gui

Investigating Quasar Data With Polars and Interactive Marimo Notebooks

Investigating Quasar Data With Polars and Interactive marimo Notebooks

Oct 21, 2025 intermediate data-science data-viz web-dev

Polars vs pandas: What's the Difference?

Polars vs pandas: What's the Difference?

Oct 15, 2025 intermediate data-science python

Polars vs pandas: What's the Difference?

Polars vs pandas: What's the Difference?

Oct 15, 2025 intermediate data-science python

Real Python Podcast E267 Title Image

The Real Python Podcast – Episode #267: Managing Feature Flags & Comparing Python Visualization Libraries

Sep 26, 2025 intermediate best-practices data-science data-viz

How to Drop Null Values in pandas

How to Drop Null Values in pandas With .dropna()

Sep 10, 2025 basics data-science python

How to Drop Null Values in pandas

How to Drop Null Values in pandas

Sep 10, 2025 basics data-science python

←
1
2
3
4
5
→