The pandas DataFrame: Working With Data Efficiently

The pandas DataFrame is a structure that contains two-dimensional data and its corresponding labels. DataFrames are widely used in data science, machine learning, scientific computing, and many other data-intensive fields.

DataFrames are similar to SQL tables or the spreadsheets that you work with in Excel or Calc. In many cases, DataFrames are faster, easier to use, and more powerful than tables or spreadsheets because they’re an integral part of the Python and NumPy ecosystems.

In this course, you’ll learn:

  • What a pandas DataFrame is and how to create one
  • How to access, modify, add, sort, filter, and delete data
  • How to handle missing values
  • How to work with time-series data
  • How to quickly visualize data

About Cesar Aguilar

Cesar Aguilar Cesar Aguilar

Cesar is an avid Pythonista and records video tutorials for Real Python. He enjoys teaching programming to anyone who listens and seeing in others their programming "eureka" moment.

» More about Cesar

Each tutorial at Real Python is created by a team of developers so that it meets our high quality standards. The team members who worked on this tutorial are:

Participant Comments

Aldir on Feb. 25, 2022

Well done Cesar! Excelent introduction to Pandas DataFrames. Thank you!

clytn579 on Oct. 3, 2021

This was a great overview of pandas and based on what I have learned, I know where to go for more details to get even deeper knowledge.

aniketbarphe on Sept. 8, 2021

Excellent Course! 10 stars out of 10 stars. Recommended to everyone!

« Browse All Courses