Skip to content

Introduction

What is pandas?

Pandas ia a framework - i.e. a collection of functionality, not a program on its own. It is based on the numerical mathmatics framework numpy.

While numpy is more directed to highly optimized numeric calculations, pandas offers additional convenience and utility for dealing with tabular data. In turn, it sacrifices some processing speed.

What is pandas used for?

Its main application cases is data processing. This includes: * Reading, exploring, cleaning, transforming and visualizing data

Common areas that make use of it are:

  • Data Science
  • Machine Learning

How to get pandas?

It can be installed via pip or conda (c.f. pandas on pypi.org). Make sure that the dependencies are installed as well.

Where to find help?

Key Points

  • pandas is a data processing framework based on numpy
  • It offers additional utility functions but sacrifices speed