Home

Awesome

Awesome pandas logo

awesome-pandas

A collection of resources for pandas (Python) and related subjects. Pull requests are very welcome!

Contents: This is an unofficial collection of resources for learning pandas, an open source Python library for data analysis. Here you will find videos, cheat-sheets, tutorials and books / papers. The curated list is divided into three parts:

  1. pandas resources - A collection of videos, cheat-sheets, tutorials and books directly related to pandas.
  2. Data analysis with Python resources - Material related to adjacent Python libraries and software such as NumPy, scipy, matplotlib, seaborn, statsmodels and Jupyter.
  3. Miscellaneous related resources - Resources related to general data analysis, Python programming, algorithms, computer science, machine learning, statistics, etc.
  4. Packages - Python packages for helping to work with Pandas.

(1) :panda_face: pandas resources

(1.1) :tv: Videos

The videos below were collected in July of 2018. They are all directly related to pandas, and the Level of a video is quantified roughly as follows:

TitleSpeakerUploaderTimeViewsYearLevel
Pandas tutorial for Data ScienceBikram Kundu-> 01:202K+2022:smiley:
Python for Data Analysis using Pandas part 1 & part 2 [repo]tommyodna2:191002019:smiley:
Data Science Best Practices with pandas [repo]Kevin MarkhamPyCon3:2310002019:smiley:
Thinking like a PandaHannah StepanekPyCon0:367002019:smiley:
Analyzing Census Data with Pandas [repo]Sergio SánchezPyCon3:156002019:smiley:
Pandas is for Everyone [repo]Daniel ChenPyCon3:186002019:smiley:
:star: Pandas From The Ground Up [repo]Brandon RhodesPyCon 20152:24910002015:smiley:
Introduction Into Pandas [repo]Daniel ChenPython Tutorial1:28460002017:smiley:
Introduction To Data Analytics With Pandas [repo]Quentin CaudronPython Tutorial1:51250002017:smiley:
Pandas for Data Analysis [repo]Daniel ChenEnthought3:45130002017:sweat_smile:
Optimizing Pandas Code [repo]Sofia HeislerPyCon 20170:29120002017:sweat_smile:
A Visual Guide To PandasJason WirthNext Day Video0:26490002015:smiley:
Analyzing and Manipulating Data with Pandas [repo]Jonathan RocherEnthought3:33220002016:smiley:
Time Series Analysis [repo]Aileen NielsenPyCon 20173:1190002017:sweat_smile:
Predicting sports winners with pandasRobert LaytonPyCon Australia0:38130002015:sweat_smile:
Pandas from the Inside [repo] [2016 talk]Stephen SimmonsPyData1:1730002017:scream:
Pandas part 1 & part 2 [repo]Joris Van den BosscheEuroSciPy3:0310002017:smiley:
Pandas: .head() to .tail() [repo]Tom AugspurgerPyData1:2630002016:sweat_smile:
Performance Pandas (london) [repo]Jeff RebackPyData0:4320002015:sweat_smile:
Performance Pandas (NYC) [repo]Jeff RebackPyData1:2630002015:sweat_smile:
Python Data Science with pandas [repo]Matt HarrisonJetBrainsTV1:0920002018:smiley:
What is the Future of Pandas [slides]Jeff RebackPyData0:3140002017:smiley:
Introduction to Python for Data Science [repo]Skipper SeaboldPyData3:183002018:smiley:
Pandas for Better (and Worse) Data Science [repo]Kevin MarkhamPyCon 20183:2130002018:smiley:

Know of a recent, good video? Send a pull request! :+1:

(1.2) :exclamation: Cheat-sheets

(1.3) :mortar_board: Tutorials

(1.4) :blue_book: Books / papers


(2) Data analysis with Python resources

(2.1) :tv: Videos

TitleSpeakerUploaderTimeViewsKeywordYearLevel
NumPy Beginner [repo]Alexandre Chabot LeClercEnthought2:4756000NumPy2016:sweat_smile:
Machine LearningAndreas Mueller & Sebastian RaschkaEnthought3:0347000sklearn2016:sweat_smile:
The Python Visualization LandscapeJake VanderPlasPyCon 20170:3321000python2017:smiley:
JupyterLab: Building Blocks for Interactive ComputingBrian GrangerEnthought0:2928000jupyter2016:smiley:
Machine Learning with Scikit Learn [repo]Andreas Mueller & Kyle KastnerEnthought3:2248000sklearn2015:sweat_smile:
Machine Learning for Time Series Data in PythonBrett NaulEnthought0:2424000cesium2016:smiley:
Computational Statistics [repo]Allen DowneyEnthought2:0510000scipy2017:sweat_smile:
Time Series Analysis [repo]Aileen NielsenPyCon 20173:119000pandas2017:sweat_smile:
Learning TensorFlowRobert LaytonPyCon Australia0:4018000tensorflow2016:sweat_smile:
JupyterHub: Deploying Jupyter NotebooksMin Ragan Kelley & Thomas KluyverPyData1:3617000jupyter2016:smiley:
Applied Time Series EconometricsJeffrey YauPyData1:3917000statsmodels2016:sweat_smile:
Machine Learning with scikit learn [repo]Andreas Mueller & Alexandre GramEnthought3:108000sklearn2017:sweat_smile:
Introduction to Numerical Computing with NumPyDillon NiederhutEnthought2:278000NumPy2017:smiley:
Dask - A Pythonic Distributed Data Science FrameworkMatthew RocklinPyCon 20170:467000dask2017:sweat_smile:
Introduction to Statistical Modeling with Python [repo]Christopher FonnesbeckPyCon 20173:197000scipy2017:sweat_smile:
Fully Convolutional Networks for Image SegmentationDaniil PakhomovEnthought0:207000scipy2017:smiley:
Exploratory data analysis in python [repo]Chloe Mawer & Jonathan WhitmorePyCon 20172:547000scipy2017:smiley:
Libraries for Deep Learning with SequencesAlex RubinsteynPyData0:4423000scipy2015:sweat_smile:
Numba - Tell Those C++ Bullies to Get Lost [repo]Gil Forsyth & Lorena BarbaEnthought2:255000numba2017:sweat_smile:
Deploying Interactive Jupyter DashboardsPhilipp RudigerEnthought0:185000jupyter2017:sweat_smile:
Data Science Using Functional PythonJoel GrusPyData0:4418000python2015:sweat_smile:
Anatomy of matplotlib [repo]Benjamin Root & Joe KingtonEnthought3:1818000matplotlib2015:sweat_smile:
Anatomy of matplotlib [repo]Benjamin RootEnthought3:024000matplotlib2017:sweat_smile:
Data Science is Software [repo]Peter Bull & Isaac SlavittEnthought2:129000jupyter2016:smiley:
Machine Learning with Scikit Learn [repo]Jake VanderPlasPyData1:3416000sklearn2015:sweat_smile:
Using Jupyter notebooks [repo]Ioanna IoannouPyCon Australia0:288000jupyter2016:sweat_smile:
Parallel Python: Analyzing Large Datasets [repo]Matthew RocklinEnthought3:057000scipy2016:scream:
Keynote: Project JupyterBrian GrangerEnthought0:487000jupyter2016:sweat_smile:
matplotlib beginner tutorial [repo]Nicolas RougierEnthought2:596000matplotlib2016:sweat_smile:
Awesome Big Data AlgorithmsTitus BrownNext Day Video0:3941000python2013:scream:
All About JupyterBrian GrangerPyData0:3911000jupyter2015:sweat_smile:
PyMC: Markov Chain Monte CarloChris FonnesbeckEnthought0:209000pyMC2014:sweat_smile:
Jupyter Advanced Topics Tutorial [repo]Jonathan Frederic & Matthias BussonierEnthought2:484000jupyter2015:scream:
Using randomness to make code much fasterRachel ThomasSF Python0:541000scipy2017:sweat_smile:
Python Profiling & PerformanceMahmoud HashemiSF Python0:281000python2016:sweat_smile:
Using List Comprehensions and Generator ExpressionsTrey HunnerPyCon 20183:213000python2018:sweat_smile:
Foundations of Numerical ComputingScott SandersonPyCon 20183:221000python2018:sweat_smile:

(2.2) :exclamation: Cheat-sheets

(2.3) :mortar_board: Tutorials

(2.4) :blue_book: Books / papers


(3) Miscellaneous related resources

(3.1) :tv: Videos

TitleSpeakerUploaderTimeViewsKeywordYearLevel
:star: So you want to be a Python expert?James PowellPyData1:5428000python2017:scream:
:star: Transforming Code into Beautiful, Idiomatic PythonRaymond HettingerNext Day Video0:48340000python2013:smiley:
:star: Builtin SuperheroesDavid BeazleyDavid Beazley0:4412000python2016:sweat_smile:
How to become a Data Scientist in 6 monthsTetiana IvanovaPyData0:56148000misc2016:smiley:
Modern DictionariesRaymond HettingerSF Python1:0744000python2016:sweat_smile:
Keynote on ConcurrencyRaymond HettingerSF Python1:1315000python2017:sweat_smile:
The Fun of ReinventionDavid BeazleyDavid Beazley0:5211000python2017:scream:
Being a Core Developer in PythonRaymond HettingerSF Python1:0219000python2016:smiley:
Visualizing Geographic DataChristopher RoachPyData0:3114000python2016:smiley:
Python's Class Development ToolkitRaymond HettingerNext Day Video0:4580000python2013:sweat_smile:
The Other Async (Threads + Async = ❤️) - YouTubeDavid BeazleyDavid Beazley0:475000python2017:scream:
Functional Programming with PythonMike MüllerNext Day Video0:2744000python2013Novice
Building a Recommendation Engine using PythonAnusua TrivediPyData0:3711000python2015Novice
Iterations of EvolutionDavid BeazleyDavid Beazley0:342000python2017Novice
"Good Enough" IS Good Enough!Alex MartelliSF Python0:534000python2016Novice
Automating Code QualityKyle KnappPyCon 20180:303000python2018:sweat_smile:

(3.2) :exclamation: Cheat-sheets

(3.3) :mortar_board: Tutorials

(3.4) :blue_book: Books / papers

The books below are perhaps of an even more general nature.


Every video is below.

TitleSpeakerUploaderTimeViewsKeywordYearLevel
How to become a Data Scientist in 6 monthsTetiana IvanovaPyData0:56148000misc2016:snake:
Introduction Into PandasDaniel ChenPython Tutorial1:2846000pandas2017:snake:
So you want to be a Python expert?James PowellPyData1:5428000python2017:snake::snake::snake:
NumPy Beginner [repo]Alexandre Chabot LeClercEnthought2:4756000NumPy2016:snake: :snake:
Introduction To Data Analytics With PandasQuentin CaudronPython Tutorial1:5125000pandas2017:snake:
Transforming Code into Beautiful, Idiomatic PythonRaymond HettingerNext Day Video0:48340000python2013:snake:
Machine LearningAndreas Mueller & Sebastian RaschkaEnthought3:0347000sklearn2016:snake: :snake:
Pandas From The Ground Up [repo]Brandon RhodesPyCon 20152:2491000pandas2015:snake: :snake:
Modern DictionariesRaymond HettingerSF Python1:0744000python2016:snake: :snake:
The Python Visualization LandscapeJake VanderPlasPyCon 20170:3321000python2017:snake:
Keynote on ConcurrencyRaymond HettingerSF Python1:1315000python2017:snake::snake:
Pandas for Data Analysis [repo]Daniel ChenEnthought3:4513000pandas2017:snake::snake:
JupyterLab: Building Blocks for Interactive ComputingBrian GrangerEnthought0:2928000jupyter2016:snake:
Optimizing Pandas Code for Speed and EfficiencySofia HeislerPyCon 20170:2912000pandas2017:snake: :snake:
A Visual Guide To PandasJason WirthNext Day Video0:2649000pandas2015:snake:
Machine Learning with Scikit Learn [repo]Andreas Mueller & Kyle KastnerEnthought3:2248000sklearn2015:snake: :snake:
Machine Learning for Time Series Data in PythonBrett NaulEnthought0:2424000cesium2016:snake:
The Fun of ReinventionDavid BeazleyDavid Beazley0:5211000python2017:snake::snake::snake:
Analyzing and Manipulating Data with Pandas [repo]Jonathan RocherEnthought3:3322000pandas2016:snake:
Computational Statistics [repo]Allen DowneyEnthought2:0510000scipy2017:snake: :snake:
Being a Core Developer in PythonRaymond HettingerSF Python1:0219000python2016:snake:
Time Series Analysis [repo]Aileen NielsenPyCon 20173:119000pandas2017:snake: :snake:
Learning TensorFlowRobert LaytonPyCon Australia0:4018000tensorflow2016:snake: :snake:
JupyterHub: Deploying Jupyter NotebooksMin Ragan Kelley & Thomas KluyverPyData1:3617000jupyter2016:snake:
Applied Time Series EconometricsJeffrey YauPyData1:3917000statsmodels2016:snake: :snake:
Machine Learning with scikit learn [repo]Andreas Mueller & Alexandre GramEnthought3:108000sklearn2017:snake: :snake:
Introduction to Numerical Computing with NumPyDillon NiederhutEnthought2:278000NumPy2017:snake:
Dask - A Pythonic Distributed Data Science FrameworkMatthew RocklinPyCon 20170:467000dask2017:snake: :snake:
Introduction to Statistical Modeling with Python [repo]Christopher FonnesbeckPyCon 20173:197000scipy2017:snake: :snake:
Fully Convolutional Networks for Image SegmentationDaniil PakhomovEnthought0:207000scipy2017:snake:
Exploratory data analysis in python [repo]Chloe Mawer & Jonathan WhitmorePyCon 20172:547000scipy2017:snake:
Visualizing Geographic DataChristopher RoachPyData0:3114000python2016:snake:
Builtin SuperheroesDavid BeazleyDavid Beazley0:4412000python2016:snake: :snake:
Python's Class Development ToolkitRaymond HettingerNext Day Video0:4580000python2013:snake: :snake:
Libraries for Deep Learning with SequencesAlex RubinsteynPyData0:4423000scipy2015:snake: :snake:
The Other Async (Threads + Async = ❤️) - YouTubeDavid BeazleyDavid Beazley0:475000python2017:snake: :snake: :snake:
Numba - Tell Those C++ Bullies to Get Lost [repo]Gil Forsyth & Lorena BarbaEnthought2:255000numba2017:snake: :snake:
Deploying Interactive Jupyter DashboardsPhilipp RudigerEnthought0:185000jupyter2017:snake: :snake:
Eyal Trabelsi - Practical Optimisations for PandasEyal TrabelsiEuropython0:455000jupyter2020:snake: :snake:
Data Science Using Functional PythonJoel GrusPyData0:4418000python2015:snake: :snake:
Pandas from the InsideStephen SimmonsPyData1:209000pandas2016:snake: :snake: :snake:
Anatomy of matplotlib [repo]Benjamin Root & Joe KingtonEnthought3:1818000matplotlib2015:snake: :snake:
Anatomy of matplotlib [repo]Benjamin RootEnthought3:024000matplotlib2017:snake: :snake:
Data Science is Software [repo]Peter Bull & Isaac SlavittEnthought2:129000jupyter2016:snake:
Machine Learning with Scikit Learn [repo]Jake VanderPlasPyData1:3416000sklearn2015Novice
Using Jupyter notebooksIoanna IoannouPyCon Australia0:288000jupyter2016Novice
Parallel Python: Analyzing Large Datasets [repo]Matthew RocklinEnthought3:057000scipy2016Novice
Functional Programming with PythonMike MüllerNext Day Video0:2744000python2013Novice
Predicting sports winners with pandas and scikit-learnRobert LaytonPyCon Australia0:3813000pandas2015Novice
Keynote: Project JupyterBrian GrangerEnthought0:487000jupyter2016Novice
matplotlib beginner tutorial [repo]Nicolas RougierEnthought2:596000matplotlib2016Novice
Awesome Big Data AlgorithmsTitus BrownNext Day Video0:3941000python2013Novice
Pandas from the InsideStephen SimmonsPyData1:173000pandas2017Novice
All About JupyterBrian GrangerPyData0:3911000jupyter2015Novice
Building a Recommendation Engine using PythonAnusua TrivediPyData0:3711000python2015Novice
Iterations of EvolutionDavid BeazleyDavid Beazley0:342000python2017Novice
"Good Enough" IS Good Enough!Alex MartelliSF Python0:534000python2016Novice
PyMC: Markov Chain Monte CarloChris FonnesbeckEnthought0:209000pyMC2014Novice
Jupyter Advanced Topics Tutorial [repo]Jonathan Frederic & Matthias BussonierEnthought2:484000jupyter2015Novice
Using randomness to make code much fasterRachel ThomasSF Python0:541000scipy2017Novice
Python Profiling & PerformanceMahmoud HashemiSF Python0:281000python2016Novice

(4) Packages