Real Python Podcast Episode #103 Title Artwork

Episode 103: Becoming More Effective at Manipulating Data With Pandas

The Real Python Podcast

Mar 25, 2022 59m

Do you wonder if you’re taking the right approach when shaping data in pandas? Is your Jupyter workflow getting out of hand? This week on the show, Matt Harrison talks about his new book, “Effective Pandas: Patterns for Data Manipulation.”

Episode Sponsor:

Matt discusses working as a corporate consultant and migrating Excel users toward Python. We explore several “NumPy-isms” that beginners get stuck on. Matt shares advice about chaining operations in pandas, which some developers find controversial.

Topics:

  • 00:00:00 – Introduction
  • 00:01:32 – Working as a consultant
  • 00:03:39 – Moving from Excel to Python
  • 00:06:50 – Who is the book for?
  • 00:10:15 – Using real data for examples
  • 00:16:16 – Sponsor: CData Software
  • 00:16:58 – What are patterns for data manipulation?
  • 00:18:38 – Cleaning and preparing data
  • 00:21:33 – What concepts were you most eager to share?
  • 00:26:57 – An example of chaining operations in pandas
  • 00:33:20 – NumPy-isms and other challenges in learning pandas
  • 00:40:20 – The use of exercises throughout the book
  • 00:43:50 – Video Course Spotlight
  • 00:45:01 – Challenges of using color throughout the book
  • 00:51:40 – Avoiding the slow path in pandas
  • 00:56:03 – What are you excited about in the world of Python?
  • 00:56:58 – What would you like to learn next?
  • 00:58:16Effective Pandas book
  • 00:58:38 – Social connections
  • 00:58:58 – Thanks and goodbye

Show Links: