Navigating the Data with Pandas

Date:

  • The talk will center on the Pandas library, an essential tool in the Python programming ecosystem for data manipulation and analysis. It will delve into the core concepts of the Pandas library, starting with Series and DataFrame objects. Series are one-dimensional arrays, and DataFrames are two-dimensional tabular structures, foundational concepts in Pandas. Since the data may not always be perfect, the talk will cover strategies for handling missing data. Attendees will learn practical tips and tricks for cleaning data. The talk will also focus on transforming datasets using functions, regular expressions, and table operations. These skills are vital for shaping and preparing data for analysis. By the end of the session, participants will have a solid data science foundation and be equipped with the skills to perform essential data manipulation tasks using Pandas. This will pave the way for further exploration in the field.

Slides