My learning journey with Pandas, the powerful Python library for data manipulation and analysis.
It contains practice notebooks and scripts where I explored different datasets, applied data-cleaning techniques, and performed meaningful analysis with Pandas. The goal is to strengthen my Python and Data Analysis skills through hands-on practice.
- 📂 Data Exploration →
read_csv,head(),tail(),info(),describe() - 🎯 Selection & Indexing →
.iloc[],.loc[], boolean filtering - 🧹 Data Cleaning →
dropna(),fillna(),drop_duplicates(),sort_values() - 📊 Aggregation & Grouping →
groupby(),sum(),mean(),count() - 🔗 Merging & Joining → SQL-style joins with
merge() - 📑 Pivot Tables → Summarizing data efficiently
- 📈 Visualization → Pandas plotting + Matplotlib (pie charts, bar plots, histograms)
- 📝 Custom Functions → Applying and creating helper functions
DataScience_salaries_2024.csv→ Job roles, salaries, and remote ratio trendsfood_order.csv→ Food order analysis & pivot tables practiceNetflix Movies and TV Shows.csv→ Cleaning, missing values handlingFortune 500 Companies US.csv→ Aggregation & grouping practice
⚠️ All datasets are publicly available and used only for educational purposes.
git clone https://github.com/Sidra-009/pandas-data-analysis.git
cd pandas-data-analysispip install pandas numpy matplotlib jupyterjupyter notebookThis repository is not a final project but a learning workspace. It reflects my continuous progress in exploring data analysis with Pandas, while building a foundation for future Data Science projects.
Sidra Saqlain 📌 Data Science Student | Enthusiastic about Data Analysis & Problem-Solving
✨ If you find this repository useful or inspiring, please ⭐ it!