This project leverages data collection, SQL preprocessing, and Power BI to deliver in-depth sales analytics and performance optimization. It includes web-scraped news data, sales trends, and insights into Power BI performance using the Performance Analyzer.
- Data Integration: Combine dataset-based and web-scraped data sources.
- Data Cleaning & Preprocessing: Perform preprocessing directly in SQL for structured analysis.
- Sales Analysis: Explore sales trends across categories, segments, regions.
- Power BI Performance Tuning: Use Performance Analyzer to optimize report speed and responsiveness.
- Interactive Dashboards: Build visually rich and interactive Power BI dashboards.
- Comprehensive Documentation: Maintain clear documentation for every step of the process.
- Collected Data from:
- Historical Sales Dataset
- Web-scraped news articles relevant to market trends
- Converted raw data into structured format using Pandas
- Uploaded structured data to SQL database
- Conducted data cleaning and transformation in SQL.
- Removed missing values, standardized data types, and normalized features.
- Exported cleaned data for downstream analytics.
- Built Power BI visualizations:
- Sales by Region, Segment, Category
- Trend over time (monthly, yearly)
- Profit and Quantity distribution
- Heatmaps and Tree Maps for profitability
- Power BI
.pbixfile is stored here.
- Used Performance Analyzer in Power BI to:
- Log load time per visual
- Optimize slow-performing visuals
- Improve report responsiveness
- Logs and results stored here for reference.
- Complete project documentation, including:
- Data flow
- Tools and technologies
- Key visual insights
- Screenshots of dashboards
- Performance tuning strategies
- Python 3.8+
- MySQL Server
- Power BI Desktop (Download here)
- Python libraries:
pip install pandas requests beautifulsoup4 mysql-connector-python
git clone https://github.com/NukaNarendra/SalesAnalysisAndOptimisingPowerBI.git
cd SalesAnalysisAndOptimisingPowerBI# Inside 'Collecting the Data' folder
python collect_data.py- Uses
pandas,requests, andBeautifulSoupto scrape news articles and merge them with sales dataset. - Converts and uploads data to SQL.
- SQL scripts are executed manually or via Python.
- Cleaned data is saved in SQL and exported to CSV for analysis.
- Open
Mining and Analysing/SalesAnalysis.pbixin Power BI Desktop. - Click Refresh to load the latest cleaned data.
- Explore dashboards with interactive filters.
- Navigate to View → Performance Analyzer in Power BI.
- Start recording and export logs.
- Logs are stored in
Collecting the Power BI Performance through Performance Analyzer/.
- Go to
Documentation of the Project/for:- Process flow diagrams
- Screenshots of each dashboard section
- Performance analysis notes
- Model explanation and improvement steps
- Sales Overview: Total Sales, Profit, Quantity
- Time Series: Monthly and Yearly breakdown
- Profit Heatmap: Region-wise performance
- Top N Analysis: Best-selling categories and products
- Performance Analyzer Results: Load times and optimizations
✔ Combined dataset and news insights.
✔ Cleaned and processed data in SQL.
✔ Power BI dashboards showing sales intelligence.
✔ Performance optimization using Power BI analyzer.
✔ Complete project documentation with visuals.
- Contributions are welcome!
Feel free to open issues or pull requests to improve the data pipeline, dashboard, or documentation.