Tech Stack: Python · SQL (SQLite) · Power BI
Timeline: Mar 2026 – Present
Analyzed HR workflows across 300 employees to identify operational bottlenecks, reduce onboarding inefficiencies, and support data-driven resource allocation decisions.
| Metric | Value |
|---|---|
| Overall Attrition Rate | 23.3% |
| Highest Attrition Dept | Sales (31.4%) |
| Avg Onboarding Duration | 23.8 days |
| Onboarding Bottleneck | IT Setup (avg 9.7 days) |
| Leave Approval Rate | 88.4% |
| Avg Performance Score | 3.38 / 5.0 |
| High Performers | 14.0% of workforce |
Builds a realistic HR dataset with 4 tables:
employees— 300 records with dept, level, salary, performance, attritiononboarding— 1,500 step-level records tracking each stage's durationleave_requests— 1,790 leave records with type, duration, approvalperformance_reviews— 492 semi-annual review records
Full Python analysis pipeline:
- Attrition analysis by department and job level
- Onboarding bottleneck detection (IT Setup = main delay)
- Leave & resource utilization by department
- Performance insights and high-performer segmentation
- Exports 5 Power BI-ready CSVs + dashboard PNG
8 structured SQL queries covering:
- Attrition rate by department
- Onboarding bottleneck per step
- Employees with slowest onboarding (top 10)
- Leave utilization & approval rate by department
- Attrition risk scoring (performance + leave cross-analysis)
- Performance trend by department
- Salary vs performance (resource allocation efficiency)
- Monthly attrition trend
Import these CSVs into Power BI for dashboards:
powerbi_kpi_summary.csv— KPI cardspowerbi_attrition_by_dept.csv— Bar chartpowerbi_onboarding_bottleneck.csv— Funnel / barpowerbi_leave_by_dept.csv— Table / matrixpowerbi_performance_by_dept.csv— Gauge / bar
# Step 1 — Install dependencies
pip install pandas numpy faker matplotlib
# Step 2 — Generate dataset
python generate_data.py
# Step 3 — Run analysis
python analyze.py
# Step 4 — Open SQL queries in DB Browser for SQLite
# File: output/hr_analytics.db
# Step 5 — Import CSVs from output/ into Power BI