Projects
A selection of my work demonstrating expertise in key data science techniques:
- I. Predictive Modeling
- II. Time Series Forecasting & Simulation
- III. Data Visualization, Business Intelligence & App Deployment
Each project showcases how I apply these analytical techniques to solve real-world problems.
I. Predictive Modeling
These projects showcase predictive modeling using algorithms like XGBoost, Random Forest, and regression to predict outcomes, quantify risk, and guide data-driven decision-making.
Credit Risk Prediction Model
- Problem: Predict credit risk to help financial institutions make informed lending decisions.
- Approach: Applied XGBoost and Random Forest with feature engineering and model evaluation.
- Technologies: Python, XGBoost, Random Forest, Azure, Power BI
- Results: Achieved high accuracy in identifying high-risk customers, enabling better lending decisions.
View on GitHub
Financial Customer Retention Analytics
- Problem: Predict customer churn and estimate lifetime value (CLV) for financial services.
- Approach: Gradient boosting models on transactional/demographic data integrated into dashboards.
- Technologies: Python, Gradient Boosting, SQL, Power BI, Azure ML
- Results: Provided actionable insights for improving customer retention and maximizing lifetime value.
View on GitHub
Predictive HR / Workforce Analytics
- Problem: Predict employee attrition to improve retention strategies.
- Approach: Random Forest and XGBoost on HR datasets; interactive dashboards for retention insights.
- Technologies: Python, XGBoost, Random Forest, SQL, Power BI, Plotly Dash
- Results: Identified high-risk employees and guided HR teams to implement effective retention strategies.
View on GitHub
II. Time Series Forecasting & Simulation
These projects demonstrate modeling of time-dependent data and using simulations to predict resource demand, trends, and scenario outcomes.
Healthcare Resource Forecasting
- Problem: Forecast patient volume and resource demand for better hospital planning.
- Approach: ARIMA + Monte Carlo simulations; automated retraining pipelines in Azure ML.
- Technologies: Python, ARIMA, Monte Carlo, SQL, Plotly Dash, Azure ML
- Results: Enabled hospital administrators to anticipate resource needs and allocate staff efficiently.
View on GitHub
Healthcare Workforce & Scheduling Optimization
- Problem: Predict staffing requirements and evaluate scheduling interventions.
- Approach: Regression and Random Forest; statistical analysis using A/B testing and bootstrap.
- Technologies: Python, Random Forest, Regression, SQL, Plotly Dash, Tableau, Azure ML
- Results: Improved staffing efficiency and optimized schedules based on predictive insights.
View on GitHub
Epidemiology: Toronto Infection Outbreaks (2016–2025)
- Problem: Predict infection outbreaks and identify hotspots.
- Approach: Prophet models for time-series forecasting; dashboards in Power BI.
- Technologies: Python, Prophet, SQL, Power BI
- Results: Provided actionable visualizations for monitoring infection trends and informing public health decisions.
View on GitHub
Financial Forecasting & Scenario Analysis
- Problem: Forecast financial KPIs and run “what-if” investment scenarios.
- Approach: Hybrid Prophet + ARIMA; Monte Carlo simulations; interactive Plotly Dash dashboards.
- Technologies: Python, Prophet, ARIMA, Monte Carlo, Plotly Dash
- Results: Allowed dynamic scenario analysis for strategic financial planning and investment evaluation.
View on GitHub
III. Data Visualization, Business Intelligence & App Deployment
These projects turn complex data into actionable insights via dashboards, apps, and automated workflows.
Operations Efficiency Dashboard
- Problem: Monitor and improve operational KPIs.
- Approach: Data cleaning and automated dashboards using Python, SQL, Power BI, and Azure Pipelines.
- Results: Streamlined reporting, enabling faster decision-making and bottleneck identification.
View on GitHub
Marketing A/B Testing Simulator
- Problem: Simulate marketing campaigns and analyze impact.
- Approach: Statistical tests (t-tests, ANOVA, bootstrap); automated reporting from Jupyter to HTML/PDF dashboards.
- Results: Provided actionable insights for marketing strategy optimization and campaign evaluation.
View on GitHub
Food Choices Streamlit App
- Problem: Explore college students’ food preferences with interactive analysis.
- Approach: Built a Streamlit app to collect, analyze, and visualize data.
- Results: Enabled users to interactively explore nutritional trends and preferences.
View on GitHub
✨ For more projects, visit my GitHub repositories