Safaa RAFIK
Portfolio

Data Analyst passionate about extracting value from data with SQL, Python, Excel, Tableau & Power BI.

Data cleaning
in SQL

In this project, I cleaned and transformed a comprehensive Nashville housing dataset in SSMS, addressing issues such as missing values, inconsistencies, and duplicate records. The goal was to prepare the data for deeper analysis, making it more reliable for future insights related to property values, sale trends, and market patterns.

Data cleaning in SQL

This project was done in MySQL, where I took on the challenge of cleaning a global dataset on layoffs, transforming raw, incomplete, and inconsistent data into a structured and actionable resource. This cleanup laid the foundation for a deeper exploration of the dataset.

Data Exploration in SQL

In this project, I used MySQL to explore a global layoffs dataset, identifying key trends and patterns in workforce reductions across different industries and time periods through SQL queries.

Full project in Excel

In this project, I analyzed a bike buyers dataset using Excel. I cleaned and transformed the data for accuracy, explored key patterns with pivot tables, and designed an interactive dashboard to visualize insights, making trends easier to interpret and supporting data-driven decisions.

Tableau Full Project

An interactive Tableau dashboard analyzing Airbnb listings to reveal pricing and revenue trends across different property sizes. The project focuses on how data aggregation (sum vs average) can lead to different business insights.

Tableau Full Project

This Tableau project explores housing trends in King County, Washington, through an interactive dashboard designed to uncover how different property features affect sale prices. It demonstrates both data storytelling and practical insights for buyers, sellers, and investors navigating the real estate market.

Tableau Full Project

In this project, I built an interactive Tableau dashboard that transformed British Airways customer reviews into meaningful visual insights, highlighting patterns in service quality and passenger satisfaction. The analysis helped identify key drivers of satisfaction and areas for potential service improvement.

Automatic File Sorter

This project automates the organization of files into specific folders based on their types (CSV, image, text). Written in Python, it scans a directory and moves files to designated folders, streamlining file management. The script is highly customizable and can be easily adapted to various file sorting needs.

Machine Learning Model

In this project, I built a machine learning model to predict whether British Airways customers' would complete their flight bookings. After feature engineering and handling class imbalance with SMOTE, I trained a Random Forest classifier that uncoverered the key factors that influence customer decisions.