Chemical Engineer graduate, pursuing MSc in Data Science
The best time to plant a tree was 20 years ago the second best time is now. - Chinese proverb
Experienced Data Science Engineer with a background in chemical engineering, passionate about building data pipelines, real-time dashboards, and machine learning solutions. Strong track record of deploying scalable analytics tools, mentoring teams, and delivering actionable insights from complex datasets.
โก Expertise: Python, SQL, Power BI, Streamlit, PySpark, Git, Redis
๐ Interests: Data Engineering, Operational Efficiency, Predictive Maintenance
๐ September 2024 โ Present
๐ October 2022 โ September 2024
โ๏ธ Developed multi-sensor ML models to predict failures in refinery rotating equipment (pumps, compressors, turbines) using vibration, current, and temperature data.
๐ Achieved 99.9% accuracy with Random Forest, surpassing literature benchmarks, and demonstrated robust feature engineering and preprocessing workflows (MATLAB + Python).
๐ Presentation slides
โฑ๏ธ An interactive web dashboard analyzing health & performance metrics from Garmin and Strava .csv
/ .fit
exports.
๐ ๏ธ Built with Streamlit to visualize pace, heart rate, cadence, and GPS tracks.
โก A lightweight backend using Flask + Redis for managing user profiles, posts, and timelines.
Designed for real-time interactions using Redis data structures (hashes, lists, sets).
Used open-source predictive maintenance dataset.
Performed EDA, data cleaning, and trained a DecisionTreeClassifier โ achieving 99.6% accuracy and 99.7% F1 score.
Simplifies solar energy production data cleaning using R.
Designed to support predictive maintenance:
Languages: Python, SQL, R, DAX, PHP, MATLAB, VBA
Tools & Platforms: Power BI, Tableau, Power Automate, Git, GitHub Copilot, MongoDB, Redis, Docker, Azure, PySpark, SQL Server
Certifications: