A Certified IBM Data Engineering Professional with extensive experience designing and maintaining scalable data pipelines and lake house environments. Expert in big data and BI tools with a proven track record of enhancing data architecture and leading cross-functional teams to align solutions with business goals. Adept at managing data projects from modeling and ETL/ELT processes to real-time streaming and machine learning deployment. A personal milestone of losing 45 kilograms and maintaining fitness through regular practice.
Title: Big Data Engineer
Project 1: Predicting Car Prices Using Apache Airflow and PySpark
Developed a data pipeline to predict car prices with automated workflow management using Airflow and added logging for monitoring.
Project 2: E-Commerce End-to-End Big Data Engineering Project
Designed a scalable ELT data pipeline for e-commerce analytics using Apache Kafka for real-time processing and Apache Spark for batch processing.