Ecommerce Market Batch ETL Pipeline
ReadyA batch ETL pipeline project that ingests transaction data, validates schemas, and produces daily revenue metrics.
Stack
Python, SQL, PostgreSQL, Docker
GitHub repository Read case study
Aspiring Data Engineer
I am focused on data engineering fundamentals: designing ETL/ELT pipelines, writing clear SQL, modeling data for analytics, and building reproducible systems. I enjoy projects that involve messy raw data, transformation logic, orchestration, and making data trustworthy.
Skills
Grouped skills make it easy to scan the stack used across pipeline, modeling, and reliability work.
Projects
A batch ETL pipeline project that ingests transaction data, validates schemas, and produces daily revenue metrics.
Stack
Python, SQL, PostgreSQL, Docker
GitHub repository Read case studyA real-time transaction monitoring and fraud detection lakehouse for simulated digital banking data.
Stack
PostgreSQL, Debezium, Kafka, PySpark, Apache Iceberg, MinIO, Nessie, dbt, Airflow, Great Expectations, Trino, Grafana, Docker
Read case studyA draft orchestration project that schedules pipeline tasks and records validation results for operational visibility.
Stack
Airflow, Python, SQL, Docker, PostgreSQL
GitHub repository Read case studyCase Studies
Draft case study for an orchestrated workflow that separates extraction, transformation, validation, and publish steps.
In progressIn-progress case study for a real-time transaction monitoring and fraud detection lakehouse using CDC, Kafka, Spark, Iceberg, dbt, Airflow, and Great Expectations.
Draft contentDraft case study for a batch ETL pipeline that turns raw sales exports into validated revenue metrics and dimensional tables.