Glowbal University Data Ingestion Pipeline
In progress An evidence-first Python ingestion and QA pipeline that standardizes international university data from official sources into product-ready review datasets.
Stack
Python, Supabase, PostgreSQL, Playwright, Serper, OpenAI, Gemini, CSV, pytest
GitHub repository
Read case study
Ecommerce Market Batch ETL Pipeline
Ready A daily Airflow pipeline that extracts e-commerce product data, validates and loads market snapshots into PostgreSQL, and supports Metabase dashboards.
Stack
Python, Apache Airflow, Pandas, Pandera, PostgreSQL, SQLAlchemy, Metabase, Docker, pytest
GitHub repository
Read case study
Formula-1 Lakehouse Analytics
In progress An ELT lakehouse analytics platform for Formula 1 historical results, race sessions, telemetry, strategy, and weather insights.
Stack
Python, Apache Airflow, MinIO, PySpark, dbt, DuckDB, Metabase, Parquet, Docker
Read case study