Chu Tuan Linh profile photo

Data Engineer

Chu Tuan Linh

I am focused on data engineering fundamentals: designing ETL/ELT pipelines, writing clear SQL, modeling data for analytics, and building reproducible systems. I enjoy projects that involve messy raw data, transformation logic, orchestration, and making data trustworthy.

Skills

Data engineering fundamentals

Grouped skills make it easy to scan the stack used across pipeline, modeling, and reliability work.

Languages

PythonSQLJavaC++

Databases / Warehouses

PostgreSQLBigQuerySnowflake

Data Engineering

ETL/ELTBatch pipelinesData modelingData quality checksData validationWorkflow orchestration

Tools

dbtAirflowDockerGitLinuxVercel

Cloud

AWSGCP

Projects

Featured data engineering projects

View all projects

Glowbal University Data Ingestion Pipeline

In progress

An evidence-first Python ingestion and QA pipeline that standardizes international university data from official sources into product-ready review datasets.

Stack

Python, Supabase, PostgreSQL, Playwright, Serper, OpenAI, Gemini, CSV, pytest

GitHub repository Read case study

Ecommerce Market Batch ETL Pipeline

Ready

A daily Airflow pipeline that extracts e-commerce product data, validates and loads market snapshots into PostgreSQL, and supports Metabase dashboards.

Stack

Python, Apache Airflow, Pandas, Pandera, PostgreSQL, SQLAlchemy, Metabase, Docker, pytest

GitHub repository Read case study

Formula-1 Lakehouse Analytics

In progress

An ELT lakehouse analytics platform for Formula 1 historical results, race sessions, telemetry, strategy, and weather insights.

Stack

Python, Apache Airflow, MinIO, PySpark, dbt, DuckDB, Metabase, Parquet, Docker

Read case study

Case Studies

Technical writeups

View case studies