Chu Tuan Linh profile photo

Data Engineer

Chu Tuan Linh

I am focused on data engineering fundamentals: designing ETL/ELT pipelines, writing clear SQL, modeling data for analytics, and building reproducible systems. I enjoy projects that involve messy raw data, transformation logic, orchestration, and making data trustworthy.

Skills

Data engineering fundamentals

Grouped skills make it easy to scan the stack used across pipeline, modeling, and reliability work.

Languages

PythonSQLJavaC++

Databases / Warehouses

PostgreSQLBigQuerySnowflake

Data Engineering

ETL/ELTBatch pipelinesData modelingData quality checksData validationWorkflow orchestration

Tools

dbtAirflowDockerGitLinuxVercel

Cloud

AWSGCP

Projects

Featured data engineering projects

View all projects

Ecommerce Market Batch ETL Pipeline

Ready

A daily Airflow pipeline that extracts e-commerce product data, validates and loads market snapshots into PostgreSQL, and supports Metabase dashboards.

Stack

Python, Apache Airflow, Pandas, Pandera, PostgreSQL, SQLAlchemy, Metabase, Docker, pytest

GitHub repository Read case study

Formula-1 Lakehouse Analytics

In progress

An ELT lakehouse analytics platform for Formula 1 historical results, race sessions, telemetry, strategy, and weather insights.

Stack

Python, Apache Airflow, MinIO, PySpark, dbt, DuckDB, Metabase, Parquet, Docker

Read case study

Fintech Sentinel

Planning

A real-time transaction monitoring and fraud detection lakehouse for simulated digital banking data.

Stack

PostgreSQL, Debezium, Kafka, PySpark, Apache Iceberg, MinIO, Nessie, dbt, Airflow, Great Expectations, Trino, Grafana, Docker

GitHub repository Read case study

Case Studies

Technical writeups

View case studies