~/projects/elt-data-platform ← projects

/projects$cat elt-data-platform.html

ELT Data Platform

SLTC · final-year research · 2025

the problem

Nine years of daily central-bank reports — locked inside unstructured PDFs. Rich economic data, completely unqueryable. The research question: can you build a platform that turns that archive into clean, structured, analyzable data, automatically and continuously?

what i built

An automated data platform orchestrated by Apache Airflow:

The whole thing runs as an orchestrated DAG, so it's repeatable and extensible — point it at new reports and it keeps going.