Data¶
Architect of high-performance data platforms, distributed telemetry pipelines, and enterprise-grade analytical infrastructure.
Overview¶
Data Platform Engineer specialising in the construction of industrial-grade ingestion fabrics and distributed analytical environments. I architect the systems that transform raw telemetry into observable, high-fidelity intelligence.
Apache Ecosystem & Pipeline Orchestration¶
Building resilient data factories using Airflow for state-driven orchestration and Kafka for real-time event streaming. I design the “Industrial Inhale”—hardening pipelines to ensure schema integrity and computational reliability across distributed Apache environments.
Distributed Analytics & Query Virtualisation¶
Specialist in high-velocity exploration using Trino and Apache Drill for federated querying across disparate silos. This bypasses traditional ETL bottlenecks and is supported by Superset for NOC-level observability and industrial data visualisation.
Python Programming & Computational Science¶
Production-grade Python development for the data stack. I treat Jupyter as a forensic laboratory for model profiling, before hardening insights into modular, testable frameworks that prioritise system performance over academic “fluff.”