I'm a data engineer specialising in cloud-native pipelines, stream processing, and infrastructure as code.
I've built scalable ingestion systems using AWS (Glue, Lambda, CDK), Apache Spark and Iceberg. My work focuses on reducing manual overhead, improving pipeline resilience, and delivering production-grade data platforms.
Currently open to new roles (full remote preferred) — ideally working on streaming systems, data platforms, or event-driven architectures.
🔧 Key tools & systems I’ve built: 📦 Apache Iceberg on AWS - CDK-based Infra as Code, migration tools, and SQS-powered write sequencing
📈 Behavioural Trends Pipeline - From raw event stream to backend API via Glue + Lambda + SQS
🔍 Structured Logging Layer - Lambda-native Python logger to enable clean observability
🐳 Lambda Image Optimisation - Docker multi-stage builds, automated ECR deployment, lifecycle control
🧹 Legacy Cleanup Automation - Repo refactors, CI test coverage, quality gate enforcement
🛠 Tech Stack AWS (Lambda, Glue, S3, Athena, Kinesis, SQS, ...) · Apache Spark / Iceberg / Kafka · Airflow · Docker · Python · SQL · CI/CD · TypeScript
📬 Let's connect 💼 LinkedIn