← All posts
Tagged

Infrastructure

7 posts

Platform EngineeringInfrastructure

Supply Chain Security: The Seven-Day Delay That Protects Your Production Systems

How I protect 15+ projects across Python, JavaScript, and Rust from supply chain attacks using a three-layer defence: registry-level delays, automated PR scheduling, and lockfile discipline.

1 Apr 2026 · 14 min read
Agentic AIPlatform Engineering

Agentic Ops: Working Backwards from the Metric That Matters

Start from a single business SLA — data freshness under 60 seconds — and trace backwards through dependency trees, metadata layers, known-error memory, and automated fixes to build an AI-operated production platform.

15 Mar 2026 · 14 min read
Platform EngineeringInfrastructure

Why Gatus Is My Preferred Health Check Tool (And Why Uptime Monitoring Isn't Enough)

Uptime tools tell you a service is running. Gatus tells you the data pipeline is actually working. How I use 73 custom health checks to monitor infrastructure, data freshness, and pipeline completeness.

1 Mar 2026 · 8 min read
DatabricksPython

Benchmarking PySpark shuffle: what the metrics actually tell you

Building a benchmarking utility for shuffle and network transfer metrics in Databricks clusters.

20 Feb 2026 · 7 min read
InfrastructureAWS

Tips for AWS Professional certification exams

Practical, non-technical tips for sitting the AWS Professional certification exams, covering time management, question reading strategy, and physical endurance.

23 Nov 2019 · 4 min read
Machine LearningInfrastructure

Machine learning for large organisations

A practical approach to building an organisational machine learning pipeline that supports multiple tools, PMML-based model deployment, CI/CD practices, and A/B testing.

30 May 2015 · 4 min read
Infrastructure

Open Data Platform

Personal views on the Open Data Platform (ODP) announcement and what it means for Hadoop users, BI vendors, and the major distribution companies.

19 Feb 2015 · 5 min read