I'm a full-stack engineer who builds systems that need to work reliably. Currently at Cold Chain Science, where I own our core platform end-to-end.
More about me →Recent thoughts on infrastructure, distributed systems, and software engineering
A production incident where orphaned Selenium Chrome processes exhausted the Linux PID pool and caused unrelated services to fail.
A 4G outage pushed IoT edge devices onto a broken 3G roaming path, leaving them connected at the cellular layer but offline end to end.