The distributed systems lessons you only learn after production

The distributed systems lessons you only learn after productionProduction has a way of exposing every assumption you thought was safePress enter or click to view image in full sizePhoto by Lavi Perchik on UnsplashEvery distributed system looks correct on a whiteboard.The nodes talk to each other, the queue absorbs the spikes, the retries handle the blips, and the diagram has...
Show HN: Capn-hook for coding agents – don't grep the same mystery twice

Don't grep the same mystery twice. Persistent memory for coding agents. When your agent spends ten minutes figuring out where something lives in your codebase, capn saves the files that answer the question. The next session gets them back in one command instead of re-exploring — and the moment the underlying files change, the saved answer deletes itself. 77% fewer tokens on...
The New Software Lifecycle

The following article originally appeared on Addy Osmani’s blog and is being republished here with the author’s permission. I cowrote a Google whitepaper about how AI is changing the software lifecycle. I’m not going to summarize the whole thing. Instead, here are the handful of ideas in it I think actually matter, plus six figures you’re welcome to reuse. Google published “The...
S3 Won the Write Path, The Fight Is Now the Read Path

Object storage did not solve every storage problem, and not every workload belongs in an object store, but S3 and the object-storage model made the first storage decision simple enough to become the default. For most modern data systems, the answer to where the durable copy should live is now obvious. S3 provided cheap, durable, scalable storage with an operational...
Evolutionary Data Through Schemaboi: Achieving Forward, Backwards, and Sideways Compatibility

Seph Gentle delivered a closing talk at a local-first software conference, arguing that the future of interoperable applications depends on a more robust and sustainable data format. He framed the current landscape as a false choice between traditional local software, which remains confined to a single device, and cloud-based software, which often holds user data hostage within centralised services. To...
6× faster binary search: from compiled code to mechanical sympathy

6× faster binary search: from compiled code to mechanical sympathy by Itamar Turner-TrauringLast updated 11 Jul 2026, originally created 11 Jul 2026 How do you speed up computational Python code? A common, and useful, starting point is: Pick a good algorithm. Use a compiled language to write a Python extension. Maybe add parallelism so you can use multiple CPU cores. But what if you need more speed?...

Category: Blog

Microservices Granularity Tradeoffs

Published by Arnon Rotem-Gal-Oz on March 17, 2025

When architects and developers embrace microservices, one of the most challenging questions is: “How big should each service be?” While true, the obvious answer –…

Software architecture workshop (slides)

Software architecture workshop (slides)

Published by Arnon Rotem-Gal-Oz on November 29, 2023

The title says it all – These are slides from a session I was working on to explain the basics of software architecture based on…

pandas on spark apply_batch/transform_batch broken? (tl;dr; No – but it isn’t well documented)

pandas on spark apply_batch/transform_batch broken? (tl;dr; No – but it isn’t well documented)

Published by Arnon Rotem-Gal-Oz on October 16, 2022

Using pypark’s pandas integration via apply_batch and transform_batch is very powerful but lacking documentation can cause hard to trace bugs – hopefully my experience (below)…

Replacing Docker Desktop with hyperkit + minikube

Replacing Docker Desktop with hyperkit + minikube

Published by Arnon Rotem-Gal-Oz on September 2, 2021

Edit June 2023: Added a section on Colima MacOS is a Unix but it isn’t a Linux so, unfortunately, if/when we need to use linux-y…

Intro to Apache Spark (slides)

Published by Arnon Rotem-Gal-Oz on December 16, 2020

I gave a general overview of Apache Spark to our R&D teams. You can find the slides below