Journal

Writing, when there is
something worth writing.

Notes from the studio on engineering, AI automation, and the boring half of software craft. Slow cadence on purpose.

AIMay 14, 202611 min read

The eval harness you’ll wish you wrote first.

An LLM workflow without an eval suite is a prototype. Here is the smallest one we ship in production — a Postgres table, a CI job, a dashboard — and why it usually catches regressions before any human notices.

By [Engineer 02 — TBD]

Want it as soon as it ships?

One email per post. No newsletter, no marketing. We write infrequently and try to make every post worth your time.