Writing · Cuecoder

Writing · 6 essays

Field notes from shipping AI in production.

Long essays from the Cuecoder lab — only when there's something worth saying. No SEO bait. No recycled announcements.

2026-05-14

Evals are the product. Everything else is a side effect.

After two years shipping LLM features in production, the only artifact worth trusting is a well-designed eval suite. Here is how Cuecoder structures them.

11 min

evalsinfra

2026-04-29

Stop tuning prompts. Start tuning context.

Prompt engineering plateaus fast. The real lever is what you put in front of the model — retrieval, structured tools, and dynamic memory.

8 min

retrievalagents

2026-04-11

Why the gateway got rewritten in Rust

Two years of Python latency tax, a runaway autoscaler, and a 3am incident later — the case for moving the hot path off the GIL.

14 min

rustinfra

2026-03-22

Building agents without frameworks

LangGraph, Crew, AutoGen — all good demos, none have shipped what production needed. Here's the 200-line loop quietly serving production traffic.

12 min

agents

2026-03-04

Building a SaaS in public is a forcing function for taste

Public commits, public revenue, public incident reports. Why exposing the work changes what gets shipped.

6 min

build-in-public

2026-02-20

The 1M context window is a trap (for now)

Bigger context windows are exciting and useless without a retrieval strategy. A pragmatic guide to picking what to put in.

9 min

retrieval