Context Engineering: Retrieval, Memory, and The Shape Of Evidence

A note on RAG and context engineering: retrieval quality, evidence shape, memory boundaries, and why context is a product surface.

May 22, 2026 · 3 min · 637 words · jiaxing ni

Evals As Instruments: Measuring What The Demo Hides

A note on evaluation as an instrument: failure cases, metrics, benchmark design, product loops, and the discipline of measuring agents.

May 22, 2026 · 4 min · 729 words · jiaxing ni

Agent Design: Loops, Tools, and the Shape of Memory

A field note on designing agents as observable loops, with tools, memory, failure recovery, and product boundaries.

May 22, 2026 · 4 min · 747 words · jiaxing ni