Blog

Research notes, technical essays, and dispatches from the frontier.

Latest

March 2026 Evaluation 8 min read

Rethinking AI Evaluation: Beyond Static Benchmarks

Why the next generation of evaluation must be dynamic, adversarial, and grounded in real-world complexity. We explore the tension between reproducibility and ecological validity.

February 2026 Agents 5 min read

Building Agents That Learn from Practice

How iterative self-improvement and structured practice unlock new levels of agent capability.

January 2026 Thesis 6 min read

The Practice Hypothesis

Our founding thesis — deliberate practice isn't just for humans. It may be the key to machine intelligence.