idler

Environments for frontier models.

Reinforcement learning environments that train frontier models to expert level, graded against ground truth, grounded in real production work.

The corpus, sized by pass@k
Method · one process, capability to graded environment
Domains · in priority order
Safety
Alignment and oversight. The first call on everything.
Defense
High-stakes capability and red-team work.
Science
Bio, pharma, and research automation.
Commerce
Agentic work grounded in real company operations.