EvSys

Roadmap

What we release today is Phase 1: the autoresearch loop. These are the pieces we're building next to turn it into the full continual-learning system.

Continual-learning autoresearch abstractions

The autoresearch loop is the engine; the next layer is the abstractions that make continual learning autonomous. As laid out in our whitepaper, the end state is a closed loop: a deployed model produces production traces; a lightweight decision agent judges whether a batch is worth learning from and, if so, triggers autoresearch - a heavier, budgeted search for the right training recipe.

Because there is no single algorithm that enables continual learning across all data - different data demands different recipes - the autoresearch agent must autonomously run experiments, recover from training collapses, and decide which checkpoint is finally worth deploying, gated by deployment evals before anything ships. We're building these abstractions directly on top of the SDK, so continual learning graduates from a config modifier into a self-driving loop.

--remote - run the whole client in a sandbox

Today you run the SDK client locally and it drives the training backend over the network. The --remote feature will run the entire client side inside an e2b sandbox - so nothing runs on your machine. No local Python environment, no local setup: launch an experiment and the whole thing executes remotely, for maximum ease of use.

Launch jobs on your own GPUs

A job orchestrator so you can launch training jobs on your own GPUs, not only a hosted Tinker-compatible service. Bring your own compute and run the same configs against it - the SDK handles scheduling and bookkeeping, the GPUs are yours.

On this page