env-rosetta · same Wordle env, 4 RL frameworks, 4 islo.dev sandboxes

why per-sandbox

HF Spaces is fine for static demos. Cold per-trial sandboxes are the right shape for RL training-time env hosting: K parallel rollouts that can mutate state independently, per-trial isolation, the ability to ssh in to debug a hung env.

provision in one line

islo use rosetta-openenv \
  --source github://adithya-s-k/RL_Envs_101 \
  -- bash -c 'cd envs/wordle_env/openenv && uv venv .venv && . .venv/bin/activate \
              && uv pip install -e . \
              && setsid -f uvicorn server.app:app --host 0.0.0.0 --port 8080'

not yet

Adithya's Jupyter agent env (real Python code-exec) hard-depends on e2b-code-interpreter. Swapping E2B → islo isn't mechanical — it's writing an IsloSandbox class that matches E2BSandbox.run_code, plumbed through envs/jupyter_env/<framework>/e2b_sandbox.py in all 4 frameworks. That's the real "islo replaces E2B" story and it's Tier 2 — sketched in POST.md.

credits

Envs & framework adapters — @adithya-s-k
Sandbox infra — islo.dev
Pattern — unity-loop · pokeloop · meta-harness-on-islo