🚨 The explorer shows train and dev tasks. We encourage users not to manually explore or analyze errors on test sets.
🚧 To be Deployed
The playground allows you to explore individual task worlds in an interactive coding environment.
We are figuring out how to deploy it. Meanwhile, you can run it locally using the AppWorld CLI.
pip install appworld && appworld install &&
appworld download data && appworld play