Skip to content
#

ai-safety-gridworlds

Here are 7 public repositories matching this topic...

Safety challenges for RL and LLM agents' ability to learn and properly apply biologically and economically aligned utility functions. The benchmarks are implemented in a gridworld-based environment. The environments are relatively simple, just as much complexity is added as is necessary to illustrate the relevant safety and performance aspects.

  • Updated Apr 17, 2026
  • Python

Improve this page

Add a description, image, and links to the ai-safety-gridworlds topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ai-safety-gridworlds topic, visit your repo's landing page and select "manage topics."

Learn more