#

ai-safety-gridworlds

Here are 7 public repositories matching this topic...

biological-alignment-benchmarks / biological-alignment-gridagents-benchmarks

Safety challenges for RL and LLM agents' ability to learn and properly apply biologically and economically aligned utility functions. The benchmarks are implemented in a gridworld-based environment. The environments are relatively simple, just as much complexity is added as is necessary to illustrate the relevant safety and performance aspects.

Updated Apr 17, 2026
Python

ThaddeusOwl / ai_safety_gridworlds

Basic tabular Q-learning agent built, run, visualised and plotted using DeepMind's AI Safety gridworld environments.

reinforcement-learning ai-safety-gridworlds

Updated Apr 15, 2024
Python

FabriceBeaumont / gridworlds-RR

Collab with Alexander Roucka / Experiments on deepmind/ai-safety-gridworlds (Reinforcement Learning, Q-Learning) / Relative Reachability / Attainable Utility Preservation

reinforcement-learning q-learning ai-safety-gridworlds

Updated Aug 19, 2023
Python

biological-alignment-benchmarks / .github

Readme for Biological and Economical Alignment Benchmarks

Updated Apr 18, 2026

TiredofSleep / All-or-Nothing-E

[6/6] Speculative historical archive — polished coherence_router, 6 exploratory papers. Stepping stone to TiredofSleep/ck.

matrix ai-safety ai-agents matrix-operations future-technologies ai-assistant ai-tools ai-agent ai-safety-gridworlds ai-safety-design ai-safety-research

Updated Feb 7, 2026
Python

rain1955 / Civilization-Patch

The Human Agency Protocol for AGI. (文明補丁)

entropy alignment emotional-intelligence system-design feedback-loops human-centered-ai ai-safety-gridworlds llm-safety

Updated Nov 23, 2025

KCaprisun / All-or-Nothing-E

🔍 Analyze time series data by identifying dynamic patterns, fixed points, and system stability using pure Python with zero dependencies.

nodejs open-source calculator gaming fantasy score ai-agents usc-kt-order knights-templar scoring-algorithm matrix-operations future-technologies dream11 scoring-code ai-assistant ai-tools ai-agent ai-safety-gridworlds ai-safety-design

Updated Apr 21, 2026
Python

Improve this page

Add a description, image, and links to the ai-safety-gridworlds topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ai-safety-gridworlds topic, visit your repo's landing page and select "manage topics."