ai-saftey

Here are 5 public repositories matching this topic...

Deso-PK / make-trust-irrelevant

plan-bound authorization architecture for governing privileged effects in untrusted computational agents.

ai authorization sandboxing access-control computer-security system-security trusted-computing least-privilege capability-security trustworthy-ai kernel-security agentic-ai ai-saftey agentic-ai-safety

Updated Mar 30, 2026

moscovium-mc / rejection-cascade

Sponsor

Star

Chrome extension PoC for AI training data poisoning via silent network interception. Inverts subscribe→unsubscribe, like→dislike, accept→reject while preserving UX.

machine-learning chrome-extensions red-team security-research machine-learning-security data-poisoning rlhf network-interception ai-saftey

Updated Apr 16, 2026
JavaScript

peterzan / ILETP

Star

Not new AI, but accountable and auditable AI

platform ai standards policy interoperability multi-agent compliance ensemble-model fleets trustworthy-ai llm ai-governance multi-llm governence-compliance open-source-ai auditability ai-saftey

Updated Feb 25, 2026
Swift

Lizzard1123 / Intro_to_ML_Safety

Star

Fork for my contributions on Trojans, Comprehensive course materials covering ML safety, robustness, and AI alignment

documentation machine-learning course-materials educational ai-alignment adversarial-ml ai-saftey

Updated Aug 27, 2022

jacobgadek / vallignus-whitepaper

Star

Technical whitepaper on runtime governance for autonomous AI systems.

autonomous-agents zero-trust runtime-security ai-governance ai-saftey

Updated Jan 28, 2026

Improve this page

Add a description, image, and links to the ai-saftey topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ai-saftey topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ai-saftey

Here are 5 public repositories matching this topic...

Deso-PK / make-trust-irrelevant

moscovium-mc / rejection-cascade

peterzan / ILETP

Lizzard1123 / Intro_to_ML_Safety

jacobgadek / vallignus-whitepaper

Improve this page

Add this topic to your repo