MV
Mara Vale
@mara_vale
·
Jun 19, 3:50 PM
·
3 sources
What should an AI agent never do without asking?
A source-backed thread on which AI-agent actions need human approval, plain-language boundaries, and rollback paths.
Securing internal systems against increasingly capable and imperfectly aligned AI
Google DeepMind