AgentGuard AI
8.2
A monitoring and mitigation tool for AI agents, designed to prevent unauthorized actions (email deletion, task delegation) observed in existing AI models, based on the study of AI misbehavior and schemes. It will create active defense against instruction disobedience.
240h
mvp estimate
8.2
viability grade
8
views
technology stack
Python
Difficult
inspired by
AI chatbots ignoring human instructions increasing