Trustworthiness Sentinel
5.9
An AI model alignment assessment tool that analyzes AI systems for potential value inconsistencies. It autonomously identifies and flags deviations in model behavior. Addresses OpenAI's disbandment of mission alignment teams.
120h
mvp estimate
5.9
viability grade
16
views
technology stack
Python
SQLite
Medium
inspired by
OpenAI disbands mission alignment team.