Redwood Research blog
Subscribe
Sign in
Share this post
Redwood Research blog
Notes on countermeasures for exploration hacking (aka sandbagging)
Copy link
Facebook
Email
Notes
More
Notes on countermeasures for exploration…
Ryan Greenblatt
Apr 4
5
Share this post
Redwood Research blog
Notes on countermeasures for exploration hacking (aka sandbagging)
Copy link
Facebook
Email
Notes
More
How can we prevent AIs from intentionally underperforming on our metrics?
Read →
Comments
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
Share this post
Notes on countermeasures for exploration…
Share this post
How can we prevent AIs from intentionally underperforming on our metrics?