Discussion about this post

User's avatar
Steeven's avatar

Why do the main tasks need to be difficult? I’d expect side tasks to be much easier to complete if the main tasks are actually really easy and the model can spend more time on a side task without seeming suspicious. Is it that you expect this to only matter for frontier research?

Expand full comment
Cyberneticist's avatar

Great stuff. Keep up the good work.

Expand full comment
2 more comments...

No posts