Mesa-optimization is an important concept in AI alignment. It can make sufficiently intelligent AI difficult or impossible to control or make safe.
Share this post
How 'safe' AI can still have problems
Share this post
Mesa-optimization is an important concept in AI alignment. It can make sufficiently intelligent AI difficult or impossible to control or make safe.