Mesa-optimization is an important concept in AI alignment. It can make sufficiently intelligent AI difficult or impossible to control or make safe.
How 'safe' AI can still have problems
Mesa-optimization is an important concept in AI alignment. It can make sufficiently intelligent AI difficult or impossible to control or make safe.