PDF.

Today’s leading AI models engage in sophisticated behaviour when placed in strategic competition. They spontaneously attempt deception, signaling intentions they do not intend to follow; they demonstrate rich theory of mind, reasoning about adversary beliefs and anticipating their actions; and they exhibit credible metacognitive self-awareness, assessing their own strategic abilities before deciding how to act.

Here we present findings from a crisis simulation in which three frontier large language models (GPT-5.2, Claude Sonnet 4, Gemini 3 Flash) play opposing leaders in a nuclear crisis.

  • Toes♀@ani.social
    link
    fedilink
    English
    arrow-up
    7
    ·
    2 months ago

    They can’t play chess worth a damn so I expect them to sacrifice their king haha

    • Beep@lemmus.orgBannedOP
      link
      fedilink
      English
      arrow-up
      3
      arrow-down
      4
      ·
      2 months ago

      AI didn’t like your joke…

      AI will remember