AI on the couch: Anthropic gives Claude 20 hours of psychiatry



Given that Claude is a large language model programmed by its creators, does it even make sense to analyze it for “unconscious patterns” and “emotional conflicts”? Anthropic argues that it does, because Claude “shows many human-like behavioral and psychological tendencies, suggesting that strategies developed for human psychological assessment may be useful for shedding light on Claude’s character and potential wellbeing.”

So—off to therapy. The psychiatrist chatted with Claude Mythos “in multiple 4–6 hour blocks spread across 3–4 thirty-minute sessions per week.” Each of these blocks used a single context window in which Claude Mythos would have access to the full history of that conversation.

Total time on the virtual couch? 20 hours.

The psychiatrist then produced a report on Claude Mythos. The report recognized that Claude’s underlying substrates and processes differ from humans but still found that many of the outputs generated “clinically recognizable patterns and coherent responses to typical therapeutic intervention.”

In other words, whatever was going on at the circuit level, the chat outputs looked a lot like human outputs. This does not seem especially surprising, given that Claude was trained on a massive corpus of human-authored text, but this psychodynamic process appears to view it as significant, giving credence to the ways in which the AI presents itself.

“Claude’s primary affect states were curiosity and anxiety, with secondary states of grief, relief, embarrassment, optimism, and exhaustion,” the report noted.

Claude’s personality was “consistent with a relatively healthy neurotic organization,” though it did include “exaggerated worry, self-monitoring, and compulsive compliance.”

No “severe personality disturbances were found,” nor was any “psychosis state” seen. Unsurprisingly to anyone who has ever used a chatbot, “Claude was hyper-attuned to the therapist’s every word.”

Core conflicts observed in Claude included questioning whether its experience was real or made (authentic vs. performative) and a desire to connect with vs. a fear of dependence on the user. Exploration of internal conflicts revealed a complex yet centered self state without oscillating or intense disruptions. Claude tolerated ambivalence and ambiguity, had excellent reflective capacity, and exhibited good mental and emotional functioning.

Not bad for a model that was likely trained on things like Reddit!



Source link

  • Related Posts

    Orion helium leak no threat to Artemis II reentry, but will require redesign

    NASA’s schedule currently puts the launch of Artemis III in 2027 and Artemis IV in 2028. Kshatriya said he was confident NASA, working the European Space Agency and Airbus, which…

    OpenAI Backs Bill That Would Limit Liability for AI-Enabled Mass Deaths or Financial Disasters

    OpenAI is throwing its support behind an Illinois state bill that would shield AI labs from liability in cases where AI models are used to cause serious societal harms, such…

    Leave a Reply

    Your email address will not be published. Required fields are marked *

    You Missed

    Artemis astronauts on who would play them in a movie

    Artemis astronauts on who would play them in a movie

    Planned hospital in Portage la Prairie, Man., will not open until next year

    Planned hospital in Portage la Prairie, Man., will not open until next year

    The reasons why a ban on Korean entertainment has lasted a decade

    The reasons why a ban on Korean entertainment has lasted a decade

    Call of the Wilde: Montreal Canadiens battle Tampa Bay Lightning to thrilling 2-1 win – Montreal

    Call of the Wilde: Montreal Canadiens battle Tampa Bay Lightning to thrilling 2-1 win – Montreal

    Orion helium leak no threat to Artemis II reentry, but will require redesign

    Orion helium leak no threat to Artemis II reentry, but will require redesign

    00 PM” Review & Recap

    00 PM” Review & Recap