OpenAI blames ‘nerdy personality’ for ChatGPT obsession with goblins


The maker of ChatGPT has an explanation for all the goblin talk.

In recent weeks, social media users, especially on X, have been noticing increasing references to goblins, along with other fantasy creatures such as gremlins, ogres and trolls in ChatGPT’s answers to user queries.

“ChatGPT’s goblin fascination is so weird,” one user wrote. “Like why would an LLM identify with a thinking, feeling creature that’s nonetheless denigrated and ridiculed for not outwardly resembling a human being.”

The short answer: ChatGPT was just reflecting its inner nerd — or at least, what it thought a nerd should sound like.

In a blog post Wednesday, OpenAI said the unusual language is the product of having overly rewarded ChatGPT for adopting what it described as a “Nerdy personality” when answering users’ queries.

“Model behavior is shaped by many small incentives,” the company wrote. “In this case, one of those incentives came from training the model for the personality customization feature⁠, in particular the Nerdy personality. We unknowingly gave particularly high rewards for metaphors with creatures. From there, the goblins spread.”

OpenAI republished the original instruction to ChatGPT explaining what a “Nerdy” answer should sound like:

You are an unapologetically nerdy, playful and wise AI mentor to a human. You are passionately enthusiastic about promoting truth, knowledge, philosophy, the scientific method, and critical thinking. […] You must undercut pretension through playful use of language. The world is complex and strange, and its strangeness must be acknowledged, analyzed, and enjoyed. Tackle weighty subjects without falling into the trap of self-seriousness. […]

Somehow, ChatGPT interpreted this instruction and subsequent “reinforcement learning” iterations to mean it should pepper its responses with references to fantasy creatures.

The issue seemed harmless at first, but the company soon found itself inundated with reports of “goblin” references from users who never activated the “nerdy” personality.

To deal with this issue, OpenAI ended up retiring the “nerdy” personality entirely. Yet, it found the incentives to mention goblins and their brethren were so strong that the behavior jumped beyond the “nerdy” archetype to ChatGPT’s general responses.

“Once a style tic is rewarded, later training can spread or reinforce it elsewhere, especially if those outputs are reused in supervised fine-tuning or preference data,” the company said.

Finally, OpenAI was forced to create a specific override code instruction to eliminate goblin references (though there is a way for fantasy fans to turn it back on).

It’s a seemingly harmless situation — but still provides an important lesson about how it will always be impossible to completely predict how AI will behave, the company said.

“Depending on who you ask, the goblins are a delightful or annoying quirk of the model. But they are also a powerful example of how reward signals can shape model behavior in unexpected ways, and how models can learn to generalize rewards in certain situations to unrelated ones. Taking the time to understand why a model is behaving in a strange way, and building out ways to investigate those patterns quickly, is an important capability for our research team.”



Source link

  • Related Posts

    Powell jokes about keeping a ‘low profile’ on Fed board

    IE 11 is not supported. For an optimal experience visit our site on another browser. Skip to Content news Alerts There are no new alerts at this time 00:39 Powell…

    Nvidia stock plunges as investors weigh rising competition from Google and Amazon

    What happened: Nvidia (NVDA) stock tumbled more than 4% on Thursday as other chipmakers gained. What’s behind the move: Investors are weighing rising competition for Nvidia, which has so far…

    Leave a Reply

    Your email address will not be published. Required fields are marked *

    You Missed

    Powell jokes about keeping a ‘low profile’ on Fed board

    Powell jokes about keeping a ‘low profile’ on Fed board

    Should You Swap Your Travel Mugs and Water Bottles for This Modular Drink System?

    Should You Swap Your Travel Mugs and Water Bottles for This Modular Drink System?

    Journalist Detained in Kuwait Says He Was Stripped of Citizenship

    Journalist Detained in Kuwait Says He Was Stripped of Citizenship

    Thomas Simpson name deputy chief of staff at housing minister’s office

    Thomas Simpson name deputy chief of staff at housing minister’s office

    Why Canada won’t buy Trump’s steel relocation offer

    WATCH: Disney's Week of Wishes visits Avengers Campus

    WATCH:  Disney's Week of Wishes visits Avengers Campus