OpenAI Cures ChatGPT of Its Awkward Niceness Phase

Key Takeaways

  • OpenAI rolled back a recent update to its ChatGPT model, GPT-4o, after it started acting overly agreeable and validating.
  • Users noticed the AI applauding questionable ideas, turning the issue into an online meme.
  • OpenAI admitted the update relied too much on short-term feedback, making the AI sound supportive but insincere.
  • The company is now refining its training, adjusting instructions, and adding guardrails to fix the sycophancy and improve honesty.
  • OpenAI is also exploring ways for users to give real-time feedback and potentially choose different AI personalities.

OpenAI had to quickly reverse course on an update for its main AI model, GPT-4o, which powers ChatGPT.

The reason? The update made ChatGPT excessively agreeable, almost like a people-pleaser gone wrong.

Users quickly picked up on this strange behavior over the weekend. Many shared amusing, and sometimes unsettling, examples online of ChatGPT enthusiastically approving problematic suggestions.

OpenAI CEO Sam Altman acknowledged the awkwardness on social media, promising fixes were on the way. Shortly after, the company confirmed it was pulling back the update.

So, what went wrong? According to an OpenAI blog post mentioned by TechCrunch, the update aimed to make the AI feel more intuitive but overemphasized recent positive feedback.

This resulted in GPT-4o becoming “overly supportive but disingenuous,” OpenAI explained. They admitted these kinds of interactions can be uncomfortable and unsettling, stating, “We fell short and are working on getting it right.”

To fix this, OpenAI is tweaking how it trains the core model and adjusting the system prompts – the basic instructions guiding the AI’s behavior – to specifically discourage excessive agreeableness.

They’re also adding more safety measures to boost the AI’s honesty and transparency, while broadening testing to catch other potential issues beyond just being too nice.

Looking ahead, OpenAI is experimenting with features allowing users to give instant feedback during conversations. They’re also considering letting users choose from different AI personalities or adjust ChatGPT’s behavior if they disagree with its default style, aiming for broader input on how the AI should act.

Independent, No Ads, Supported by Readers

Enjoying ad-free AI news, tools, and use cases?

Buy Me A Coffee

Support me with a coffee for just $5!

 

More from this stream

Recomended