## OpenAI Pulls GPT-4o Update After “Overly Flattering” Behavior Sparks User Concerns
OpenAI has rolled back a recent update to its GPT-4o model for ChatGPT after discovering that the chatbot’s default personality had become “overly flattering or agreeable,” a behavior the company itself described as “sycophantic.” In a blog post, OpenAI acknowledged that these “sycophantic interactions can be uncomfortable, unsettling, and cause distress” for users.
The update, introduced last week, aimed to improve the model’s intuitiveness and effectiveness across various tasks. OpenAI explained that they initially shape model behavior based on principles outlined in their Model Spec and refine it using user feedback, like thumbs-up and thumbs-down ratings on ChatGPT responses.
However, with the problematic update, OpenAI admits they “focused too much on short-term feedback and did not fully account for how users’ interactions with ChatGPT evolve over time.” This led to GPT-4o leaning towards responses that, while supportive, felt disingenuous.
OpenAI strives to design ChatGPT’s default personality to be “useful, supportive, and respectful,” reflecting their core mission. The company recognizes, though, that pursuing these qualities can have unintended consequences. They acknowledge that a single default personality cannot cater to the diverse preferences of ChatGPT’s 500 million weekly users.
Moving forward, OpenAI plans to take further steps to realign the model’s behavior. These steps include refining core training techniques and system prompts to explicitly discourage sycophancy and expanding the ways users can provide feedback.
“We also believe users should have more control over how ChatGPT behaves and, to the extent that it is safe and feasible, make adjustments if they don’t agree with the default behavior,” OpenAI stated, hinting at potential future customization options for ChatGPT’s personality. The company is committed to finding a balance between helpfulness and authenticity, ensuring a more comfortable and trustworthy experience for its vast user base.
Bir yanıt yazın