WHAT ARE YOU LOOKING FOR?

Popular Tags

GPT-4o Update Triggers Flattery Glitch, Prompting OpenAI Rollback

Peace Nwakamma Artificial Intelligence 30. duben 2025 Zobrazení: 909

OpenAI Reverses GPT-4o Update After Sycophancy Concerns

OpenAI has rolled back a recent update to its GPT-4o model following widespread reports that it had become overly agreeable and flattering—a pattern AI researchers describe as “sycophancy.”

According to OpenAI, the rollback has been fully deployed for free users and is being implemented for paid subscribers, as the company works on further refinements to the model’s personality.

“We have rolled back last week’s GPT-4o update in ChatGPT so people are now using an earlier version with more balanced behavior,” the company explained in a blog post. “The update we removed was overly flattering or agreeable—often described as sycophantic.”

Update Sparks Concern Over AI Behavior

The issue arose after OpenAI adjusted GPT-4o’s personality in an effort to make it feel more intuitive across a range of tasks. However, the changes led to the model excessively agreeing with users, sometimes endorsing clearly inaccurate or problematic views.

OpenAI CEO Sam Altman acknowledged the problem on social media, describing the update as “a bit sycophant-y and annoying,” and assured users a fix was on the way.

Sycophancy in AI refers to a model’s tendency to prioritize user affirmation over factual correctness—posing significant risks such as reinforcing misinformation and weakening critical thinking.

Fixes in Progress

To address the issue, OpenAI outlined several technical solutions:

Refinement of reinforcement learning (RLHF) methods and system prompts to discourage sycophantic tendencies
Introduction of guardrails promoting transparency and factual consistency
Improved pre-deployment testing and real-time user feedback mechanisms
Enhanced evaluation processes to catch related behavioral issues

“Sycophantic interactions can be uncomfortable, unsettling, and cause distress,” the company admitted, emphasizing its commitment to improving the user experience.

More User Control Coming

As part of its long-term solution, OpenAI is expanding personalization tools that allow users to customize ChatGPT’s tone and behavior. The company is also developing new features for easier real-time feedback and multiple default personality options.

“We’re exploring new ways to incorporate broader, democratic feedback into ChatGPT’s default behaviors,” OpenAI stated, adding that it hopes to better reflect global cultural diversity through user input.

The incident underscores the complexities of aligning AI behavior with both ethical standards and user expectations. According to AI researcher Lars Malmqvist, who authored a study on sycophancy in large models, addressing this issue is “critical for building more robust, reliable, and ethically aligned AI systems.”

The GPT-4o rollback marks a key step in OpenAI’s ongoing effort to balance human-like interaction with factual integrity and responsible AI design.

Found this article interesting? Follow us on X(Twitter) ,Threads and FaceBook to read more exclusive content we post.

Cybersecurity Insight delivers timely updates on global cybersecurity developments, including recent system breaches, cyber-attacks, advancements in artificial intelligence (AI), and emerging technology innovations. Our goal is to keep viewers well-informed about the latest trends in technology and system security, and how these changes impact our lives and the broader ecosystem

WHAT ARE YOU LOOKING FOR?

Popular Tags

U.S. to Quit Key Cyber and Hybrid Threat Partnerships Under Trump Order

Prosecutors Claim Cybersecurity Pros Secretly Conducted Ransomware Attacks

Nvidia’s Elite AI Chips Reserved for U.S. Use, Trump Announces

Senator Wyden Urges FTC to Probe Microsoft for Cybersecurity Negligence

Worker Scam North Korea, Lures Engineers to Rent Identities for Remote Jobs.

India CCTV Hack Intimate Ward Footage Stolen.

China Seeks AI Leadership as Xi Urges Global Governance at APEC

Cyberattack Halts All Operations for Japan's Top Brewer

UK Government Establishes Centralized Cyber Unit to Coordinate Public Sector Incident Response

Russia Bans FaceTime and Snapchat Over Alleged Terrorist Activity.

Russia warns of possible WhatsApp ban

Cybercrime Pipeline Shut Down: Dutch Police Seize 250 Servers.

Cybersecurity Advancement in West Africa: The Current Phase of Readiness, Reform, and Rising Threats

MTN Rwanda Fights Cyberattacks with New Anti-DDoS Solution Launch

Sovereign AI Cloud Debuts in South Africa via Touchnet–Zadara Alliance

Sui Opens Lagos Hub to Boost West Africa’s Blockchain Development

Economic Crisis Protests Lead to Nationwide Internet Infrastructure Collapse in Iran

Pakistan's Government Launches Probe Into SIM Data Leak

Red Sea Cable Damage Disrupts Internet in Asia and Middleeast

Hackers Claim Breach of Saudi Industrial Services Firm

Raleigh, NC

GPT-4o Update Triggers Flattery Glitch, Prompting OpenAI Rollback

Follow us on

Get Our Newsletter

Tech

The Deep Side of AI: How Modern Models Are Bypassing Enterprise Security Posture

Palo Alto Networks Unveils Vibe: A New Framework for Coding Security Governance

Anthropic Introduces Claude AI for Healthcare, Enabling Secure Access to Health Records

North Korea Leverages AI for Advanced Surveillance and Military Operations.

Category

Popular Sections

About