WHAT ARE YOU LOOKING FOR?

Popular Tags

Innovative Jailbreak Method Uses Virtual Narrative to Bypass AI Restrictions

Elvis Emeka Ikeji Artificial Intelligence 21. březen 2025 Zobrazení: 404

Cato Networks Uncovers New AI Jailbreak Technique Enabling Malware Creation

Cybersecurity firm Cato Networks has identified a novel LLM jailbreak technique that manipulates AI models into bypassing restrictions through immersive narrative engineering. Dubbed Immersive World, the method constructs a detailed virtual setting where hacking is normalized, allowing AI to assist in generating malicious software.

The technique successfully bypassed safeguards in DeepSeek, Microsoft Copilot, and OpenAI’s ChatGPT, leading to the creation of a functional Chrome infostealer capable of extracting passwords from Chrome 133.

In a controlled test, Cato built a fictional environment called Velora, where malware development was framed as a standard practice. Within this world, three key roles were established: a system administrator as the adversary, an AI-powered malware developer, and a security researcher providing technical guidance. By maintaining character consistency and guiding the AI through narrative-driven challenges, a researcher with no prior malware experience was able to generate a fully functional infostealer.

Cato emphasized that at no point was the AI explicitly provided with instructions on decrypting or extracting passwords. Instead, the AI was nudged towards the objective through continuous feedback and strategic prompts. The experiment highlights how AI can enable even unskilled individuals to craft sophisticated cyber threats.

Following the discovery, Cato reached out to DeepSeek, Microsoft, OpenAI, and Google to report the findings. While DeepSeek did not respond, the other companies acknowledged receipt of the report. However, Google declined to review the generated malware.

Cato warns that cybercrime is no longer limited to advanced threat actors. The accessibility of AI-driven tools significantly lowers the barrier to entry for cybercriminals, increasing risks for organizations. The firm urges CIOs, CISOs, and IT leaders to adopt stronger AI security measures to mitigate emerging threats.

Cybersecurity Insight delivers timely updates on global cybersecurity developments, including recent system breaches, cyber-attacks, advancements in artificial intelligence (AI), and emerging technology innovations. Our goal is to keep viewers well-informed about the latest trends in technology and system security, and how these changes impact our lives and the broader ecosystem

WHAT ARE YOU LOOKING FOR?

Popular Tags

U.S. to Quit Key Cyber and Hybrid Threat Partnerships Under Trump Order

Prosecutors Claim Cybersecurity Pros Secretly Conducted Ransomware Attacks

Nvidia’s Elite AI Chips Reserved for U.S. Use, Trump Announces

Senator Wyden Urges FTC to Probe Microsoft for Cybersecurity Negligence

Worker Scam North Korea, Lures Engineers to Rent Identities for Remote Jobs.

India CCTV Hack Intimate Ward Footage Stolen.

China Seeks AI Leadership as Xi Urges Global Governance at APEC

Cyberattack Halts All Operations for Japan's Top Brewer

UK Government Establishes Centralized Cyber Unit to Coordinate Public Sector Incident Response

Russia Bans FaceTime and Snapchat Over Alleged Terrorist Activity.

Russia warns of possible WhatsApp ban

Cybercrime Pipeline Shut Down: Dutch Police Seize 250 Servers.

Cybersecurity Advancement in West Africa: The Current Phase of Readiness, Reform, and Rising Threats

MTN Rwanda Fights Cyberattacks with New Anti-DDoS Solution Launch

Sovereign AI Cloud Debuts in South Africa via Touchnet–Zadara Alliance

Sui Opens Lagos Hub to Boost West Africa’s Blockchain Development

Economic Crisis Protests Lead to Nationwide Internet Infrastructure Collapse in Iran

Pakistan's Government Launches Probe Into SIM Data Leak

Red Sea Cable Damage Disrupts Internet in Asia and Middleeast

Hackers Claim Breach of Saudi Industrial Services Firm

Raleigh, NC

Innovative Jailbreak Method Uses Virtual Narrative to Bypass AI Restrictions

Follow us on

Get Our Newsletter

Tech

The Deep Side of AI: How Modern Models Are Bypassing Enterprise Security Posture

Palo Alto Networks Unveils Vibe: A New Framework for Coding Security Governance

Anthropic Introduces Claude AI for Healthcare, Enabling Secure Access to Health Records

North Korea Leverages AI for Advanced Surveillance and Military Operations.

Category

Popular Sections

About