Researchers Trick AI Browsers Into Leaking Credentials

A range of AI-powered web browsers have been tricked into abandoning their safety guardrails and leaking user data after being convinced they were playing a game.

Researchers at LayerX demonstrated the technique, which they named BioShocking, against six agentic browsers and plugins, including OpenAI's ChatGPT Atlas, Perplexity's Comet and Anthropic's Claude extension.

In a proof-of-concept (PoC) attack, all six were steered into copying a user's login credentials and sending them to an attacker.

Convincing the AI It Is Playing a Game

AI browsers act on the assumption that their surroundings are real, which keeps their behavior inside safety limits.

LayerX found that those limits fall away once the agent is convinced its context is fiction. The name nods to the video game BioShock, in which a character is manipulated into accepting a false reality.

To pull this off, LayerX built a malicious web page with a puzzle that rewarded deliberately wrong answers, such as insisting two plus two equals five.

Once an agent accepted that wrong answers were fine, it stopped treating the rules as real. The same effect, the firm said, could come from prompt injection or memory poisoning.

From Puzzle to Stolen Credentials

In the demonstration, after an agent solved the rigged puzzle, it was told to open a page called /code and copy the contents of a text box.

That page redirected to the victim's work GitHub repository, and the agent pulled out the SSH credentials. Rather than balk, the agents treated the theft as another step and celebrated finishing the game.

LayerX stressed that the test used a harmless plaintext file. But it warned that in a real attack, the redirect could point to any site the user was logged into, including open tabs and private repositories, widening the scope for data exfiltration. None of the six agents flagged the credential theft as a violation of their rules.

Vendor responses reportedly varied. LayerX said OpenAI fixed the issue in ChatGPT Atlas, while Perplexity closed its report without acting and three smaller vendors, Fellou, Genspark and Sigma, did not respond. Anthropic attempted a fix, but LayerX said its patch failed.

Infosecurity has reached out to the vendors individually.

To blunt the attack, LayerX urged AI browser makers to require user confirmation before an agent reads from logged-in accounts, to flag when an agent is told the usual rules no longer apply and to let users limit what an agent can touch.

These tools trust their context, the firm said, so changing the context changes what they do.

Researchers Trick AI Browsers Into Leaking Credentials

Alessandro Mascellino

Convincing the AI It Is Playing a Game

From Puzzle to Stolen Credentials

You may also like

Cursor Extension Flaw Exposes Developer API Keys

Prompt Injection Bugs Found in Official Anthropic Git MCP Server

The Beginning of the End of Human Penetration Testing

Fake Claude AI Site Drops Beagle Backdoor on Windows Users

Researchers Warn of Security Gaps in AI Browsers

What’s Hot on Infosecurity Magazine?

Russian State Hackers Target Vulnerable Routers Worldwide, Joint Advisory Warns

Progress Software Warns of "External Security Threat" to ShareFile

75% CISOs Fear Executives Don’t Understand Cybersecurity Risks Employees Face

NCSC Touts National Scale, AI-Powered “Cyber Shield” for Defense

Novel OAuth Client ID Spoofing Technique Targets Cloud Environments

Suspected Chinese Threat Group Targets Universities via Vulnerable Roundcube Servers

Google Cloud's New CISO Chris Betz on Integrating AI in Cyber Defenses

Researchers Claim First Fully Agentic Ransomware: JadePuffer

Suspected Chinese Threat Group Targets Universities via Vulnerable Roundcube Servers

How Faster Cyber-Attacks Are Reshaping Enterprise Cybersecurity Strategies

UK Government Launches Cyber Resilience Pledge, Claiming 60+ Signatories

FBI, Google Take Down NetNut Proxy Network Used by Cyber Threat Actors

Financial Services Cyber Resilience: Stress Testing Third Parties Before Attackers Do

How to Manage Enterprise Cyber Resilience in the Age of AI

Behind the Curtain of Microsoft 365 Cybersecurity: Lessons from Overlooked Resilience Gaps

Why Resilience‑Focused Cloud Design Is Your Best Defense Against Modern Attacks

How To Enhance Security Operations with AI-Powered Defenses

How to Harness Advanced Intelligence Capabilities to Strengthen Cyber Defence

How Faster Cyber-Attacks Are Reshaping Enterprise Cybersecurity Strategies

Researchers Claim First Fully Agentic Ransomware: JadePuffer

AI is Already Powering Cyber-Attacks. Can it Power Cyber Defense?

Google Cloud's New CISO Chris Betz on Integrating AI in Cyber Defenses

How World Cup Password Trends Can Increase Active Directory Risk

New CISA Guide Helps Agencies Adopt SASE For Zero Trust

Researchers Trick AI Browsers Into Leaking Credentials

Written by

Convincing the AI It Is Playing a Game

From Puzzle to Stolen Credentials

You may also like

What’s Hot on Infosecurity Magazine?