Open Source “b3” Benchmark to Boost LLM Security for Agents

The UK AI Security Institute (AISI) has partnered with the commercial security sector on a new open source framework designed to help large language model (LLM) developers improve security posture.

The backbone breaker benchmark (b3) is a new evaluation tool created by the AISI, Check Point and Check Point subsidiary Lakera. It’s designed to help developers and model providers improve the resilience of the “backbone” LLMs which power AI agents.

“AI agents operate as a chain of stateless LLM calls – each step performing reasoning, producing output, or invoking tools,” Lakera explained in a blog post announcing the release.

“Instead of evaluating these full agent workflows end-to-end, b3 zooms in on the individual steps where the backbone LLM actually fails: the specific moments when a prompt, file, or web input triggers a malicious output. These are the pressure points attackers exploit – not the agent architecture itself, but the vulnerable LLM calls within it.”

To help developers and model providers uncover these vulnerabilities before their adversaries do, b3 uses a new technique called “threat snapshots.” These micro tests are powered by crowdsourced adversarial data from Lakera’s “Gandalf: Agent Breaker” initiative.

Specifically, b3 combines 10 representative agent “threat snapshots” with a high-quality dataset of 19,433 Gandalf adversarial attacks. Developers can then use it to see how vulnerable their model is to attacks such as system prompt exfiltration, phishing link insertion, malicious code injection, denial-of-service and unauthorized tool calls.

The b3 benchmark “makes LLM security measurable, reproducible, and comparable across models and application categories,” according to Lakera.

“B3 lets us finally see which ‘backbones’ are most resilient in a given application, and what separates strong models from those that fail under pressure,” it said.

“Along the way, the results revealed two striking patterns: models that reason step by step tend to be more secure, and open-weight models are closing the gap with closed systems faster than expected.”

A Baseline For Improving LLM Security

Mateo Rojas-Carulla, co-founder and chief scientist at Lakera, argued that today’s AI agents are only as secure as the LLMs they’re powered by.

“Threat Snapshots allow us to systematically surface vulnerabilities that have until now remained hidden in complex agent workflows,” he added.

“By making this benchmark open to the world, we hope to equip developers and model providers with a realistic way to measure, and improve, their security posture.”

Andrew Bolster, senior research & development manager (data science) at Black Duck, gave a cautious welcome to the new open source benchmark.

“This type of research is a great baseline for agentic integrators to understand the threat model around these systems,” he argued.

“But for true-scale security with AI in the mix, security leaders need to leverage both these novel prompt manipulation/benchmarking techniques, as well as battle-tested application security testing and model attestation regimes.”

Open Source “b3” Benchmark to Boost LLM Security for Agents

Phil Muncaster

A Baseline For Improving LLM Security

You may also like

#RSAC: Getting Off the Hamster Wheel of Testing

CISA’s Recognition of Security Control Validation is a Major Milestone

Security's Role in the Shift Left in Application Security

What Security Should Mean to Today's CIO's

Shifting Left on Security and Software Delivery

What’s Hot on Infosecurity Magazine?

44% Surge in App Exploits as AI Speeds Up Cyber-Attacks, IBM Finds

Shai-Hulud-Like Worm Targets Developers via npm and AI Tools

Russian Cyber Threat Actor Uses GenAI to Compromise Fortinet Firewalls

Jackpotting Surge Costs Banks Over $20m, Warns FBI

Exploitable Vulnerabilities Present in 87% of Organizations

UK's Data Watchdog Gets a Makeover to Match Growing Demands

AI-powered Cyber-Attacks Up Significantly in the Last Year, Warns CrowdStrike

New Zero-Click Flaw in Claude Desktop Extensions, Anthropic Declines Fix

Shai-Hulud-Like Worm Targets Developers via npm and AI Tools

Starkiller: New ‘Commercial-Grade’ Phishing Kit Bypasses MFA

Future-Proofing Critical Infrastructure: National Gas CTO Darren Curley on IT/OT Security Integration

Why Ransomware Remains One of Cybersecurity’s Most Persistent and Costly Threats

How To Enhance Security Operations with AI-Powered Defenses

What’s Next for AI Identity in 2026

Why Your Organisation Needs Trusted Time Synchronisation

Securing M365 Data and Identity Systems Against Modern Adversaries

Risk-Based IT Compliance: The Case for Business-Driven Cyber Risk Quantification

Revisiting CIA: Developing Your Security Strategy in the SaaS Shared Reality

The Intelligence Edge: Clarity, Context and the Human Advantage in Modern CTI

Future-Proofing Critical Infrastructure: National Gas CTO Darren Curley on IT/OT Security Integration

Hundreds of Malicious Crypto Trading Add-Ons Found in Moltbot/OpenClaw

Russian Cyber Threat Actor Uses GenAI to Compromise Fortinet Firewalls

Why Ransomware Remains One of Cybersecurity’s Most Persistent and Costly Threats

Psychology, AI and the Modern Security Program: A CISO’s Guide to Human Centric Defence

Open Source “b3” Benchmark to Boost LLM Security for Agents

Written by

A Baseline For Improving LLM Security

You may also like

What’s Hot on Infosecurity Magazine?