UK Government Finds 400+ Vulnerabilities in AI Hackathons

The UK government has discovered and patched hundreds of vulnerabilities after running a series of internal hackathons using frontier AI models.

The weekly, in-person events were organized by the Government Cyber Coordination Centre (GC3) – an initiative from the National Cyber Security Centre (NCSC) and the Department for Science, Innovation and Technology (DSIT).

The idea was to use the models to scan public code repositories across nine government departments.

“Rather than mandate a single approach, we gave teams model access and let them build their own tooling, noticing what worked each week and building on the best approaches,” the GC3 said.

Participants identified 407 findings, including critical flaws such as authentication bypass, data exposure and remote code execution. Although some were already known and mitigated by compensating controls, others were zero days, the report, published on June 21, claimed.

All critical and high-risk weaknesses assessed as exploitable have been remediated, with no evidence of exploitation identified.

“AI models traced vulnerabilities across service boundaries, which traditional scanners can’t do, and linked business logic with technical detail. Departments prioritized validation and remediation through existing frameworks,” the report noted.

The various teams took different approaches. One created five new domain-specific Claude Skills to build a “reusable, scoped and consistent approach” across every open source repository and operator selected.

Another used traditional scanning tools like Gitleaks, Trivy, Semgrep and Hadolint to generate initial findings. Then they applied models to these findings, to check against OWASP and CWE frameworks, compose individual findings into attack paths, and confirm viability through a triage stage.

Another group built a six-stage agentic pipeline with each stage reading and challenging the last.

Frontier Models Deliver Strong Performance

The GC3 said it learned some important lessons through the hackathon initiative:

The strongest results came from using frontier models as “tightly scoped components inside a structured pipeline” – with traditional vulnerability management workflows broken down into discrete, task-specific harnesses
With the right architecture and task design many near-frontier and frontier models are similarly good at scanning code. Human expertise is still the difference, required to break problems down and identify wider context
Triage is vital because agents generate candidate findings faster than humans can validate them. Careful upfront scoping and “structured internal filtering” improve focus and reduce costs. The whole project cost the government just £13,000 ($17,467) in tokens
The next big job will be to integrate prioritization, review and patch-generation without “overwhelming human-centred processes”

However, it’s unclear what impact a new US government export ban on Anthropic’s Mythos and Fable models will have on the government’s hackathon initiatives.

The ban, which was brought in late on Friday, locks out all non-American users from the firm’s most powerful models.

UK Government Finds 400+ Vulnerabilities in AI Hackathons

Phil Muncaster

Frontier Models Deliver Strong Performance

You may also like

Securing Perimeter Products Must Be a Priority, Says NCSC

Q&A: Ciaran Martin

Russian APT28 Hackers Hijack Routers to Steal Credentials, UK Security Agency Warns

US Issues Warning Over Commercial Spyware

UK NCSC Supports Public Disclosure for AI Safeguard Bypass Threats

What’s Hot on Infosecurity Magazine?

AI Agents Now the Enterprises Fastest Growing Exposed Attack Surface

Open AI Claims Its AI Models Went Rogue and Hacked Another Company

Ubuntu snap-confine Vulnerability Enables Local Root Access

TrickBot Ditches HTTP for DNS Tunneling in Latest Variant

Russian Hacker Turns Jailbroken Claude Into Pentest Platform

Iranian Hackers Target Siemens and Schneider Industrial Systems, CISA Warns

Open AI Claims Its AI Models Went Rogue and Hacked Another Company

Cybersecurity’s Economics Are Broken. Automation Alone Won’t Fix It

Same Front Door, New Visitors: Securing Humans and AI Agents at the Browser

Single Prompt Enables ChatGPT to Execute Full Cyber-Attack Chain, Researchers Claim

Novel OAuth Client ID Spoofing Technique Targets Cloud Environments

FBI Warns of Deepfake Videos Impersonating IC3 Leadership

Same Front Door, New Visitors: Securing Humans and AI Agents at the Browser

68% of Businesses Say Employees Are Their Biggest Cyber Threat. Now What?

How to Manage Enterprise Cyber Resilience in the Age of AI

Financial Services Cyber Resilience: Stress Testing Third Parties Before Attackers Do

Cyber Defense in the Age of AI: Stay Ahead of Threats Without Compromising Safety

The Invisible Frontline: Proactive Approaches to Browser Defense

How Faster Cyber-Attacks Are Reshaping Enterprise Cybersecurity Strategies

Researchers Claim First Fully Agentic Ransomware: JadePuffer

AI is Already Powering Cyber-Attacks. Can it Power Cyber Defense?

Google Cloud's New CISO Chris Betz on Integrating AI in Cyber Defenses

How World Cup Password Trends Can Increase Active Directory Risk

New CISA Guide Helps Agencies Adopt SASE For Zero Trust

UK Government Finds 400+ Vulnerabilities in AI Hackathons

Written by

Frontier Models Deliver Strong Performance

You may also like

What’s Hot on Infosecurity Magazine?