OpenAI Enhances Defensive Models to Mitigate Cyber-Threats

A surge in model performance has reshaped OpenAI’s internal planning, the company revealed on Wednesday.

According to a new report, capability assessments using capture the flag (CTF) challenges have shown improvement from 27% on GPT-5 in August 2025 to 76% on GPT-5.1-Codex-Max in November 2025.

OpenAI has warned that some upcoming systems may reach “High” capability levels on its Preparedness Framework, meaning they could eventually assist with tasks ranging from complex intrusion operations to the development of zero-day exploits.

Jon Abbott, co-founder and CEO of ThreatAware, said the warning underscores the need to focus on basic protections.

“OpenAI’s warning that new models pose ‘high’ cybersecurity risks is exactly why getting the security foundations right is absolutely critical. AI might be accelerating the pace of attacks, but our best defense will continue to be nailing the fundamentals first.”

The company also said it is preparing for that possibility by developing layers of safeguards intended to channel advanced capabilities toward defensive outcomes. OpenAI added that its main goal is to strengthen the position of security teams that remain outnumbered and under-resourced.

Strengthening Industry-Wide Understanding

To manage the dual-use risks inherent in cyber workflows, the company outlined a defense-in-depth strategy built on several components:

Access controls, infrastructure hardening, egress controls and monitoring
Training that steers models away from harmful requests while maintaining usefulness for education and defense
System-wide detection tools that can block or reroute unsafe activity
End-to-end red teaming by external specialists

“These safeguards are designed to evolve with the threat landscape,” the company said.

Abbott noted that rising capability makes long-standing threats more dangerous.

“Old-school threats, when combined with the scale and precision enabled by AI, make for a particularly toxic combination,” he explained.

“With models that can develop working zero-day remote exploits or assist with complex, stealthy intrusions, the barrier to entry for criminals has been dramatically lowered.”

OpenAI said it is coordinating with global experts to improve real-world applications of defensive AI and is preparing a trusted access program for qualifying users.

Another effort, Aardvark, is already in private beta. The agentic security researcher scans codebases, identifies vulnerabilities and proposes patches, and has uncovered new CVEs in open-source projects.

OpenAI said it will also launch a Frontier Risk Council to advise on responsible capability use, with further collaboration through the Frontier Model Forum aimed at refining shared threat models and improving ecosystem-wide mitigation strategies.

Image credit: Prathmesh T / Shutterstock.com

OpenAI Enhances Defensive Models to Mitigate Cyber-Threats

Alessandro Mascellino

Strengthening Industry-Wide Understanding

You may also like

Google Launches Framework to Secure Generative AI

Russian Hackers Try to Bypass ChatGPT's Restrictions For Malicious Purposes

Microsoft, OpenAI Confirm Nation-States are Weaponizing Generative AI in Cyber-Attacks

Cyber-criminals “Jailbreak” AI Chatbots For Malicious Ends

Dark Web Markets Offer New FraudGPT AI Tool

What’s Hot on Infosecurity Magazine?

44% Surge in App Exploits as AI Speeds Up Cyber-Attacks, IBM Finds

Shai-Hulud-Like Worm Targets Developers via npm and AI Tools

Russian Cyber Threat Actor Uses GenAI to Compromise Fortinet Firewalls

Jackpotting Surge Costs Banks Over $20m, Warns FBI

Exploitable Vulnerabilities Present in 87% of Organizations

UK's Data Watchdog Gets a Makeover to Match Growing Demands

AI-powered Cyber-Attacks Up Significantly in the Last Year, Warns CrowdStrike

New Zero-Click Flaw in Claude Desktop Extensions, Anthropic Declines Fix

Shai-Hulud-Like Worm Targets Developers via npm and AI Tools

Starkiller: New ‘Commercial-Grade’ Phishing Kit Bypasses MFA

Future-Proofing Critical Infrastructure: National Gas CTO Darren Curley on IT/OT Security Integration

Why Ransomware Remains One of Cybersecurity’s Most Persistent and Costly Threats

How To Enhance Security Operations with AI-Powered Defenses

What’s Next for AI Identity in 2026

Why Your Organisation Needs Trusted Time Synchronisation

Securing M365 Data and Identity Systems Against Modern Adversaries

Risk-Based IT Compliance: The Case for Business-Driven Cyber Risk Quantification

Revisiting CIA: Developing Your Security Strategy in the SaaS Shared Reality

The Intelligence Edge: Clarity, Context and the Human Advantage in Modern CTI

Future-Proofing Critical Infrastructure: National Gas CTO Darren Curley on IT/OT Security Integration

Hundreds of Malicious Crypto Trading Add-Ons Found in Moltbot/OpenClaw

Russian Cyber Threat Actor Uses GenAI to Compromise Fortinet Firewalls

Why Ransomware Remains One of Cybersecurity’s Most Persistent and Costly Threats

Psychology, AI and the Modern Security Program: A CISO’s Guide to Human Centric Defence

OpenAI Enhances Defensive Models to Mitigate Cyber-Threats

Written by

Strengthening Industry-Wide Understanding

You may also like

What’s Hot on Infosecurity Magazine?