Chinese AI Firms Hit Claude with Distillation Attacks, Anthropic Warns

Generative AI firm Anthropic said three Chinese AI companies have generated millions of queries with the Claude large language model (LLM) in order to copy the model – a technique called ‘model distillation attack.’

In a new blog published on February 23, Anthropic said three GenAI labs based in China, DeepSeek, Moonshot and MiniMax, have generated over 16 million exchanges with Claude through approximately 24,000 fraudulent accounts, in violation of Anthropic’s terms of service and regional access restrictions.

Model distillation is a legitimate AI training method that involves training a less capable model on the outputs of a stronger one.

It can also be used maliciously to rapidly and inexpensively gain advanced capabilities from other labs, bypassing the significant time and resources required for independent development.

Beyond concerns about trade secrets and competitive advantage, Anthropic warned that illicitly distilled models can be used for malicious and harmful purposes that the original owner of the stolen model has built guardrails against, such as developing bioweapons or carrying out malicious cyber activities, and thus create security risks.

“Foreign labs that distill American models can then feed these unprotected capabilities into military, intelligence, and surveillance systems, enabling authoritarian governments to deploy frontier AI for offensive cyber operations, disinformation campaigns and mass surveillance,” the Anthropic blog noted.

Anthropic does not currently offer commercial access to Claude in China or to subsidiaries of Chinese companies located outside of the country for security reasons.

How Anthropic Fights Against Distillation Attacks

While the three distillation campaigns pursued different goals (e.g. improving agentic reasoning or coding capabilities), they all followed a similar playbook, using fraudulent accounts and proxy services to access Claude at scale while evading detection.

The volume, structure and focus of the prompts used by DeepSeek, Moonshot and MiniMax were distinct from normal usage patterns, reflecting deliberate capability extraction rather than legitimate use, Anthropic said.

The US-based GenAI company attributed the campaigns based on IP address correlation, request metadata, infrastructure indicators and reports of similar behaviors from industry partners.

To prevent and mitigate illicit distillation attacks targeting Claude, Anthropic implemented the following security controls:

Detection systems to identify attack patterns in API traffic
Tools to detect chain-of-thought elicitation and coordinated account activity
Stronger verification for high-risk accounts (educational, research, startups)
Product, API and model-level safeguards to reduce misuse

Chinese AI Firms Hit Claude with Distillation Attacks, Anthropic Warns

Kevin Poireault

How Anthropic Fights Against Distillation Attacks

You may also like

Cyber AI Trends Review: Preparing for 2025

OpenAI's Promptfoo Deal Plugs Agentic AI Testing Gap

Low-Skilled Cybercriminals Use AI to Perform "Vibe Extortion" Attacks

AI Security Threats Loom as Enterprise Usage Jumps 91%

DeepSeek Exposed Database Leaks Sensitive Data

What’s Hot on Infosecurity Magazine?

DeepLoad Malware Combines ClickFix With AI-Generated Code to Avoid Detection

European Commission Confirms Cloud Data Breach

Critical Citrix NetScaler Vulnerability Exploited in the Wild

Cybercriminals Exploit Tax Season With New Phishing Tactics

Lloyds IT Glitch Exposed Data of Nearly 500,000 Banking Customers

Security Researchers Sound the Alarm on Vulnerabilities in AI-Generated Code

Security Researchers Sound the Alarm on Vulnerabilities in AI-Generated Code

DeepLoad Malware Combines ClickFix With AI-Generated Code to Avoid Detection

Attackers Rapidly Weaponize Critical Oracle WebLogic RCE, Honeypot Study Finds

Iran-Linked Pay2Key Ransomware Group Re-Emerges

Hackers Exploit Compromised Enterprise Identities at Industrial Scale, Warns SentinelOne

How to Manage Large PST Files in Microsoft Outlook

How to Maintain the Intelligence Edge in a Disrupted Threat Landscape

Securing M365 Data and Identity Systems Against Modern Adversaries

How To Enhance Security Operations with AI-Powered Defenses

How to Recover From a Cyber-Attack: A Step-by-Step Playbook

How Trusted Time Strengthens Network Security

Cyber Defense in the Age of AI: Stay Ahead of Threats Without Compromising Safety

Exclusive Interview with OpenClaw's Security Advisor

How Allianz Cyber Educator Daria Catalui Puts People First to Build a Human Firewall

The Cloud Risk Nobody Talks About: Why Resilience‑Focused Cloud Design Is Your Best Defense Against Modern Attacks

AI Security and Governance Virtual Summit

High-Tech Sector Overtakes Finance as Top Target for Cyber-Attacks, Mandiant Reports

Tycoon2FA Phishing Service Resumes Activity Post-Takedown

Chinese AI Firms Hit Claude with Distillation Attacks, Anthropic Warns

Written by

How Anthropic Fights Against Distillation Attacks

You may also like

What’s Hot on Infosecurity Magazine?