Google Expands Content Watermarking Tool to AI-Generated Text

Google has unveiled a new method to label text as AI-generated without altering it.

This new feature, announced on May 14, has been integrated into Google DeepMind’s SynthID tool, which was already capable of identifying AI-generated images and audio clips.

This method introduces additional information to the large language model (LLM)-based tool while generating text. This action is invisible to the user.

Traditionally, an LLM generates texts by predicting the most probable following words one by one. The characters, words and groups of words are broken down into single entities called ‘tokens.’ Each possible new token is assigned a probability score, and the token with the highest score is generated.

Google’s SynthID for text calculates an adjusted probability score with the additional information provided.

The combination of the token’s scores from both the LLM and SynthID is considered the watermark.

“This pattern of scores is compared with the expected pattern of scores for watermarked and unwatermarked text, helping SynthID detect if an AI tool generated the text or if it might come from other sources,” Google explained in a blog post.

The tech giant explained that although this technique isn’t designed to stop motivated adversaries like cyber attackers or hackers from causing harm, “it can make it harder to use AI-generated content for malicious purposes.”

SynthID for AI-generated text has been deployed on Google AI chatbot Gemini.

Limitations of SynthID for Text

Google said this AI watermarking method is more flexible than classifier-based ones, which “often only perform well on particular tasks.”

However, the SynthID text watermarking feature also has its limitations.

For instance, it works better for longer generated texts – “like when [an LLM is] prompted to generate an essay, a theater script or variations on an email” – than for prompts asking for factual responses that imply fewer variations.

Additionally, the method performs well even when the text has been mildly transformed (e.g. cropped or partly modified), but less so when it has been significantly rewritten or translated into another language.

Finally, the tech giant recommends combining this method with other AI-generated text watermarking methods.

SynthID Expands to Watermark AI-Generated Videos

In the same blog post, Google explained that SynthID can now also watermark AI-generated videos, a feature announced at Google I/O on May 14.

Building on SynthID for AI-generated images, the technique embeds a watermark directly into the pixels of every video frame, making it imperceptible to the human eye, but detectable for identification.

Google has started using SynthID to label every video generated by its AI-generated video tool Veo that is published on its AI video platform VideoFX.

An LLM-based Scam Call Alert

Google also announced during its I/O conference that it was testing a real-time scam alert tool for users placing a call.

Based on Gemini Nano, Google’s light LLM tool for on-device tasks, this new feature provides real-time alerts during a call if it detects conversation patterns commonly associated with scams.

Google added that it will release more information about this new feature later this year.

Google Expands Synthetic Content Watermarking Tool to AI-Generated Text

Kevin Poireault

Limitations of SynthID for Text

SynthID Expands to Watermark AI-Generated Videos

An LLM-based Scam Call Alert

You may also like

Google to Restrict Election-Related Answers on AI Chatbot Gemini

Cyber Threat Intelligence Pros Assess AI Threat Technology Readiness Levels

Academics Develop Testing Benchmark for LLMs in Cyber Threat Intelligence

UK General Election: Tech Policy Expert Calls for Law Overhaul to Combat Deepfakes

#Infosec2024: Decoding SentinelOne's AI Threat Hunting Assistant

What’s hot on Infosecurity Magazine?

Most IT Leaders Say Severity of Cyber-Attacks has Increased

Chinese Espionage Group Upgrades Malware Arsenal to Target All Major OS

Russia Shifts Cyber Focus to Battlefield Intelligence in Ukraine

Exclusive: Paris 2024 CISO Reveals Cybersecurity Plans for the Olympics

Prolific DDoS Marketplace Shut Down by UK Law Enforcement

Cybercriminals Exploit CrowdStrike Outage Chaos

Fact vs. Fiction: Dispelling Zero Trust Misconceptions

Cybercriminals Exploit CrowdStrike Outage Chaos

Exclusive: Paris 2024 CISO Reveals Cybersecurity Plans for the Olympics

CISA's Jack Cable Discusses US Push for More Secure Software

Chinese Espionage Group Upgrades Malware Arsenal to Target All Major OS

North Korean Hackers Targeted Cybersecurity Firm KnowBe4 with Fake IT Worker

The Future of Fraud: Defending Against Advanced Account Attacks

Mastering IP & Data Security in the Industrial Age

Experiencing a DDoS Simulation to Enhance Defenses

How to Unlock Frictionless Security with Device Identity & MFA

Adapting to Tomorrow's Threat Landscape: AI's Role in Cybersecurity and Security Operations in 2024

How to Proactively Remediate Rising Web Application Threats

#Infosec2024: Claire Williams on Leadership, Cultivating a High Performing Team and Overcoming Adversity (video)

#Infosec2024: Navigating the Ransomware Toll on Victims with Jason Nurse (video)

#Infosec2024: Experts Share How CISOs Can Manage Change as the Only Constant

#Infosec2024: 104 EU Laws Have Different Definitions of Cybersecurity

Infosecurity Magazine Autumn Online Summit 2024: Day Two

Infosecurity Magazine Autumn Online Summit 2024: Day One

Google Expands Synthetic Content Watermarking Tool to AI-Generated Text

Written by

Limitations of SynthID for Text

SynthID Expands to Watermark AI-Generated Videos

An LLM-based Scam Call Alert

You may also like

What’s hot on Infosecurity Magazine?