Why Voice Authentication Should Not Be Used to Secure Critical Assets

We know voices can be duplicated with almost trivial effort, but until now I’d never found a good use case worth putting in the effort for to actually try it. Much to my surprise, when I contacted banking giant HSBC over their phone banking line recently, I was prompted to register for voice authentication.

I duly obliged, not because I thought this would improve my experience, but as I genuinely had not come across an implementation that tried to secure a critical asset (like your money) using voice authentication.

First off, before you run off and attempt to log into everyone’s bank account with their duplicated voice, you do need some preliminary info. You need two sets of information:

One of: Account number (with sort code), 16-digit card number or customer ID number
Your date of birth

The first identifier will be the more complex to get and would require a bit of a targeted approach or social engineering but you only need one of the three. The second piece of data is almost trivial nowadays if anyone has a social media presence, just look for the birthday cakes on their social media accounts and read the comments.

Once you have that information, you’re then presented with the voice authentication prompt. Now to register, you need to repeat the phrase “my voice is my password” four times which is enough for them to create a voice sample. That’s it.

When you authenticate, the phrase is the same. Yes, that means every single HSBC customer that has activated voice authentication uses the same authentication phrase “my voice is my password.” This makes it even easier.

Duplicating Someone’s Voice is Easy

So what does it take to duplicate someone’s voice? Well, it takes about five minutes. Find a public recording of them on any social media post. I took a recording of me from a security conference, sampled one minute’s worth of audio and then used Lovo.ai or speechify.com to clone the voice (both are free to use). Type in the text you want the cloned voice to say – again, everyone has the same phrase.

Then you need to play it back into the phone conversation, which is literally just having the MP3 file play on your phone when it asks you to authenticate your voice. If you wanted to automate this you could – for example the Twilio API, even under its free tier has the capacity for you to play specific files after specific prompts programmatically. Skype also has the capacity to inject audio files into calls using drag and drop and of course, all completely free.

But what is the potential damage done if you access phone banking. According to HSBC, you can do the following: check your balance, make payments, pay bills, transfer money, set up standing orders, update your details and block your card or report it stolen. Quite a lot then.

And what did the bank have to say about all this? I contacted their customer team explaining my concerns around voice cloning and voice authentication and this was their response:

"There is nothing worry about, I understand that you are concerned if your voice can be imitated. However this is not as easy as it seems to be. From my end I can assure that there cannot be any security breaches on your account. If you are still very concerned, you can have a word with our telephone banking on how the voice passwords work. Please contact our Telephone Banking Helpdesk on 03457 404 404 and from overseas +44 1226 261 010, the team works from Monday to Sunday between 8 am to 8 pm, who should be able to look into this for you."

Quite contradictory and not very reassuring considering I’d just cloned my own voice and accessed my account in about five minutes. Could the implementation be improved?

The most obvious is removing the default phrase that every HSBC customer uses. For example, randomly generating a set of words for someone to say to authenticate would work, but then you’d need a larger voice sample to make sure their voice can be recognized across the whole spectrum of the English language and this would frustrate users who would have to talk for a few minutes just to register.

This also wouldn’t defeat voice cloning since the cloned voices are generated using the same mechanism – longer sample equals better accuracy. Generating them on the fly also isn’t a barrier. For example, with Speechify there’s a short wait of around 10-15 seconds before whatever you’ve typed is generated in the cloned voice as audio. The prompt gives you a few tries to authenticate via voice so you have plenty of time to generate new cloned audio depending on what is being asked.

What’s the solution? Simply put, don’t use it. Even as a layer in multi-factor authentication it is so trivial to duplicate that it shouldn’t be relied upon to secure anything, especially your money.

Why Voice Authentication Should Not Be Used to Secure Critical Assets

Alex Haynes

Duplicating Someone’s Voice is Easy

You may also like

Why Voice Verification is the Future of Authentication

HSBC Joins Quantum-Secure Network

UK Banks Face Consumer Frustration Over Digital Identity Management

Improving Banking Experiences with Biometrics

My Voice is My Ultimate Password

What’s hot on Infosecurity Magazine?

macOS Vulnerability Could Expose User Data, Microsoft Warns

Internet Archive and Wayback Machine Resurrect After DDoS Wave

Microsoft Named Most Imitated Brand in Phishing Attacks

FIN7 Gang Hides Malware in AI “Deepnude” Sites

Instagram Rolls Out New Sextortion Protection Measures

Coffee Lovers Warned of New Starbucks Phishing Scam

Microsoft Named Most Imitated Brand in Phishing Attacks

Experts Play Down Significance of Chinese Quantum “Hack”

Casio Confirms Ransomware Outage and Data Breach

EU Adopts Cyber Resilience Act for Connected Devices

NIST Scraps Passwords Complexity and Mandatory Changes in New Guidelines

Ethical Hackers Embrace AI Tools Amid Rising Cyber Threats

New Cyber Regulations: What it Means for UK and EU Businesses

The Future of Fraud: Defending Against Advanced Account Attacks

Reinforcing Firewall Security: The Need to Adapt to Persistent Cyber Threats

Learn Key Strategies for Industrial Data Security

How to Proactively Remediate Rising Web Application Threats

Bouncing Back: Building Organizational Resilience in the Face of Cyber-Attack

#CyberMonth: Software Updates, A Double-Edged Sword for Cybersecurity Professionals

Russia's SVR Targets Zimbra, TeamCity Servers for Cyber Espionage

#CyberMonth: How to Outsmart Novel Phishing Tactics and Techniques

Ivanti: Three CSA Zero-Days Are Being Exploited in Attacks

#CyberMonth: How to Protect Your Digital Life, Six Ways to Stay Safe Online

31 New Ransomware Groups Join the Ecosystem in 12 Months

Why Voice Authentication Should Not Be Used to Secure Critical Assets

Written by

Duplicating Someone’s Voice is Easy

You may also like

What’s hot on Infosecurity Magazine?