New AI tool recreates faces solely through voice data

As deep fake technology becomes increasingly difficult to suss out online, from AI voices to celebrity lookalikes, a new tool has allowed researchers to recreate faces through voice recordings.

The era of deepfakes and artificial personas is steadily creeping up on us, one technological breakthrough at a time.

While you may have seen some uncanny TikTok accounts creating deepfake videos of celebrities such as Tom Cruise, and celebrity AI voice generators such as Uberduck, a new research tool developed at MIT recreates the face of a real person using nothing but their voice.

The results so far are fairly mixed – some get ethnicities, genders, and face structures all mixed up – but there have been accurate samples that show promise for potential use in the future.

The algorithm is called Speech2Face and was part of a research paper first published in 2019. A demo is available online if you’re curious to check it out for yourself.

Faces seem to be more accurately recreated with longer audio clips, which shouldn’t come as much of a surprise. The code was created using millions of videos from YouTube, with the software modelled by learning ‘audio-visual and voice-face correlations’ from a wide-range of samples.

It’s still a work in progress, of course, so it isn’t completely on point every time. The potential for a system that registers voices and identifies individuals quickly could be huge, particularly within legal systems and surveillance companies.

Researches behind the tech are adamant that it is only for scientific purposes, but we already know that larger companies – like Facebook, Google, Amazon, and a bunch more – are already very interested in advanced Metaverse programmes, Web 3.0, and harvesting user data. An ability to identify anyone quickly like this could be devastating in the wrong hands.

DIY Photography also points out that software like this could put the identities of influencers at risk, especially those who keep their faces hidden. TikTokers or YouTubers that make a deliberate effort to mask their identity could be discovered through audio snippets of their voices, from any clip they’ve ever posted.

Still, that’s likely far off in the future, as the algorithm is privative at present. It seems we’ll have to accept a future where AI and deepfake technology blurs the line between real and artificial, with misinformation likely to remain rampant and harder to stamp out.

Detecting identities through brief voice clips is simply another step along an inevitable path. Let’s just hope things don’t spiral out of control.

US government strikes $200m Grok deal to modernise Defense Department

A week after Grok’s ‘MechaHitler’ debacle, the US government has announced a $200m contract with the AI platform to modernise the Defense Department. The chaotic relationship between Donald Trump and Elon Musk is throwing out some ridiculous headlines, and this is just the latest. If you’re chronically online, like me, you’ll be familiar with Grok’s peculiar crash out last week, in which the chatbot anointed itself ‘MechaHitler’ and generated a...

By Jamie Watts London, UK

How a ban on smartphones is transforming Dutch schools

Credit: Pexels

News

How a ban on smartphones is transforming Dutch schools

As global leaders become more concerned about the impact of smartphones on children’s development, many have moved to ban their use in schools. Nearly two years into its own ban, the Dutch government is measuring the results. It’s no secret that smartphones are addictive, even to adults. The constant buzz of notifications tempts us out of the current moment, out of productivity, and social media apps steal time we’d otherwise...

By Jessica Byrne London, UK

Jack Dorsey’s launches Bluetooth messaging app Bitchat

Credit: Thred

Social

Jack Dorsey’s launches Bluetooth messaging app Bitchat

Bitchat is a decentralised and encrypted chat service able to send and receive messages through Bluetooth signal – and without WiFi or phone service. Will it become an instant hit? Twitter co-founder Jack Dorsey has just launched a new peer-to-peer massaging app that works without WiFi or phone service. Bitchat instead utilises a phone’s Bluetooth signal to allow for messages to be sent and received between contacts in the same...

By Jamie Watts London, UK

Terrorists are leveraging AI and it’s becoming harder to stay ahead

Credit: Thred

Technology

Terrorists are leveraging AI and it’s becoming harder to stay ahead

Staying one step ahead of terrorist outfits is proving increasingly difficult with the rapid proliferation of open source platforms and artificial intelligence. Monday marked the 20th anniversary of the 7/7 terror attacks in London, and with the grim events of that week once again at the forefront of the public’s minds, counter-terrorism groups are working to stay ahead of nefarious groups that may try to cause harm in the future. According...

By Jamie Watts London, UK

As deep fake technology becomes increasingly difficult to suss out online, from AI voices to celebrity lookalikes, a new tool has allowed researchers to recreate faces through voice recordings.

Related articles

The EU finally plans to crack down on fast fashion

Students build solar greenhouse that produces energy

Popular

Jack Dorsey’s launches Bluetooth messaging app Bitchat

How a ban on smartphones is transforming Dutch schools

The Oakley Meta smart glasses reek of surveillance capitalism

Keep up with thred by signing up to our planet-positive newsletter!

More from thred.

US government strikes $200m Grok deal to modernise Defense Department

How a ban on smartphones is transforming Dutch schools

Jack Dorsey’s launches Bluetooth messaging app Bitchat

Terrorists are leveraging AI and it’s becoming harder to stay ahead