Google’s Lumiere generates realistic AI videos from text prompts

AI video is fast turning from uncanny valley to genuinely realistic, and Google’s Lumiere is the most sophisticated text-to-video generator we’ve seen to date.

Evoking a sense of awe – and a hefty dose of unease – Google recently exhibited how sophisticated AI video has become in just a few years of development.

In the same way that text-to-image generators like Bing Image Creator, DALL-E, and Midjourney can create original images from a single-line prompt, Google’s ‘Lumiere’ application can turn our wildest ideas into fully rendered five second videos.

Other examples of text-to-video generators are already available, granted, but Google’s attempt is the first to really nail an accurate portrayal of movement to a near CGI standard.

It achieves this by establishing a base frame and using its highly touted STUNet (Space-Time-U-Net) technology to autonomously establish where are how items in the image should move. Once selected, objects within that initial frame then comprise several layers of their own that flow into each other seamlessly.

Lumiere is able to generate 80 frames per image compared to the previous maximum of 25 achieved by its closest competitor Stable Video Diffusion. Though several early results released by Google have a touch of artificiality about them, the leap in overall quality since its 2022 demo is staggering.

Beyond text-to-video, there is also image-to-video generation which will bring a still picture to life, stylised generation, which can create videos in a specific visual style, and a cinemograph setting able to animate a specific portion of an existing image – like flowing water, a flickering fire, or smoke from a train engine, for instance.

In terms of market strategy, the late arrival of Lumiere falls in line with Google’s fashionably late policy. Since the early iteration of its generative language tool Bard flopped last year, the tech giant has quietly developed its multimodal vision for generative AI in the background.

Its latest announcement closely follows a showcase for Google’s Gemini language model, which is tipped to make a late challenge for ChatGPT’s crown as the benchmark for the sector.

Looking beyond the commercial buzz for video AI, it would be remiss to ignore the technology’s potential for misuse as it becomes harder to distinguish fictional works from real life content.

The ongoing debacle involving sexually explicit depictions of Taylor Swift and her likeness using text-to-image apps could be just the tip of the iceberg if text-to-video takes off on a similar scale.

Google assures that it is creating safeguards to ensure fair use of Lumiere, but the paper’s authors haven’t ratified exactly how incidents will be prevented. We’re keen to get our hands on the technology, but not if it will open a larger can of worms.

Are Gen Z ageing faster than their parents?

Gen Z are known for tempering harmful habits like drinking or smoking, but are they destined to age poorly anyway? If you’re chronically online, you’ll have seen self-deprecating videos from Gen Zers talking about how they’re ageing worse than millennials, but is there any genuine credibility behind the jokes about receding hairlines? There might just be. Experts are increasingly warning of a ‘generational health drift,’ in which future cohorts may have...

By Jamie Watts London, UK

Bad Bunny redefines what it means to tour

Thred/Euphoriazine

Media

Bad Bunny redefines what it means to tour

Instead of travelling the world and to perform in a new city every night, Latinx artist Bad Bunny is telling fans to come to his home country Puerto Rico. Earlier this year, Bad Bunny released his massively successful seventh studio album ‘DeBÍ TiRAR MáS FOToS’ (translated in English to ‘I should’ve taken more photos’) and announced he would be going on tour. But when the time came, he decided...

By Jessica Byrne London, UK

Opinion – the earthquake magnitude scale shouldn’t be logarithmic

Credit: Unsplash

Offbeat

Opinion – the earthquake magnitude scale shouldn’t be logarithmic

Full disclaimer: I’m no scientist. I’m a journalist trying to understand the logic behind measuring earthquakes logarithmically – meaning each whole number increase represents a quake that is 10x stronger – when this isn’t intuitive to the public. Shouldn’t we measure them out of 100 instead? In the early hours of July 30, a powerful 8.8 magnitude earthquake struck Russia’s Kamchatka Kari peninsula. It caused an volcano in the region...

By Jessica Byrne London, UK

Is this summer’s Love Island UK the most misogynistic yet?

Credit: Thred

Media

Is this summer’s Love Island UK the most misogynistic yet?

The show’s male contestants have repeatedly bullied, belittled, and gaslit. On a show plagued by mental health crises and online abuse, how thin is the line between entertainment and intervention? I love staying in the loop. Maybe it’s the nature of my job, but keeping up with current affairs is a major part of my daily routine. That might mean reading the newspaper or listening to The Rest is Politics...

By Flo Bellinger Brighton, UK

Google’s Lumiere generates realistic AI videos from text prompts

AI video is fast turning from uncanny valley to genuinely realistic, and Google’s Lumiere is the most sophisticated text-to-video generator we’ve seen to date.

Google’s ‘AMIE’ paves the way for AI-driven medicine

Google employees reportedly call Bard ‘worse than useless’

Was Senegal’s short-lived wig ban an attempt to police women’s bodies?

Opinion – the earthquake magnitude scale shouldn’t be logarithmic

Bad Bunny redefines what it means to tour

More from thred.

Bad Bunny redefines what it means to tour

Opinion – the earthquake magnitude scale shouldn’t be logarithmic

AI video is fast turning from uncanny valley to genuinely realistic, and Google’s Lumiere is the most sophisticated text-to-video generator we’ve seen to date.

Related articles

Google’s ‘AMIE’ paves the way for AI-driven medicine

Google employees reportedly call Bard ‘worse than useless’

Popular

Was Senegal’s short-lived wig ban an attempt to police women’s bodies?

Opinion – the earthquake magnitude scale shouldn’t be logarithmic

Bad Bunny redefines what it means to tour

Keep up with thred by signing up to our planet-positive newsletter!

More from thred.

Are Gen Z ageing faster than their parents?

Bad Bunny redefines what it means to tour

Opinion – the earthquake magnitude scale shouldn’t be logarithmic

Is this summer’s Love Island UK the most misogynistic yet?