OpenAI’s new Sora model can generate minute-long videos from text prompts

OpenAI on Thursday announced Sora, a brand new model that generates high-definition videos up to one minute in length from text prompts. Sora, which means “sky” in Japanese, won’t be available to the general public any time soon. Instead, OpenAI is making it available to a small group of academics and researchers who will assess harm and its potential for misuse.

“Sora is able to generate complex scenes with multiple characters, specific types of motion, and accurate details of the subject and background,” the company said on its website. “The model understands not only what the user has asked for in the prompt, but also how those things exist in the physical world.”

One of the videos generated by Sora that OpenAI shared on its website shows a couple walking through a snowy Tokyo city as cherry blossom petals and snowflakes blow around them.

Another shows realistic-looking wooly mammoths walking through a snowy meadow against a backdrop of snow-clad mountain ranges.

Prompt: “Several giant wooly mammoths approach treading through a snowy meadow, their long wooly fur lightly blows in the wind as they walk, snow covered trees and dramatic snow capped mountains in the distance, mid afternoon light with wispy clouds and a sun high in the distance… pic.twitter.com/Um5CWI18nS

— OpenAI (@OpenAI) February 15, 2024

OpenAI says that the model works as a result of “deep understanding of language,” which lets it interpret text prompts accurately. Still, like basically all AI image- and video-generators we’ve seen, Sora isn’t perfect. In one of the examples, the prompt, which asks for a video of a Dalmatian looking through a window and people “walking and cycling along the canal streets,” omits the people and the streets in the video entirely. OpenAI also warns that the model can struggle to understand cause and effect — it can generate a video of a person eating a cookie, for instance, but the cookie may not have bite marks.

Sora isn’t the first text-to-video model around. Other companies including Meta, Google and Runway, have either teased text-to-video tools or made them available to the public. Still, no other tool is currently able to generate videos as long as 60 seconds. Sora also generates entire videos at once, instead of putting them together frame-by-frame like other models, which makes sure that subjects in the video stay the same even when they go out of view temporarily.

The rise of text-to-video tools has sparked concerns over their potential to more easily create realistic-looking fake footage. “I am absolutely terrified that this kind of thing will sway a narrowly contested election,” Oren Etzioni, a professor at the University of Washington who specializes in artificial intelligence, and the founder of True Media, an organization that works to identify disinformation in political campaigns, told The New York Times. And generative AI more broadly has sparked backlash from artists and creative professionals concerned about the technology being used to replace jobs.

OpenAI’s new Sora model can generate minute-long...

Cooler Master MasterBox Q300L Micro-ATX Tower with...

ASUS TUF Gaming GT301 ZAKU II Edition ATX mid-Towe...

ASUS TUF Gaming GT501 Mid-Tower Computer Case for ...

be quiet! Pure Base 500DX Black, Mid Tower ATX cas...

ASUS ROG Strix Helios GX601 White Edition RGB Mid-...

Corsair 5000D Airflow Tempered Glass Mid-Tower ATX...

CORSAIR 7000D AIRFLOW Full-Tower ATX PC Case, Blac...

Bgears b-Voguish Gaming PC with Tempered Glass ATX...

Phanteks (PH-EC360ATG_DWT01) Eclipse P360A Ultra-f...

Corsair iCUE 4000X RGB Mid-Tower ATX PC Case ̵...

Asian Salmon Salad – Barefeet in the Kitchen

Garlic Chicken – Spend With Pennies

EASY BAKED MISSISSIPPI PORK CHOPS

Weekly Meal Plan Apr 21, 2025

Leave a reply Cancel reply

Compare items

Shopping cart