Technology

74900 readers

2307 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

L4s@hackingne.ws

423

OpenAI introduces Sora, its text-to-video AI model (www.theverge.com)

submitted 2 years ago* (last edited 2 years ago) by catculation@lemmy.zip to c/technology@lemmy.world

140 comments fedilink hide all child comments

https://openai.com/sora

Archive https://archive.is/V8Fv3

you are viewing a single comment's thread
view the rest of the comments

[–] sleepmode@lemmy.world 19 points 2 years ago (1 children)

After seeing the horrific stuff my demented friends have made dall-e barf out I’m excited and afraid at the same time.

[–] Carighan@lemmy.world 6 points 2 years ago (4 children)

The example videos are both impressive (insofar that they exist) and dreadful. Two-legged horses everywhere, lots of random half-human-half-horse hybrids, walls change materials constantly, etc.

It really feels like all this does is generate 60 DALL-E images per second and little else.

[–] archomrade@midwest.social 9 points 2 years ago

For the limitations visual AI tends to have, this is still better than what I've seen. Objects and subjects seem pretty stable from Frame to Frame, even if those objects are quite nightmarish

I think "will Smith eating spaghetti" was only like a year ago

[–] Natanael@slrpnk.net 4 points 2 years ago

This would work very well with a text adventure game, though. A lot of them are already set in fantasy worlds with cosmic horrors everywhere, so this would fit well to animate what's happening in the game

[–] Theharpyeagle@lemmy.world 2 points 2 years ago

I mean, it took a couple months for AI to mostly figure out that hand situation. Video is, I'd assume, a different beast, but I can't imagine it won't improve almost as fast.

[–] fidodo@lemmy.world 1 points 2 years ago

It will get better, but in the mean time you just manually tell the AI to try again or adjust your prompt. I don't get the negativity about it not being perfect right off the bat. When the magic wand tool originally came out, it had tons of jagged edges. That didn't make it useless, it just meant it did a good chunk of the work for you and you just needed to manually get it the rest of the way there. With stable diffusion if I get a bad hand you just inpaint and regenerate it again until it's fixed. If you don't get the composition you want, just generate parts of the scene, combine it in an image editor, then have it use it as a base image to generate on top of.

They're showing you the raw output to show off the capabilities of the base model. In practice you would review the output and manually fix anything that's broken. Sure you'll get people too lazy to even do that, but non lazy people will be able to do really impressive things with this even in its current state.