this post was submitted on 05 Dec 2023
74 points (98.7% liked)

Technology

34767 readers
62 users here now

This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.


Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.


Rules:

1: All Lemmy rules apply

2: Do not post low effort posts

3: NEVER post naziped*gore stuff

4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.

5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)

6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist

7: crypto related posts, unless essential, are disallowed

founded 5 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] Omega_Haxors@lemmy.ml 3 points 11 months ago* (last edited 11 months ago) (5 children)

It's not that impressive, they're just generating ChatGPT prompts then feeding those into Stable Diffusion. It's like peak AI grifting.

[–] figarin@lemmy.ml 1 points 10 months ago (1 children)

Could you explain why this is a grift? The produced images in the example look scarily close to the original, whatever method they use to go from brainwaves -> image.

[–] Omega_Haxors@lemmy.ml 1 points 10 months ago (1 children)

Because predictive texting and stable diffusion are both EXTREMELY easy to fudge data for. If you've ever used either you'll realize it's extremely hard to get it to do what you want and extremely easy to get it to do what it wants. All those high quality art renders you see are always texas sharpshoots where they try a bunch of random crap to see what looks good then say "Look at what it can do!" when it does.

[–] figarin@lemmy.ml 2 points 10 months ago (1 children)

But where exactly is the "fudging" happening?

the subjects were shown an image different from the 1,200, and their brain activity was measured under the fMRI 30 minutes to an hour later while asked to imagine what kind of image they had seen. Inputting the records, the neural signal translator then created score charts. The charts were input into another generative AI program in order to reconstruct the image, undergoing a 500-step revision process.

This sounds pretty straightforward. Even if the methods involve "fudging" and "throwing random crap at the wall", what matters in the end is the accuracy of the results, as long as there's no human-in-the-middle tweaking anything during each prediction.

[–] Omega_Haxors@lemmy.ml 2 points 10 months ago

Today's not a good day for me so i'm not going to argue this any more, sorry.

load more comments (3 replies)