this post was submitted on 05 Oct 2023

90 points (88.8% liked)

Technology

34987 readers

234 users here now

This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.

Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.

Rules:

1: All Lemmy rules apply

2: Do not post low effort posts

3: NEVER post naziped*gore stuff

4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.

5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)

6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist

7: crypto related posts, unless essential, are disallowed

founded 5 years ago

MODERATORS

MinutePhrase@lemmy.ml

[Survey] Can you tell which images are AI generated? (forms.gle)

submitted 1 year ago by popcar2@programming.dev to c/technology@lemmy.ml

51 comments fedilink hide all child comments

Hey everyone. I made a casual survey to see if people can tell the difference between human-made and AI generated art. Any responses would be appreciated, I'm curious to see how accurately people can tell the difference (especially those familiar with AI image generation)

top 50 comments

sorted by: hot top controversial new old

[–] davidgro@lemmy.world 21 points 1 year ago (1 children)

Got 9/20.

That was a good selection of images, quite tricky.

I'm proud of getting both the LEGO minifig ones correct.

[–] qdJzXuisAndVQb2@lemm.ee 6 points 1 year ago (1 children)

Another 9/20er here. I did feel like I was guessing a lot, it was almost satisfying to get such a midway score.

[–] cyberwolfie@lemmy.ml 2 points 1 year ago

I also got 9/20, feeling certain about only a handful, and completely thrown off by others. Since all questions were yes/no, expected score would be 10/20, so my score correctly reflects that I had no real idea what was AI-generated or not. I expect the average score to be close to 10/20, skewed somewhat higher by those who might have a keen eye for some telltale signs of AI-trickery.

[–] Noved@lemmy.ca 18 points 1 year ago (1 children)

The one that got me was definitely the fruits, I didn't realize that AI was able to generate decent text yet lol

[–] popcar2@programming.dev 11 points 1 year ago* (last edited 1 year ago) (1 children)

DALL-E 3 is the only model that gets text right. It usually yields consistent results but can still jumble on words if you ask it to say too much. It's a big step forward regardless.

AI generated photo of a cat saying "I'm king of the world!"

[–] davidgro@lemmy.world 7 points 1 year ago

That's incredible. I'm usually surprised when there's a single correct word.

[–] candyman337@sh.itjust.works 12 points 1 year ago (3 children)

I take issue with this because the devil is usually in the details with ai images and these are all low rez jpgs making it harder to tell with some of these.

[–] popcar2@programming.dev 8 points 1 year ago

Unfortunately it seems like google forms resizes the image to fit the forms. If I had known this before I would've used something else, but oh well. I've stretched the images as far as they can go now, which seems to be around 740x740.

[–] biddy@feddit.nl 2 points 1 year ago

True, but low rez web jpegs is a huge part of the market for images. AI will replace stock photos and that's incredibly disruptive on it's own.

[–] SzethFriendOfNimi@lemmy.world 1 points 1 year ago

Right? Are hands visible? Because those are a bear to get right

[–] astanix@lemmy.world 9 points 1 year ago

Well I got less than 50%, maybe I'm an AI. What is even real anymore!?

[–] Cwiiis@kbin.social 7 points 1 year ago (1 children)

10/20 but I'm a little annoyed that what looks exactly like a panel from Berserk is apparently AI generated... Feels like the training data might just be replicated in entirety there, either that or someone asked it to generate "Guts from Berserk" 😛

[–] Gullible@sh.itjust.works 2 points 1 year ago

You can always notice AI based on the random belts on armor clad figures. It’s a neat belt, but why did it begin and end at the center of their chest?

[–] reflex_aliens@lemmy.ml 7 points 1 year ago

3/20. I almost got everything perfectly backwards!

[–] ram@bookwormstory.social 7 points 1 year ago* (last edited 1 year ago)

12/20, most of my mistakes were in saying human-made work was AI generated.

[–] TheBlue22@lemmy.blahaj.zone 7 points 1 year ago (1 children)

10/20, got tricked by the horse in one of them, it looked really messed up, like something AI would make.

I guess the artist cant make horses

[–] mindbleach@sh.itjust.works 5 points 1 year ago

That artist also forgot reins. And the front wheels of the carriage are goofy. And the city is a bunch of squiggles. And there's a bizarre oversaturated smear of farmland in the distance. It's the sort of human drawing that makes people go 'did AI draw this?' because a year ago they would've just said 'this kinda sucks.'

The psychedelic waterfall one has the opposite problem, where the tree at left immediately had me go 'that's a robot,' but it is also how humans draw when they've done quite a lot of drugs. Anywhere besides a landscape it would be inexcusable. But there's every reason you might want to draw a tree, that way. An anime character eating ramen - not so much.

[–] mindbleach@sh.itjust.works 6 points 1 year ago

Some of these seem unfair because - if they're real images, they're images that resemble common errors, and if they're generated, they're examples of those errors being situational enough to look ambiguous. I can tell you what I'm looking at in each image. I can tell you where I've seen that misplaced or overused in a ton of generated images. But I can also tell you what humans tend to scribble out that might've been picked up by machines without me noticing, and I can explain some that-looks-suspect locations as mundane physical artifacts.

You could argue that's the point - demonstrating how far the technology has come in basically one year. But there's some cases where damn near anything is plausible, so long as it's locally sensible. Any close-up of a face might be from the "this person does not exist" kind of network, because with eight billion people on Earth, yeah, I'll believe that's a guy. But if you show me three pictures of the same alleged guy, I'm gonna know whether it's a real dude or a machine hallucination. Nature photos are similarly hard because nature's kinda anything-goes. Drawings, even moreso. There's not much difference between an AI going nuts on waterfalls because it has poor segmentation and a human who wanted to draw a clusterfuck of waterfalls.

Here's what I'm looking at in each image. Her thumb's too good behind the glass, even if her fingernails are a little weird and the bench seat's not quite the same color on either side. His glasses are the only thing that's a little off, especially the gray-looking hairs on only his right temple, even though both could be perspective. His everything's too smooth; if this isn't generated then someone airbrushed a photo to death. Sketchy lines going nowhere and multiple approximations of a shape had me assume human over computer, but the bench's third leg and janked-up shadow point to a computer or a shitty artist. This guy looks filtered instead of drawn, but it might just be scratched instead of drawn, and honestly his wonky hold on the book is less concerning than the other image's bench. Perspective's all fucked-up and I'm unsure why the mouse is in a bucket, but the most computery parts are the fine detail in distant waves and up-close spray, because the high frequency doesn't match the drawing style. Except the next image has detailed asymmetrical elements and some smoke in front that only makes sense locally so I assumed these were human / generated pairs and marked the boat one as more-likely human. Fine stripey detail and repetition are suspect, as mentioned, make enough sense in this context that the distant foliage is almost more concerning. Rough painting originally had me mark this as human, versus the previous image, but where fine details appear (e.g. bottom left corner) they don't make any sense for a human to have focused on. Either a person did a shit job drawing those horses and really scribbled out a city, or this is exactly the sort of disordered localized detail some models add. (Honestly the scale birds and bottom-left white scribble are the only things that look like 'sloppy human' versus 'sloppy computer.') God rays on craggy waterfalls are the hardest call because humans might also draw this geological uncertainty; I marked it as generated because the smaller fall to the right finishes plausibly but starts from nowhere. Soft glow forest mountains are a generated cliche at this point. Monotonic crisp layers are not. Only the English text and rounded speech-bubble tail are tells at this point. An ice cream cat seems like the kind of dumb shit you'd ask an AI to do, but this is a tough call: there's three different kinds of "strawberry" here and they're not bungled together, but the pay and cookie placement seem bizarre in light of the rim of the fish-cone, and the placement of the beads is either cause for criticism of a human artist or shockingly flexible for a network. Lego image one could go either way. But Lego image two has cliche composition, an impossibly detailed plastic scarf, and asymmetric nonsense prints on her legs. Cat one is painted with consistent brushstrokes on everything but the whiskers. Cat two is either a painting filter or a person drawing badly from a photo reference. Cat three is the same warm-glow cliche that's easy to do on a computer, and if a human did that with actual paint, bravo.

11/20.

Everything photographic, I nailed. You picked some some lackluster human art.

[–] muhyb@programming.dev 5 points 1 year ago

I got real life ones correct, almost all paintings too (10 was tricky) but charcoal drawings are kinda impossible to guess.

[–] ar0177417@lemmy.world 5 points 1 year ago (1 children)

10/20

[–] Zeragamba@lemmy.ca 1 points 1 year ago

same score here

[–] FooBarrington@lemmy.world 5 points 1 year ago

Cool test, thanks for putting this together! I got 8/20 - this essentially proved to me that for many cases, AI generation is not really distinguishable anymore.

[–] TonyTonyChopper@mander.xyz 4 points 1 year ago

12/20, these new systems are way too good at what they do. With drawings it's basically impossible to tell unless there are obvious mistakes. And the mugshots are spot on too.

[–] Traegs@lemmy.world 4 points 1 year ago

I got 12/20 and most of the ones I got wrong are the ones I was second guessing myself on.

[–] gullible@kbin.social 4 points 1 year ago

Detail plays a huge role in determining AI, and many of the pictures appear compressed which makes that criterion… difficult to consider. I’m not surprised that I got half right, regardless. The man on the bench really got me, why is his ankle thread-thin?

[–] ArtyTester@artemis.camp 4 points 1 year ago

I got all the “real life” pictures correct, but most of the drawings and paintings were in line with straight guessing.

There are things that stick out as “wrong” on a picture of life, but in a drawing or paining the question is “is this wrong or just what the artist decided to do?”

[–] starman2112@sh.itjust.works 3 points 1 year ago

Wow, I got exactly half of them right. I reasoned my way through four of them–I assume AI can't handle refraction or reflection very well, so I reasoned that 1, 2, and 12 were all fakes. On 17, the scarf is too detailed to be a Lego piece.

Every other guess was vibes based, and on my vibes-based guesses I got 6/16

[–] rbn@feddit.ch 3 points 1 year ago (1 children)

Questions 11, 12, and 20 we're all graded incorrectly as the correct answer contradicts the specified source.

Based on the automatic grading I got 12 out of 20. Based on the feedback / comments I got 15 correct.

I'm quite proud that within the human photographies I classified 100% correct but I guess it'll be impossible for me if the algorithms just improve a tiny little bit.

[–] popcar2@programming.dev 2 points 1 year ago

They were fixed after posting but that may be after you opened the link, answers should be good now.

[–] hulemy@ani.social 3 points 1 year ago

9/20 damnit, thought I'd be better at this

[–] PrimalHero@kbin.social 3 points 1 year ago

6/20 I am bad at this

[–] crowfx@crowfx.web.id 3 points 1 year ago

Got 7/20.

Those images are actually hard to distinguish for me. An eye opening experience.

[–] HipPriest@kbin.social 2 points 1 year ago

8/20 here. Proud of getting some right. Really shocked about the answers to some!

A tricky test indeed

[–] Codename_goose@sh.itjust.works 2 points 1 year ago

11/20 I feel slightly upset by the fact that it’s still a 50/50 coin flip for me which was which.

[–] Mr_Dr_Oink@lemmy.world 2 points 1 year ago

14/20

Cool survey.

Some of the similar ones i got the wrong way around, but i was quite happy with my answers for others where i was quite confident there were (to me) obvious indicators of AI and got them right.

[–] motor_spirit@lemmy.world 2 points 1 year ago

7/20 I think I'm supposed to seppuku or whatever now

[–] southernwolf@pawb.social 2 points 1 year ago (1 children)

14/20, nicely done! I will say, I maybe would have wanted to see more AI results than just from Dall-e 3, though given I still missed 6 of the, that speaks very highly of Dall-e 3's capabilities. But some midjourney and SDXL images would have made for a wider guessing selection too.

[–] MellowBright@lemmy.ml 3 points 1 year ago

I agree. I've only been using Stable Diffusion so I was surprised to see it's lack of presence. I feel like half to ones I got wrong were because I'm too used to SD's quirks.

Dall-e has apparently gotten really damn good lately. The talking fruit in particular. Being context aware enough to create accurate speech bubbles is crazy neat. One huge step closer to AI comic books.

[–] virku@lemmy.world 2 points 1 year ago (1 children)

Question 11 didn't have a correct answer.

[–] popcar2@programming.dev 3 points 1 year ago* (last edited 1 year ago)

Fixed, thanks for reporting.

[–] ma11en@lemmy.world 2 points 1 year ago (2 children)

13/20

load more comments (2 replies)

[–] KoboldCoterie@pawb.social 1 points 1 year ago

Well, I did alright on the people, but I got almost all of the landscapes incorrect. I'll also admit to guessing on a number of them; if I had to explain exactly what I was basing my answer on for ones I said were AI, I'd have missed 2 more, because I just couldn't see anything that looked off, it was just a hunch.

[–] NaturalViber@lemmy.world 1 points 1 year ago

I got 14/20. I am the real human that has gotten the high score. I am not a bot. Real human only.

[–] RocketBucket@lemmy.ml 1 points 1 year ago

15/20 - Nice survey, was quite tricky

[–] slazer2au@lemmy.world 1 points 1 year ago

11/20

I have used SD so I can see some of the ai tell tale signs but haven't used Dall-E at all.

[–] baggins@lemmy.ca 1 points 1 year ago

The link doesn't work.

[–] Danar@discuss.tchncs.de 1 points 1 year ago (1 children)

Questions 12 and 20 seem to have incorrect answers. The correct answer is "no", but the comment says they were created by DALLE-3

[–] popcar2@programming.dev 2 points 1 year ago

Fixed both right before seeing this comment, I'm really not awake enough for this :P

[–] Jimmycakes@lemmy.world 1 points 1 year ago

I got 13.

load more comments