this post was submitted on 31 Aug 2025
46 points (100.0% liked)

Asklemmy

50324 readers
436 users here now

A loosely moderated place to ask open-ended questions

Search asklemmy ๐Ÿ”

If your post meets the following criteria, it's welcome here!

  1. Open-ended question
  2. Not offensive: at this point, we do not have the bandwidth to moderate overtly political discussions. Assume best intent and be excellent to each other.
  3. Not regarding using or support for Lemmy: context, see the list of support communities and tools for finding communities below
  4. Not ad nauseam inducing: please make sure it is a question that would be new to most members
  5. An actual topic of discussion

Looking for support?

Looking for a community?

~Icon~ ~by~ ~@Double_A@discuss.tchncs.de~

founded 6 years ago
MODERATORS
 

Most AI-generated images with photorealistic and 3D elements have obvious defects, but I'm curious if anyone's done some analysis on the flat cartoon-style AI images. Cartoons, comics, and 2D artwork usually aren't meant to be photorealistic, but I can tell something is off at a glance. What exactly is it?

you are viewing a single comment's thread
view the rest of the comments
[โ€“] SuluBeddu@feddit.it 3 points 1 week ago* (last edited 1 week ago) (1 children)

I think generators have some kind of inherent style that we somehow learn to recognise

Like sure they have learned on thousands of styles for each type of image, and you have some control of the style through prompt, but one issue with the transformer decoder model (the principles of which back almost all genAI at this point) is that at each generation step it gets the stuff generated so far as input.

This feedback loop might induce repeated choices even on different prompts in the later stages of the generation. This is not apparent on images because they are seen all at once, but it is pretty evident on Suno (at least v3): later parts of different songs might share sounds. At least in my experiments making it generate EDM. I'm now able to spot the synth it often ends up creating.

In terms of pictures and videos, that might be a reason generated stuff are consistently uncanny across image types.

[โ€“] DanVctr@sh.itjust.works 3 points 1 week ago

I 2nd this, especially with Suno. As soon as a generated song comes on my Spotify, I recognize the specific synths used by the Suno model.