this post was submitted on 31 Aug 2025
46 points (100.0% liked)
Asklemmy
50324 readers
436 users here now
A loosely moderated place to ask open-ended questions
Search asklemmy ๐
If your post meets the following criteria, it's welcome here!
- Open-ended question
- Not offensive: at this point, we do not have the bandwidth to moderate overtly political discussions. Assume best intent and be excellent to each other.
- Not regarding using or support for Lemmy: context, see the list of support communities and tools for finding communities below
- Not ad nauseam inducing: please make sure it is a question that would be new to most members
- An actual topic of discussion
Looking for support?
Looking for a community?
- Lemmyverse: community search
- sub.rehab: maps old subreddits to fediverse options, marks official as such
- !lemmy411@lemmy.ca: a community for finding communities
~Icon~ ~by~ ~@Double_A@discuss.tchncs.de~
founded 6 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I think generators have some kind of inherent style that we somehow learn to recognise
Like sure they have learned on thousands of styles for each type of image, and you have some control of the style through prompt, but one issue with the transformer decoder model (the principles of which back almost all genAI at this point) is that at each generation step it gets the stuff generated so far as input.
This feedback loop might induce repeated choices even on different prompts in the later stages of the generation. This is not apparent on images because they are seen all at once, but it is pretty evident on Suno (at least v3): later parts of different songs might share sounds. At least in my experiments making it generate EDM. I'm now able to spot the synth it often ends up creating.
In terms of pictures and videos, that might be a reason generated stuff are consistently uncanny across image types.
I 2nd this, especially with Suno. As soon as a generated song comes on my Spotify, I recognize the specific synths used by the Suno model.