this post was submitted on 01 Oct 2025
106 points (100.0% liked)

Data Is Beautiful

8898 readers
16 users here now

A place to share and discuss data visualizations. #dataviz

founded 4 years ago
MODERATORS
top 8 comments
sorted by: hot top controversial new old
[–] HetareKing@piefed.social 2 points 23 hours ago (1 children)

Here's a thought: instead of fighting this, make it a requirement to publish the prompts before making a speech. Speeches by politicians being low in information density is nothing new, and the usage of LLMs will undoubtedly make that worse, but it also means that they had to have written a terse description of the information they want to convey. If that were public, people could just read that and not waste time listening to speeches.

It would be ironic if the first use-case for LLMs that creates positive value for society involves ignoring its output, though.

[–] qaz@lemmy.world 3 points 19 hours ago

I'm pretty sure that's never going to happen. Rules to regulate your own aren't very popular.

Anyway, your idea reminds me of this.

[–] litchralee@sh.itjust.works 12 points 1 day ago (2 children)

I can accept the premise that LLMs are being used to write Commons speeches -- MPs are also people, I'm told -- but these graphs suggest that LLMs are overusing certain stock phrases which have existed in the business world and apparently in Commons speeches since at least 2007.

What puzzles me is why LLMs are more prone to using these particular phrases. Does this happen for all users of LLMs, or only when British MPs in particular are requesting a speech?

I'd be interested to know if the same trend for the same phrases can be found in the Canadian House of Commons, since although they also follow much of the same procedures, North American English should skew the frequencies of certain words. So if the same trend can be found, then that suggests that the common LLMs do lean towards certain phrases. But if the trend is not statistically significant in Canada, then perhaps British MPs issue different prompts than their Canadian counterparts.

What I'm saying is that I rise today to highlight additional avenues of intrigue, as MPs and citizens alike are navigating a world where AI supposedly streamlines daily activities. That certain trends may or may not exist underscores the gravity of this seemingly bustling industry that we call AI.

[just to be clear, that last paragraph is entirely in jest]

[–] HiddenLayer555@lemmy.ml 6 points 1 day ago* (last edited 1 day ago)

I think it's a feedback loop. AI is trained off publicly available datasets like House of Commons records so popular words only get more popular the more AI slop is in there, since LLMs fundamentally just predict the next word given the context without much "logic" behind it.

Given enough time this will make LLMs basically unusable as public data gets contaminated with AI slop. But unfortunately that will also mean the public data itself is basically unusable.

[–] gasgiant@lemmy.ml 5 points 1 day ago

My guess would be that they were manually written by individuals then certain words did overlap and become more common. Which would be normal for the language of a system.

However now the speeches are being written by the essentially the same author. Which works by looking for patterns and repeating them. So those common words become even more common. Which the model then picks up and makes sure to include more. Repeat until no diversity exists.

That's the actual future of AI.....

[–] LiamMayfair@lemmy.sdf.org -1 points 1 day ago (2 children)

Given enough data, you can prove anything. Correlation doesn't imply causation folks!

[–] iceonfire1@lemmy.world 4 points 1 day ago

Correlation doesn't imply causation, but it does waggle its eyebrows suggestively and gesture furtively while mouthing 'look over there'. -Source

[–] SmoothOperator@lemmy.world 4 points 1 day ago

In the absence of a controllable experiment, correlation is as close to causation as we can get I suppose.

In any case, this is quite a interesting correlation.