this post was submitted on 26 Oct 2024

1246 points (99.3% liked)

Piracy: ꜱᴀɪʟ ᴛʜᴇ ʜɪɢʜ ꜱᴇᴀꜱ

64838 readers

180 users here now

⚓ Dedicated to the discussion of digital piracy, including ethical problems and legal advancements.

Rules • Full Version

1. Posts must be related to the discussion of digital piracy

2. Don't request invites, trade, sell, or self-promote

3. Don't request or link to specific pirated titles, including DMs

4. Don't submit low-quality posts, be entitled, or harass others

Loot, Pillage, & Plunder

📜 c/Piracy Wiki (Community Edition):

🏴‍☠️ Other communities

FUCK ADOBE!

!GenP@lemmy.dbzer0.com

Torrenting/P2P:

Gaming:

💰 Please help cover server costs.


Ko-fi	Liberapay

founded 2 years ago

MODERATORS

db0@lemmy.dbzer0.com

sunbrothersco@lemmy.dbzer0.com

Flatworm7591@lemmy.dbzer0.com

RandomLegend@lemmy.dbzer0.com

Andromxda@lemmy.dbzer0.com

CosmicTurtle0@lemmy.dbzer0.com

tenchiken@lemmy.dbzer0.com

unruffled@anarchist.nexus

1246

When corporations scrape academic papers, it's justified. When individuals do it, it's inexcusable. (lemmy.ml)

submitted 11 months ago by TheImpressiveX@lemmy.ml to c/piracy@lemmy.dbzer0.com

70 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] EmbarrassedDrum@lemmy.dbzer0.com 33 points 11 months ago (1 children)

and in due time, we'll hack OpenAI and get the sources from the chat module..

I've seen a few glitches before that made ChatGPT just drop entire articles in varying languages.

[–] FaceDeer@fedia.io 24 points 11 months ago (1 children)

AI models don't actually contain the text they were trained on, except in very rare circumstances when they've been overfit on a particular text (this is considered an error in training and much work has been put into coming up with ways to prevent it. It usually happens when a great many identical copies of the same data appears in the training set). An AI model is far too small for it, there's no way that data can be compressed that much.

[–] EmbarrassedDrum@lemmy.dbzer0.com 8 points 11 months ago

thanks! it actually makes much sense.

welp guess I was wrong. so back to .edu scraping!