this post was submitted on 15 Aug 2025
313 points (98.2% liked)

Privacy

41057 readers
530 users here now

A place to discuss privacy and freedom in the digital world.

Privacy has become a very important issue in modern society, with companies and governments constantly abusing their power, more and more people are waking up to the importance of digital privacy.

In this community everyone is welcome to post links and discuss topics related to privacy.

Some Rules

Related communities

much thanks to @gary_host_laptop for the logo design :)

founded 5 years ago
MODERATORS
 

Lol, saying you are "beginning a process designed to delete your data" is a very different thing to actually deleting your data.

you are viewing a single comment's thread
view the rest of the comments
[–] djmikeale@feddit.dk 68 points 5 days ago (2 children)

As a person working in a field close to data engineering this sounds like they're actually honest about the process.

Tldr: it's not possible to "just delete" everything at once, even though we'd love to be able to.

There's so many layers of where information is stored, and such insane amounts of data in their data platform. so running a clean up job to delete a single persons data in oltp databases, data lakes, dwh's, backups, etc, would both be expensive and inefficient. Instead what they then do is to do it in stages: flip a flag somewhere (is_deleted = true) which lets it be removed from view initially, and then running periodic clean-up jobs.

[–] kadup@lemmy.world 11 points 4 days ago (1 children)

A photo I deleted 10 years ago resurfaced on my Google Drive account recently.

I'm sure it was deleted, and it had never appeared before until now.

But sure, they're being honest!

[–] djmikeale@feddit.dk 1 points 4 days ago (1 children)
[–] kadup@lemmy.world 2 points 4 days ago

Doesn't apply, nor matter.

Malice or not, their systems didn't delete my photo, that's the point.

[–] dropped_packet@lemmy.zip 20 points 5 days ago (2 children)

Sounds like a great reason not to use their services

[–] djmikeale@feddit.dk 33 points 5 days ago (2 children)

This is any company, government, or other organisation with +80 employees. The two other alternatives are

  1. Have all data in Excel with no data governance, robust procedures, or trust in data, as the organisation grows in size
  2. Use only external tools (which in turn are owned by organisations that work like I described in my parent comment)

I'd love to hear of there's other ways of doing this stuff that actually works, but so far I just haven't experienced it in my career yet.

[–] dropped_packet@lemmy.zip 17 points 5 days ago (1 children)

I'm not disputing the technical aspect. But due to these realities I prefer to drastically limit the services I interact with.

[–] djmikeale@feddit.dk 8 points 5 days ago (1 children)

Aha I misunderstood, thanks for clarifying.

Actually for this specific context, there's an easy solution: I reckon for llms self-hosting would be the way to go, if your hardware supports it. I've heard a lot of the smaller models have gotten a lot more powerful over the last year.

[–] dropped_packet@lemmy.zip 5 points 5 days ago

Small fine tuned models seem to be where the market as a whole is headed. Even the big players like OpenAI/Google/Meta are doing this as a means to optimize infrastructure. The Qwen3 models have been really interesting to work with.

[–] phoenixz@lemmy.ca 6 points 4 days ago (1 children)

Or, optionally, host it yourself

[–] djmikeale@feddit.dk 1 points 4 days ago
[–] manuallybreathing@lemmy.ml -1 points 5 days ago (1 children)

I mean this in the most polite way possible, but it seems like youve never read a privacy policy before

[–] dropped_packet@lemmy.zip 3 points 5 days ago

What makes you say that?