this post was submitted on 01 Sep 2025
611 points (97.5% liked)
Fediverse
36627 readers
186 users here now
A community to talk about the Fediverse and all it's related services using ActivityPub (Mastodon, Lemmy, KBin, etc).
If you wanted to get help with moderating your own community then head over to !moderators@lemmy.world!
Rules
- Posts must be on topic.
- Be respectful of others.
- Cite the sources used for graphs and other statistics.
- Follow the general Lemmy.world rules.
Learn more at these websites: Join The Fediverse Wiki, Fediverse.info, Wikipedia Page, The Federation Info (Stats), FediDB (Stats), Sub Rehab (Reddit Migration)
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
In the blog post you linked, neither the author or myself came to your conclusion:
The blog post also says this:
My comment should have been clearer; what I meant when i said it is more "decentralized architecturally" I was referring to the data model part of the architecture as opposed to the physical server infrastructure currently operating it. The latter is obviously quite centralized still, but the former is designed for resilience against nodes unexpectedly (and permanently) failing.
Okay yes this makes sense. Although, honestly i think I'd prefer the AP method of doing it because BlueSky sends ALL content to all nodes, so it's MUCH less cost effective to join with a private server.
I run my own lemmy instance, so i know the data volume since 2023 has been probably like a terabyte or so. But, with BlueSky I'd have to account for the data volume of all users on the platform as a whole, bringing the data volume way up to tens of terabytes (a guess based almost entirely on nothing).
So it really boils down to yes I agree that AP has problems with data accessibility, but I'd prefer that over unnecessary data redundancy
I think this is a common misconception based on some critics' incorrect assumptions and back-of-the-envelope math. See the atproto overview for the different components involved, and then this post (from a BlueSky employee) "A Full-Network Relay for $34 a Month" for some numbers.
If I understand correctly, to run a "full nework relay" does mean to consume all of the text posts from all known servers, but not necessarily all of the media, and not necessarily to keep data you aren't interested in for any long period of time.
Also, you can run your own PDS and/or App Views without running your own relay at all. And, you can also use multiple other people's relays.
Disclaimer: I'm not an atproto expert, and I haven't set any of this up myself.
You can design an appview that crawls PDSes directly, no relay needed.
AppViewLite does that
You can actually do it like mastodon and only take on data from specific users or PDS. Wafrn does this.