this post was submitted on 06 Aug 2023
111 points (99.1% liked)
Fediverse
28481 readers
794 users here now
A community to talk about the Fediverse and all it's related services using ActivityPub (Mastodon, Lemmy, KBin, etc).
If you wanted to get help with moderating your own community then head over to [email protected]!
Rules
- Posts must be on topic.
- Be respectful of others.
- Cite the sources used for graphs and other statistics.
- Follow the general Lemmy.world rules.
Learn more at these websites: Join The Fediverse Wiki, Fediverse.info, Wikipedia Page, The Federation Info (Stats), FediDB (Stats), Sub Rehab (Reddit Migration), Search Lemmy
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
My main concern is the long-term cost of compute and storage. These instances aren't going to be free, and hopefully we can build a funding model that works.
Especially since, and correct me if I am wrong, but every instance holds all of the data for all of the other instances too? (that they are federated with).
This means there is an insane amount of redundancy no? With hundreds or thousands of servers the cost would eventually become prohibitive and need to rely on only a select few large servers and thus Lemmy doesn't 'solve' the issue it tries to in that sense.
Or, maybe it's only the bandwidth that becomes an issue and the data storage is actually minimal. If that's the case I can see more how a smaller server could afford to be part of the ecosystem. Perhaps also down the line if not already there could be a cut off point for historical data to avoid bloat.
Just the text I think. It's not nothing, but if you upload an image to your instance as part of a post, the text is copied to my instance, but with just a link to the image, so it could be worse.
To put this into perspective. Wikipedia text only is under 100gb uncompressed.
Wikipedia isn't a social platform. I suspect that their text growth was
log(n)
or something of the like. The only new text are things that are literally new or updates.Lemmy has no cap there. The amount of new text will grow in some proportion to the user base. The more users and more instances, the more text. To say nothing of duplication from cross posting when you get wonky cuts in the federation connections.
None of this is free and it's going to be a problem if Lemmy grows.
Ahhh, ok that makes far more sense actually then. Text alone isn't too bad especially if there are some optimisations available along the way.
And even then, the text data could eventually be stored in a content-addressed store (like IPFS or torrent files). This would mean that each instance could keep only its own data and let the redundant part in some cache.
not all the data afaik, but all the data for subs that it’s users are subscribed to