this post was submitted on 04 Dec 2023
622 points (95.9% liked)
Fediverse
28496 readers
613 users here now
A community to talk about the Fediverse and all it's related services using ActivityPub (Mastodon, Lemmy, KBin, etc).
If you wanted to get help with moderating your own community then head over to !moderators@lemmy.world!
Rules
- Posts must be on topic.
- Be respectful of others.
- Cite the sources used for graphs and other statistics.
- Follow the general Lemmy.world rules.
Learn more at these websites: Join The Fediverse Wiki, Fediverse.info, Wikipedia Page, The Federation Info (Stats), FediDB (Stats), Sub Rehab (Reddit Migration), Search Lemmy
founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
And this is exactly the communities that fediverser wants to bring!
Reddit's moat is not on the popular content, it's in the long tail. Reddit knows that people on /r/politics or /r/gifs are mostly to pad their numbers, but their real strength is that you can not find people to talk about Kerbal Space Program and Rain World outside of Reddit.
These "extremely niche" communities are the ones that are being held by network effects. These are the communities that I'd like to have on fediverser.network, and these are the communities that I wish we could get coordinated enough to pull away from Reddit.
alien.top was mirroring about 150 subreddits for two months, most of them of the niche type. The database of "1M comments" is taking less than 10GB of disk space. Looking at the last backup, the whole database uncompressed is 18GB. It's running on commodity hardware. Even with the mirrors making copies of the images to object storage, my object storage bill this month was a whooping $0.66.
If we focus on the long tail, it is not that expensive. And by the time that we actually start getting bigger number of users, I'm sure that we can come up with different strategies to deal with the data. We can create a common pool of resources for shared storage, we can divide the instances in "topic-based" and "user-home" (like I've been doing with communick.news and the ones on !communick_news_network@communick.news), etc.
Why shouldn't at least try to do it?
I guess if you just link the images from Reddit it's not that computationally intensive. I very much doubt that Reddit is going to let this slide if Lemmy ever gets that big though.
Because there are things to lose, and this isn't a risk-free process. I expanded more on my reasoning in my last paragraph:
The images are actually copied to the mirrored server.
It's not that simple to do that per user. You'd need:
Is the opt-out solution aggressive? Yes, no doubt. But I thought that this "aggression" was pointed to Reddit and therefore justifiable. The whole reason that this approach forces its hand to be able to get the data is because Reddit API changes was a clear sign that they want to treat the data from the users as their own. The protests were not effective against this, and showed to Reddit that they can win any conflict against dissenting mods. If Reddit tracked back on their policies and showed to be a good steward of one of the most vast amount of user data, I wouldn't be putting so much effort in this project.
If you can think of any other approach to make this work and is aligned with the clear goal of the project (make it easy for people to migrate away from Reddit, in a way that those that come here can already find their niche communities) I'm all for trying it.
That's really interesting, but why do you do that? Surely having the clients fetch the data from Reddit's servers themselves would be easier?
I hate Reddit as a platform too, but I very much disagree with this philosophically. I don't break the rules against the enemy because then the enemy would be allowed to break the rules against me. If we want to grow as a platform, we have to stay civilized. The one that fails to do that dies.
I think you misunderstood my idea about opt-out bridges. I meant that there should be a toggle for Lemmy users on Lemmy which mirrored/bridged content should be shown to them. These should be off by default, but easily changeable.
Easier? Yes. Reasonable? Not at all. Reddit wants to control all the data, the whole API fiasco started because they started to abuse their power, do you think they can trusted of stewards of social media data?
This is a fight, not a game. There are no rules. Do you think they care about rules when they started forcing moderators out of the protesting subs? Or lying about what Christian was asking during pricing negotiations? Or when they get mods working for them to do their bidding?
Let's not be naive. They will leverage anything they have to get the upper hand. We are not going to win anything by pretending there is a higher moral ground to stand on.
The moral high ground is the ONLY thing we have. Lemmy as a platform exists to be a non-evil counterpart to Reddit. It would have no purpose to exist were it not for our better ethics.
It is the ethos of decentralized platforms (which can not by its very nature be controlled by any single entity) that makes them superior, not the individuals on it.
Also, the only way to argue that what I am doing is "evil" is by accepting their premise that they own the data and that mirrors are "stealing" from them.