I’m really enjoying lemmy. I think we’ve got some growing pains in UI/UX and we’re missing some key features (like community migration and actual redundancy). But how are we going to collectively pay for this? I saw an (unverified) post that Reddit received 400M dollars from ads last year. Lemmy isn’t going to be free. Can someone with actual server experience chime in with some back of the napkin math on how expensive it would be if everyone migrated from Reddit?
It’s certainly a challenge. I run three instances on the #fediverse. Two small ones and a larger one with 450 users. I have a donation page; initially people were enthusiastic and I covered costs. Now, I have regular monthly donators but not enough to cover costs so I am subsidising it. I took the decision when I launched that it could happen and it’s my problem.
I think there will be many instances will fall away in the coming months due to costs. Especially if you are thousands of users and associated costs.
We need to come up with a new funding model, where people appreciate you get nothing for nothing. All the large corporates sell your data as advertising for revenue. The greater public do not appreciate they are selling their soul.
Is there an approximate specs per number of users guide to size a lemmy instance?
Yeah, I’d love to get an approximate sense of how much these instances cost
I haven’t seen one yet. Disk usage this morning on lemmy.world was reported at about 4GB over 11 days (probably low usage). The 100GB drive would probably fill up in 275 days or so if usage did not increment. If it’s not redundant and dies, all that content is lost.
So storage will be a huge issue for lemmy unless I’m missing something.
100GB is practically nothing nowadays.
There are people (myself included, not to brag) running home servers with literally hundreds of terabytes of data. At that ~0.3 GB/day number, I alone could host 3,500 years worth of data. Get some of those r/DataHoarders and r/HomeLab guys on here and Lemmy would never run out of space.
Considering Lemmy is absolutely tiny compared to Reddit, these aren’t numbers worth considering. Every single instance needs to mirror data, and I still don’t understand how this is supposed to scale to something a fraction the size of Reddit unless the federation is just a few enormous instances that can afford that scale. It’s not like everyone pitches in what they can – every single instance individually needs to be able to support the entire dataset and the associated synchronization traffic (for the portion that its users have requested access to).
What if there are a couple large archive mirrors and the posts on other servers have a life expectancy maybe based on time, but also engagement? Crucial posts could be stickied, but I don’t see the need for everyone to hold onto everything forever. Even in the event of a catastrophic loss of the archives, the communities could still live on and rebuild.
Yeah, that’s a good point. I imagine there’d have to be some compromise like that for smaller instances. How often do users load up content older than say a couple of weeks or a month? Could be a hinderance on the experience, hard for me to estimate (for myself) how often that happens.
That is just one instance and that is a small amount of users. Reddit has 430 million active monthly users. Let’s say 1% move over to Lemmy. According to the 4 GB per 22 days (with ~1k users per instance) for Lemmy.world, that would mean you would need 1.6 Terrabytes of storage per day to support that 1% of users. Of course this would be spread over a number of instances… but you can start to see where the problem lies…
Can content be stored somewhere like S3 instead of spinning disk? It would certainly be more robust and cheaper.
Unfortunately, you can’t equate resource usage to number of users.
500 users will use more than 5 times the resources of 100 users, because activity feeds on activity. But it gets worse than being non-linear.
If you judge you’re using as many resources as you comfortably can at 500 users and close registrations, you will soon exceed your resource capacity anyway. Why? Because users who register elsewhere will increase your federation activity and therefore resource requirements.
I do imagine it being cool if those who have desktops or servers could donate via some kind of ipfs caching that apps could reroute fetching with