Lemmy as a project has suffered all month because Lemmy.ml has not been sharing critical logs from Nginx and Lemmy's code logging itself

RoundSparrow · edit-2 2 years ago

Lemmy as a project has suffered all month because Lemmy.ml has not been sharing critical logs from Nginx and Lemmy's code logging itself

sunaurus@lemm.ee · 2 years ago

Hey buddy, I understand you’re frustrated, but I just want to make a few points:

I have personally seen many instance admins and Lemmy contributors note many times over the past weeks that Lemmy is unoptimized and not ready for the current traffic
I have myself mentioned it several times in announcements to users of my own Lemmy instance
Lemmy maintainers have asked for help with optimization in several channels
Lemmy maintainers are clearly working hard at fixing Lemmy issues and improving performance - just look at the work that went into 0.18 - the fact that it’s far from perfect is clear to everybody, but progress is constantly being made
Lemmy maintainers have mentioned multiple times that their inboxes are full of notifications and DMs - it’s not that they’re brushing anything under the rug, it’s just that they’re not physically able to keep up with the volume of communication that is being thrown at them

I really believe that you have some useful insights and can be very helpful for Lemmy, but I’m afraid that if you take this accusatory tone and blame people for not doing enough then that will overshadow anything helpful that you’re actually saying.

Having said all that, if you would like to take a look at some stats about queries on lemm.ee (a Lemmy instance with 4k users - definitely much smaller than lemmy.ml), I have put together a spreadsheet here: https://docs.google.com/spreadsheets/d/e/2PACX-1vSPpqM6QCZYAAvnWe8p-xxN553ukRIquHw71j3nB763x7TNeqeUO-Oss51yPC7zVaT2x4jll39NCeMu/pubhtml#

RoundSparrow · edit-2 2 years ago

Lemmy maintainers have asked for help with optimization in several channels

I do not see them using Lemmy itself to actually discuss the problems of Lemmy. Specific to lemmy.ml and the developer relationship with this specific server, crashes (logs) are not being shared.

10 days ago: https://lemmy.ml/post/1271936

I can not emphasize the title of the posting you are reading enough. “Lemmy as a project has suffered all month because Lemmy.ml has not been sharing critical logs from Nginx and Lemmy’s code logging itself”

Logs, logs, logs. Why were these crash logs not shared as part of the Lemmy project? When the most busy server on the whole project is not sharing their Rust code logs and crashes, what are us trying to work on the SQL and architecture problems supposed to do? I didn’t even report 1 in 100 of the crashes I was experiencing.

It is a peer to peer network, server to server, and the central hub has encouraged everyone to run out and create new servers without any concern to report the crashes going on within the central hub. I just don’t get why everyone here is defending such behavior and leadership.

What I see was sharing of CONCLUSIONS - that “increase the worker count” was the problem. No, the problem is fundamental to the whole Rust application’s automatically generated SQL statements, lack of data caching, lack of proper MTA and queue for federation inbound and outbound data. Just saying that the federation worker count was the problem and making the value infinite was not in any way getting to the problems that sharing the server crash logs would have exposed.

June 14, the GitHub issue on “Scaling Federation” was CLOSED by project leadership! Meanwhile, lemmy.ml was crashing for me every hour! Failing to federate with any reliability too. June 15 is when https://lemmy.ml/post/1271936 was opened, the day after this CLOSE of a GitHub issue:

The DDOS is coming from WITHIN THE HOUSE. Lemmy’s performance problems are causing federation to bring down peer servers, and the LOGS of Rust code exceptions that are being KEPT SECRET will reveal this! The sharing of logs and making this a federation-wide announcement that the hub is failing on data exchange is critical, not optional

It’s sad to me that the leadership of this project can’t just come out and openly admit it is “experimental” project and “unstable”, and is ignoring https://lemmy.ml/post/1271936 and bragging on GitHub that it is “high performance Rust”. It might have seemed high performance when you sent 8 whole test messages to 4 servers a day, but that isn’t the meaning of “high performance”. depressing to see such denial and the people who believe in the “reality distortion field” around the project.