Lemmy Benchmarking - Working on tooling

vapeloki · 2 years ago

Lemmy Benchmarking - Working on tooling

RoundSparrow · 2 years ago

Also, Ratelimiting per IP is an issue.

I have concerns about Lemmy having a pattern of hiding these behaviors under the cover. In other words, people running servers having no kind of operator console to know that it is happening. Ideally to me, it would be a setting to adjust in a screen to set for an instance to disable/set threshold. If one doesn’t exist, maybe we can identify where in the code the limit is enforced and hand-edit the code.

vapeloki · 2 years ago

In the site settings, rate limiting can be configured.

But, yes there should be some way to see if limits where hit or not. Maybe this could be done via prometheus and just provide the option to gather this data outside of lemmy itself.

RoundSparrow · 2 years ago

I’m assuming Federation backdoor, API for incoming server to server transactions, doesn’t have a rate limit? But I haven’t validated that assumption.

Maybe this could be done via prometheus and just provide the option to gather this data outside of lemmy itself.

Something, from what I’ve seen so far, Lemmy has no application specific logging and just dumps everything into the system log. I really think operators need some concept of how many signups, logins, post, comments, communities they are getting per hour/day/week - which external websites are out there publishing number of communities and users.

Dessalines · 2 years ago

The docker-compose.yml file in the docker folder has a config for postgres logging. We use it to diagnose performance issues in prod. There’s a DB tag on the lemmy issue tracker, I suggest using that to track performance issues.

RoundSparrow · 2 years ago

The problem is we need to get some data out of the big sites, Beehaw, Lemmy.ml, Lemmy.world - so that we can see what it is like having far more comments, likes, federation activity, and interactive user loads.

Is someone with a big instance willing to publish their logs?

vapeloki · 2 years ago

These log settings are not very good for production. The DB would spend more time logging, then working.

But: We can simulate this kinds of load with local toolings. My first tests look quiet promising, if i would only find the bug in my old pg-exporter i wrote for mal last employer ;) (yes, it is open source).

vapeloki · edit-2 2 years ago

Maybe @dessalines@lemmy.ml can shed some light on this. Not sure if the find time, but it would be great to get some more input.

vapeloki · 2 years ago

Current workaround for the Rate limit issue: https://gitea.loki.codes/lemmy-performance/load-test/src/branch/main/pkg/instance/prepare.go#L19

RoundSparrow · 2 years ago

PoWA (PostgreSQL Workload Analyzer)… how much overhead do you think this adds to a server? run in production?

vapeloki · 2 years ago

In production, it should only be run with the remote setup. The data aggregation can get very heavy on the DB. Remote setup works fine, i have run this in large production database with nearly no impact