Lemmy_server 0.18.1 - Rust code, critical turning parameters and BOLO for symptoms, including log searches

RoundSparrow · edit-2 2 years ago

Lemmy_server 0.18.1 - Rust code, critical turning parameters and BOLO for symptoms, including log searches

qprimed · edit-2 2 years ago

following as I think (unverified) that some.of the current jerboa symptoms (specifcially crashes) may be a result of the app mishandling malformed API responses due to server overload.

qprimed · 2 years ago

quick update on this. have been playng with the jerboa v0.0.36 fdroid build and it seems to handle network errors and junk data in a much more sane way - still not ideal, but we now get error notifications instead of an outright crash.

looks like the jerboa robustness issue is slowly being addressed.

RoundSparrow · edit-2 2 years ago

What are the symptoms?

Lemmy.world was upgraded to 0.18.1-rc1 today, and a user reported this problem with an upvote failing:

I’ve seen several JSON parsing problems because a raw message is returned “Timeout”, and is that the PostgreSQL polling timeout? There is an open bug about not getting errors because the Lemmy API doesn’t return JSON on many error paths: https://github.com/LemmyNet/lemmy/issues/3366

qprimed · 2 years ago

same here. I have seen multiple examples of bad responses bubbling up through the lemmy web front end. anecdotally, these seem to come in clusters (server overload?) and seem to coincide with increased abends on the jerboa app.

if all of this is is related – and it looks more and more like it – then the underlying server side API response inconsistencies have to be resolved and any clients must handle error conditions (includng junk data and non-responses) in a sane manner.

DB performance is obciously pretty damn important for scaling and will be part and parcel of the solution, but inconsistency of API operation is a pretty fundamental issue that must (and I am sure will) get worked out.

in the meantime, lemmy client devs get free(!) servers (production no less!) to test their client error handling against :-p

RoundSparrow · 2 years ago

Another code value to get some tuning/behavior references on:

pub const FEDERATION_HTTP_FETCH_LIMIT: u32 = 50;

RoundSparrow · edit-2 2 years ago

2 days ago someone pointed out what has been driving me crazy… the Rust code looks “clean” and direct, because so many error conditions are outright ignored. Database not responding, etc, is not represented in the code.

source: https://github.com/LemmyNet/lemmy/pull/3414

Lemmy_server 0.18.1 - Rust code, critical turning parameters and BOLO for symptoms, including log searches

Lemmy_server 0.18.1 - Rust code, critical turning parameters and BOLO for symptoms, including log searches

HTTP and Database Parameters

lemmy_server behavior