Server Monitoring software recommendations

VitabytesDev@feddit.nl · 6 months ago

Server Monitoring software recommendations

Hercules@lemmy.world · 6 months ago

I think prometheus + grafana might be what you are looking for. In combination with loki grafana can also be used for viewing log messages.

Im_old@lemmy.world · 6 months ago

Absolutely this, nothing else is required. Well, maybe alertmanager if you want to receive alerts

farcaller@fstab.sh · 6 months ago

and swap Prometheus for VictoriaMertics, or your homelab ram usage becomes Prometheus ram usage.

N1ghtstalk3r@lemmy.world · 6 months ago

I’ll second this. Prometheus + Grafana is what I’m using now, but you can definitely add more extensions/monitors to get far more detail, like Loki which was suggested above.

onlinepersona@programming.dev · 6 months ago

Do both have to run on the host machine or can a remote machine execute the probes (over ssh or something).

Anti Commercial-AI license

miau@lemmy.sdf.org · 6 months ago

Grafana is just the frontend, its a dashboard for your different data sources Prometheus is the “database”, it scrapes data from your endpoints over http

Luci@lemmy.ca · 6 months ago

I use Check_MK

darkham · 6 months ago

Same here. I still don’t understand why everyone is about Grafana. I’ve tested it and checkMK is more… Everything.

Tobias@social.cybertalk.io · 6 months ago

@darkham @Luci What do you think about zabbix?

mbirth · 6 months ago

Switched from CMK to Zabbix at my previous job. Zabbix is far more comfortable and has all the same possibilities that CMK has. But you can setup everything in the web GUI and don’t need to reload anything.

Tobias@social.cybertalk.io · 6 months ago

@mbirth In Zabbix you can configure everything via web ?

mbirth · 6 months ago

Yes! And if it gets too complex for simple checkboxes and formulas, there are a few places where you can enter JavaScript into a textbox. But it’s all inside the web GUI. No need to fiddle with files on the server.

Tobias@social.cybertalk.io · 6 months ago

@mbirth I use #Zabbix in my #homelab but I wasn’t sure if I knew all the functions. So far, it’s been comfortable, yes

Dran@lemmy.world · 6 months ago

+1 for cmk. Been using it at work for an entire data center + thousands of endpoints and I also use it for my 3 server homelab. It scales beautifully at any size.

LordCrom@lemmy.world · 6 months ago

+1 for CMK. It’s built on nagios. Been using it for decades. That shit is rock solid and has never let me down.

Prometheus is metrics and grafana reports it. IMHO, better reporting and graphing, better eye candy. But also harder to setup and get right.

CMK agent works on 95% of what you want with just the agent.

Mora@pawb.social · edit-2 6 months ago

I’ve recently found Beszel and i want to use it to replace my grafana/Prometheus/node exporter stack. It seems to be a rather easy & clean solution. Sure, you can do more with grafana and Prometheus but I can’t be bothered having to learn that, when all I want is some simple monitoring.

https://github.com/henrygd/beszel

𝕸𝖔𝖘𝖘@infosec.pub · 6 months ago

We use libreNMS. Its docs state that it will do this, but we only use the uptime monitoring feature, so I can’t arrest as to how well it will monitor everything else.

Luckyfriend222@lemmy.world · 6 months ago

I use this too. When SNMP is set up there are loads of things you can monitor with LibreNMS. Much less of a learning curve than Grafana + Prometheus, although the latter probably has some nice tweaks available that SNMP does not provide.

ohlaph@lemmy.world · 6 months ago

I would use OpenTelemetry, Prometheus, and Grafana…

mumblerfish@lemmy.world · 6 months ago

Which parts are OpenTelemetry for? Is Prometheus Agent, Prometheus Server and Grafana not enough?

ohlaph@lemmy.world · 6 months ago

I like it because I use it for MELT in general. Prometheus generally does metrics and if you want to include logs, traces and events, it becomes more cumbersome. With the Otel collector, I can just update my collector configuration to point to the various services.

I’m not saying OP can’t use what you suggested, just stating what I would use.

Moonrise2473@feddit.it · 6 months ago

I like munin, it’s very limited, a bit hard to configure and doesn’t have many features but uses almost no resources

agile_squirrel · 6 months ago

You’ve already received some great suggestions. Another one is Netdata. Personally, I use glances to collect the data and Home Assistant to display the dashboard. But I only do this because I already had Home Assistant running.

model_tar_gz@lemmy.world · 6 months ago

I work for a large enterprise and build ML model monitoring pipelines fairly frequently—this will be a more in depth but similar use case to what you’re asking.

We use Grafana (visualization) and Prometheus (timeseries db)—they’re built for this use case exactly. Tons of info out there on how to build, configure, connect to your sensors, and deploy it.

Rizilia@lemmy.zip · 6 months ago

I recently start using Observium for some basic monitoring. I’m happy so far.

azron · 6 months ago

Munin is a tried and true solution. It installs on the server creates graphs and makes it easy to see a stair step graph to problems like out of memory.

I’d also highly recommend installing atop and having it collect stats every 1 to 2 minutes. You can go to a crashed server and step through what was running in a “top” like interfsce. I install atop on any server as a means for post incident diagnosis.

Kadath (she/her)@lemmy.world · 6 months ago

Glances has everything you require and it can also be self-contained.

vegetaaaaaaa@lemmy.world · 6 months ago

I use the Netdata agent (with cloud features disabled). Easy installation, FOSS, 0 configuration required, tons of metrics.

Appoxo@lemmy.dbzer0.com · 6 months ago

The agent on TrueNAS is loaded to the brim.

themoonisacheese@sh.itjust.works · 6 months ago

Can’t really go wrong with the old school nagios+thruk. The learning curve is a tad steep but it teaches you a lot of things about your systems.

Evotech@lemmy.world · 6 months ago

Nagios is really not great imo. It’s very not modern.

But if you insist on Nagios at least do like. Icinga or spmething

Avid Amoeba@lemmy.ca · 6 months ago

Prometheus, even by itself.

Evotech@lemmy.world · 6 months ago

We use the ELK stack.