Table des matières

Server Monitoring

Voir aussi: /informatique/web/webstats

À lire:

Outils

  1. top - Process Activity Command
  2. vmstat - System Activity, Hardware and System Information
  3. w - Find Out Who Is Logged on And What They Are Doing
  4. uptime - Tell How Long The System Has Been Running
  5. ps - Displays The Processes
  6. free - Memory Usage
  7. mpstat - Multiprocessor Usage (package “sysstat”)
  8. pmap - Process Memory Usage
  9. netstat - Network Statistics
  10. ss - Network Statistics
  11. iptraf - Real-time Network Statistics (package “iptraf”)
  12. tcpdump - Detailed Network Traffic Analysis
  13. strace - System Calls
  14. /Proc file system - Various Kernel Statistics
  15. lsof - list open files, network connections and much more.
  16. nmap - scan your server for open ports.
  17. ntop web based tool - ntop is the best tool to see network usage in a way similar to what top command does for processes i.e. it is network traffic monitoring software. You can see network status, protocol wise distribution of traffic for UDP, TCP, DNS, HTTP and other protocols.
  18. mtr - mtr combines the functionality of the traceroute and ping programs in a single network diagnostic tool. Finding out a bad or simply overloaded network link with Linux/UNIX oses

Présentation de ces outils:

Status page

Cachet

https://cachethq.io/

The open source status page system. Beautifully crafted, Translated, JSON API, Scheduled maintenance, Metrics, Two-factor authentication.

sysstat

Using sar you can monitor performance of various Linux subsystems (CPU, Memory, I/O..) in real time. You can also collect all performance data on an on-going basis, store them, and do historical analysis to identify bottlenecks. Sar is part of the sysstat package.

À lire:

Sur Debian

sudo apt-get install sysstat

Default settings for

Activer la collecte:

sudo vi /etc/default/sysstat
# Should sadc collect system activity informations? Valid values
# are "true" and "false". Please do not put other values, they
# will be overwritten by debconf!
ENABLED="false"

Mettre ENABLED=“true”

MRTG

Server And Network Monitoring. Nagios is a popular open source computer system and network monitoring application software. You can easily monitor all your hosts, network equipment and services. It can send alert when things go wrong and again when they get better.

FAN is “Fully Automated Nagios”. FAN goals are to provide a Nagios installation including most tools provided by the Nagios Community. FAN provides a CDRom image in the standard ISO format, making it easy to easilly install a Nagios server. Added to this, a wide bunch of tools are including to the distribution, in order to improve the user experience around Nagios.

Voir /informatique/MRTG

Nagios

Nagios

Netdata

https://github.com/firehol/netdata/wiki

netdata is a scalable, distributed, real-time, performance and health monitoring solution for Linux, FreeBSD and MacOS. It is open-source too. Out of the box, it collects 1k to 5k metrics per server per second. It is the corresponding of: top, vmstat, iostat, iotop, sar, systemd-cgtop and a dozen more console tools running in parallel. netdata is very efficient in this: the daemon needs just 1% to 3% cpu of a single core, even when it runs on IoT.

Many people view netdata as a collectd + graphite + grafana alternative, or compare it with cacti or munin. All these are really great tools, but they are not netdata. Let's see why.

My primary goal when I was designing netdata was to help us find why our systems and applications are slow or misbehaving. To provide a system that could kill the console for performance monitoring.

To do this, I decided that:

Grafana

http://grafana.org

Prometheus

Prometheus

Prometheus differs from Loki by focusing on metrics instead of logs, and delivering logs via pull, instead of push.

Grafana Loki

https://github.com/grafana/loki

Loki differs from Prometheus by focusing on logs instead of metrics, and delivering logs via push, instead of pull.

Glances

Glances est un logiciel libre (distribué sous licence LGPL) permettant de surveiller votre système d'exploitation GNU/Linux ou BSD à partir d'une interface texte. Glances utilise la librairie libstatgrab pour récupérer les informations de votre système. Il est développé en langage Python.

Shinken

A Python Nagios® Core total rewrite.

http://shinken-monitoring.org

Il existe aussi shinken, mais ça marche un peu comme ça veut et quand ça veut. Depuis, ça prend énormément de RAM pour le service rendu.

Zabbix

Zabbix

(sd-basic@ml.ovh.net) Ca marche plutôt bien mais les anciennes versions consomment énormément de CPU sur le serveur (a priori réglé dans la 1.6, mais j’ai pas encore testé)

SNM

http://snm.sourceforge.net/

CACTI

Web-based Monitoring Tool. Cacti is a complete network graphing solution designed to harness the power of RRDTool's data storage and graphing functionality. Cacti provides a fast poller, advanced graph templating, multiple data acquisition methods, and user management features out of the box. All of this is wrapped in an intuitive, easy to use interface that makes sense for LAN-sized installations up to complex networks with hundreds of devices. It can provide data about network, CPU, memory, logged in users, Apache, DNS servers and much more. See how to install and configure Cacti network graphing tool under CentOS / RHEL.

http://www.cacti.net/

Ovh

OCO

RTM

Snort

Snort SP

Snort RNA

informatique/Snort

OpenPOM

This web user interface allow you to view in a single page almost everything about nagios and icinga : alert, ack, downtime, comment. You can also interact with nagios and icinga through ack, downtime, comment, disable and reset buttons.

zenoss

http://www.zenoss.org/

icinga

https://www.icinga.org/

Services

UpTimeRobot

https://uptimerobot.com

Free plan

HetrixTools

https://hetrixtools.com

Free plan

Server Density

http://serverdensity.com/

Prix:

pingdom

http://www.pingdom.com/

newrelic

https://newrelic.com/

LogMatic

LogMatic.io