Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents


The Control System monitoring is the fundamental task that continuously monitors the healthy of the Control System and related equipments (including switch, VMs, PCs, moxa, icpdas...) and provides alarms and dashboards to early detect malfunctions.

The monitoring system covers four different layers:

  1. network (switch)
  2. appliance (PCs,moxa,VMs,icpdas, other networking appliance)
  3. services (DBs, grafana, http, ntp, memcached, kafka, k8s)
  4. control applications (devils DCS, US CHAOS, IOC EPICS)

Grafana will be used to centralise and to uniform data coming from different heterogeneous sources, generate alarms and provide dashboards. 

Several helper tools will be developed to feed different grafana data sources. These tools will be provided as dockerized micro-services  in order to maximise portability, reliability, availability.

These microservices will be hosted in the CSI.


Activities

Jira
serverINFN Ticketing System
columnIdsissuekey,summary,issuetype,created,updated,duedate,assignee,reporter,priority,status,resolution
columnskey,summary,type,created,updated,due,assignee,reporter,priority,status,resolution
maximumIssues20
jqlQuerylabels = monitoring AND status != Done
serverId8087fedc-8816-3706-9e66-78f987f39e0c

Tools


Documentation

Devil Monitor (DCS)devil monitor: https://baltig.infn.it/lnf-da-control/dcs-monitor-alive.git