Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents


The Control System monitoring is the fundamental task that continuously monitors the healthy of the Control System and related equipments (including switch, VMs, PCs, moxa, icpdas...) and provides alarms and dashboards to early detect malfunctions.

The monitoring system covers the different 4 different layers:

  1. network (switch)
  2. appliance (PCs,moxa,VMs,icpdas, other networking appliance)
  3. services (DBs, grafana, http, ntp, memcached, kafka, k8s)
  4. control applications (devils DCS, US CHAOS, IOC EPICS)

Grafana will be used to centralise and uniform data coming from different heterogeneous sources, generate alarms and provide dashboards. 

Several helper tools will be developed to feed different data sources and provided as dockerized micro-services  in order to maximise portability, reliability, availability.

These microservices will be hosted in the CSI.


Activities

Jira
serverINFN Ticketing System
columnIdsissuekey,summary,issuetype,created,updated,duedate,assignee,reporter,priority,status,resolution
columnskey,summary,type,created,updated,due,assignee,reporter,priority,status,resolution
maximumIssues20
jqlQuerylabels = monitoring
serverId8087fedc-8816-3706-9e66-78f987f39e0c

...


Documentation



devil monitor: https://baltig.infn.it/lnf-da-control/dcs-monitor-alive.git

...