R12.1-JA-2025June05

Monitored Metrics

This topic introduces reference for monitored metrics data, including:

Server Metrics

The following table lists the key metrics on the Home Page of the Service Monitor Portal.

Key MetricsDescription
Active Web Service Connections The number of active connections established with NetworkBrain system Web service for API requests, such as login, search, and third-party API requests.
Disconnected Server CountThe number of servers that are disconnected in the system.
User Tasks (Task Engine)The number of task flows processed in Task Engine.
Data Tasks (Front Server)The number of data task groups processed in the Front Server.
MongoDB Data Size (7 days ago)The size of system data saved in MongoDB 7 days ago.
MongoDB Data Size (current)The size of system data currently saved in MongoDB.
ES Index Size (7 days ago)The size of index data created in Elasticsearch 7 days ago.
ES Index Size (current)The size of index data current created in Elasticsearch.

Monitored Service Metrics

Service Monitor Agent can monitor the following four basic service metrics:

  • CPU Utilization
  • Memory Utilization
  • Used Memory
  • Number of Threads

Besides the above basic service metrics, the following metrics related to service quality and performance can be monitored, which may vary depending on different services.

Service TypeMetric Names
NetworkBrain Services

NetworkBrain System Web

  • Number of Active Connections
  • Number of Anonymous Requests
  • Volume of Served Bytes
  • Volume of Received Bytes
  • Volume of Transferred Bytes
  • Number of Connection Attempts
  • Number of 404 Errors
Worker Server
  • Number of started tasks
  • Number of running tasks
Knowledge Cloud ProxyN/A
NetworkBrain Task Engine
  • Service Start Times
  • Service Stop Times
  • Number of Times to Enter Standby Mode
  • Number of Times to Enter Activation Mode
  • Number of Received Trigger Events
  • Number of Task Flow Start Events
  • Number of Finished Task Flows
  • Number of Tasks Pending in Task Queue
  • Number of Tasks in Memory Repository
  • Number of Tasks in Unacknowledged Queue
  • Number of Skipped Events
  • Number of blacklist machines
  • Number of heavy tasks
NetworkBrain Front Server

  • Number of Processing Tasks
  • Data Transfer Info for High-Priority Tasks
  • Data Transfer Info for Low-Priority Tasks
  • Parser Info
  • Low-Priority Data Upload Info
  • High-Priority Data Upload Info
  • Available Live Threads
  • Number of Available Non-CLI Task Info
  • Number of Available Telnet Task Info
  • Number of Available SSH Task Info
  • Number of Times to Rejoin Scheduling Queue due to Insufficient CLI Sessions
  • Number of Requests to Download SSH Key

NetworkBrain Front Server Controller
  • Number of Server Connections
  • Data Receiving Info
  • Active Front Servers
  • Number of Processing Data Task Groups
  • Number of Data Task Groups to be Stopped
  • Number of Dispatched Direct-Live-Access Tasks
  • Direct-Live-Access Info
  • DTG Result Queue Info
  • DataEngine Data Upload Queue Info
Third-party ServicesMongoDB
  • Number of Connected Clients
  • Number of Unused Connections
  • Number of Active Client Connections
  • Number of Active Client Connections with Read and Write Operations
  • Volume of Received Network Traffic
  • Volume of Sent Network Traffic
  • Volume of Held Uncompressed Data
  • Amount of Space Allocated to Collections
Elasticsearch
  • Time in Milliseconds Spent on Young-Generation Garbage Collections
  • Time in Milliseconds Spent on Old-Generation Garbage Collections
  • Number of HTTP Connections Currently Open
  • Available Disk Space
  • Number of Queued Threads for Bulk Requests
  • Number of Queued Threads for Index Requests
  • Number of Queued Threads for Query Requests
  • Number of Rejected Threads for Bulk Requests
  • Number of Rejected Threads for Index Requests
  • Number of Rejected Threads for Query Requests
Redis
  • Number of Commands Processed per Second
  • Number of All Lookups of Keys
  • Number of Failed Lookups of Keys
  • Number of Removed Keys due to Reaching the max memory Limit
  • Number of Blocked Clients While Waiting on BLPOP, BRPOP, or BRPOPLPUSH
  • Number of Slaves Connected to the Current Master
  • Number of Connected Clients
  • Number of Rejected Connections due to Reaching the max client Limit
  • Time in Seconds of the Link Being Down between Slave and Master
  • Time in Seconds since Last Interaction between Slave and Master
RabbbitMQ
  • Number of Ready Messages for All Queues
  • Number of Ready Messages for ETL Queues
  • Number of Ready Messages for RMClientCallback Queues
  • Number of Ready Messages for FSC trigger Queue
  • Number of Ready Messages for flowengine health monitor Queue
  • Number of Ready Messages for flowengine leader Queue
  • Number of Ready Messages for flowengine task status Queue
  • Number of Ready Messages for taskengine event Queue
  • Number of Ready Messages for xfagent task Queues
  • Number of Ready Messages for prepared tasks Queue
PostgreSQL
  • Number of Threads
  • Number of Running PostgreSQL Back-end Porcesses
Redis Sentinel
  • Number of Commands Processed per Second
  • Number of All Lookups of Keys
  • Number of Failed Lookups of Keys
  • Number of Removed Keys due to Reaching the max memory Limit
  • Number of Blocked Clients While Waiting on BLPOP, BRPOP, or BRPOPLPUSH
  • Number of Slaves Connected to the Current Master
  • Number of Connected Clients
  • Number of Rejected Connections due to Reaching the maxclient Limit
  • Time in Seconds of the Link Being Down between Slave and Master
  • Time in Seconds since Last Interaction between Slave and Master