This topic introduces reference for monitored metrics data, including:
Server Metrics
The following table lists the key metrics on the Home Page of the Service Monitor Portal.
Key Metrics | Description |
Active Web Service Connections | The number of active connections established with NetworkBrain system Web service for API requests, such as login, search, and third-party API requests. |
Disconnected Server Count | The number of servers that are disconnected in the system. |
User Tasks (Task Engine) | The number of task flows processed in Task Engine. |
Data Tasks (Front Server) | The number of data task groups processed in the Front Server. |
MongoDB Data Size (7 days ago) | The size of system data saved in MongoDB 7 days ago. |
MongoDB Data Size (current) | The size of system data currently saved in MongoDB. |
ES Index Size (7 days ago) | The size of index data created in Elasticsearch 7 days ago. |
ES Index Size (current) | The size of index data current created in Elasticsearch. |
Monitored Service Metrics
Service Monitor Agent can monitor the following four basic service metrics:
- CPU Utilization
- Memory Utilization
- Used Memory
- Number of Threads
Besides the above basic service metrics, the following metrics related to service quality and performance can be monitored, which may vary depending on different services.
Service Type | Metric Names |
NetworkBrain Services | NetworkBrain System Web | - Number of Active Connections
- Number of Anonymous Requests
- Volume of Served Bytes
- Volume of Received Bytes
- Volume of Transferred Bytes
- Number of Connection Attempts
- Number of 404 Errors
|
Worker Server | - Number of started tasks
- Number of running tasks
|
Knowledge Cloud Proxy | N/A |
NetworkBrain Task Engine | - Service Start Times
- Service Stop Times
- Number of Times to Enter Standby Mode
- Number of Times to Enter Activation Mode
- Number of Received Trigger Events
- Number of Task Flow Start Events
- Number of Finished Task Flows
- Number of Tasks Pending in Task Queue
- Number of Tasks in Memory Repository
- Number of Tasks in Unacknowledged Queue
- Number of Skipped Events
- Number of blacklist machines
- Number of heavy tasks
|
NetworkBrain Front Server | - Number of Processing Tasks
- Data Transfer Info for High-Priority Tasks
- Data Transfer Info for Low-Priority Tasks
- Parser Info
- Low-Priority Data Upload Info
- High-Priority Data Upload Info
- Available Live Threads
- Number of Available Non-CLI Task Info
- Number of Available Telnet Task Info
- Number of Available SSH Task Info
- Number of Times to Rejoin Scheduling Queue due to Insufficient CLI Sessions
- Number of Requests to Download SSH Key
|
NetworkBrain Front Server Controller | - Number of Server Connections
- Data Receiving Info
- Active Front Servers
- Number of Processing Data Task Groups
- Number of Data Task Groups to be Stopped
- Number of Dispatched Direct-Live-Access Tasks
- Direct-Live-Access Info
- DTG Result Queue Info
- DataEngine Data Upload Queue Info
|
Third-party Services | MongoDB | - Number of Connected Clients
- Number of Unused Connections
- Number of Active Client Connections
- Number of Active Client Connections with Read and Write Operations
- Volume of Received Network Traffic
- Volume of Sent Network Traffic
- Volume of Held Uncompressed Data
- Amount of Space Allocated to Collections
|
Elasticsearch | - Time in Milliseconds Spent on Young-Generation Garbage Collections
- Time in Milliseconds Spent on Old-Generation Garbage Collections
- Number of HTTP Connections Currently Open
- Available Disk Space
- Number of Queued Threads for Bulk Requests
- Number of Queued Threads for Index Requests
- Number of Queued Threads for Query Requests
- Number of Rejected Threads for Bulk Requests
- Number of Rejected Threads for Index Requests
- Number of Rejected Threads for Query Requests
|
Redis | - Number of Commands Processed per Second
- Number of All Lookups of Keys
- Number of Failed Lookups of Keys
- Number of Removed Keys due to Reaching the max memory Limit
- Number of Blocked Clients While Waiting on BLPOP, BRPOP, or BRPOPLPUSH
- Number of Slaves Connected to the Current Master
- Number of Connected Clients
- Number of Rejected Connections due to Reaching the max client Limit
- Time in Seconds of the Link Being Down between Slave and
Master - Time in Seconds since Last Interaction between Slave and Master
|
RabbbitMQ | - Number of Ready Messages for All Queues
- Number of Ready Messages for ETL Queues
- Number of Ready Messages for RMClientCallback Queues
- Number of Ready Messages for FSC trigger Queue
- Number of Ready Messages for flowengine health monitor Queue
- Number of Ready Messages for flowengine leader Queue
- Number of Ready Messages for flowengine task status Queue
- Number of Ready Messages for taskengine event Queue
- Number of Ready Messages for xfagent task Queues
- Number of Ready Messages for prepared tasks Queue
|
PostgreSQL
| - Number of Threads
- Number of Running PostgreSQL Back-end Porcesses
|
Redis Sentinel | - Number of Commands Processed per Second
- Number of All Lookups of Keys
- Number of Failed Lookups of Keys
- Number of Removed Keys due to Reaching the max memory Limit
- Number of Blocked Clients While Waiting on BLPOP, BRPOP, or BRPOPLPUSH
- Number of Slaves Connected to the Current Master
- Number of Connected Clients
- Number of Rejected Connections due to Reaching the maxclient Limit
- Time in Seconds of the Link Being Down between Slave and Master
- Time in Seconds since Last Interaction between Slave and Master
|