Server metrics
It guides you to the server monitoring metrics.
Category V1
agent_status_summary
This category collects metrics related to agent status every 10 seconds.
Fields
Field name | Unit | Description | Remarks |
---|---|---|---|
inActTime | Millisecond(ms) | Amount of time the agent remains inactive | - |
isActive | - | Whether the agent is active or not | true / false |
isRebooted | - | Whether the server has been recently restarted | true / false |
lastActTime | Millisecond(ms) | Time when the agent was last activated | 0 : If disabled |
oid | - | Unique IDs for each agent in the project | - |
startTime | Millisecond(ms) | Timestamp when the agent was started. | - |
status | - | Status of the current agent | - |
uptime | Second (sec) | Server's uptime | - |
The values in the status
field are distinguished as follows:
-
ok
: The server is in normal status where it is active. -
inactive
: The server is disabled. -
warning
: A warning event is in progress on the server. -
fatal
: A critical event is in progress on the server. -
paused
: The agent has paused data collection.
server_base
It is the metric for server resources.
-
Collection interval: 5 seconds
-
Statistical data: 5 minutes, 1 hour
Tags
Tag name | Description | Remarks |
---|---|---|
cpu_cores | Number of server cores | - |
host_ip | Server IP | - |
oname | Server name | - |
okindName | Server type name | - |
os | OS type (Linux or Windows) | - |
Fields
Field name | Unit | Description | Remarks |
---|---|---|---|
cpu | Percentage | All CPU utilization | Common |
cpu_c1 | Percent | CPU c1 state utilization | Windows |
cpu_c2 | Percent | CPU c2 state utilization | Windows |
cpu_c3 | Percent | CPU c3 state utilization | Windows |
cpu_ctxt | Prime number | Context switching utilization | Linux |
cpu_dpc | Percent | CPU DPC time percentage | Windows |
cpu_idle | Percent | CPU idle rate | Common |
cpu_interrupt | Percent | cpu interrupt | Linux |
cpu_iowait | Percent | CPU iowait rate | Linux |
cpu_irq | Percent | CPU IRQ utilization | Linux |
cpu_load1 | Prime number | Load average for 1 minute | Linux |
cpu_load5 | Prime number | Load average for 5 minutes | Linux |
cpu_load15 | Prime number | Load average for 15 minutes | Linux |
cpu_new_procs_forked | Positive number | OS process fork call | Linux |
cpu_procs_blocked | Positive number | Process blocked state count | Linux |
cpu_procs_running | Positive number | Process running state count | Linux |
cpu_softirq | Percent | CPU softirq utilization | Linux |
cpu_steal | Percent | CPU steal utilization | Linux |
cpu_sys | Percent | CPU system utilization | Common |
cpu_usr | Percent | CPU user utilization | Common |
epochtime | Positive number | Epoch time in which the metric was collected | Common |
memory_available | Byte | Available memory size | Common |
memory_buffers | Byte | Memory buffer size | Linux |
memory_cached | Byte | Memory cache size | Common |
memory_free | Byte | Free memory size | Common |
memory_pageFaults | Positive number | Memory page fault count | Common |
memory_pavailble | Percent | Available memory utilization | Common |
memory_pused | Percent | Memory utilization | Common |
memory_shared | Byte | Shared memory size | Linux |
memory_slab | Byte | Slab memory size | Linux |
memory_sreclaimable | Byte | Slab reclaimable memory size | Linux |
memory_sunreclaim | Byte | Slab claimable memory size | Linux |
memory_swappused | Percent | Swap utilization | Common |
memory_swaptotal | Byte | Total swap size | Common |
memory_swapused | Byte | Swap usage | Common |
memory_total | Byte | Total memory size | Common |
memory_used | Byte | Memory usage | Common |
processorQueueLength | Positive number | Process queue length | Windows |
uptime | Positive number | Server uptime after a boot (TimeStamp) | Common |
server_cpu_core
Metrics for each server CPU core
-
Collection interval: 5 seconds
-
Statistical data: -
Tags
Tag name | Description | Remarks |
---|---|---|
cpu | Server core number | - |
ip | Server IP | - |
oname | Server name | - |
os | OS type (Linux or Windows) | - |
Fields
Field name | Unit | Description | Remarks |
---|---|---|---|
idle | Percent | Core idle rate | Common |
iowait | Percent | Core iowait rate | Windows |
irq | Percent | Core IRQ utilization | Linux |
load1 | Prime number | Core load average for 1 minute | Linux |
load5 | Prime number | Core load average for 5 minutes | Linux |
load15 | Prime number | Core load average for 15 minutes | Linux |
nice | Percent | Core nice utilization | Common |
processorQueueLength | Positive number | Core process queue length | Windows |
softirq | Percent | Core softirq utilization | Linux |
steal | Percent | Core steal utilization | Linux |
sys | Percent | Core sys utilization | Linux |
usr | Percent | Core usr utilization | Linux |
server_disk
It has metrics for each file system mounted on each server.
-
Collection interval: 5 seconds
-
Statistical data: 5 minutes, 1 hour
Tags
Tag name | Description | Remarks |
---|---|---|
deviceId | Device ID | - |
deviceIdHash | Device ID hash value | - |
fileSystem | File system type | - |
mountPoint | File system's mount point | - |
mountPointHash | File system's mount point hash | - |
oname | Server name | - |
os | OS type (Linux or Windows) | - |
Fields
Field name | Unit | Description | Remarks |
---|---|---|---|
blksize | Byte | Block size | Linux |
count | Positive number | WhaTap only | Common |
freePercent | Percent | File system's free space rate | Linux |
freeSpace | Byte | File system's free space amount | Linux |
ioPercent | Percent | IO utilization In some disk types, inaccurate values can be measured. | Linux |
ioutilOrder | Positive number | WhaTap only | Linux |
queueLength | Mistake | Queue length | Common |
totalInode | Positive number | Total number of Inodes | Linux |
totalSpace | Byte | Total size of the file system | Linux |
usedPercent | Percent | File system utilization | Linux |
usedInode | Positive number | Number of used inodes | Linux |
usedInodePercent | Percent | Inode utilization | Linux |
userSpaceOrder | Positive number | WhaTap only | Common |
readBps | Byte | Bytes read per second | Common |
writeBps | Byte | Bytes written per second | Common |
readIops | Prime number | Read count per second | Common |
writeIops | Prime number | Written count per second | Common |
server_network
Network interface metrics for each server
-
Collection interval: 5 seconds
-
Statistical data: 5 minutes, 1 hour
Tags
Tag name | Description | Remarks |
---|---|---|
desc | Network interface name | - |
descHash | Network interface's hash value | - |
hwAddr | MAC address | - |
ip | Interface IP address | - |
oname | Server name | - |
os | OS type (Linux or Windows) | - |
Fields
Field name | Unit | Description | Remarks |
---|---|---|---|
bandwidthOrder | Positive number | WhaTap only | - |
count | Positive number | WhaTap only | Common |
droppedIn | Positive number | Inbound drop | - |
droppedIn | Positive number | Outbound drop | - |
errorIn | Positive number | Inbound error | - |
errorOut | Positive number | Outbound error | - |
packetIn | pps | Inbound packet | - |
packetOut | pps | Outbound packet | - |
trafficIn | bps | Inbound traffic | - |
trafficInAcct | Positive number | WhaTap only | - |
trafficOut | bps | Outbound traffic | - |
trafficOutAcct | Positive number | WhaTap only | - |
server_netstat
It provides the network protocol statistics for each server.
-
Collection interval: 5 seconds
-
Statistical data: 5 minutes, 1 hour
Tags
Tag name | Description | Remarks |
---|---|---|
oname | Server name | - |
os_arch | OS architecture name | - |
os_cpucore | Number of server CPU cores | - |
os_cpusocket | Number of server CPU sockets | - |
os_cpuvendor | Server CPU's vendor name | - |
os_ip | Server IP | - |
os_memory | Server memory size | - |
OS name type (Linux or Windows) | - | |
os_public_ip | Public server IP | - |
os_release | Server OS release name | - |
whatap_build | WhaTap agent's build hash value | - |
whatap_home | Path of the WhaTap agent Home | - |
whatap_product | WhaTap product type | - |
whatap_version | WhaTap agent version | - |
Fields
Field name | Unit | Description | Remarks |
---|---|---|---|
netstat_Close_Wait | Positive number | Number of CLOSE_WAIT state sockets | Common |
netstat_Closing | Positive number | Number of closed sockets | Common |
netstat_Established | Positive number | Number of established sockets | Common |
netstat_Fin_Wait1 | Positive number | Number of FIN_WAIT1 sockets | Common |
netstat_Fin_Wait2 | Positive number | Number of FIN_WAIT2 sockets | Common |
netstat_Last_Ack | Positive number | Number of LASK_ASK sockets | Common |
netstat_Listen | Positive number | Number of LISTEN sockets | Common |
netstat_Syn_Recv | Positive number | Number of SYN_RECV sockets | Common |
netstat_Syn_Sent | Positive number | Number of SYN_SENT sockets | Common |
netstat_Time_Wait | Positive number | Number of TIME_WAIT sockets | Common |
netstat_Unknown | Positive number | Number of unknown sockets | Common |
Server_Filedescriptors | Positive number | Number of OS file descriptors | Common |
server_process
It has the server process information.
-
Collection interval: 20 seconds
-
Statistical data: 5 minutes
Tags
Tag name | Description | Remarks |
---|---|---|
alias | Server alias | - |
cmd1 | Process execution command 1 | - |
cmd2 | Process execution parameter | - |
hash | Process hash value | - |
name | Process name | - |
oname | Server name | - |
user | Linux user name for the process | - |
Fields
Field name | Unit | Description | Remarks |
---|---|---|---|
count | Positive number | Number of processes | Common |
cpu | Percent | CPU Utilization | Common |
cpuAccumulated | Percent | WhaTap only | Common |
cpuOrder | Order | WhaTap only | Common |
memOrder | Order | WhaTap only | Common |
memory | Percent | Memory utilization | Common |
memoryAccumulated | Percent | WhaTap only | Common |
openFileDescriptors | Positive number | Number of open file descriptors | Common |
rbps | Byte | Bytes of the disk read per second | Common |
rbpsAccmulated | Byte | WhaTap only | Common |
riops | Positive number | Count read per second in the disk | Common |
riopsAccmulated | Positive number | WhaTap only | Common |
wbps | Byte | Bytes written per second in the disk | Common |
wbpsAccmulated | Byte | WhaTap only | Common |
wiops | Positive number | Count written per second in the disk | Common |
wiopsAccmulated | Positive number | WhaTap only | Common |
rss | Byte | RSS memory size | Common |
rssAccmulated | Byte | WhaTap only | Common |
Catetory V2
infra_cpu
It is a metric related to the server's CPU usage.
-
Collection interval: 2 seconds
-
Statistical data: 5 minutes, 1 hour
Fields
Field name | Description | OS Name |
---|---|---|
total | It displays the sum of the utilization in user mode (User) and kernel mode (Sys) out of the total CPU utilization. | HP-UX, AIX, Solaris, Linux, Windows |
user | It displays the sum of the utilization in user mode (User) and kernel mode (Sys) out of the total CPU utilization. | HP-UX, AIX, Solaris, Linux, Windows |
sys | It indicates the utilization in kernel mode (Sys) out of the total CPU utilization. | HP-UX, AIX, Solaris, Linux, Windows |
nice | It indicates the CPU utilization of low-priority (niced) processes. The utilization of high-priority and default processes appears in user (percent). | HP-UX, Linux |
wait | It indicates the time ratio consumed to wait for finishing the I/O operations out of the total CPU utilization. | HP-UX, AIX, Solaris, Linux |
total_wait | It indicates the sum of the utilization in user mode (User), kernel mode (Sys), and I/O waiting (wait) out of the total CPU utilization. | HP-UX, AIX, Solaris, Linux |
idle | It indicates the percentage of CPU idle time, not performing any work. | HP-UX, AIX, Solaris, Linux, Windows |
lCores | It indicates the number of logical cores allocated to the OS. | HP-UX, AIX, Solaris, Linux, Windows |
irq | It indicates the CPU utilization used to process the hardware interrupt requests. | Linux |
softirq | It indicates the CPU utilization used to process the software interrupt requests. | Linux |
steal | In a virtualized environment, it indicates the time ratio that the virtual CPU waits involuntarily while the hypervisor is servicing another process. | Linux |
procUtil | This Windows-only metric indicates the total CPU utilization for all processors in the system (Windows Task Manager's CPU utilization). | Windows |
loadavg_1m | It indicates the average load of the system over the past one minute. | HP-UX, AIX, Solaris, Linux |
loadavg_5m | It indicates the average load of the system over the past five minutes. | HP-UX, AIX, Solaris, Linux |
loadavg_15m | It indicates the average load of the system over the past 15 minutes. | HP-UX, AIX, Solaris, Linux |
infra_mem
It is a metric related to the server's memory usage.
-
Collection interval: 2 seconds
-
Statistical data: 5 minutes, 1 hour
Fields
Field name | Description | OS Name |
---|---|---|
pUsed | It indicates only the memory utilization for user processes or the kernel out of the total memory utilization (excluding buffer and cache). | HP-UX, AIX, Solaris, Linux, Windows |
used | It displays the memory usage in bytes for user processes or the kernel, excluding buffer and cache areas from the total memory. | HP-UX, AIX, Solaris, Linux, Windows |
pCache | It indicates the ratio of the area used as cache among the total memory. | HP-UX, AIX, Solaris, Linux |
cache | It displays the amount of memory used as cache among the total memory in bytes. | HP-UX, AIX, Solaris, Linux |
buffer | This Linux-only metric displays the amount of buffer used for I/O operations out of the total memory in bytes. | Linux |
pUsed_pCache | It displays the memory utilization as the sum of memory and cache areas used by user processes or the kernel. | HP-UX, AIX, Solaris, Linux |
size | It displays the total memory size allocated to the server in bytes. | HP-UX, AIX, Solaris, Linux, Windows |
available | It displays the available memory size in bytes. | Linux |
sreclaimable | It displays the size of the cache memory used by the kernel that can be reclaimed if necessary in bytes. | Linux |
sunreclaim | It displays the size of the cache memory being used by the kernel that cannot be reclaimed in bytes. | Linux |
slab | It displays the size of cache memory used by the kernel in bytes, representing the sum of sreclaimable and sunreclaim. | Linux |
pageFaults | It is the number of cases that when a process accesses memory, the corresponding data does not exist in physical memory. | Linux |
poolPaged | This Windows-only metric displays the bytes in the paged pool, which represents the memory area that can be swapped out to disk. | Windows |
poolNonPaged | This Windows-only metric displays the bytes in the nonpaged pool, which is an area that is always resident in memory. | Windows |
systemCacheResident | This Windows-only metric displays the bytes in the system cache memory. | Windows |
swapIn | It displays the number of pages per second moved from the swap (paging space) area of the disk into memory. | HP-UX, AIX, Solaris, Linux, Windows |
swapOut | It displays the number of pages per second moved from the memory to the disk swap (paging space) area. | HP-UX, AIX, Solaris, Linux, Windows |
fsIn | It displays the number of pages per second loaded into memory based on the read operation on the file system. | HP-UX, AIX, Solaris, Linux, Windows |
fsOut | It displays the number of pages per second written to disk based on the write operation on the file system. | HP-UX, AIX, Solaris, Linux, Windows |
scanned | It is the number of pages scanned per second for swap out or page replacement. | HP-UX, AIX, Solaris |
freed | It is the number of pages of free memory space created per second when a swap out occurs. | HP-UX, AIX, Solaris |
infra_swap
It is a metric related to the server's swap usage.
-
Collection interval: 2 seconds
-
Statistical data: 5 minutes, 1 hour
Fields
Field name | Description | OS Name |
---|---|---|
size | It displays the size of the total allocated swaps in bytes. | HP-UX, AIX, Solaris, Linux, Windows |
pUsed | It displays the size of the swaps in use as a percentage. | HP-UX, AIX, Solaris, Linux, Windows |
used | It displays the size of the allocated active swaps in bytes. | HP-UX, AIX, Solaris, Linux, Windows |
infra_kernel
It is a performance metric related to the server's resource usage of the system and kernel.
-
Collection interval: 2 seconds
-
Statistical data: 5 minutes, 1 hour
Fields
Field name | Description | OS Name |
---|---|---|
runq | It is the number of processes (or threads) that are ready to run but have not yet been assigned CPU. | HP-UX, AIX, Solaris, Linux, Windows |
blockq | This indicates the number of processes (or threads) in the queue for I/O, paging, and such. | HP-UX, AIX, Solaris, Linux |
pendingq | It is an AIX-only metric, indicating the number of threads waiting for response messages for I/O on raw devices. | AIX |
waitq | It indicates the number of processes (or threads) that have been swapped out but are temporarily in sleep state. | HP-UX, Solaris |
fork | It indicates the average number of forks per second. (fork: a system call that creates a new process by duplicating the existing process) | HP-UX, AIX, Solaris, Linux |
exec | It indicates the average number of exec calls per second. (exec: a system call that replaces the current process with a new program) | HP-UX, AIX, Solaris |
interrupt | It displays the number of cases per second where the CPU handled hardware interrupts. | HP-UX, AIX, Solaris, Linux, Windows |
syscall | It displays the number of cases per second where the kernel processed system calls. | HP-UX, AIX, Solaris, Windows |
ctxswitch | It displays the number of context switches per second between processes. | HP-UX, AIX, Solaris, Linux, Windows |
infra_disk
It is the metric related to the server's physical disk capacity and input/output (IO) performance.
-
Collection interval: 2 seconds
-
Statistical data: 5 minutes, 1 hour
Fields
Field name | Description | OS Name |
---|---|---|
read | It displays the total amount of data read from all physical disks on the server in bytes. | HP-UX, AIX, Solaris, Linux, Windows |
write | It displays the total amount of data written to all physical disks on the server in bytes. | HP-UX, AIX, Solaris, Linux, Windows |
total | It displays the total amount of data read and written in all physical disks on the server in bytes. | HP-UX, AIX, Solaris, Linux, Windows |
readIops | It displays the number of read operations processed per second across all physical disks on the server. | HP-UX, AIX, Solaris, Linux, Windows |
writeIops | It displays the number of write operations processed per second across all physical disks on the server. | HP-UX, AIX, Solaris, Linux, Windows |
totalIops | It displays the number of read and write operations processed per second across all physical disks on the server. | HP-UX, AIX, Solaris, Linux, Windows |
infra_network
It is a metric related to the total network traffic of the server.
-
Collection interval: 2 seconds
-
Statistical data: 5 minutes, 1 hour
Fields
Field name | Description | OS Name |
---|---|---|
rxbps | It displays the amount of data received (Rx) via all NICs (excluding loopback) on the server in bits per second (bps). | HP-UX, AIX, Solaris, Linux, Windows |
txbps | It displays the amount of data transmitted (Tx) via all NICs (excluding loopback) on the server in bits per second (bps). | HP-UX, AIX, Solaris, Linux, Windows |
totalbps | It displays the total amount of data received and transmitted (Tx) via all NICs (excluding loopback) on the server in bits per second (bps). | HP-UX, AIX, Solaris, Linux, Windows |
rxpps | It displays the packets per second received (Rx) via all NICs (excluding loopback) on the server. | HP-UX, AIX, Solaris, Linux, Windows |
txpps | It displays packets per second transmitted (Tx) via all NICs (excluding loopback) on the server. | HP-UX, AIX, Solaris, Linux, Windows |
totalpps | It displays packets per second received (Rx) and transmitted (Tx) via all NICs (excluding loopback) on the server. | HP-UX, AIX, Solaris, Linux, Windows |
The loopback interface (lo) traffic is excluded from aggregation.
infra_process
It is a metric related to the status of active processes and resources on the server.
-
Collection interval: 2 seconds
-
Statistical data: 5 minutes, 1 hour
Fields
Field name | Description | OS Name |
---|---|---|
procTotal | This value displays the total number of processes running on the server. | HP-UX, AIX, Solaris, Linux, Windows |
threadTotal | This value displays the total number of threads running on the server. | HP-UX, AIX, Solaris, Linux, Windows |
handleTotal | This value displays the total number of handles running on the Windows OS. | Windows |
defunctTotal | This value indicates the number of defunct processes. | HP-UX, AIX, Solaris, Linux |
It displays the numbers of processes, threads, handles, and defunct processes running on the server.