Skip to Content

Turn your connections into holiday cash with our new Customer Referral Program! Learn more

Observability dashboards (Beta) for Jitterbit private agents 11.37 or later

Introduction

The following sections show example private agent observability dashboard charts, what the chart illustrates, and which metrics the chart is based on.

The images shown are from the Datadog dashboard. Elasticsearch dashboards contain the same charts and metrics.

Expand the following section to see a full-screen view of an example Datadog dashboard:

Datadog dashboard example

Datadog dashboard

Agent Capability (section)

This section's charts show the agent's capability, its ability to process requests. The values are:

  • Capable: Value 1 (shown in green). A capable agent can accept more operations for processing.

  • Incapable: Value 0 (shown in red). An incapable agent has reached its maximum capacity for processing and cannot accept more operations for processing.

Tip

Availability and capability are separate concepts; an agent can be both available (registered as online) and incapable (busy processing its maximum workload).

API Capability Over Time

API capability over time

How the agent's capability to execute APIs has changed over time.

Metric Description
com.jitterbit.agent.capability.api.capable Agent can handle APIs.

Agent Capable

Agent capable

At-a-glance status indication of each agent's capability across these metrics:

Metric Description
com.jitterbit.agent.capability.system.resource.capable Agent can process operations.
com.jitterbit.agent.capability.runop.capable Agent can run operations.
com.jitterbit.agent.capability.api.capable Agent can handle APIs.

CPU Capability

CPU capability

The agent's CPU usage and CPU threshold values over time.

Metric Description
com.jitterbit.agent.capability.cpu.usage Agent host's CPU usage.
com.jitterbit.agent.capability.cpu.max.threshold CPU threshold.

System Resource Capable over time

System resource capable over time

The agent host's resource capability across time.

Metric Description
com.jitterbit.agent.capability.system.resource.capable Agent host's resource capability (1 = capable, 0 = not capable).

Memory Capability

Memory capability

The agent host's memory usage and memory threshold values over time.

Metric Description
com.jitterbit.agent.capability.memory.usage Agent host's memory usage.
com.jitterbit.agent.capability.memory.max.threshold Memory threshold.

Run Operation Capable over time

Run operation capable over time

The agent's capability to run operations over time.

Metric Description
com.jitterbit.agent.capability.runop.capable Agent's operation execution capability (1 = capable, 0 = not capable).

Apache Thread Capability

Apache thread capability

The number of threads and the thread threshold for the agent's Apache web server.

Metric Description
com.jitterbit.agent.capability.apache.usage Number of Apache threads used.
com.jitterbit.agent.capability.apache.total Number of allowed Apache threads.

Queue Thread Capability

Queue thread capability

The number of queue threads and the queue thread threshold for the agent.

Metric Description
com.jitterbit.agent.capability.queue.usage Number of queue threads used.
com.jitterbit.agent.capability.queue.max Number of allowed queue threads.

Agent Operations (section)

Operations over time

Operations over time

The number of successful, failed, and processing operations over time.

Metric Description
metric.operation.count.by.status Number of operations per tagged state (successful, failed, processing).

Operations statuses over time

Operation statuses over time

The number of running, successful (with information), submitted, and error operations over time.

Metric Description
metric.operation.count.by.status Number of operations per tagged state (running, success_with_info, submitted, error).

Top 10 Operations

Top 10 operations

Metric Description
metric.operation.count.by.status Number of operations per tagged state.

Top 10 Successful

Top 10 successful

Metric Description
metric.operation.count.by.status Number of operations per tagged state.

Top 10 Failed

Top 10 failed

Metric Description
metric.operation.count.by.status Number of operations per tagged state.

System Load (Linux only)

System load

The 5-minute average system load per agent group.

Metric Description
system.load.5 5-minute average system load.

System Process Queue (Windows only)

System process queue

The average number of threads in the system process queue.

Metric Description
system.proc.queue_length The number of threads that are observed as delayed in the processor ready queue and are waiting to be executed.

Unsynced Operation Logs

Unsynced Operation Logs

The number of unsynchronized agent operation logs, plotted by time.

Metric Description
private_agent.unsynced_logs Number of unsynchronized agent operation logs.

Incoming message queue by agent group

Incoming message queue by agent group

The number of operations in the queue waiting to be processed per agent group.

Metric Description
private_agent.incomingmsgtab_count Number of queue-bound operations.

Apache idle workers

Apache idle workers

The number of idle Apache worker threads per agent host, plotted by time.

Metric Description
apache.performance.idle_workers Number of idle worker threads in the agent's Apache web server.

Apache busy workers

Apache busy workers

The number of busy Apache worker threads per agent host, plotted by time.

Metric Description
apache.performance.busy_workers Number of busy worker threads in the agent's Apache web server.

Postgres Connections

Postgres connections

The number of active PostgreSQL database connections per agent host, plotted by time.

Metric Description
postgresql.connections The number of active connections to this database.

Postgres Rows fetched / returned / inserted / updated (per sec)

Postgres rows fetched returned inserted updated

The number of rows fetched, returned, inserted, or updated per second by the agent's PostgreSQL database, plotted by time.

Metric Description
postgresql.rows_fetched The number of rows fetched by queries in this database.
postgresql.rows_returned The number of rows returned by queries in this database.
postgresql.rows_inserted The number of rows inserted by queries in this database.
postgresql.rows_updated The number of rows updated by queries in this database.

Postgres - Inserts / updates / deletes (per sec)

Postgres inserts updates deletes

The number of rows inserted, updated, or deleted per second by the agent's PostgreSQL database, plotted by time.

Metric Description
postgresql.rows_inserted The number of rows inserted by queries in this database.
postgresql.rows_updated The number of rows updated by queries in this database.
postgresql.rows_deleted The number of rows deleted by queries in this database.

Postgres - Deadlocks per database

Postgres deadlocks

The number of PostgreSQL database deadlocks per agent host, plotted by time.

Metric Description
postgresql.deadlocks The rate of deadlocks detected in this database.

Agent CPU Average

Agent CPU average

The percentage of time the CPU spends on average in user and kernel space processes, per agent host, plotted by time.

Metric Description
system.cpu.user The percent of time the CPU spent running user space processes.
system.cpu.system The percent of time the CPU spent running the kernel.

Agent CPU Max

Agent CPU max

The percentage of time the CPU spends in user and kernel space processes, per agent host, plotted by time.

Metric Description
system.cpu.user The percent of time the CPU spent running user space processes.
system.cpu.system The percent of time the CPU spent running the kernel.

Percentage of memory used

Percentage of memory used

The percentage of used to total memory, per agent host, plotted by time.

Metric Description
system.mem.total The total amount of physical RAM.
system.mem.usable Value of MemAvailable from /proc/meminfo if present, but falls back to adding free + buffered + cached memory if not.

Network IO

Network IO

The number of read and write requests, per agent host, plotted by time.

Metric Description
system.io.r_s The number of read requests issued to the device per second.
system.io.w_s The number of write requests issued to the device per second.

Network traffic received (per sec)

Network traffic received

The number of kilobytes per second received, per agent host, plotted by time.

Metric Description
system.net.bytes_rcvd The number of bytes received on a device per second.

Network traffic sent (per sec)

Network traffic sent

The number of kilobytes per second sent, per agent host, plotted by time.

Metric Description
system.net.bytes_sent The number of bytes sent from a device per second.

Disk Space % used

Disk space used

The amount of disk space used per device, plotted by time.

Metric Description
system.disk.used The amount of disk space in use.
system.disk.total The total amount of disk space.

Disk utilization (%) (Linux only)

Disk utilization

The percentage of disk space utilization, per agent host, plotted by time.

Metric Description
system.io.util The percent of CPU time during which I/O requests were issued to the device.
system.disk.utilized The amount of disk space in use as a percentage of the total.

Disk read time

Disk read time

The percentage of time spent reading the disk, per agent host, plotted by time.

Metric Description
system.disk.read_time_pct Percent of time spent reading from disk.

Disk write time

Disk write time

The percentage of time spent writing to the disk, per agent host, plotted by time.

Metric Description
system.disk.write_time_pct Percent of time spent writing to disk.

Java Heap Memory

Java heap memory

The total amount of Java heap memory, per agent host, plotted by time.

Metric Description
jvm.heap_memory The total Java heap memory used.

JVM thread count

JVM thread count

Number of JVM threads, per agent host, plotted by time.

Metric Description
jvm.thread_count The number of live threads.

Incoming MessageTab by Group

Incoming MessageTab by group

Number of items in the agent's PostgreSQL messagestab table, by agent host, plotted by time.

Metric Description
private_agent.incomingmsgtab_count Number of rows in TranDb.messagestab.

OperationsEngine Processes

OperationsEngine processes

Number of operations engine processes, plotted by time.

Metric Description
proc.operationsengine.cpu.num_threads Number of CPU threads used by the operations engine.

Operations by agent (top 25)

Operations by agent top 25

The top 25 agents by number of operations.

Metric Description
proc.openginebyname.cpu.num_threads Number of CPU threads used by the operations engine per agent.

Agent Services Client Active Connections

Agent services client active connections

Number of active client connections per agent, plotted by time.

Metric Description
com.jitterbit.agent.services.client.active.connections Number of active client connections.

Agent Services Client Max Connections

Agent services client max connections

Maximum number of allowed client connections per agent, plotted by time.

Metric Description
com.jitterbit.agent.services.client.max.connections Maximum number of allowed active client connections.

Agent Services Client Available Connections

Agent services client available connections

Number of available client connections per agent, plotted by time.

Metric Description
com.jitterbit.agent.services.client.available.connections Number of available client connections.

Rest Client Active Connections

Rest client active connections

Number of active client REST connections per agent, plotted by time.

Metric Description
com.jitterbit.rest.client.active.connections Number of active REST client connections.

Rest Client Max Connections

Rest client max connections

Maximum number of allowed client REST connections per agent, plotted by time.

Metric Description
com.jitterbit.rest.client.max.connections Maximum number of allowed REST client connections.

Rest Client Idle Connections

Rest client idle connections

Number of idle client REST connections per agent, plotted by time.

Metric Description
com.jitterbit.rest.client.available.connections Number of available REST client connections.

Rest Client Local Active Connections

Rest client local active connections

Number of active local client REST connections per agent, plotted by time.

Metric Description
com.jitterbit.rest.client.local.active.connections Number of active local REST client connections.

Rest Client Local Max Connections

Rest client local max connections

Maximum number of allowed local client REST connections per agent, plotted by time.

Metric Description
com.jitterbit.rest.client.local.max.connections Maximum number of allowed local REST client connections.

Rest Client Local Idle Connections

Rest client local idle connections

Number of available local client REST connections per agent, plotted by time.

Metric Description
com.jitterbit.rest.client.local.available.connections Number of available local REST client connections.

Operations Running Over Schedule

Operations running over schedule

A list of operations that have run over schedule, and how many times each has.

Metric Description
metric.operation.runing.over.scheduled.interval Names of operations running over schedule.

Submitted Operations Waiting to Run

Submitted operations waiting to run

Operations submitted but not yet started. (The Pending Operations column shows how many operation instances are pending in Submitted status.)

Metric Description
metric.operation.count.by.status Number of operations per status.

Operation Duration

Operation duration

Operation completion duration, and the current interval of an operation instance run, by operation ID, plotted by time.

This chart shows if an operation instance is taking more than the average amount of time the operation takes to finish.

Note

If there are changes to the operation that increase the running time, the average operation duration will be initially skewed, but should correct itself over time.

Metric Description
private_agent.operation_run_time_seconds How long the operation has been running for.
metric.operation.duration.seconds The average operation run time.