circle-exclamation
Documentation in progress. New content is added regularly.

Uptime Monitoring

Track agent uptime and availability

Knowing your agent is deployed is not the same as knowing it is reachable. Monitoring tracks whether your agents are online and responding, surfaces daily and historical uptime, and alerts the right people when something breaks. Open Observatory > Monitoring to see the status of every agent at a glance.

circle-exclamation

How Monitoring Works

Monitoring tracks whether your agents are online and responsive, giving you a week-by-week view of uptime. Over time, this builds a history you can use to spot reliability issues, verify that deployments went smoothly, and hold your production agents to an availability standard.

The platform sends test requests to each monitored agent multiple times throughout the day. If the agent responds correctly, the check passes and the agent is marked as healthy. If it fails to respond or returns an error, the agent is marked as down, meaning it was not reachable or unable to answer during that check.

heart-pulse

Automated Health Checks

The platform tests each agent multiple times a day, verifying it is reachable and able to respond

chart-line

Uptime History

Daily health status and uptime trends across four time windows: 24 hours, 7 days, 30 days, and 90 days

bell

Email Alerts

Configure per-agent notifications so the right team members are alerted the moment an agent goes down

circle-check

Enabling Monitoring for an Agent

Agents that support monitoring appear in the table with their toggle off by default. To start tracking an agent:

1

Locate the Agent

Find the agent you want to monitor in the table. Use the Agents filter or sort by Agent Name to narrow down the list.

2

Enable Monitoring

Toggle Enable Agent Monitoring on the far right of the agent's row.

3

Add Notification Recipients

Select the dropdown arrow next to the toggle to open the Email notification panel. Search and select the console users who should receive email alerts when this agent's status changes. Confirm with OK.

Notifications are configured per agent, so different team members can be responsible for different agents.

Checking Agent Uptime

Monitoring shares the same filtering controls found across Observatory, with one difference: instead of selecting a custom date range, the date picker works week by week. Select any day on the calendar and the table updates to show the full Sunday-to-Saturday week that contains it.

calendar Date Range

Select any day on the calendar to define the Sunday-to-Saturday week you want to monitor

robot Agents

Filter by agent or narrow it down to a specific deployment

The uptime table contains uptime information about the status for each agent and for each day of the selected week. A green checkmark means the agent was healthy that day; a dash means no data was collected. The Current Status tells you whether the agent was responsive in the last check. Hover over any day cell to see the exact uptime percentage for that day.

Overall Uptime

Select the expand icon on any agent row to reveal its overall uptime across four time windows: last 24 hours, 7 days, 30 days, and 90 days. If monitoring has not collected enough data yet, the charts show "No data."

Use these trends to catch gradual degradation before it becomes an outage. A steady 99% that drops to 92% over 30 days is worth investigating even if today's status looks healthy.

Best Practices

  • Monitor production agents only: Focus on agents serving real users to keep the table clean and alerts meaningful

  • Assign recipients per agent: Route alerts to the people responsible for each agent so notifications reach the right inbox

  • Watch the 30-day trend: A slow decline in uptime is easier to catch in the 30-day chart than in daily checkmarks

circle-check

Last updated

Was this helpful?