Table of Contents

Share on:

Share on LinkedInShare on FacebookShare on Twitter
Playground Orkes

Ready to build reliable applications 10x faster?

ENGINEERING & TECHNOLOGY

Ensuring 99.999% Uptime: Why Service Uptime Monitoring is Crucial

Riza Farheen
Developer Advocate
February 25, 2025
7 min read

How often have you been frustrated by a website that’s down, taking forever to return online? If it happens too frequently, would you trust that platform again? Probably not.

With businesses running 24/7, downtime is more than just an inconvenience–it’s a direct hit to customer trust and revenue. Studies reveal that businesses can lose thousands of dollars for every hour of downtime. This is why businesses strive for “five-nines” (99.999%) uptime, with an average of less than 6 minutes of downtime annually.

Major outages can bring businesses to a standstill, leading to frustrated customers. This is where uptime monitoring becomes crucial to minimize potential losses. By proactively detecting downtime, businesses can prevent significant disruptions, keep things running smoothly, and build reliable services.

In this blog, we’ll explore why service uptime monitoring is essential for modern businesses and how Orkes Conductor can help implement application flows to monitor HTTP endpoint availability.

What is service uptime monitoring?

Service uptime monitoring is a process of monitoring and managing critical services, such as a website or application, to ensure they remain accessible 24/7.

With continuous monitoring, businesses can receive automated alerts when disruptions occur, allowing them to take immediate action. A robust system can notify relevant team members about any detected downtimes via multiple communication channels, ensuring the right team members can respond promptly.

Why use service uptime monitoring?

The demand for fast and reliable online services has always been higher, especially with the growing shift toward digital services. For instance, the increasing trend of digital payments is rapidly replacing traditional cash transactions.

According to Visa Perspective’s Top trends shaping 2025 payments:

Digital payment quote from Visa Perspective
Source: Visa Perspective

The payment industry is just one example of the growing shift toward online services. Industries like e-commerce and others are also witnessing increased consumer demand for digital platforms. As more businesses move toward online services, ensuring uptime monitoring becomes vital. By keeping track of service status, you can identify potential issues—such as outages, performance slowdowns, or errors—before they escalate into major problems.

Here are a few reasons why uptime monitoring is a must for businesses today:

1. Prevent reputation damage

Outages can severely damage your organization’s reputation. Frustrated users experiencing downtime may leave negative reviews, vent on social media, or decide to switch to a competitor.

2. Meet Service Level Agreements (SLAs)

Many businesses establish Service Level Agreements (SLAs) to meet specific performance standards. Failure to detect an outage can lead to missed SLAs, potentially resulting in contract violations and penalties.

3. Minimize loss of revenue

Downtime can directly impact revenue for businesses that rely on online sales and transactions. Customers may be unable to access services or complete purchases, leading to missed sales opportunities.

Consistent service uptime monitoring is essential to overcome these challenges. It ensures your services are quickly restored, minimizing disruptions and enhancing the user experience.

Building a service uptime monitoring with Orkes Conductor

Orkes Conductor, the leading workflow orchestration platform, can be leveraged to build a service uptime monitoring system that fits your business operations. Unlike off-the-shelf monitoring tools, Orkes Conductor gives you full control over uptime checks, allowing you to customize logic to fit your exact business needs. Let’s walk through building the service uptime monitoring flow.

To get started, simply create a free account in Orkes Developer Edition and follow along with the tutorial. This example is available as a ready-to-use template. For more details, check out the Availability Monitoring for HTTP Endpoints documentation to learn how to import and use it.

Service uptime monitoring flow

Let’s break down the logic for the service uptime monitoring system:

  • The system checks the service endpoint for availability.
  • It retrieves the server’s timestamp and status code.
  • If downtime is detected, a notification is sent via the preferred channel.

Now, let's build the workflows that will orchestrate this monitoring system. Here’s an overview of the service uptime monitoring system we are going to build:

  1. Create a workflow to monitor the endpoints.
  2. Create a separate workflow for sending downtime notifications and call it as a sub-workflow.
  3. Automate the monitoring workflow to run every 2 minutes.

Part 1: Availability monitoring

The first part of the service uptime monitoring system checks the endpoint’s availability. This process can be accomplished by creating a Monitor-HTTP-Endpoint-Availability workflow.

Step 1: Check the endpoint availability

The first step is to check if the HTTP(s) endpoints are available. To check the availability, create an HTTP task in your Conductor workflow. Configure the endpoint method, URL, and other parameters. When the workflow is executed, it will call the endpoint and retrieve the response.

HTTP task making a call to an endpoint for receiving the response
Use an HTTP task in Orkes Conductor to call an endpoint
Step 2: Retrieve the timestamp

If the server is down, it’s essential to know when the downtime was detected. To achieve this, use an Inline task that extracts the timestamp when the response was received.

Inline task extracting the timestamp when the response was received
Use an Inline task in Orkes Conductor to extract the timestamp
Step 3: Evaluate the server’s status code

Use the Switch task in your Conductor workflow to evaluate the response from the HTTP task. If the response indicates success, the workflow will terminate. If failure is detected, the workflow will proceed to the next part, which involves notifying the appropriate team.

Switch task evaluating the response from the HTTP task
Use the Switch task in Orkes Conductor to evaluate the response from the previous HTTP task

The sample workflow input includes the notification method (e.g., SMS), the endpoint URL to be monitored, and the sender/receiver information for notifications.

{
  "notification_type": "SMS",
  "endpoint_url": "https://example.com/",
  "notification_from": "14XXXXXXXXX",
  "notification_to": "12XXXXXXXX"
}

Here is the full Monitor-HTTP-Endpoint-Availability workflow.

Conductor workflow for monitoring HTTP endpoint availability
Monitor-HTTP-Endpoint-Availability workflow in Orkes Conductor

The notification step is incorporated as a Sub Workflow, which facilitates best practices in modularization.

Calling notification flow as a sub workflow
Using the Sub Workflow task to call the notification workflow

Sub Workflows in Conductor simplifies workflow management, making it easy to reuse common workflows across multiple processes.

The notification system is a typical flow within an organization when building applications. Creating a dedicated workflow for your organization’s payment system that integrates all notification methods allows you to easily adapt your notification flow to different application processes using the Sub Workflow functionality in Conductor.

Part 2: Notifying users

Once the endpoint is monitored and downtime is detected, the next part is to notify the corresponding team members to take action. This is where the second part of the application flow is to be implemented. For this second part, let’s build a Notify-Channels workflow that sends notifications via the preferred channel.

Step 1: Check the preferred notification method

Let's add a Switch task to a new workflow to assess the input and determine the preferred notification method.

Switch case evaluation on input parameters
Using the Switch task to evaluate the switch condition based on workflow inputs

The Switch task evaluates the workflow input and proceeds along the appropriate path.

Step 2: Send notification via the preferred channel

With the chosen notification type, the next step is to send the service downtime alert through the selected channel. In this workflow, we’ve defined four cases: SMS, email, Slack, and PagerDuty.

Here’s an example of a flow that sends notifications via SMS using Twilio as the provider.

Sample SMS notification flow via Twilio
The notification flow for alerting through SMS via Twilio in Orkes Conductor

The workflow is set up to send notifications via SMS using Twilio, emails through SendGrid, Slack channel alerts, and PagerDuty notifications.

Sample notification flow
The notification flow for alerting through multiple channels

The notification method here is not restricted to just four channels. Adding more switch cases allows you to incorporate other notification systems to accommodate the organization’s preferences.

Here is the full Notify-Channels workflow.

Conductor workflow for notifying users via multiple channels
Notify-Channels workflow in Orkes Conductor

Part 3: Automating the monitoring process

The final part is automating the monitoring process to check the endpoint availability every 2 minutes to achieve 99.999% uptime.

For this, create a Workflow Scheduler in Conductor, using cron expressions to set the workflow to run every 2 minutes. Once configured, Conductor will automatically trigger the workflow every 2 minutes without requiring any further action from the user.

Workflow scheduler for running every 2 mins
Monitor HTTP Endpoints Availability workflow scheduled to run every 2 mins

The ability to schedule workflows using cron expression greatly streamlines the automation of your workflows. Conductor further enhances this experience by supporting different time zones, avoiding the need for manual time conversions. Yet another outstanding feature of Scheduler is its user-friendly interface, which displays upcoming scheduled runs, allowing you to verify and confirm your schedules easily.

While this uptime monitoring system is a simple version, you can use Orkes Conductor to extend the flow further to suit your specific application needs.

This workflow is available as an out-of-box template, allowing you to receive alerts via PagerDuty, Slack, SMS, and email. For more details, check out the Availability Monitoring for HTTP Endpoints documentation to learn how to import and use the template.

Summary

Uptime monitoring of services is crucial for a business to thrive, as it directly impacts revenue. By leveraging tools like Orkes Conductor, you can build an automated monitoring system to track your services 24/7 to achieve 99.999% uptime. This, in turn, can decrease the customer churn rate and ultimately enhance customer satisfaction and loyalty.

Orkes Conductor is a powerful workflow orchestration platform that simplifies and automates all your application flows. Whether monitoring service uptime, creating AI-powered applications, or building a RAG system, Orkes Conductor offers the flexibility and scalability to meet your business needs. Try out Orkes Conductor for free using our Developer Edition sandbox.

Related Blogs

Best Practices for Production-Scale RAG Systems — An Implementation Guide

Feb 20, 2025

Best Practices for Production-Scale RAG Systems — An Implementation Guide