logoAlways On.

How Harbor Lab Reduced Incident Resolution Time from Days to Minutes with NOFire AI

How Harbor Lab Reduced Incident Resolution Time from Days to Minutes with NOFire AI

Avatar

“Our engineers now have access to the right insights, reducing stress and improving our response times. It’s made our on-call experience smoother and more efficient.”

CTO, Spyros Lamprinidis

Company Overview

Harbor Lab is a pioneering maritime tech company that streamlines port cost management for the global shipping industry. With a commitment to innovation and operational excellence, their engineering team manages over 20 production services and 300 monitored resources. As they scaled, ensuring smooth incident response and maintaining service reliability became a top priority, adding pressure to a small SRE team.

The Challenge

Frequent RDS Incidents Coinciding with Releases

Recurring AWS RDS incidents aligned with software releases, leading to downtime and creating a "prolonged panic mode" for engineers.

Incomplete Runbooks

New engineers struggled with on-call responsibilities due to incomplete documentation, making incident resolution inconsistent and slow.

Slow Root Cause Analysis

Without structured troubleshooting workflows, engineers spent excessive time pinpointing root causes—resulting in unnecessary stress.

Solution

NOFire AI provided Harbor Lab’s engineering team with real-time visibility transforming their approach to incident management. Instead of firefighting, engineers could now proactively resolve issues with confidence. NOFire AI is monitoring all the Alerts, providing classification visible to all engineers in order to react fast, directly to Slack. It provides information about critical incidents, explaining the evidence from different observability resources, tracing down the impact to any related services. When action is required, comprehensive steps are provided by NOFire to mitigate the issue.

How-HarborLab-Reduced-Incident-Resolution-Time-from-Days-to-Minutes-with-NOFire-AI

Results & Impact

Rapid Troubleshooting: Reduced root cause identification from multiple days to just 3 minutes.

Empowered On-Call Engineers: Provided clear steps and recommended actions for each investigation, enabling faster, more confident decision-making.

Enhanced Service Reliability: Improved SLA adherence and minimized downtime, keeping shipping operations on track.

Avatar

“Now, we can all pinpoint the root cause in minutes and fix it before it escalates. NOFire AI is a game-changer for our team.”

SRE Lead, Stelis Panagiotakis

About

  • Industry: Maritime Tech
  • Team size: 51-100

Tech Stack

  • AWS
  • AWS RDS
  • Kubernetes
  • Spring
  • Grafana Cloud

Setup

  • 20+ production services
  • 300 monitored resources
Book a demo

Join our vision

We want to turn down the noise for the folks running our digital world, so they achieve a fireless growth.


Stop firefighting, start building