The story behind Sanos

2am. Somewhere in a data centre operations room.

The console is lit up.
Four hundred alerts.
Three of them matter.

You've been staring at this screen for twenty minutes. The monitoring system is doing exactly what it was configured to do — firing on every threshold breach, every latency spike, every service wobble. The problem is, it has no idea which three of those four hundred events actually mean something is broken. That's your job.

I've sat in that chair. More times than I can count, across more than two decades of building and running network operations centres — in telecoms, in enterprises, for managed services customers who trusted us to watch their infrastructure while they slept. The tools kept improving. The alert volume kept growing. And the gap between what the monitoring system knew and what a human engineer needed to know never really closed.

That gap is what Sanos is built to close.

"The tools kept improving. The alert volume kept growing. The gap between what the monitoring system knew and what a human engineer needed to know never really closed."

— Ayad Aftab, Founder, Sanos

What twenty years taught me

Alert fatigue is not a staffing problem. It's a design failure — and throwing more engineers at a noisy monitoring stack doesn't fix it.

Root cause analysis shouldn't require a war room. If you need five engineers and three dashboards to answer "what broke?", the tooling has already failed you.

Most critical incidents announce themselves — minutes in advance, across multiple signals — to a system that isn't listening for the pattern.

The IT lead at a 200-person company carries the same operational risk as an enterprise NOC team. They just do it alone, without the tools or the headcount.

The intelligence that enterprise AIOps platforms sell for $50,000 a year can — and should — be available to every team running real infrastructure, regardless of size.

The background

Twenty years inside
the problem.

My career was built in network operations. I started on the floor of a telecoms NOC — watching screens, triaging alerts, learning the hard way which signals meant something and which were noise. Over the years I moved from engineer to architect to operator: designing monitoring infrastructure, deploying NMS platforms across enterprise environments, and eventually leading teams that delivered managed NOC services to large organisations.

I've implemented monitoring solutions at scale — across telcos, financial services, and critical infrastructure. I've worked with most of the major monitoring and network management platforms on the market. I've seen what works at enterprise scale, and I've seen why none of it translates cleanly to a smaller team with fewer resources and no dedicated ops staff.

By the time I decided to build Sanos, I had spent two decades watching smart engineers drown in alert noise — not because they weren't capable, but because the tools they were given weren't designed to help them think. They were designed to observe and fire. The thinking was always left to the human.

Early career

NOC Engineer — Telecoms

Started on the operations floor of a major telecoms provider. First exposure to large-scale alert management, threshold tuning, and the reality of alert fatigue at volume.

Network monitoring Incident response Telecoms infrastructure

Mid career

Monitoring Architect — Enterprise deployments

Designed and deployed monitoring and NMS solutions for enterprise clients. Built observability stacks covering network, application, and infrastructure layers across industries.

NMS deployment Observability design Enterprise architecture

Later career

Head of NOC — Managed services

Led the delivery of managed NOC services to enterprise customers — responsible for 24/7 monitoring operations, SLA management, and the continuous challenge of scaling human attention across growing alert volumes.

Managed NOC SLA operations Team leadership

Now

Founder — Sanos

Building the AI monitoring assistant I wished existed throughout my career — designed from the ground up for the IT lead who runs everything alone, without the enterprise budget or the dedicated ops team.

AI-native design SMB-first Zero-noise ops

What I kept noticing

The tools got better.
The problem
stayed the same.

Every year, the monitoring platforms got more capable. More integrations, more dashboards, richer data, faster ingestion. And every year, the teams operating them got more overwhelmed. More alerts per engineer, not fewer. More context-switching, not less.

The enterprise AIOps platforms — Moogsoft, BigPanda, OpsRamp — proved that AI-driven correlation and triage genuinely works. Deployments showed 35–40% reductions in L1/L2 ticket volume. Root cause cycles that used to take hours collapsed to minutes. The technology was real.

But those platforms were built for — and priced for — organisations with 50-person IT ops teams, dedicated AIOps engineers, and six-figure tool budgets. The smaller teams, the ones managing just as much critical infrastructure with a fraction of the resources, were left out entirely.

That observation sat with me for years. It became the product.

The enterprise proof point

AI-driven alert correlation demonstrably works. Enterprise deployments consistently show 35–40% reduction in ticket volume and root cause cycles shortened from hours to minutes. The intelligence is proven. The access is not democratised.

The SMB blind spot

Teams of 1–3 managing infrastructure for 50–500 person companies operate with the same risk profile as a large enterprise NOC — but with no L1/L2 structure, no AIOps tooling, and no budget for either. They're on-call by default, every day.

The tool proliferation trap

Teams experiencing daily alert fatigue use an average of 7.4 monitoring tools, vs 4.5 for healthier teams. More tools doesn't mean more visibility — it means more noise and more dashboards to correlate manually.

The missing layer

Between the raw alert and the IT lead's attention, there's a gap. In an enterprise NOC, L1 engineers fill that gap. For everyone else, that gap is just silence — until something breaks badly enough that it can't be ignored.

Why I built Sanos

The AI layer that should
have always existed.

I didn't set out to build another monitoring tool. The market has plenty. What I set out to build was the thinking layer that sits between the monitoring tool and the human — the thing that does what an experienced L1 engineer does, but automatically, at scale, and for a team that can't afford an L1 engineer.

Sanos ingests whatever you already have — Datadog, CloudWatch, Prometheus, Zabbix, PagerDuty, all of it — and applies AI correlation and triage to the full alert stream. It deduplicates. It groups related signals into single incidents. It re-classifies severity based on actual business impact, not tool defaults. And it surfaces the three incidents that actually need a human, with root cause already reasoned through.

It's the tool I would have wanted throughout my career. And it's designed to work for a team of one — not a 50-person ops organisation with a dedicated AIOps engineer.

What Sanos is built on

No rip-and-replace. Sanos connects to your existing tools. You don't change your stack — you add the intelligence layer on top of it.

Live in hours, not months. No six-month implementation. No dedicated tuning team. AI-calibrated from day one, improving as it learns your environment.

Plain language, not dashboards. Root cause in a sentence. A suggested fix alongside it. Not another screen to interpret.

SMB-first pricing. The same intelligence that costs enterprises $50k/year, priced for a team that's watching its budget as carefully as it watches its infrastructure.

Built by someone who's run NOCs. Not a product invented by a VC-backed team that read a whitepaper. Every design decision comes from two decades of operational experience.

Ayad Aftab

Founder · Sanos · 20 years in network operations

If you're reading this, you probably know the feeling. The 3am page. The wall of alerts on Monday morning. The thirty-minute triage session before you even understand what broke. The muted alert you know you shouldn't have muted but couldn't face one more night of it.

I built Sanos because I spent twenty years watching that experience repeat itself — across teams large and small, across industries, across every monitoring platform the market offered. The technology to solve it existed. The will to apply it to smaller teams didn't.

Sanos is early. We're working with a small group of IT leads to build it properly — not in a vacuum, but in the real environments where alert fatigue actually lives. If you've read this far, I suspect you're exactly the kind of person I want in that group.

The goal is simple: you should spend your time resolving incidents, not finding them.

Ayad Aftab

Founder, Sanos

The console is lit up. Four hundred alerts. Three of them matter.

Twenty years insidethe problem.

The tools got better.The problemstayed the same.

The AI layer that shouldhave always existed.

Join the teamsshaping Sanos.

The console is lit up.
Four hundred alerts.
Three of them matter.

Twenty years inside
the problem.

The tools got better.
The problem
stayed the same.

The AI layer that should
have always existed.

Join the teams
shaping Sanos.