The Problem

Every Enterprise is a Snowflake

Agent

Same agent.
Identical code.

DD

PD

GF

CF

Datadog + PagerDuty + Grafana + Confluence

SP

AZ

SN

Splunk + Azure + ServiceNow

NR

PD

JR

NT

WK

New Relic + PagerDuty + Jira + Notion + Wiki

You ship it. It works beautifully — for your test environment.
Then every customer connects completely different tools.

	Monitoring	Ticketing	Knowledge	Comms	Code/CI
Ops	Datadog	PagerDuty	Confluence	Slack	—
Support	—	Zendesk	Notion	Intercom	—
Eng	Splunk	Jira	Confluence	Slack	GitHub
Sales	—	Salesforce	Notion	Outlook	—

Every row is different. Every column is different.

5–15 SaaS tools per team, wired differently at every company.

N tools × M tenants × P auth = explosion

Every new vendor = months of integration work

How do you build ONE agent that works across ALL tenants?

Agent

Discovered, not declared

Tools found at runtime per tenant

Delegated, not embedded

Auth handled by platform

Composable, not monolithic

Domain logic adapts to tools

Production Agent for Enterprise Tenants

Skills

Reusable domain logic
that orchestrates tools
without vendor coupling

Native Tools

Platform capabilities
every tenant gets
for free

MCP

A protocol that lets
the agent discover &
invoke any tool at runtime

Three architectural ideas. Let's build it up.

Why is This Actually Hard?

The Four Constraints Nobody Warns You About

At a hackathon: one user, one set of tools, one API key in an .env file.
That agent does not work for 10,000 paying customers.

Customer A's agent must never see Customer B's data.
One leaked API response across tenants = game over.

OAuth tokens expire mid-conversation. Some tools need re-consent.
The agent doesn't own credentials. The user does.

You must design within these walls — not pretend they don't exist.

Now that we know the constraints —
let's look at the architecture that satisfies all four.

Three layers. Three ideas. One protocol that ties them together.

Orchestrator

→

Skills

→

Tools

The Architecture

AI-Powered Ops Investigation Assistant

Two Types of Tools, One Skill

The Skill Doesn't Care Where the Data Comes From

The skill has a plan — an evidence-gathering loop. Some steps hit native tools. Some hit MCP apps.
The skill doesn't distinguish.

Native tools gave context (what changed, what's connected). MCP apps gave signals (metrics, logs, traces).
The skill fused them into a grounded diagnosis.

MCP Deep Dive

How Does the Agent Talk to Tools It's Never Seen?

The agent never talks directly to Splunk or Datadog. It talks to the gateway, which multiplexes across all connected providers.

Four problems. Four modules. This is the checklist for building multi-tenant tool access.

The system doesn't crash. The output notes the gaps. Confidence adjusts. An answer is still produced.

Skills

The Strategy Layer Between Prompts and Tools

The skill is tool-agnostic. It says "I need metrics" — not "call Datadog". The gateway resolves the how. The skill owns the what and why.

The ledger is the difference between "an LLM that sometimes calls tools" and "a disciplined investigator that builds a case."

Production Lessons

What Broke, What Scaled, What Surprised Us

We shipped to real enterprise tenants. These are the scars.