Why Your AIOps Agents Are Only As Good As Your Infrastructure Data (+ OpsMill Infrahub Review)

TLDR: Most AIOps failures trace back to a data problem, not an algorithm problem. When AI agents act on stale CMDBs, disconnected spreadsheets, and ad-hoc scripts, errors cascade fast. Graph-based infrastructure data platforms like OpsMill’s Infrahub address this by modeling real infrastructure relationships rather than flattening everything into asset tables. Infrahub’s open-core model (free Community edition, licensed Enterprise) makes it accessible to evaluate, but the Enterprise edition’s pricing is opaque and requires a sales conversation. If your org is scaling AI-driven ops beyond a single domain, fix the data layer first or accept that your agents will keep flying blind.

Why This Matters Now

OpsMill announced a $14M Series A today (May 7, 2026), led by IRIS with BGV, Serena, and Partech participating. The round is notable less for the dollar amount than for the thesis behind it: the company’s bet is that the biggest bottleneck in enterprise AIOps adoption is not the AI layer but the infrastructure data feeding it.

The numbers support this framing. Gartner forecasts that 30% of enterprises will automate more than half their network activities by 2026, up from under 10% in mid-2023. ITIC’s 2024 downtime report pegs the average cost of an hour of unplanned outage at $300,000. Yet the data describing most enterprises’ infrastructure — what exists, how it connects, how it should be configured — remains scattered across tools that were never designed for agent consumption. ABI Research estimates the broader AI-driven IT operations market at $220 billion, and demand for enterprise network automation has tripled since 2023. The infrastructure data layer is the foundation nobody wants to talk about.

The Infrastructure Data Problem at a Glance

Dimension	Traditional CMDB	Spreadsheets / Scripts	Graph-Based Platform (e.g., Infrahub)
Data model	Flat tables, predefined fields	Unstructured, per-team conventions	Flexible schema, relationship-first
Relationship mapping	Limited; foreign keys at best	None	Native graph connections
Version control	Snapshot-based or none	Manual file versioning	Git-integrated branching, diff, merge
Change validation	Manual review or post-deploy audit	None	Built-in CI pipelines pre-merge
AI/agent readiness	Requires ETL middleware	Requires custom parsing	GraphQL/REST APIs, MCP server for LLM agents
Typical staleness	Days to weeks behind reality	Instantly stale once someone forgets to update	Near-real-time via sync connectors
Scale ceiling	Enterprise-grade but rigid	Breaks past a few hundred assets	Graph DB scales to hundreds of thousands of elements
Cost model	Per-node or per-user licensing (often $50-200+/node/year)	Free but unmaintainable	Open-source Community; Enterprise pricing undisclosed

The Hidden Reason AIOps Deployments Fail

Your AIOps agent just flagged a critical firewall misconfiguration — but the CMDB it pulled that data from hasn’t been updated in three weeks. That’s the actual failure mode most teams are dealing with, and it has nothing to do with the AI layer.

The typical enterprise infrastructure data stack looks like this: a CMDB that was configured once during a migration project and has drifted since, a collection of team-specific spreadsheets that contradict each other, and a set of scripts that scrape configuration from live devices but store it in formats only the author understands. When an AIOps agent queries this mess for the current state of a firewall rule or the dependency chain behind a microservice, it gets incomplete or conflicting answers. The agent doesn’t know the data is bad. It acts on it anyway.

In financial services, a misconfigured firewall rule surfaced by stale CMDB data can trigger a regulatory penalty. In manufacturing, an agent deploying a network change based on an outdated topology map can shut down a production line. The failure mode is not “AI made a bad prediction.” The failure mode is “AI confidently executed a correct action on incorrect data.”

Earned insight: In my experience, the single most common root cause of failed AIOps pilot projects is not model accuracy or integration complexity — it is that the infrastructure data the AI consumed was between two weeks and six months out of date. Teams spend months tuning correlation rules when they should have spent weeks auditing their data layer.

What a Graph-Based Approach Changes

Traditional CMDBs store infrastructure as rows in tables: one row per server, one row per switch, with foreign keys linking them when someone remembers to set them up. This flat model works for asset accounting but collapses when you need to answer questions like “what services depend on this switch, and what happens if it goes down at 2 AM?”

A graph database stores infrastructure elements as nodes and their relationships as edges. A server is connected to a VLAN, which is connected to a firewall policy, which is connected to a compliance requirement. The relationships are first-class data, not afterthoughts. When an AI agent needs to reason about blast radius or dependency chains, the answers are already in the topology — no multi-table joins, no ETL pipelines, no guessing.

This matters specifically for AIOps because modern incident response is relationship-dependent. An alert about high latency on a pod means nothing without the context of which service runs on that pod, which database it connects to, which network path it traverses, and whether any of those components changed recently. Graph-native platforms make that context queryable by default.

OpsMill Infrahub: What It Does

Infrahub is an open-source infrastructure data management platform built on Neo4j (graph database), Git, and Python. Founded in 2023 by Damien Garros (ex-Juniper, ex-Roblox, ex-Network to Code) and Karen Gallantry, the platform is designed as a single source of truth for infrastructure teams and the AI agents working alongside them.

What Works

Flexible, schema-first data modeling. Unlike CMDBs that force you into predefined fields, Infrahub lets you define your own data model — devices, services, topologies, business context — enforced with validation. This means the model matches your actual infrastructure, not a vendor’s assumption of what infrastructure looks like.

Git-integrated version control for data. Every change to schema, data, or generated configurations goes through a branch-diff-review-merge workflow identical to code review. This is a genuine differentiator: CMDBs track snapshots at best, and most track nothing. Infrahub gives you an immutable, queryable history of who changed what, when, and why — and that history is queryable by your agents, not just your humans.

Built-in CI validation. Proposed changes run through CI pipelines before merge. If a change would break a dependency or violate a constraint, it fails before reaching production. This is the kind of safety net that turns infrastructure data from “probably right” into “validated right.”

AI-ready integrations. Infrahub ships with a Model Context Protocol (MCP) server, making its validated data directly consumable by Claude, Cursor, and other LLM-based agents. It also includes GraphQL and REST APIs, plus sync connectors for NetBox, Nautobot, and IP Fabric.

Real customer traction. TikTok runs Infrahub in production. Eurofiber, a European cloud services provider, cut deployment times from five days to fifteen minutes after implementing it. The open-source community includes hyperscalers, global retailers, fintechs, and insurers.

Where It Struggles

Pricing opacity. The Enterprise edition pricing is entirely hidden behind a “contact sales” wall. The pricing page mentions “processing capabilities” as the pricing axis but discloses no numbers. For a startup competing against well-documented alternatives, that’s a friction point. Early-stage enterprise buyers want to model TCO before scheduling a demo.

Network/infrastructure focus, not full-stack. Infrahub is purpose-built for network and data center infrastructure. If you need a unified data platform that also covers application-layer services, SaaS dependencies, or business process mapping, you’ll need to supplement Infrahub with other tools or rely on its sync connectors.

Early-stage ecosystem. Founded in 2023, the company is still building out its partner ecosystem, integrations, and documentation. The Discord community is active but small compared to alternatives like NetBox (which has years of community momentum). Enterprise features like SSO validation, high-availability Neo4j, and hardened deployments are Enterprise-only — you won’t get them on the free tier.

Competitive positioning requires nuance. OpsMill positions against CMDBs and GitOps, but the more direct competition is NetBox Labs and Nautobot — tools that many network teams already know and use. Migrating from a working NetBox instance to Infrahub requires a clear ROI case, and OpsMill’s marketing doesn’t yet make that case sharply.

Warning: Infrahub’s Community edition uses Neo4j Community, which lacks high-availability clustering. For production deployments where infrastructure data availability is itself critical, you’ll need the Enterprise edition — and that means entering a sales cycle with undisclosed pricing. Factor this into your evaluation timeline.

Infrahub Strengths:

Graph-native data model maps real infrastructure relationships, not flat tables
Git-integrated version control for all data and schema changes
Built-in CI validation prevents bad data from reaching production
MCP server provides direct LLM/agent integration out of the box
Open-source Community edition under Apache 2.0 — no cost to evaluate
Real production traction at TikTok and Eurofiber

Infrahub Weaknesses:

Enterprise pricing completely undisclosed; no public TCO guidance
Focused on network/data center infrastructure — not a full-stack CMDB replacement
Smaller community and ecosystem than established alternatives like NetBox
High-availability deployment requires Enterprise edition
Early-stage company (Series A) — long-term viability not yet proven at scale

Who Else Plays in This Space

Infrahub doesn’t compete in a vacuum. The infrastructure data management landscape includes several alternatives worth evaluating depending on your starting point:

ServiceNow CMDB remains the enterprise default, deeply integrated with ITSM workflows and compliance reporting. It’s the most mature option but also the most rigid — adapting it for AI agent consumption typically requires middleware and significant configuration. Pricing runs $50-100+/user/month depending on the module stack.

NetBox Labs (NetBox) is the most widely adopted open-source network source of truth, with a large community and extensive plugin ecosystem. It uses a relational database, not a graph, which limits relationship querying. NetBox Cloud offers a hosted version for teams that don’t want to self-manage.

Nautobot forked from NetBox and adds a job/automation framework. It’s a strong choice for teams already embedded in the Ansible/Python automation ecosystem, though it shares NetBox’s relational database limitations.

Itential focuses on network automation orchestration — more of an execution layer than a data layer. It can consume data from CMDBs or source-of-truth tools but doesn’t replace them.

Port.io and Backstage serve as internal developer portals and service catalogs. They map application-layer context well but generally don’t have the deep infrastructure topology modeling that AIOps agents need.

Earned insight: The choice between a graph-based platform and a relational one often comes down to query complexity. If your agents only need “what IP is assigned to this server,” a relational source of truth is fine. If they need “show me every service impacted if this switch fails, including the compliance implications,” you need a graph. Most teams underestimate how quickly their query needs escalate once agents start running autonomously.

Pricing Reality

This is where transparency breaks down across the category:

Platform	Community / Free Tier	Enterprise Cost	Pricing Model	Hidden Costs
Infrahub	Full-featured Community (Apache 2.0)	Undisclosed — contact sales	”Processing capabilities” based	HA requires Enterprise; Neo4j Enterprise license bundled
NetBox	Open-source (Apache 2.0)	NetBox Cloud from ~$500/mo	Node count / hosted instance	Plugin maintenance, self-hosting infra
ServiceNow CMDB	None	$50-100+/user/month (ITSM bundle)	Per-user, per-module	Implementation services ($100K+ typical), ongoing admin
Nautobot	Open-source (Apache 2.0)	Network to Code consulting	Consulting-based support	Self-hosting, plugin compatibility
Port.io	Free tier (limited)	Custom pricing	Per-developer, per-service	Focused on app layer; infra modeling is secondary

Pricing verified May 7, 2026 from vendor websites and published documentation. Infrahub Enterprise pricing is not publicly available.

Tip: If you’re evaluating Infrahub, start with the Community edition and the hosted sandbox at sandbox.infrahub.app. Model your actual infrastructure — not a demo topology — and stress-test the graph queries your AIOps agents would need. This gives you real TCO evidence before entering an Enterprise sales cycle.

Who Should Use a Graph-Based Infrastructure Data Platform

Good fit:

Organizations running hybrid or multi-cloud infrastructure with complex interdependencies that CMDBs can’t model accurately
Teams deploying AIOps agents that need relationship-aware context (dependency mapping, blast radius analysis, change impact prediction)
Network and infrastructure teams managing 1,000+ devices across multiple sites or vendors
Companies where compliance or regulatory requirements demand an auditable, version-controlled record of infrastructure changes

Not a good fit:

Small teams with fewer than a few hundred infrastructure assets — a spreadsheet or NetBox instance is likely sufficient
Organizations whose AIOps needs are limited to log aggregation and alerting (Datadog, New Relic, and Dynatrace handle this without a separate data layer)
Teams that need an application-layer service catalog more than infrastructure topology mapping — Port.io or Backstage is a better fit
Companies not ready to invest in data modeling upfront — Infrahub’s schema-first approach requires deliberate design work before you’ll get value

Graph-Based Data Platform Strengths:

Enables relationship-aware AI reasoning across infrastructure
Version-controlled data prevents the “CMDB drift” problem
Scales to complex hybrid environments without flattening topology
Reduces mean time to resolution when agents can traverse dependency graphs

Graph-Based Data Platform Weaknesses:

Higher upfront investment in schema design and data migration
Requires team buy-in to treat infrastructure data as a first-class engineering artifact
Smaller vendor ecosystem than established CMDB incumbents
Graph database operations expertise is less common than relational DB skills

Bottom Line

The AIOps industry has a data problem it doesn’t talk about enough. The race to deploy AI agents for incident response, capacity planning, and automated remediation is outpacing the infrastructure data quality those agents require. If your CMDB was last audited during a migration project, your spreadsheets contradict each other, and your scripts parse configs into formats only their author understands — no amount of model tuning will fix the outputs. Bad data in, bad decisions out.

OpsMill’s Infrahub is one of the more credible attempts to solve this problem. The graph-native data model, Git-integrated version control, and built-in CI validation address the root causes of bad infrastructure data in ways that bolting AI features onto an existing CMDB doesn’t. The Eurofiber case (five days to fifteen minutes for service deployments) is the kind of concrete result that justifies the migration effort.

The caveats are real: Enterprise pricing is a black box, the ecosystem is early-stage, and the platform is network/infrastructure-focused rather than full-stack. But fixing the data layer isn’t optional — it’s the prerequisite everything else depends on.

This month: Deploy Infrahub Community in your lab environment (it’s free, Apache 2.0). Model one real infrastructure domain — your core switching layer or a single datacenter pod — not a demo topology. Run the GraphQL queries your AIOps agents would actually need. If the graph traversals return accurate, relationship-aware answers that your current CMDB can’t match, you have your ROI case for the Enterprise sales conversation.

Rating: 3.8 / 5 for enterprise infrastructure teams evaluating AI-ready data platforms. Strong technical foundations and real production traction, held back by pricing opacity and early-stage ecosystem maturity.

Why Your AIOps Agents Are Only As Good As Your Infrastructure Data (+ OpsMill Infrahub Review)

Why This Matters Now

The Infrastructure Data Problem at a Glance

The Hidden Reason AIOps Deployments Fail

What a Graph-Based Approach Changes

OpsMill Infrahub: What It Does

What Works

Where It Struggles

Who Else Plays in This Space

Pricing Reality

Who Should Use a Graph-Based Infrastructure Data Platform

Bottom Line

Get the weekly AI stack briefing

Discussion

Why Your AIOps Agents Are Only As Good As Your Infrastructure Data (+ OpsMill Infrahub Review)

Why This Matters Now

The Infrastructure Data Problem at a Glance

The Hidden Reason AIOps Deployments Fail

What a Graph-Based Approach Changes

OpsMill Infrahub: What It Does

What Works

Where It Struggles

Who Else Plays in This Space

Pricing Reality

Who Should Use a Graph-Based Infrastructure Data Platform

Bottom Line

Related Articles

Get the weekly AI stack briefing

Discussion