Upcoming Event nOps at FinOps X 2026 - Register to Join

Your AI Spend.
Every Agent.
Every Team.
Every Customer.
Fully Allocated.

nOps Inform attributes every dollar of AI Cloud spend to the model, account, team, customer and feature that drove it — hour by hour, with no unassigned bucket left behind. Understand per-customer COGS, analyze feature ROI, and allocate spend whether it flows through Bedrock natively or from tools like Cursor, Claude or OpenAI.

Allocate spend by agent, team and department

Map AI costs — from Bedrock models to developer agents and external platforms — to the product, team or environment behind them, down to the hour.

Real-time anomaly detection

Catch AI cost spikes as they happen — not days later when the invoice lands. Hourly comparison flags structural anomalies and sudden model cost changes the same hour they occur.

Integrate Cursor, Claude Code and OpenAI Codex

Track developer AI spend through nOps to attribute costs from Cursor, Claude Code and OpenAI Codex to the teams and projects that generated them — no tagging changes required.

Optimize AI with our recommendations

Turn visibility into action — nOps surfaces token efficiency opportunities, model substitution candidates and cache tuning recommendations alongside your spend data.

nOps manages $4B+ in annual cloud spend for innovative brands, from startups to enterprises, saving them 50%+ autonomously.

Hourly AI Cost Allocation & Bedrock Visibility

Hourly Spend Allocation by Model & Department

  • Full attribution chain: 
hour → account → model → department
  • Every account and model mapped to the product or team that owns it
  • 100% of spend allocated — no “unassigned” bucket left behind
  • Expandable allocation tree with week-over-week deltas, share bars and NEW / ANOMALOUS tags

Developer & Agent Cost Attribution

  • Attribute AI costs from Cursor, Claude Code and similar developer tools to the team that generated them — automatically, when routed through Bedrock, with no tagging changes required
  • See total cost per agent workflow, not just individual model calls — every step in the chain attributed back to the agent that triggered it
  • Track cost per agent run over time to detect runaway loops, growing chain lengths, and sudden price changes from model version upgrades
  • Cleanly separate developer AI spend from production workloads — know what’s engineering tooling and what’s customer-facing COGS

Completely Headless — Query AI Costs from Any Workflow

  • Ask for allocation, anomalies, recommendations or token breakdowns in natural language — from nOps Clara, Claude, Cursor or any AI harness your teams already use
  • Responses include rich visualizations generated inline — spend charts, attribution breakdowns and anomaly timelines, not just text
  • Fully headless API access — embed AI cost data and recommendations in your existing dashboards, Slack workflows or internal tools without opening the nOps UI
  • Same virtual tags, same department mapping and same allocation rules whether you open the dashboard or query it in chat

Dive Deeper Into Spikes & Anomalies

  • Hourly granularity surfaces overnight bursts and intra-day spikes that daily averages hide entirely
  • Every flagged hour ties the spike to the model, account and department that caused it — with the multiple vs the same hour last week
  • Structural anomalies caught automatically: overnight Sales activity, a model in the wrong account, or a brand-new model version

Turn Visibility Into Action — AI Optimization Recommendations

  • Token efficiency opportunities — prompt compression, context window sizing and input:output ratio analysis that surfaces over-retrieval and bloated system prompts in RAG workloads
  • Model substitution candidates — flag tasks running on larger models where a smaller one meets the quality bar, with estimated monthly savings per substitution shown before you commit
  • Cache tuning recommendations — hit rate, write/read ratio and TTL analysis; when cache writes dominate cost but hit rates are low, nOps quantifies the opportunity and shows the target
  • Provisioned throughput and batch candidates — consistent high-volume workloads that would be cheaper on committed throughput; async workloads that can move to batch pricing

Total Bedrock & Model Visibility

  • Every model in one place — Anthropic, OpenAI, Llama — with native Bedrock (Nova, Titan embeddings, Guardrails) shown separately, never blended into model spend
  • All the metrics you need: current vs prior 7 days, peak hour, week-over-week change and spike count
  • Benchmark spend and efficiency across accounts and products at a glance

Granular Allocation — Native Tags and Virtual Rules

  • If Bedrock cost allocation tags already exist in your CUR — model, session, or custom resource tags — nOps reads them directly, no configuration required
  • For spend without tags, virtual tag rules fill the gap: condition-based matching on service, account, region or any existing tag value → assign a team, product or cost category from a point-and-click UI
  • Priority ordering with first-match-wins: precise rules take precedence; a catch-all handles everything else — no code, no CUR schema knowledge required
  • Allocated + Unallocated = Total always visible — an explicit unallocated bucket shows exactly what still needs a rule

Named Views — Slice Your Costs Any Way You Need

  • Create named views — AI Platform, Customer-Facing AI, Engineering Costs — each with its own filter rules and breakdown dimension
  • Build a per-customer COGS view: see exactly how much AI infrastructure each revenue-generating customer or product feature costs — the foundation for margin analysis and ROI
  • Each view is a tab on the reporting page: switch lenses without leaving the screen or rebuilding a query
  • Combine filters across service, account, region and any real or virtual tag — AND / OR logic, any depth
  • Every view shows its own daily trend, breakdown chart and unallocated bucket — the same reconciliation invariant across all lenses

Token-Level Cost Breakdown

  • Spend split into input, output, cache-read and cache-write — under the same department mapping as your cost view
  • See the real driver: cache writes are a small share of volume but a large share of cost; cache reads are the opposite
  • Input : output ratio characterises the workload — high ratios signal retrieval / agentic use, low ratios signal pricier generation

Allocate Every Dollar

100% of Bedrock and AI spend mapped to a product, team or environment. No black box, no orphan costs.

Catch Spikes in the Hour

Hourly comparison vs the prior week flags anything 3× over its same-hour baseline, plus structural anomalies — automatically.

No AWS Tagging Required

Virtual tag rules allocate costs that AWS never tagged — condition-based, priority-ordered, with an unallocated bucket showing exactly what’s left.

Cursor, Claude Code & OpenAI Codex

Attribute developer AI spend from Cursor, Claude Code and OpenAI Codex to the teams and projects that generated it — no tagging changes required.

Optimize AI Recommendations

Turn visibility into action — model substitution candidates, cache tuning opportunities and provisioned throughput candidates surfaced alongside your spend.

Available Everywhere You Work

Delivered through nOps Inform and exposed to every leading AI harness — query your spend from the tools you already use.

How AI Cost
Allocation Works

See where every AI dollar is going in minutes — no agent and no changes to your applications.

Step 1

Connect Your Billing Data

nOps Inform reads your AWS Cost & Usage Report — no agent, no SDK, no code change. Bedrock and Claude line items are picked up automatically.

Step 2

Visibility & Attribution

Get hourly spend mapped to model, account, department and token type, with this-week-vs-last-week context and automatic spike detection.

Step 3

Define Your Allocation Rules

Build virtual tag rules in a point-and-click UI — match costs by service, account, region or any existing tag value and assign a team, product or cost category. An unallocated bucket shows exactly what still needs a rule. No code, no AWS tagging changes.

Step 4

Allocate & Monitor Continuously

Keep 100% of spend allocated with week-over-week tracking and real-time anomaly alerts. Act on AI optimization recommendations — model substitution, cache tuning and throughput candidates — surfaced alongside your spend. Queryable from nOps Inform or any leading AI harness.

How nOps Compares

See why leading teams choose nOps to allocate, attribute and control their AI spend.

Capability nOps Traditional Cost Tools
Hourly cost granularity Yes Daily or monthly only
Spend allocated by model Yes Service-level only
Department / product attribution Fully mapped Manual tagging required
Token-level cost breakdown Input / output / cache Not available
Cache-write cost visibility Yes Hidden in blended totals
Real-time anomaly detection Automated, same-hour alerts After the invoice lands
Structural anomaly detection Overnight / wrong-account / new model None
Customer-facing vs internal split Derived Not possible
100% of spend allocated Guaranteed Large "unassigned" bucket
Multi-account, multi-model Unified Siloed views
Works with your AI harnesses Native (nOps Inform) Dashboard-only
Developer tool attribution (Cursor, Claude Code, OpenAI Codex) Native via Bedrock BYOK Not available
AI cost optimization recommendations Model substitution, cache tuning, throughput Not available
Setup effort Connect CUR — no agent Tagging projects & manual mapping
Custom allocation rules (no AWS tagging needed) Built-in virtual tag rule builder Requires manual AWS tag changes or scripting
Named views with custom filter & breakdown logic Unlimited user-defined views Fixed reports only

The Result

Hourly

Cost granularity, not daily averages

100%

Of AI spend allocated to a team or product

Real-Time

Anomalies caught the hour they happen

Every Harness

Queryable wherever your teams work

Frequently Asked Questions

nOps Inform reads your AWS Cost & Usage Report and resolves each Bedrock / Claude line item down to the hour, then maps it to a model, AWS account and the department or product that owns it. The result is a full attribution chain — hour → account → model → department — with 100% of spend allocated and nothing left in an “unassigned” bucket.

All Claude models on Bedrock — Opus, Sonnet and Haiku across versions — plus native Amazon Bedrock usage such as Nova tokens, Titan embeddings and Guardrails. Native Bedrock infrastructure is shown separately and never blended into model-inference totals. New model versions are picked up automatically and flagged the first time they appear.

Hourly. Hourly granularity surfaces overnight bursts and intra-day spikes that daily or monthly averages mask entirely — the level you need for real anomaly detection.

Through a department mapping that pairs AWS accounts and models with the product or team behind them — for example, Opus spend to Sales, Sonnet 4.5 to one platform and Sonnet 4.6 to another, regardless of which account they appear in. The same mapping is applied consistently to both cost and token views.

A pattern that is suspicious regardless of dollar size — such as a sales workload running between midnight and dawn, a model showing up in an account where it shouldn’t, or a brand-new model version appearing for the first time. nOps flags these alongside cost spikes.

For each hour in the current week, nOps compares cost against the average of the same hour across the prior week. A spike is flagged when an hour runs several times over its same-hour baseline and clears a minimum dollar floor, plus any single hour above a high-water threshold — each tied to the model, account and department that caused it.

Yes. Spend is split into input, output, cache-read and cache-write tokens under the same department mapping as the cost view, so you can see not just how many tokens you burned but which kind drove the bill.

Cache writes are usually a small share of token volume but a large share of cost, while cache reads are the opposite — high volume, low cost. Showing tokens and cost side by side makes that divergence visible and points to opportunities to tune cache TTL and reuse.

Yes — this is one of the most common use cases. If your CUR data includes Bedrock cost allocation tags (session tags, model tags or custom resource tags) that identify which customer or feature drove a call, nOps reads them directly and a named view can group by that dimension immediately. If your spend isn’t tagged at that level, virtual tag rules let you derive the same breakdown using account, service, region or any tag combination — without touching your application code. The result is a per-customer COGS view and a per-feature view that you can compare against revenue or usage data to understand which parts of your AI product are profitable and which aren’t.

Yes, for spend that flows through AWS Bedrock or appears in your AWS Cost & Usage Report. Tools like Cursor support bring-your-own-key configurations that route model calls through your own Bedrock account — those costs appear in your CUR exactly like any other Bedrock inference and are attributed automatically. For AI API spend that lands outside AWS (direct Anthropic or OpenAI billing), that won’t appear in the CUR; you would track it through those providers’ billing exports separately.

Virtual tag rules let you assign a cost category — team, product, environment or any label — to any cost row that matches conditions you define in the nOps UI. Each rule tests a combination of service, AWS account, region or existing tag values. Rules are evaluated in priority order: the first rule whose conditions all match wins, and the assigned tag value is applied to that row. A catch-all rule (no conditions) placed last captures anything not yet matched. An unallocated bucket always shows what hasn’t been covered, so Allocated + Unallocated always equals your total spend — there’s no hidden remainder.

Named views are saved lenses on your AI cost data — each with its own filter rules and breakdown dimension. For example, an “AI Platform” view filters to Bedrock spend and groups by feature tag, while a “Customer-Facing AI” view filters to production accounts only and groups by product. Views appear as tabs on the reporting page so you can switch between them without rebuilding a query. Every view shows its own daily trend, breakdown chart and unallocated bucket with the same reconciliation guarantee.

No. Allocation is built entirely from your AWS Cost & Usage Report. There is nothing to install in your application and no SDK to integrate.

The AWS CUR typically lands one to two days behind, so the most recent calendar day of the current window is usually partial. nOps states this clearly so week-over-week numbers aren’t misread.

The feature is fully available in the nOps Inform platform and exposed to all leading AI harnesses, so your teams can request allocation, anomalies or token breakdowns from the assistants and copilots they already use — returning the same data and the same department mapping as the dashboard.

Yes. Spend across every connected account and model is unified into one view, with the mapping resolving each account to its owning team or product.

It is included in the analysis automatically, assigned to its account’s department, and flagged explicitly as new — version migrations can change the per-call price significantly, so nOps calls out the shift rather than silently absorbing it.

Allocation is derived from billing and usage metering in the CUR — model names, accounts, token counts and cost — not from prompt or response content. [PLACEHOLDER — confirm exact data-handling wording with security / legal before publishing.]

Yes. nOps Inform complements your existing cost tooling and adds the model-, department- and token-level allocation those tools typically can’t produce.

Trusted with $4B in spend by 500+ innovative brands,
from
startups to enterprise.

”Partnering with nOps made it easy and seamless to optimize our AWS costs. The platform's automated cost optimization and commitment management reduced our AWS spend significantly, allowing us the time to focus on building and innovating. The flexibility nOps offers means we can stay agile and efficient without the risk of long-term commitments.”
Matt Morgan
Head of Engineering & Product, US, CommentSold
”As a FinOps lead at Arlo, I view cost transparency as non-negotiable, and nOps has delivered exactly that. The platform gave us end-to-end visibility into our cloud spend, eliminating guesswork and surfacing the true cost drivers across our environment. What stands out is how quickly the insights translate into action. nOps doesn’t just present data, it highlights the specific optimizations, anomalies, and inefficiencies we can address immediately. Those recommendations have driven real cost reductions for us while strengthening accountability across engineering teams.”
Alex Kuan
FinOps Lead, Arlo
”nOps has been invaluable in helping manage the intricacies of optimizing AWS environments and costs in today’s dynamic environment. The nOps Commitment Manager program provides us with the agility to progress and innovate, without the typical constraints associated with long-term commitments. The platform also gives us a clear and transparent picture of our costs, so we can plan and rightsize our resources accordingly.”
Khaled BEN JANNET
Datacenter & Cloud Team Leader, Vermeg
”nOps has transformed how we manage our AWS infrastructure. We considered several solutions, including another Reserved Instance solution, and then decided to work with nOps because it was a more complete solution. It gave us automated commitment management and an effortless way to track and monitor our costs with Business Contexts. We have seen a significant reduction in costs and highly recommend nOps to any organization looking to optimize their cloud costs and improve operational efficiency.”
Herman Lotter
Technology Operations Manager, Kurtosys
”Working with nOps has been great! Their team really knows FinOps and we’ve learned a lot of new ways to analyze our AWS costs and identify ways to reduce our monthly spend. Now, it’s easy to quickly report on our costs and how it’s connected back to the business.”
Andy Kwan
Senior Director of Infrastructure, BENlabs

Get a tour of the nOps Platform

Schedule a demo with our AWS experts to see how to optimize your AWS cloud

A Recognized Leader in Cloud Management

Advanced technology partner AWS, G2 4.8 rating, FinOps Foundation member and many more