From Tokenmaxxing to Valuemaxxing

Allocate spend by developers and agents

Map AI costs — from Bedrock models to developer agents and external platforms — to the product, team or environment behind them, down to the hour. Track developer AI spend through nOps to attribute costs from Cursor, Claude Code and OpenAI Codex to the teams and projects that generate them.

Real-time anomaly detection

Catch AI cost spikes as they happen — not days later when the invoice lands. Hourly comparison flags structural anomalies and sudden model cost changes the same hour they occur.

Optimize AI with our recommendations

Turn visibility into action — nOps surfaces token efficiency opportunities, model substitution candidates and cache tuning recommendations alongside your spend data.

nOps manages $4B+ in annual cloud spend for innovative brands, from startups to enterprises, saving them 50%+ autonomously.

AI Visibility & Optimization (Tokenomics)

Hourly Spend Allocation by Model & Department

Full attribution chain: hour → account → model → department
Every account and model mapped to the product or team that owns it
100% of spend allocated — no “unassigned” bucket left behind
Expandable allocation tree with week-over-week deltas, share bars and NEW / ANOMALOUS tags

Developer & Agent Cost Attribution

Attribute AI costs from Cursor, Claude Code and similar developer tools to the team that generated them — automatically, when routed through Bedrock, with no tagging changes required
See total cost per agent workflow, not just individual model calls — every step in the chain attributed back to the agent that triggered it
Track cost per agent run over time to detect runaway loops, growing chain lengths, and sudden price changes from model version upgrades
Cleanly separate developer AI spend from production workloads — know what’s engineering tooling and what’s customer-facing COGS

Completely Headless — Query AI Costs from Any Workflow

Ask for allocation, anomalies, recommendations or token breakdowns in natural language — from nOps Clara, Claude, Cursor or any AI harness your teams already use
Responses include rich visualizations generated inline — spend charts, attribution breakdowns and anomaly timelines, not just text
Fully headless API access — embed AI cost data and recommendations in your existing dashboards, Slack workflows or internal tools without opening the nOps UI
Same virtual tags, same department mapping and same allocation rules whether you open the dashboard or query it in chat

Dive Deeper Into Spikes & Anomalies

Hourly granularity surfaces overnight bursts and intra-day spikes that daily averages hide entirely
Every flagged hour ties the spike to the model, account and department that caused it — with the multiple vs the same hour last week
Structural anomalies caught automatically: overnight Sales activity, a model in the wrong account, or a brand-new model version

Turn Visibility Into Action — AI Optimization Recommendations

Token efficiency opportunities — prompt compression, context window sizing and input:output ratio analysis that surfaces over-retrieval and bloated system prompts in RAG workloads
Model substitution candidates — flag tasks running on larger models where a smaller one meets the quality bar, with estimated monthly savings per substitution shown before you commit
Cache tuning recommendations — hit rate, write/read ratio and TTL analysis; when cache writes dominate cost but hit rates are low, nOps quantifies the opportunity and shows the target
Provisioned throughput and batch candidates — consistent high-volume workloads that would be cheaper on committed throughput; async workloads that can move to batch pricing

Total Bedrock & Model Visibility

Every model in one place — Anthropic, OpenAI, Llama — with native Bedrock (Nova, Titan embeddings, Guardrails) shown separately, never blended into model spend
All the metrics you need: current vs prior 7 days, peak hour, week-over-week change and spike count
Benchmark spend and efficiency across accounts and products at a glance

Granular Allocation — Native Tags and Virtual Rules

If Bedrock cost allocation tags already exist in your CUR — model, session, or custom resource tags — nOps reads them directly, no configuration required
For spend without tags, virtual tag rules fill the gap: condition-based matching on service, account, region or any existing tag value → assign a team, product or cost category from a point-and-click UI
Priority ordering with first-match-wins: precise rules take precedence; a catch-all handles everything else — no code, no CUR schema knowledge required
Allocated + Unallocated = Total always visible — an explicit unallocated bucket shows exactly what still needs a rule

Named Views — Slice Your Costs Any Way You Need

Create named views — AI Platform, Customer-Facing AI, Engineering Costs — each with its own filter rules and breakdown dimension
Build a per-customer COGS view: see exactly how much AI infrastructure each revenue-generating customer or product feature costs — the foundation for margin analysis and ROI
Each view is a tab on the reporting page: switch lenses without leaving the screen or rebuilding a query
Combine filters across service, account, region and any real or virtual tag — AND / OR logic, any depth
Every view shows its own daily trend, breakdown chart and unallocated bucket — the same reconciliation invariant across all lenses

Token-Level Cost Breakdown

Spend split into input, output, cache-read and cache-write — under the same department mapping as your cost view
See the real driver: cache writes are a small share of volume but a large share of cost; cache reads are the opposite
Input : output ratio characterises the workload — high ratios signal retrieval / agentic use, low ratios signal pricier generation

Allocate Every Dollar

100% of Bedrock and AI spend mapped to a product, team or environment. No black box, no orphan costs.

Catch Spikes in the Hour

Hourly comparison vs the prior week flags anything 3× over its same-hour baseline, plus structural anomalies — automatically.

No AWS Tagging Required

Virtual tag rules allocate costs that AWS never tagged — condition-based, priority-ordered, with an unallocated bucket showing exactly what’s left.

Cursor, Claude Code & OpenAI Codex

Attribute developer AI spend from Cursor, Claude Code and OpenAI Codex to the teams and projects that generated it — no tagging changes required.

Optimize AI Recommendations

Turn visibility into action — model substitution candidates, cache tuning opportunities and provisioned throughput candidates surfaced alongside your spend.

Available Everywhere You Work

Delivered through nOps Inform and exposed to every leading AI harness — query your spend from the tools you already use.

How AI Cost Allocation Works

See where every AI dollar is going in minutes — no agent and no changes to your applications.

STEP 1

Connect Your Billing Data

nOps Inform reads your AWS Cost & Usage Report — no agent, no SDK, no code change. Bedrock and Claude line items are picked up automatically.

STEP 2

Visibility & Attribution

Get hourly spend mapped to model, account, department and token type, with this-week-vs-last-week context and automatic spike detection.

STEP 3

Define Your Allocation Rules

Build virtual tag rules in a point-and-click UI — match costs by service, account, region or any existing tag value and assign a team, product or cost category. An unallocated bucket shows exactly what still needs a rule. No code, no AWS tagging changes.

STEP 4

Allocate & Monitor Continuously

Keep 100% of spend allocated with week-over-week tracking and real-time anomaly alerts. Act on AI optimization recommendations — model substitution, cache tuning and throughput candidates — surfaced alongside your spend. Queryable from nOps Inform or any leading AI harness.

How nOps Compares

See why leading teams choose nOps to allocate, attribute and control their AI spend.

Capability	nOps	Traditional Cost Tools
Hourly cost granularity	Yes	Daily or monthly only
Spend allocated by model	Yes	Service-level only
Department / product attribution	Fully mapped	Manual tagging required
Token-level cost breakdown	Input / output / cache	Not available
Cache-write cost visibility	Yes	Hidden in blended totals
Real-time anomaly detection	Automated, same-hour alerts	After the invoice lands
Structural anomaly detection	Overnight / wrong-account / new model	None
Customer-facing vs internal split	Derived	Not possible
100% of spend allocated	Guaranteed	Large 'unassigned' bucket
Multi-account, multi-model	Unified	Siloed views
Works with your AI harnesses	Native (nOps Inform)	Dashboard-only
Developer tool attribution	Native via Bedrock BYOK	Not available
AI cost optimization recommendations	Model substitution, cache tuning, throughput	Not available
Setup effort	Connect CUR — no agent	Tagging projects & manual mapping
Custom allocation rules (no AWS tagging needed)	Built-in virtual tag rule builder	Requires manual AWS tag changes or scripting
Named views with custom filter & breakdown logic	Unlimited user-defined views	Fixed reports only

The Result

Hourly

Cost granularity, not daily averages

100%

Of AI spend allocated to a team or product

Real-time

Anomalies caught the hour they happen

Every Harness

Queryable wherever your teams work

Frequently Asked Questions

How does nOps allocate AI spend?

nOps Inform reads your AWS Cost & Usage Report and resolves each Bedrock / Claude line item down to the hour, then maps it to a model, AWS account and the department or product that owns it. The result is a full attribution chain — hour → account → model → department — with 100% of spend allocated and nothing left in an "unassigned" bucket.

What models and services does it cover?

All Claude models on Bedrock — Opus, Sonnet and Haiku across versions — plus native Amazon Bedrock usage such as Nova tokens, Titan embeddings and Guardrails. Native Bedrock infrastructure is shown separately and never blended into model-inference totals. New model versions are picked up automatically and flagged the first time they appear.

How granular is the cost data?

Hourly. Hourly granularity surfaces overnight bursts and intra-day spikes that daily or monthly averages mask entirely — the level you need for real anomaly detection.

How does nOps map spend to teams and products?

Through a department mapping that pairs AWS accounts and models with the product or team behind them — for example, Opus spend to Sales, Sonnet 4.5 to one platform and Sonnet 4.6 to another, regardless of which account they appear in. The same mapping is applied consistently to both cost and token views.

What is a "structural anomaly"?

A pattern that is suspicious regardless of dollar size — such as a sales workload running between midnight and dawn, a model showing up in an account where it shouldn't, or a brand-new model version appearing for the first time. nOps flags these alongside cost spikes.

How does spike detection work?

For each hour in the current week, nOps compares cost against the average of the same hour across the prior week. A spike is flagged when an hour runs several times over its same-hour baseline and clears a minimum dollar floor, plus any single hour above a high-water threshold — each tied to the model, account and department that caused it.

Can I break spend down by token type?

Yes. Spend is split into input, output, cache-read and cache-write tokens under the same department mapping as the cost view, so you can see not just how many tokens you burned but which kind drove the bill.

Why do cache-write tokens matter?

Cache writes are usually a small share of token volume but a large share of cost, while cache reads are the opposite — high volume, low cost. Showing tokens and cost side by side makes that divergence visible and points to opportunities to tune cache TTL and reuse.

Can I allocate AI costs by customer, product or feature to analyze ROI?

Yes — this is one of the most common use cases. If your CUR data includes Bedrock cost allocation tags (session tags, model tags or custom resource tags) that identify which customer or feature drove a call, nOps reads them directly and a named view can group by that dimension immediately. If your spend isn't tagged at that level, virtual tag rules let you derive the same breakdown using account, service, region or any tag combination — without touching your application code. The result is a per-customer COGS view and a per-feature view that you can compare against revenue or usage data to understand which parts of your AI product are profitable and which aren't.

Does nOps surface AI spend from tools like Cursor, Claude or OpenAI?

Yes, for spend that flows through AWS Bedrock or appears in your AWS Cost & Usage Report. Tools like Cursor support bring-your-own-key configurations that route model calls through your own Bedrock account — those costs appear in your CUR exactly like any other Bedrock inference and are attributed automatically. For AI API spend that lands outside AWS (direct Anthropic or OpenAI billing), that won't appear in the CUR; you would track it through those providers' billing exports separately.

How do virtual tag rules work?

Virtual tag rules let you assign a cost category — team, product, environment or any label — to any cost row that matches conditions you define in the nOps UI. Each rule tests a combination of service, AWS account, region or existing tag values. Rules are evaluated in priority order: the first rule whose conditions all match wins, and the assigned tag value is applied to that row. A catch-all rule (no conditions) placed last captures anything not yet matched. An unallocated bucket always shows what hasn't been covered, so Allocated + Unallocated always equals your total spend — there's no hidden remainder.

What are named views and how are they different from the main allocation report?

Named views are saved lenses on your AI cost data — each with its own filter rules and breakdown dimension. For example, an "AI Platform" view filters to Bedrock spend and groups by feature tag, while a "Customer-Facing AI" view filters to production accounts only and groups by product. Views appear as tabs on the reporting page so you can switch between them without rebuilding a query. Every view shows its own daily trend, breakdown chart and unallocated bucket with the same reconciliation guarantee.

Does nOps require an agent or code changes?

No. Allocation is built entirely from your AWS Cost & Usage Report. There is nothing to install in your application and no SDK to integrate.

How fresh is the data?

The AWS CUR typically lands one to two days behind, so the most recent calendar day of the current window is usually partial. nOps states this clearly so week-over-week numbers aren't misread.

Which AI harnesses can I use this from?

The feature is fully available in the nOps Inform platform and exposed to all leading AI harnesses, so your teams can request allocation, anomalies or token breakdowns from the assistants and copilots they already use — returning the same data and the same department mapping as the dashboard.

Does it work across multiple AWS accounts?

Yes. Spend across every connected account and model is unified into one view, with the mapping resolving each account to its owning team or product.

What happens when a new model version appears?

It is included in the analysis automatically, assigned to its account's department, and flagged explicitly as new — version migrations can change the per-call price significantly, so nOps calls out the shift rather than silently absorbing it.

Does nOps store my prompts or workload data?

Allocation is derived from billing and usage metering in the CUR — model names, accounts, token counts and cost — not from prompt or response content.

Can I use this alongside my existing FinOps tools?

Yes. nOps Inform complements your existing cost tooling and adds the model-, department- and token-level allocation those tools typically can't produce.

Free 30-Minute Savings Analysis

Achieve a 55% effective savings rate with nOps.

Book a free savings analysis to:

Benchmark performance vs peers
See how much more you can save
Set up in 15 minutes and time-to-value in a few days

Customers who switch to nOps save 20% more on average versus competitors.

Trusted with $4B in spend by 500+ innovative brands, from startups to enterprise.

“Partnering with nOps made it easy and seamless to optimize our AWS costs. The platform's automated cost optimization and commitment management reduced our AWS spend significantly, allowing us the time to focus on building and innovating. The flexibility nOps offers means we can stay agile and efficient without the risk of long-term commitments.”

Matt Morgan

Head of Engineering & Product, US, CommentSold

“nOps has transformed how we manage our AWS infrastructure. We considered several solutions, including another Reserved Instance solution, and then decided to work with nOps because it was a more complete solution. It gave us automated commitment management and an effortless way to track and monitor our costs with Business Contexts. We have seen a significant reduction in costs and highly recommend nOps to any organization looking to optimize their cloud costs and improve operational efficiency.”

Herman Lotter

Technology Operations Manager, Kurtosys

“As a FinOps lead at Arlo, I view cost transparency as non-negotiable, and nOps has delivered exactly that. The platform gave us end-to-end visibility into our cloud spend, eliminating guesswork and surfacing the true cost drivers across our environment. What stands out is how quickly the insights translate into action. nOps doesn’t just present data, it highlights the specific optimizations, anomalies, and inefficiencies we can address immediately. Those recommendations have driven real cost reductions for us while strengthening accountability across engineering teams.”

Alex Kuan

FinOps Lead, Arlo

“nOps provided the visibility and control we needed to optimize our AWS costs without long-term commitments. I highly recommend nOps for cloud optimization and managing cloud costs effectively.”

Andy Kwan

Senior Director of Infrastructure, BENlabs

“nOps has been invaluable in helping manage the intricacies of optimizing AWS environments and costs in today’s dynamic environment. The nOps Commitment Manager program provides us with the agility to progress and innovate, without the typical constraints associated with long-term commitments. The platform also gives us a clear and transparent picture of our costs, so we can plan and rightsize our resources accordingly.”