writing

What Happens When Your AI Vendor Triples the Price?

Jul 3, 2026· 5 min read· Roger Stringer

Your product runs on a model you don't own, priced by a company that can change the number whenever it wants. That's not a partnership. That's a tenancy, and you don't hold the lease.

This is the part nobody warns founders about. You ship something that works, customers love it, your margins look healthy on the spreadsheet. Then the email arrives: new pricing, effective next quarter. Or worse, the model you built around is being retired and you have ninety days to move. Suddenly the thing that made your product possible is the thing that can break it.

And the money flowing into this space guarantees the turbulence keeps coming. AI-agent software spend is projected at roughly $206.5 billion in 2026, up about 139% from $86.4 billion in 2025. When a market more than doubles in a year, pricing is not settled. It's being invented in real time, and you're one of the people it's being invented on. The market is also shifting toward outcome-based pricing — paying per completed task instead of per seat — which sounds fairer until you realize it means your cost now scales directly with your usage, and the vendor controls the meter.

The trap is convenience

Here's how founders walk into it. Early on, you pick the best model available, wire your product straight into its API, and move fast because moving fast is the whole point. Every prompt, every workflow, every feature gets shaped around that one provider's quirks. It works, so you don't touch it.

What you've actually done is hand a third party a veto over your unit economics. The cost of switching keeps climbing the longer you wait, because more of your product assumes that one vendor exists exactly as it is today. By the time the price triples, untangling yourself is a rebuild, not a config change. The vendor knows this. That's the leverage.

I watched this hit a team that had built their whole support product on one provider's model. They'd wired everything to it because it was the best option the week they started. Then the provider deprecated that version with about two months' notice and moved pricing on the replacement. Nothing was "broken" in the bug sense — it just cost more to run overnight, and behaved differently enough that answers they'd spent months tuning started drifting. They burned the next six weeks on an emergency migration nobody had budgeted, while the rest of the roadmap stopped dead. The migration wasn't hard because the new model was bad. It was hard because they'd assumed the old one would always be there.

Own the parts that are actually yours

The fix isn't loyalty to a vendor and it isn't paranoia about every dependency. It's being deliberate about what you own versus what you rent.

You rent the model. Models are getting cheaper, faster, and more interchangeable every quarter, and there will always be a better one next year. Renting compute is fine. Welding your business to one supplier of it is not.

You own everything that makes your product yours. Your prompts, refined over months of real usage. Your data and the way it's structured. Your evals — the test suite that tells you whether a given model is actually good enough for your customers, instead of you guessing. And a thin layer between your product and whatever model sits behind it, so swapping providers is a decision you make in an afternoon, not a quarter.

That last piece is the whole game. When you can run your evals against three providers and pick the one that's best this month, a price hike stops being a crisis and becomes a negotiation. You move. The vendor knows you can move. The leverage flips.

"Cheapest model today" is the wrong target

Plenty of founders optimize for the lowest per-token price and call it discipline. It isn't. Per-token cost is the most volatile, least durable number in your entire stack. Building your architecture around it is like signing a lease because the rent is low this month and ignoring that the landlord can raise it whenever he likes.

The number that actually protects your margins isn't the price. It's how fast you can walk away from it. Optimize for that.

This is where the real work lives, and it's the part you can't hand off to a tool. Wiring up a model API is the mechanical 70% — any competent developer or a decent coding agent can do it in a day. Deciding what to abstract, how to structure your evals, which dependencies are safe to lean on and which ones quietly become a noose — that's the 30% that determines whether your product survives its own vendor. That judgment is the job. It's the difference between a system that bends when the market moves and one that snaps.

When I build this for a client, the first thing I put behind a thin interface is the model call itself — one place everything routes through, so swapping providers is a config change instead of a search-and-replace across the codebase. The prompts and the eval suite live in our repo, never locked inside a vendor's platform. What I deliberately don't over-engineer is everything else. Founders love to imagine they'll need to support five providers on day one and build some elaborate abstraction for it. You won't. You need one clean seam at the model boundary and the discipline to keep your prompts and tests yours. That's most of the protection for a fraction of the work.

Don't architect your company around a price you don't control. Own your prompts, your data, and your exit — and let the models compete for your business instead of the other way around.