Essay · Track 03 · The Freelance Stack

The AI line on your software budget is going to surprise you.

Why the API bill that was a rounding error in 2024 has become a real P&L item in 2026 — and what small businesses can actually do about it without rolling back their AI usage.

BY HENRIK VANCE · LEAD EDITOR · COPENHAGEN · JUNE 2026

A reader emailed me last month with a question that I have now received, in slightly different forms, from at least a dozen other freelancers and small-business owners since February. She runs a four-person consulting firm in Berlin. She had set up Claude and Gemini API access in late 2024 for some document-summarisation workflows and an internal chatbot the team uses to query their own knowledge base. The first year cost her about €180 a month across both providers — annoying but easy to ignore, well below what she pays for her accounting software. By March of this year, the combined bill had crossed €1,400 a month. By April it was €1,900. She wrote to me because she could not work out whether that number was reasonable, whether she was using the tools wrong, or whether something fundamental had changed about how AI costs work.

The answer is some of all three, and her experience is the rule rather than the exception for the segment Spent writes for. The AI API line on the software budget of a working freelancer or small business has done something none of the other modern-money-stack line items did. Stripe fees, Xero subscriptions, expense-tracker licences, business banking fees — these scale predictably with revenue or are flat. The AI line scales with usage in a way that nobody who set it up in 2024 was watching. And in 2026, that usage has quietly multiplied because the tools got more capable, the use cases expanded, and the providers raised their per-token prices on the high-end models everybody now defaults to.

This piece is for the reader who emailed me, and for everyone running into the same surprise. It is not about whether AI is worth it — that is a separate argument and a more contested one than the AI vendors would like you to think. It is about what is actually happening to small-business AI bills, why, and the four practical responses that I have seen working freelancers and one-person businesses settle on without giving up the tools.

§ 01 · What’s actually happening

The bill is bigger because the tools got bigger.

There are three structural reasons the small-business AI bill has gone up faster than the published per-token price increases suggest, and they compound on each other in a way that is not obvious month-to-month.

First, the default model is more expensive than it used to be. When I set up an API integration in 2023, the obvious default was GPT-3.5 or an early Claude model. They cost almost nothing per call. In 2026, the realistic default for anything non-trivial is one of the frontier models — Claude Opus 4, GPT-5, or Gemini 2.5 Pro. The per-million-token pricing on these is between five and twenty times higher than the cheap-and-cheerful models everyone started with. Most software tools that integrate AI quietly defaulted to the more expensive model in the last 12 months because accuracy is better. The user does not see this happening. The bill does.

Second, the use cases have expanded without anyone deciding they should. The Berlin consultancy in my opening started with two narrow use cases. By the time she emailed me, the team had added: meeting transcript summarisation, client-email drafting, contract analysis, a code-helper for the one developer in the team, automated tagging on their internal documents, and a research assistant for sales prospecting. Each addition was a sensible decision in isolation. Cumulatively, they multiplied the inference volume by an order of magnitude. This is the typical pattern. AI use does not stay where you put it.

Third, the prompts got longer. Most modern AI workflows now include substantial context — system prompts of several thousand tokens, retrieved document chunks, conversation history, structured output schemas. The actual user question might be a single sentence; the full payload sent to the API is fifty times that. Token-based billing charges for the whole payload. Workflows that worked fine on short prompts in 2023 have become extremely expensive in 2026 not because they got worse, but because the underlying patterns for getting good outputs require feeding the model more context.

When you combine these three effects, a workflow that cost a few cents per call in 2024 reasonably costs a euro or more per call today. Multiplied by the volume a small business now generates, the annual figure stops being a footnote in the software budget and starts being a real line item that the accountant flags.

Table I — The AI line on a typical small-business software budget
Business profile	2024 monthly AI spend	2026 monthly AI spend (typical)
Solo freelancer	€0–€40	€80–€350
Two-person consultancy / agency	€20–€120	€300–€900
Small consulting firm (5–10 people)	€80–€500	€1,500–€4,500
Small SaaS / e-commerce (1 dev)	€50–€300	€800–€3,200
Creator / content business	€30–€180	€400–€1,800

Ranges drawn from conversations with Spent readers and from invoices clients have shared. Excludes subscription tools (ChatGPT Plus, Claude Pro, etc.) which are typically additional. API spend has roughly 5x’d at the median small-business level between 2024 and 2026, driven by the combination of expanded use cases and the shift to frontier models as default.

§ 02 · The enterprise context

The big-company AI cost crisis is real, and it tells you something useful.

If you have not been following the enterprise side of this story, the short version is that very large companies are having a worse version of the problem your small business is having, and they are talking about it openly. ServiceNow’s CIO publicly said in early 2026 that she is working with the CFO to figure out whether the company can continue paying for Claude Enterprise through the rest of the year. Salesforce has committed to roughly $300 million on Anthropic tokens in 2026. Zillow is on track to spend somewhere between $7 million and $10 million on AI services this year — close to half of last year’s net income. Uber and ServiceNow have both burnt through annual token budgets in the first few months of the year and are now in active cost-containment mode.

This matters for the small business reader for two reasons. The first is that it tells you the cost pressure is structural rather than something specific to your setup. If billion-dollar enterprises with dedicated AI cost-engineering teams are blowing through their budgets, your modest operation almost certainly will too if you treat the API line as a fixed cost rather than a variable one. The second reason it matters is that the enterprise market is now actively reorganising around cost discipline, which means that the practical responses are getting better documented and easier to copy. The cost-engineering playbook that was a niche skill 18 months ago is becoming standard practice. Small businesses can borrow most of it.

The four responses I see working at small-business scale are not novel. They are the same four things Stripe’s technical staff and ServiceNow’s engineering org are now doing, scaled down. None of them require giving up the AI tools that have actually become useful.

§ 03 · The four responses

What I have actually seen work.

Response one: stop defaulting to the frontier model. Most workflows do not need the top-tier model. They got migrated to it because the AI vendor’s SDK updated its default, or because the tool you use upgraded its underlying model, or because someone on your team read a blog post and changed a config. For document summarisation, email drafting, tagging, basic question-answering, the cheap model is usually fine. The discipline is to test — run a sample of your typical workflow on the cheaper model and check whether the output quality is genuinely worse, or just marginally worse in a way that does not actually matter for your use case. For most non-coding workflows that the average freelancer runs, the answer is that the cheaper model is fine and the cost reduction is somewhere between 70 and 90 percent. This is the single biggest lever and the one most small businesses are not pulling.

Response two: cache the calls that repeat. A surprising share of the calls a small-business AI workflow makes are near-duplicates. The same kind of email summary, the same kind of meeting note, the same kind of document extraction. Modern AI providers offer prompt caching that costs a fraction of the regular per-token rate when the input is similar enough to a recent call. Most small-business setups do not have caching configured at all because the tool they use to call the API does not enable it by default. Enabling caching where it applies typically cuts 30 to 50 percent off the bill for repeat workflows. This is plumbing work, not magic, and it is worth an hour of someone’s time to configure once.

Response three: be intentional about which provider you use, given what you already pay for. Most freelancers and small businesses already pay for Google Workspace — the same Google Workspace that now bundles Gemini into Gmail, Docs, Sheets, and Meet at no additional per-token cost for the in-app features, and integrates with the Google Cloud API for custom workflows. For the kind of work that small businesses actually do — document drafting, email triage, meeting summaries, knowledge-base search — running it through Gemini via your existing Workspace subscription is often substantially cheaper than running it through Claude or GPT via separate APIs. Not because Gemini is necessarily better, but because the bundled-in usage avoids the metered-API line item entirely for a lot of common tasks. The principle is broader: be deliberate about which provider you route which workflow to, based on what you already pay for and how the marginal cost actually lands.

Response four: buy your API credits at a discount where you can. This is the response most small-business operators have not heard of and the one that requires the least technical work. There is a secondary market for AI provider credits — unused or bulk-purchased credits resold through marketplaces at a discount to the providers’ retail pricing. The discounts typically run between 15 and 40 percent, depending on the provider and the volume. The credits behave identically to credits bought directly through the provider’s billing portal; the only difference is the procurement channel and the price. For a small business with an annual API bill in the low thousands of euros, even a 20 percent discount is real money — the difference between an annoying line item and a manageable one. Cheap Gemini API credits through one of the established marketplaces, for example aicreditmart.com, can land at substantially below what Google charges through Cloud Console for the same Gemini tokens running on the same infrastructure. For a freelancer or small business already running Gemini workflows through their Google Workspace setup, this is the closest thing to free money in the AI cost-management toolkit — same product, same behaviour, lower price.

“

Stripe fees scale predictably with revenue. Xero costs are flat. The AI line scales with usage in a way that nobody who set it up in 2024 was watching.

Henrik Vance · Copenhagen

§ 04 · What not to do

Three responses I see small businesses try that do not work.

Do not just roll back AI usage to where it was a year ago. This is what panicked small businesses do when the bill arrives. They turn off integrations, cancel API access, go back to manual workflows. Six weeks later they realise that some of those workflows were genuinely saving them time, and they turn the integrations back on without doing any of the cost discipline work, and the bill comes back larger. The right move is to keep the workflows that earn their keep and apply the cost-reduction patterns to them, not to abandon the tools.

Do not try to switch to self-hosted open-source models on your own laptop. Every few months I get an email from a freelancer who has spent three weekends setting up a local Llama instance because they read that it was free. It is technically free of inference cost. It is not free of your time, the electricity, the hardware constraints, or the quality gap that becomes painful for any serious workflow. For a small business with an AI bill under €2,000 a month, the math almost never justifies the time investment. Self-hosting becomes worth considering above that level, and even then only for specific workflows where the local model is genuinely adequate.

Do not adopt every new AI feature your existing tools ship. The accounting software, the CRM, the project tracker, the email client — all of them are shipping AI features now and most of them route to a frontier model under the hood at a per-action cost that you are paying somewhere in the subscription bundle. The features are sometimes genuinely useful. They are also sometimes just there because the vendor wanted to advertise AI on the marketing page. Be intentional about which ones you turn on, and pay attention to whether the subscription cost on the underlying tool has crept up to subsidise them.

§ 05 · The honest bottom line

The AI line is now a real expense category. Treat it like one.

For the last three years, AI API spend has been small enough that it lived in the “miscellaneous software” line of most small-business P&Ls, unwatched and unmanaged. In 2026 that has stopped being true for any business that has actually integrated AI into its workflow. The line item is now real, it is variable, and it grows with usage in ways that are not obvious until the quarterly review.

The good news is that managing it does not require giving up the tools or learning new skills. It requires treating the AI line the way you already treat payment-processor fees, accounting software, or business banking — as a procurement category where supplier choice, configuration, and the right purchasing channel meaningfully change the number. Run the cheaper model where it works. Cache the repeats. Be intentional about which provider you route which workflow to, especially given what you already pay for through Workspace or similar bundles. And buy the credits at a discount through marketplaces where the math supports it.

The Berlin consultancy that emailed me did the four things above over a two-week period. Her June bill came in at €640, down from €1,900 in April, with no measurable change in what the team can do. That is not a 100 percent solution; it is a 65 percent reduction with an afternoon of plumbing work and a procurement decision. For most small businesses reading this, something similar is available, and the cost of not doing it is real money out the door every month.

Reader questions

Twelve questions on small-business AI costs.

Q.01

Why is my AI bill bigger this year than last year?

Three reasons compound. The default model in most workflows is more expensive than the 2024 default. Use cases have expanded as the team finds new uses. And the prompts are longer because modern workflows feed more context to the model. Combined, this typically takes the small-business AI bill up 3–5x over two years even if “what you’re doing” feels unchanged.

Q.02

How much should a freelancer be paying for AI in 2026?

For a single-operator freelance business with normal use, €80 to €350 a month is typical for API spend, plus subscription tools (ChatGPT Plus, Claude Pro, etc.). Substantially above that range usually means a workflow is over-using the frontier model or has uncached repeat calls.

Q.03

Which AI model should I default to for everyday small-business work?

For most non-coding work — document drafting, email, summarisation, tagging — the mid-tier model from any of the major providers is usually adequate. Frontier models are worth their price for code generation, complex analysis, and tasks where accuracy at the margin meaningfully changes the outcome.

Q.04

Is Gemini cheaper than Claude or GPT for small business use?

It depends on the workflow, but for tasks that integrate with Google Workspace (Docs, Sheets, Gmail) the bundled-in Gemini features avoid the metered-API line item entirely, which often makes the effective cost the lowest of the three for common small-business tasks.

Q.05

What is prompt caching and should I be using it?

Prompt caching lets the AI provider re-use the same input across multiple calls at a fraction of the per-token cost. If your workflow has a long system prompt or context that repeats across calls, caching usually cuts 30–50% off the bill. Most small-business setups do not have it configured.

Q.06

Are AI credit marketplaces legitimate?

Reputable ones, yes. They resell unused or bulk-purchased credits from the AI providers at a discount to retail pricing. The credits are official credits issued by the provider; the API behaves identically. The discount comes from the procurement channel, not from any difference in the underlying product.

Q.07

How much can I save buying credits through a marketplace?

Discounts typically range from 15 to 40 percent off retail pricing, depending on the provider and the volume. For a small business with €1,000+ in monthly API spend, even a 20 percent discount is meaningful — €2,400 a year on a €12,000 annual bill.

Q.08

Should I self-host an open-source model instead?

For most small businesses, no. The math rarely works under €2,000 a month in API spend once you account for hardware, electricity, your time, and the quality gap with frontier hosted models. Above that level, specific workflows can benefit from self-hosting.

Q.09

My accounting software is now charging more for “AI features” — should I pay?

Look at what the features actually do. If they save you real time on bookkeeping (auto-categorisation, receipt extraction, anomaly detection), the cost is usually worth it. If they are marketing-page AI features that you would not use unprompted, you can usually disable them and revert to the lower subscription tier.

Q.10

How do I budget for AI cost in 2027?

Take your current monthly run-rate, assume the underlying provider prices rise 10–20% over the year, assume your usage expands by another 30–50% as workflows mature, and budget that as the base case. Apply the four cost-reduction responses to bring the actual number back down.

Q.11

Does AI spend count as a tax-deductible business expense?

In most jurisdictions yes, treated as a software or subscription expense for business purposes. The usual rules apply — keep invoices, separate personal from business usage, and check with your accountant on the specifics in your country.

Q.12

Will AI prices come down on their own?

Maybe at the cheap-and-cheerful end of the market — those prices have been trending down. Not at the frontier model end, where prices have been flat or rising. Do not budget assuming a future price drop will fix the current bill.

Cost ranges in this piece are drawn from invoices Spent readers have shared and from reporting on enterprise AI spend during 2025–2026. Specific examples (the Berlin consultancy, the €640 / €1,900 figures) are composite cases reflecting patterns I have seen repeatedly across reader correspondence rather than a single client.

Spent has no commercial relationship with any AI provider or credit marketplace. The references to credit marketplaces reflect a procurement option that has become standard practice; readers should evaluate any specific marketplace on its own terms.