AI FinOps

Field note · Klarna

Klarna, AI customer service, and the resolutions that stayed

Klarna did something genuinely brave: it bet big on AI customer service, said so loudly, published the numbers — and then, when the picture got more complicated, said that loudly too. There's a lot to admire here, and a lot to learn from. Most of the lesson lives in the gap between two questions: "how many contacts did the AI handle?" and "how many problems did it actually solve and keep solved?"

What this is An illustrative analysis built on Klarna's own public statements and reputable reporting, all cited below. No inside information; Klarna hasn't endorsed it. The worked figures later are hypothetical, chosen to show the method — not Klarna's real numbers, which it has never published at this grain. This note treats cost per accepted change in its generalized form, "cost per accepted resolution," because the work here is customer support rather than code.

The story Klarna told, out loud

In February 2024, Klarna announced that its OpenAI-powered assistant had, in its first month, handled 2.3 million conversations — about two-thirds of its customer-service chats — doing the equivalent work of 700 full-time agents. (Worth being precise and fair: that "700 agents" is work-equivalent and hiring Klarna says it didn't need to do, not 700 people shown the door.) Average resolution time fell from ~11 minutes to under 2, repeat inquiries dropped about 25%, the assistant ran in 23 markets and 35-plus languages around the clock, and Klarna projected a $40 million profit improvement for the year — with customer satisfaction, it said, "on par" with human agents.[1]

Those are real, impressive numbers, and Klarna deserves credit for putting them on the record where the rest of us could learn from them. Over the same period the company's headcount fell substantially — its IPO filing later showed full-time employees dropping from around 5,500 at the end of 2022 to roughly 3,400 at the end of 2024 — and revenue per employee climbing from about $344,000 to $821,000.[6] The efficiency was not imaginary. Something powerful was happening.

What those numbers measured

Look at the headline metrics together — conversations handled, deflection rate, minutes-to-resolution, agents'-worth of work, cost avoided. They all answer the same kind of question: how much support load did the AI take on, and how cheaply? That's genuinely useful, and for a customer-service org it's the natural first thing to watch.

What that family of metrics can't quite see, by design, is the part that decides whether the saving is real: did each handled contact actually resolve the customer's problem, and did it stay resolved? A conversation that the AI "handled" but the customer had to re-open two days later, or escalate, or stew over before quietly churning, still counts once in the deflection numerator — even though, from the customer's point of view, it wasn't a resolution at all. That's not a flaw in deflection rate; it's just outside what it was built to watch.

What was harder to see — and what Klarna did about it

To Klarna's real credit, it noticed, and it said so plainly. In May 2025, Siemiatkowski reflected that cost had become "a too predominant evaluation factor" in how the support org was organized, and that "what you end up having is lower quality." The company began rehiring human agents — in a flexible, remote model — and framed a guaranteed human option as a brand promise: "investing in the quality of the human support is the way of the future for us."[3][4]

It's worth being careful here, because this story is often told too harshly. Klarna did not say AI customer service had failed. The assistant still handled the large majority of contacts; the company stayed AI-first; the efficiency gains were real and remain real. (Siemiatkowski also walked back a separate, more sweeping 2024 claim about "shutting down Salesforce," clarifying in 2025 that Klarna had consolidated data internally rather than replacing its SaaS with a single model.[5]) What changed wasn't a verdict on AI — it was the realization that optimizing the visible cost of a contact, without an equally clear read on whether the contact landed, had let quality drift further than anyone intended.

The honest version This isn't "company tries AI, AI fails, company retreats." It's the much more common and more useful story: a bold team measured the half of the picture that was easy to measure, moved fast on it, noticed the other half slipping, and adjusted — in public. The interesting question is whether a single number could have shown the drift sooner, while it was still small.

A friendlier dial: cost per accepted resolution

Generalize cost per accepted change to support work and you get cost per accepted resolution: the fully-loaded cost of producing customer resolutions that the customer actually accepted and that stayed accepted — divided by the count of those that did. A resolution "stays" if the customer doesn't re-contact about the same issue, escalate, or churn within a chosen window (say, 14 days). Re-contacts and escalations don't count in the denominator, and the cost of handling them — usually a human picking up the pieces — lands in the numerator as rework.

The difference from deflection rate is the whole point. Deflection counts contacts the AI took off the queue. Cost per accepted resolution counts problems that actually got solved and stayed solved, and prices the cleanup when they didn't. It's the same shift, in customer-support clothes, that the metric makes for code: stop celebrating volume produced, start measuring value kept.

A worked reading (illustrative)

Hypothetical numbers — not Klarna's — just to show how the dial behaves. Imagine 100,000 support conversations in a month, with AI handling 70% and humans the other 30%:

Deflection viewAccepted-resolution view
Conversations handled by AI70,00070,000
Conversations handled by humans30,00030,000
Direct handling cost (AI ~$0.25, human ~$4.00)$137,500$137,500
Re-contacts / escalations within 14 days (AI 25%, human 8%)not counted19,900
Rework cost (escalations cleaned up by humans)$70,000
Resolutions that stayed resolved80,100
Headline number$1.38 / conversation$2.59 / accepted resolution

Read on deflection, this looks close to free: $1.38 a conversation against an all-human baseline near $4. Read on accepted resolution, the real figure is $2.59 — still a clear win over all-human (which works out to roughly $4.35 per resolution that stays), but nearly double the headline, and the gap is the quality signal. That one number does two friendly things at once: it confirms the AI investment is genuinely paying off, and it points straight at where the value is leaking — the 25% re-contact rate — so the team knows exactly what to improve rather than discovering it months later through customer sentiment.

The takeaway

Klarna's arc isn't a cautionary tale; if anything it's a model of how to do this well — move boldly, measure something, share it openly, and correct course out loud when the fuller picture arrives. The only thing a cost-per-accepted-resolution lens would have added is time: a single, finance-legible dial that priced the re-contacts and escalations from month one, so the quality drift showed up as a gently rising number while it was still easy to steer, instead of as a story that had to be walked back later.

That's the whole pitch, really. Not "AI was a mistake" — it plainly wasn't — but "measure the resolutions that stay, not just the contacts that get handled, and you get to enjoy the savings and keep the quality." We're all still learning how to balance those two, and Klarna taught the rest of us a great deal by being willing to do it in the open.

Run your own numbers → How to use the metric → More field notes →

Sources

  1. Klarna, "Klarna AI assistant handles two-thirds of customer service chats in its first month" (Feb 27, 2024) — the source of the 2.3M-conversations, 700-agent-equivalent, 11-to-2-minute, and $40M figures. Every headline number traces to this Klarna release. https://www.klarna.com/international/press/klarna-ai-assistant-handles-two-thirds-of-customer-service-chats-in-its-first-month/
  2. CNBC, "Klarna CEO says AI helped company shrink workforce by 40%" (May 14, 2025). https://www.cnbc.com/2025/05/14/klarna-ceo-says-ai-helped-company-shrink-workforce-by-40percent.html
  3. Fortune, "Klarna turns back to humans as AI cost-cutting hurt quality" (May 9, 2025) — Siemiatkowski: cost had become "a too predominant evaluation factor," producing "lower quality," and "investing in the quality of the human support is the way of the future." https://fortune.com/2025/05/09/klarna-ai-humans-return-on-investment/
  4. Entrepreneur, "Klarna CEO Reverses Course By Hiring More Humans, Not AI" (May 2025) — Klarna pilots rehiring human agents in a flexible, remote model while keeping AI handling the bulk of contacts. https://www.entrepreneur.com/business-news/klarna-ceo-reverses-course-by-hiring-more-humans-not-ai/491396
  5. TechCrunch, "Klarna CEO doubts that other companies will replace Salesforce with AI" (Mar 4, 2025) — Siemiatkowski walking back the earlier "we shut down Salesforce" framing. https://techcrunch.com/2025/03/04/klarna-ceo-doubts-that-other-companies-will-replace-salesforce-with-ai/
  6. Klarna Group plc, SEC Form F-1 (filed Mar 14, 2025) — reports average revenue per employee rising from ~$344,000 (2022) to ~$821,000 (2024) alongside its AI-efficiency narrative. https://www.sec.gov/Archives/edgar/data/2003292/000162828025012824/klarnagroupplcf-1.htm

This is a field note — a friendly, illustrative reading of the public record, not a commissioned case study and not a scorecard. Klarna's willingness to publish both its wins and its rethink is exactly why there's enough here to learn from. Corrections and better public data are genuinely welcome via GitHub.