We Tried Replacing Employees with AI It Broke

The most valuable AI lessons of the past three years have not come from the success stories. They have come from the failures the companies that moved fast, cut humans, deployed AI at scale, and then faced the consequences in customer satisfaction scores, legal exposure, security breaches, and in some cases, public retractions. These cases are not fringe events. They are a pattern. And the pattern reveals something important: AI systems fail in specific, predictable ways when deployed to replace human judgment in contexts where the cost of error is high.

Klarna: The Most Public Reversal

In 2024, Klarna CEO Sebastian Siemiatkowski became one of the most-quoted AI optimists in the industry. His claim: AI was doing the work of 700 employees. The company had stopped hiring 'largely due to AI.' The messaging was confident and the press covered it extensively. Within roughly twelve months, Siemiatkowski had publicly admitted the company 'went too far' and 'focused too much on cost.' Customer satisfaction scores had declined sharply. Users described AI support responses as 'generic, repetitive, and insufficiently nuanced.' Klarna began rehiring humans. The cycle from AI boast to public retreat took less than a year.The Klarna case is important not because it was unusual, but because Siemiatkowski was unusually honest about what happened. Most companies that overdeployed AI and then quietly reversed course did not issue public statements. Klarna's transparency made visible what is likely a much more widespread phenomenon.

McDonald's McHire: Security Disaster

In mid-2025, McDonald's AI-powered hiring platform McHire built on Paradox.ai's system was found to have a test administrator account secured with the default credentials '123456/123456' and no multi-factor authentication. Security researchers accessed data linked to approximately 64,000 applicants. The exposed information included full chat transcripts from the 'Olivia' hiring chatbot and responses to personality assessment questions. McDonald's and Paradox.ai both issued statements. No social security information was accessed. But the incident illustrated a consistent pattern in enterprise AI deployment: AI systems are often treated as products to be turned on, not infrastructure to be secured.

Workday Hiring AI: Algorithmic Discrimination

A federal court in California allowed a nationwide class action to proceed in May 2025 against Workday, alleging that its AI-driven resume screening system discriminated against older applicants. The lead plaintiff a Black male jobseeker over 40 applied to more than 100 jobs through companies using Workday's AI screening and was automatically rejected every time. One rejection arrived at 1:50 AM, less than an hour after he applied. No human could have reviewed his application. The lawsuit alleges the AI 'baked in' existing biases by training on historical hiring data that reflected discriminatory human decisions. Thousands of companies use Workday's tools to screen candidates with no human in the loop.

Block: The Partial Rehire

When Jack Dorsey cut 4,000 employees in February 2026 and attributed the decision to AI capability advances, he acknowledged in his own memo the risk of 'moving too fast' and said the company had built in 'flexibility to account for mistakes.' Within weeks of the announcement, Block began quietly rehiring some of the employees it had just cut. At least four former workers returned to the company. One departure was acknowledged by Block leadership as a 'clerical error.' Another was described as 'operational necessity.' The speed of the reversal in some cases less than two weeks suggests the decision to cut was not as precisely calibrated to AI capability as the public framing implied.

Air Canada: The Chatbot That Made a Promise Nobody Could Keep

In a widely cited 2024 case that set legal precedent, Air Canada's AI chatbot told a customer that bereavement fare discounts could be applied retroactively a policy that did not exist. When the customer attempted to claim the discount after booking, Air Canada refused, arguing the chatbot had made an error and the company was not bound by it. A Canadian civil tribunal ruled against Air Canada: the company was responsible for information provided by its own AI system, whether or not a human authorized it. The ruling established that AI chatbots are not separate legal entities. Their outputs are the company's outputs. The liability follows.

What the Failures Have in Common

High-stakes, judgment-dependent contexts: every significant AI failure involves a domain where the cost of error is high hiring decisions, customer commitments, security of personal data, clinical assessments
No human in the loop at the point of impact: the failures occur when AI makes or communicates decisions without a human review step before consequences are delivered
Training data that encodes past discrimination: AI hiring, lending, and housing tools consistently reproduce the biases in their training data not because of malicious intent, but because historical data reflects historical discrimination
Security treated as an afterthought: AI deployments frequently inherit the security posture of consumer products rather than enterprise infrastructure default credentials, no MFA, no access reviews
Speed of deployment outpacing governance: the timeline from 'pilot' to 'production affecting real people' compresses in AI in ways that do not happen with traditional software, and governance structures have not kept pace

The Bottom Line

MIT's 2025 review of over 300 AI implementations found that the biggest failures were not technical. They were organizational: weak controls, unclear ownership, and misplaced trust in AI systems that had not been stress-tested in production conditions.The companies that have made AI work are the ones that defined a specific business problem, assigned a named owner, invested in data quality, maintained human review for high-consequence decisions, and measured outcomes against the baseline. The companies that failed are the ones that deployed first and governed later. Given that AI systems can make thousands of decisions per minute, 'governing later' can mean thousands of errors before the problem is detected.

AI Productivity

AI Is Creating More Work Than It Solves

7 min read

View all →

Customer Experience

The Rise of Predictive Customer Experience Platforms

Reactive customer experience fixing problems after customers complain about them is the most expensive way to manage customer relationships. Predictive customer experience platforms that anticipate needs, preempt failures, and personalise interactions before the customer reaches out are the competitive standard that is rapidly becoming the baseline expectation.

8 min read

AI Workers

From Copilots to AI Workers: The Next Enterprise Shift

The enterprise AI narrative is shifting from augmentation to delegation. For two years, the primary enterprise AI deployment model has been the copilotan AI assistant that suggests completions, drafts responses, and summarizes documents but requires human review and approval before any action is taken. This model served an important transitional purpose: it allowed enterprises to gain confidence in AI output quality while maintaining human control over execution. But the copilot model has an operational ceiling that is becoming increasingly visible in 2026: augmentation improves individual productivity, but delegation scales organizational capability. The enterprises that are pulling ahead are not those with the most sophisticated copilots helping humans work fasterthey are enterprises deploying AI workers that complete entire workflows autonomously while humans focus on judgment and strategy.

10 min read

Digital Workforce

How AI Agents Are Becoming the New Digital Workforce

The enterprise workforce model is undergoing its most significant restructuring since the industrial revolution. The traditional assumptionthat every operational task requires a human employee with a job title, salary, and benefits packageis collapsing under economic pressure that makes human-only workforces uncompetitive at scale. A Fortune 500 company that requires 2,000 customer service representatives to handle inquiry volume that could be processed by 400 human agents coordinating 60 AI workers is not operating efficientlythey are operating with a workforce model designed for 1990 when software could store and retrieve information but could not reason, decide, or act autonomously. The enterprises that are winning in 2026 are those that have rebuilt their workforce architecture around a hybrid model where AI agents handle high-volume execution and human employees focus on judgment, strategy, and relationship management that creates differentiated value.

11 min read

We Tried Replacing Employees with AI It Broke

Klarna: The Most Public Reversal

McDonald's McHire: Security Disaster

Workday Hiring AI: Algorithmic Discrimination

Block: The Partial Rehire

Air Canada: The Chatbot That Made a Promise Nobody Could Keep

What the Failures Have in Common

The Bottom Line

Related articles

Get Started

SuperManager AGI Intelligence

AGI Deployments

Company

Resources

Get Involved