Is AI Making Your Team Lazy? | 性视界 Business School AI Institute

Exploring the hidden cost of human disengagement from AI

Listen to this article:

We are rapidly entering an AI era defined by the 鈥渁gentic鈥� shift. These tools now write code, manage inboxes, conduct research, and execute multi-step workflows without a human lifting a finger. But when AI does more, what happens to the humans at the end of the line? Does the presence of a 鈥減erfect鈥� partner actually make us better, or does it slowly erode the very skills and attention required to provide oversight? As we mark the renaming of D^3 as the HBS AI Institute this month, we鈥檙e taking a look back at some of our foundational research that defines the era. In 鈥�,鈥� HBS AI Institute post-doctoral fellow Fabrizio Dell’Acqua Fabrizio Dell’Acqua designed a field experiment to test what happens when the quality of AI assistance advances. His findings, it turns out, have serious implications for anyone using AI or in charge of systems where humans and AI share responsibility.

Key Insight: Falling Asleep at the Wheel

鈥淚f the AI appears too high quality, workers are at risk of 鈥榝alling asleep at the wheel鈥� and mindlessly following its recommendations without deliberation.鈥� [1]

The paper鈥檚 central hypothesis begins with a simple behavioral observation: as AI quality increases, the rational incentive to exert one鈥檚 own effort decreases. When a tool appears highly reliable, people may stop checking its work closely, stop gathering their own information, and stop exercising independent judgment. Dell鈥橝cqua calls this 鈥渇alling asleep at the wheel.鈥� The result is a subtle but important distinction between AI performance in isolation and human-AI performance in practice. What matters is not only how good the model is, but how people behave when using it.

Key Insight: The Counter-Intuitive Power of 鈥淔lawed鈥� Predictions

鈥淥n average, HR recruiters receiving lower-quality AI were less likely to 鈥榝all asleep鈥� as they tended not to automatically select the AI-recommended candidate.鈥� [2]

To test this theory, Dell鈥橝cqua conducted a field experiment involving 181 professional HR recruiters who were tasked with reviewing 44 resumes each for a software engineering position. The recruiters were randomly assigned different levels of AI assistance: a 鈥淧erfect鈥� AI with approximately 99% accuracy, a 鈥淕ood鈥� AI with approximately 85% accuracy, a 鈥淏ad鈥� AI with roughly 75% accuracy, or no AI at all. Recruiters knew which tier of AI they were working with before starting. The results were clear and striking: recruiters who collaborated with the 鈥淏ad鈥� AI actually outperformed those with the 鈥淕ood鈥� AI. Because the 鈥淏ad鈥� AI was clearly imperfect, the recruiters remained vigilant, spending more time on each application and verifying the AI鈥檚 claims. This group effectively learned the AI鈥檚 weaknesses and improved their own performance to compensate. Those with better AI moved faster and delegated more.

Key Insight: The Design Implication

鈥淒esigning effective structures for human/machine collaboration requires careful consideration of the organization鈥檚 objectives and task features.鈥� [3]

Dell鈥橝cqua is careful not to recommend that organizations simply deploy older, worse AI models. The real prescription is more nuanced: design AI systems with human behavioral responses in mind, not just technical performance benchmarks. In settings where people can add value, the design of the interaction becomes a strategic variable. That might mean calibrating AI confidence displays, introducing deliberate uncertainty signals for borderline cases, or creating interfaces that prompt humans to engage before surfacing a recommendation. A system that nudges humans to stay attentive may perform better than one that invites passive approval.

Why This Matters

For executives and business leaders, the lesson here is that combined human-AI performance is its own optimization target, and it might not move in lockstep with AI accuracy improvements. Strategy in the age of AI still requires an understanding of human psychology and effort. If leaders want better outcomes, they need to think beyond technical benchmarks to workflows where their employees remain wide awake at the wheel.

Bonus

This article shows that impressive AI performance can hide important weaknesses. Here, the issue hinges on over-reliance by human collaborators, but at other times it鈥檚 caused by the model itself. For example, even highly capable AI systems can still struggle with something as basic as multi-digit multiplication. For a closer look at this, check out When Giants Stumble: What Multiplication Reveals about AI鈥檚 Capabilities.

References

[1] Dell鈥橝cqua, Fabrizio, 鈥淔alling Asleep at the Wheel: Human / AI Collaboration in a Field Experiment on HR Recruiters,鈥� Working paper, Laboratory for Innovation Science, 性视界 Business School (2022), 2.

[2] Dell鈥橝cqua, 鈥淔alling Asleep at the Wheel,鈥� 3.

[3] Dell鈥橝cqua, 鈥淔alling Asleep at the Wheel,鈥� 4.

Meet the Authors

is a postdoctoral researcher at 性视界 Business School. His research explores how human/AI collaboration reshapes knowledge work: the impact of AI on knowledge workers, its effects on team dynamics and performance, and its broader organizational implications.

Watch a video version of the Insight Article here.

性视界