AI Thinks Like Us: Flaws, Biases, and All, Study Sees

April 1, 2025

Summary: A new research finds that ChatGPT, while excellent at reasoning and arithmetic, exhibits many of the same mental prejudices as people when making personal choices. In tests for common wisdom errors, the AI showed optimism, risk aversion, and even the typical gambler’s fallacy, though it avoided other normal people mistakes like base-rate neglect.

Incidentally, newer versions of the AI were more mathematically precise but also displayed stronger judgment-based prejudices in some cases. These studies raise worries about relying on AI for high-stakes choices, as it may not reduce human error but otherwise manage it.

Important Information:

Judgment vs. Logic: AI excels at goal things but struggles with personal decision-making.
Oversight Needed: Professionals warn that AI may be monitored like a mortal decision-maker.

Origin: INFORMS

Can we really believe AI to make better choices than humans? A new research says… no often.

Researchers have discovered that OpenAI’s ChatGPT, one of the most advanced and popular AI models, makes the same kinds of decision-making mistakes as humans in some situations&nbsp, –&nbsp, showing biases like overconfidence of hot-hand ( gambler’s ) fallacy&nbsp, –&nbsp, yet acting inhuman in others ( e. g., not suffering from base-rate neglect or sunk cost fallacies ).

Published in the&nbsp, INFORMS book Manufacturing &amp, Service Operations Management, the study reveals that ChatGPT doesn’t really squeeze numbers&nbsp, –&nbsp, it” thinks” in ways oddly similar to humans, including cognitive shortcuts and blind spots.

These biases remain more stable across various business situations but may alter as Artificial evolves from one version to the next.

AI: A Smart Assistant with Human-Like Imperfections

The study, &nbsp,” A Manager and an AI Walk into a Bar: Does ChatGPT Make Biased Decisions Like We Do”? ,&nbsp, put ChatGPT through 18 different bias tests. The benefits?

AI is great at math, but struggles with view calls&nbsp, – It excels at reasonable and probability-based issues but falls when choices require personal argument.
Bias isn’t going away &nbsp, – Although the newer GPT-4 type is more mathematically precise than its forerunner, it often displayed&nbsp, stronger&nbsp, prejudices in judgment-based jobs.

Why This Matters

From job hiring to loan approvals, AI is already shaping major decisions in business and government. But if AI mimics human biases, could it be reinforcing bad decisions instead of fixing them?

” As AI learns from human data, it may also think like a human&nbsp, –&nbsp, biases and all”, says Yang Chen, lead author and assistant professor at Western University.

” Our research shows when AI is used to make judgment calls, it sometimes employs the same mental shortcuts as people”.

The study found that ChatGPT tends to:

Overestimate itself&nbsp, – ChatGPT assumes it’s more accurate than it really is.
Seek confirmation&nbsp, – AI favors information that supports existing assumptions, rather than challenging them.
Avoid ambiguity&nbsp, –&nbsp, AI prefers alternatives with more certain information and less ambiguity.

” When a decision has a clear right answer, AI nails it – it is better at finding the right formula than most people are”, says&nbsp, Anton Ovchinnikov&nbsp, of Queen’s University. ” But when judgment is involved, AI may fall into the same cognitive traps as people”.

So, Can We Trust AI to Make Big Decisions?

With governments worldwide working on AI regulations, the study raises an urgent question: Should we rely on AI to make important calls when it can be just as biased as humans?

” AI isn’t a neutral referee”, says Samuel Kirshner of UNSW Business School. ” If left unchecked, it might not fix decision-making problems&nbsp, –&nbsp, it could actually make them worse”.

The researchers say that’s why businesses and policymakers need to monitor AI’s decisions as closely as they would a human decision-maker.

” AI should be treated like an employee who makes important decisions&nbsp, –&nbsp, it needs oversight and ethical guidelines”, says Meena Andiappan of McMaster University. ” Otherwise, we risk automating flawed thinking instead of improving it”.

What’s Next?

The study’s authors recommend regular audits of AI-driven decisions and refining AI systems to reduce biases. With AI’s influence growing, making sure it improves decision-making&nbsp, –&nbsp, rather than just replicating human flaws&nbsp, –&nbsp, will be key.

” The evolution from GPT-3.5 to 4.0 suggests the latest models are becoming more human in some areas, yet less human but more accurate in others”, says Tracy Jenkin of Queen’s University.

” Managers must evaluate how different models perform on their decision-making use cases and regularly re-evaluate to avoid surprises. Some use cases will need significant model refinement”.

About this AI and cognition research news

Author: Ashley Smith
Source: INFORMS
Contact: Ashley Smith – INFORMS
Image: The image is credited to Neuroscience News

Original Research: Open access.
” A Manager and an AI Walk into a Bar: Does ChatGPT Make Biased Decisions Like We Do”? by Tracy Jenkin et al. Manufacturing &amp, Service Operations Management

Abstract

A Manager and an AI Walk into a Bar: Does ChatGPT Make Biased Decisions Like We Do?

Problem definition: Large language models ( LLMs) are being increasingly leveraged in business and consumer decision-making processes.

Because LLMs learn from human data and feedback, which can be biased, determining whether LLMs exhibit human-like behavioral decision biases ( e. g., base-rate neglect, risk aversion, confirmation bias, etc. ) is crucial prior to implementing LLMs into decision-making contexts and workflows.

To understand this, we examine 18 common human biases that are important in operations management ( OM) using the dominant LLM, ChatGPT. &nbsp,

Methodology/results: We perform experiments where GPT-3.5 and GPT-4 act as participants to test these biases using vignettes adapted from the literature ( “standard context” ) and variants reframed in inventory and general OM contexts.

In almost half of the experiments, Generative Pre-trained Transformer ( GPT ) mirrors human biases, diverging from prototypical human responses in the remaining experiments. We also observe that GPT models have a notable level of consistency between the standard and OM-specific experiments as well as across temporal versions of the GPT-3.5 model.

Our comparative analysis between GPT-3.5 and GPT-4 reveals a dual-edged progression of GPT’s decision making, wherein GPT-4 advances in decision-making accuracy for problems with well-defined mathematical solutions while simultaneously displaying increased behavioral biases for preference-based problems. &nbsp,

Managerial implications: First, our results highlight that managers will obtain the greatest benefits from deploying GPT to workflows leveraging established formulas.

Second, that GPT displayed a high level of response consistency across the standard, inventory, and non-inventory operational contexts provides optimism that LLMs can offer reliable support even when details of the decision and problem contexts change.

Third, although selecting between models, like GPT-3.5 and GPT-4, represents a trade-off in cost and performance, our results suggest that managers should invest in higher-performing models, particularly for solving problems with objective solutions.

Funding: &nbsp, This work was supported by the Social Sciences and Humanities Research Council of Canada]Grant SSHRC 430-2019-00505]. The authors also gratefully acknowledge the Smith School of Business at Queen’s University for providing funding to support Y. Chen’s postdoctoral appointment.

Share This Post

Subscribe To Our Newsletter

Get updates and learn from the best

More To Explore

Neuroscience Articles

AI Thinks Like Us: Flaws, Biases, and All, Study Sees

About this AI and cognition research news

Share This Post

Subscribe To Our Newsletter

Get updates and learn from the best

More To Explore

Listening Builds Trust, But Stories Change Minds

Study Shows Brain Transistors That Drive Political Passion and Intensity

Long-Term Mental Damage is a Result of Heavy Alcohol Use

Do You Want To Boost Your Business?

drop us a line and keep in touch

Get Started

Follow Us