Connect with us

News

AI Language Models Display Human-like Cognitive Limitations, New Study Reveals

Artificial intelligence cognitive comparison showing digital neural network overlaid on human brain model, illustrating AI cognitive decline study findings

Consequently, a groundbreaking study has revealed surprising limitations in artificial intelligence systems. Moreover, researchers have discovered that leading AI language models show cognitive patterns similar to human mental decline. As a result, this finding raises important questions about AI’s future in healthcare and decision-making applications.

Background and Context

Notably, OpenAI’s ChatGPT emerged just two years ago. Since then, it has transformed how people interact with AI, enabling collaboration on various tasks from poetry to homework. Furthermore, ChatGPT represents just one of many AI programs that now respond to queries with remarkable human-like abilities.

However, this human-like resemblance extends beyond basic interactions. Recently, Israeli researchers discovered that LLMs actually experience a form of cognitive decline over time, much like humans do.

Research Methodology and Findings

Research Team and Approach

Initially, the research team gathered extensive data through systematic testing. Specifically, neurologists Roy Dayan and Benjamin Uliel from Hadassah Medical Center partnered with Gal Koplewitz, a data scientist at Tel Aviv University. Together, they evaluated several AI models using standard neurological tests. In particular, they assessed ChatGPT versions 4 and 4o, two Gemini variants, and Claude 3.5.

Initial Results

Subsequently, the findings revealed concerning patterns of decline. Therefore, the researchers concluded that these patterns mirror “neurodegenerative processes in the human brain.” Here are the key results:

  • First, ChatGPT 4o led with 26/30 points
  • Meanwhile, ChatGPT 4 and Claude achieved 25/30 points
  • Finally, Gemini scored significantly lower at 16/30 points
Comparisons of five LLM MoCA scores. (Dayan et al., BMJ, 2025)

Technical Analysis

Understanding AI Limitations

Despite their sophisticated responses, LLMs fundamentally operate like advanced predictive text systems. Although they process information quickly, they often struggle to distinguish between meaningful content and nonsense. Additionally, their statistical approach to text generation differs significantly from human cognitive processes.

Cognitive Assessment Results

Visuospatial Performance

The research revealed significant weaknesses across all models. For instance, they struggled with basic tasks like:

  • Trail-making exercises
  • Cube design copying
  • Clock drawing activities
Attempts to draw a Necker cube (top left) by a human (top right) and ChatGPT versions 4 (bottom left) and 4o (bottom right). (Dayan et al., BMJ, 2025)

Emotional and Spatial Intelligence

Throughout testing, the AI models showed concerning limitations. For example, Claude responded to location questions with vague statements like “the specific place would depend on your location.” Similarly, all models displayed limited empathy during specialized tests, suggesting potential parallels with frontotemporal dementia symptoms.

Future Implications and Development

Generational Progress

Nevertheless, researchers noted consistent improvements across AI generations. While earlier versions showed more limitations, newer models demonstrate enhanced capabilities. Yet, the team emphasizes that these systems cannot receive actual dementia diagnoses.

Impact on Clinical Medicine

Therefore, these findings challenge current assumptions about AI in healthcare. Although AI shows promise, its limitations in visual interpretation and spatial reasoning require careful consideration.

Implementation Guidelines

Current Recommendations

Organizations implementing AI should follow these key guidelines:

  • First, regularly assess AI capabilities
  • Then, maintain clear documentation of limitations
  • Furthermore, develop robust usage protocols
  • Finally, ensure human oversight

Future Development Priorities

Moving forward, developers should focus on:

  • Primarily, enhancing visual processing
  • Additionally, improving contextual understanding
  • Moreover, developing better emotional intelligence
  • Lastly, strengthening decision-making capabilities

Looking Ahead

In conclusion, AI technology continues to advance rapidly. While future models may achieve perfect cognitive scores, current systems require careful oversight. Therefore, users should approach AI advice with appropriate skepticism.

Conclusion

Finally, this research offers crucial insights into AI capabilities and limitations. Although continued development promises improvements, understanding current constraints remains essential for proper implementation.


This research was published in BMJ and includes insights from neurologists Roy Dayan and Benjamin Uliel from Hadassah Medical Center, and data scientist Gal Koplewitz from Tel Aviv University.