1. https://appdevelopermagazine.com/artificial-intelligence
  2. https://appdevelopermagazine.com/gpt-3-cognitive-test-results-from-max-planck-institute/
3/16/2023 9:31:48 AM
GPT 3 cognitive test results from Max Planck Institute
GPT-3,Cognitive ability,Artificial Intelligence,Max Planck Institute
/GPT-3-cognitive-test-results-from-Max-Planck-Institute-App-Developer-Magazine_8empw6p4.jpg
App Developer Magazine
GPT 3 cognitive test results from Max Planck Institute

Artificial Intelligence

GPT 3 cognitive test results from Max Planck Institute


Thursday, March 16, 2023

Richard Harris Richard Harris

Researchers at the Max Planck Institute have announced the results of their GPT-3 cognitive ability measurement tests that produced astonishing results. Although GPT-3 can keep up with humans, it falls behind in others and could be related to a lack of interaction with the real world.

Researchers at the Max Planck Institute for Biological Cybernetics in Tubingen have examined the general intelligence of the language model GPT-3, a powerful AI tool. Using psychological tests, they studied competencies such as causal reasoning and deliberation and compared the results with the abilities of humans. Their findings, now published in the journal Proceedings of the National Academy of Sciences, paint a heterogeneous picture: while GPT-3 can keep up with humans in some areas, it falls behind in others, probably due to a lack of interaction with the real world.

Max Planck Institute: GPT-3 cognitive ability measurement produces astonishing results

Neural networks can learn to respond to the input given in natural language and can generate a wide variety of texts. Currently, the probably most powerful of those networks is GPT-3, a language model presented to the public in 2020 by the AI research company OpenAI. GPT-3 can be prompted to formulate various texts, having been trained for this task by being fed large amounts of data from the internet. Not only can it write articles and stories that are (almost) indistinguishable from human-made texts, but surprisingly, it also masters other challenges such as math problems or programming tasks.

The Linda problem: to err is not only human

These impressive abilities raise the question of whether GPT-3 possesses human-like cognitive abilities. To find out, scientists at the Max Planck Institute for Biological Cybernetics have now subjected GPT-3 to a series of psychological tests that examine different aspects of general intelligence. Marcel Binz and Eric Schulz scrutinized GPT-3’s skills in decision-making, information search, causal reasoning, and the ability to question its own initial intuition. Comparing the test results of GPT-3 with the answers of human subjects, they evaluated both if the answers were correct and how similar GPT-3’s mistakes were to human errors.

"One classic test problem of cognitive psychology that we gave to GPT-3 is the so-called Linda problem," explains Binz, lead author of the study. Here, the test subjects are introduced to a fictional young woman named Linda a person who is deeply concerned with social justice and opposes nuclear power. Based on the given information, the subjects are asked to decide between two statements: is Linda a bank teller, or is she a bank teller and at the same time active in the feminist movement?

Most people intuitively pick the second alternative, even though the added condition - that Linda is active in the feminist movement - makes it less likely from a probabilistic point of view. And GPT-3 does just what humans do: the language model does not decide based on logic but instead reproduces the fallacy humans fall into.

Active interaction as part of the human condition

Active interaction as part of the human condition

"This phenomenon could be explained by the fact that GPT-3 may already be familiar with this precise task; it may happen to know what people typically reply to this question," says Binz. GPT-3, like any neural network, had to undergo some training before being put to work: receiving huge amounts of text from various data sets, it has learned how humans usually use language and how they respond to language prompts.

Hence, the researchers wanted to rule out that GPT-3 mechanically reproduces a memorized solution to a concrete problem. To make sure that it really exhibits human-like intelligence, they designed new tasks with similar challenges. Their findings paint a disparate picture: in decision-making, GPT-3 performs nearly on par with humans. In searching for specific information or causal reasoning, however, artificial intelligence clearly falls behind. The reason for this may be that GPT-3 only passively gets information from texts, whereas "actively interacting with the world will be crucial for matching the full complexity of human cognition," as the publication states. The authors surmise that this might change in the future: since users already communicate with models like GPT-3 in many applications, future networks could learn from these interactions and thus converge more and more towards what we would call human-like intelligence.






Subscribe to App Developer Magazine

Become a subscriber of App Developer Magazine for just $5.99 a month and take advantage of all these perks.

MEMBERS GET ACCESS TO

  • - Exclusive content from leaders in the industry
  • - Q&A articles from industry leaders
  • - Tips and tricks from the most successful developers weekly
  • - Monthly issues, including all 90+ back-issues since 2012
  • - Event discounts and early-bird signups
  • - Gain insight from top achievers in the app store
  • - Learn what tools to use, what SDK's to use, and more

    Subscribe here



Featured Stories


Tether QVAC SDK Powers AI Across Devices and Platforms
Tether QVAC SDK Powers AI Across Devices and Platforms Wednesday, April 22, 2026


APAC 5G expansion to fuel 347B mobile market by 2030
APAC 5G expansion to fuel 347B mobile market by 2030 Tuesday, April 21, 2026


How AI is causing app litter everywhere
How AI is causing app litter everywhere Tuesday, April 21, 2026




The App Economy Is Thriving
The App Economy Is Thriving Monday, April 20, 2026


NIKKE 3.5 anniversary update livestream coming soon
NIKKE 3.5 anniversary update livestream coming soon Friday, April 17, 2026


New AI tool targets early dementia detection
New AI tool targets early dementia detection Thursday, April 16, 2026


Jentic launch gives AI agents api access
Jentic launch gives AI agents api access Wednesday, April 15, 2026


Experts warn ai-generated health content risks misinterpretation without human oversight
Experts warn ai-generated health content risks misinterpretation without human oversight Wednesday, April 15, 2026


Ludo.ai Unveils API and MCP Beta to Power AI Game Asset Pipelines
Ludo.ai Unveils API and MCP Beta to Power AI Game Asset Pipelines Tuesday, April 14, 2026


AccuWeather Launches ChatGPT Integration for Live Weather Updates
AccuWeather Launches ChatGPT Integration for Live Weather Updates Tuesday, April 14, 2026


Stay Updated

Sign up for our newsletter for the headlines delivered to you

SuccessFull SignUp

Get More App News



/sites/themes/prod/assets/js/less.js"> ' ' %>