The Scarlet Pimp
Supreme Member
- Apr 2, 2008
- 1,319
- 4,308
https://www.maximumtruth.org/p/ais-ranked-by-iq-ai-passes-100-iq
Takeaway #1: Claude-3 stuns — it represents a new leap in AI
I was already impressed by how ChatGPT-4 went from “unscoreable” to IQ 85, after I verbalized the questions. I was halfway through writing this post when Claude-3 came out, yesterday.
I’m amazed by its score.
Also, look at the consistent progression:
Claude-1 was hardly better than random. It got 6 answers right, giving it ~64 IQ.
Claude-2 scored 6 additional points per test (worth ~18 IQ points).
Claude-3 scored yet another 6.5 points, worth ~19 more IQ points, bring it up to above the human average.
The symmetric increases make me wonder if Anthropic is releasing versions based on internal benchmarks that happen to closely correlate with this IQ measure.
Claude 3 is one of the most capable and human-like artificial intelligence models I’ve ever encountered. It is able to rationalize to a certain extent, aware of its own limitations and can speculate on its potential.
I went beyond these initial prompts and explored deeper questions of how AI can be used to solve problems, whether Claude could maintain an academic style of writing for a silly subject like pineapple on pizza and how it would handle the trolly problem.
In all encounters the responses were well reasoned, thoughtful and surprisingly natural. It wasn’t shy of considering the impact and on the trolly problem even suggested a priority to the passenger of a driverless car as the AI driving was tasked with specifically protecting them.
Unlike previous iterations of Claude, the new version also has a broader view of the world, able to analyze images, graphs and other forms of data input which I think has contributed to this more natural perspective on the world.
I don’t think Claude is an Artificial General Intelligence (AGI), that is a much higher bar but I do think it shows some early signs of general intelligence and interacting with it is much like interacting with a human.
Takeaway #1: Claude-3 stuns — it represents a new leap in AI
I was already impressed by how ChatGPT-4 went from “unscoreable” to IQ 85, after I verbalized the questions. I was halfway through writing this post when Claude-3 came out, yesterday.
I’m amazed by its score.
Also, look at the consistent progression:
Claude-1 was hardly better than random. It got 6 answers right, giving it ~64 IQ.
Claude-2 scored 6 additional points per test (worth ~18 IQ points).
Claude-3 scored yet another 6.5 points, worth ~19 more IQ points, bring it up to above the human average.
The symmetric increases make me wonder if Anthropic is releasing versions based on internal benchmarks that happen to closely correlate with this IQ measure.
Claude 3 is one of the most capable and human-like artificial intelligence models I’ve ever encountered. It is able to rationalize to a certain extent, aware of its own limitations and can speculate on its potential.
I went beyond these initial prompts and explored deeper questions of how AI can be used to solve problems, whether Claude could maintain an academic style of writing for a silly subject like pineapple on pizza and how it would handle the trolly problem.
In all encounters the responses were well reasoned, thoughtful and surprisingly natural. It wasn’t shy of considering the impact and on the trolly problem even suggested a priority to the passenger of a driverless car as the AI driving was tasked with specifically protecting them.
Unlike previous iterations of Claude, the new version also has a broader view of the world, able to analyze images, graphs and other forms of data input which I think has contributed to this more natural perspective on the world.
I don’t think Claude is an Artificial General Intelligence (AGI), that is a much higher bar but I do think it shows some early signs of general intelligence and interacting with it is much like interacting with a human.
Last edited: