Anthropic claims Claude 3 better than competition Anthropic claims Claude 3 better than competition

Anthropic, in case you have not heard of it, is a company established by several ex-OpenAI engineers. At a certain point, they saw things differently than their employer, and decided to build a separate AI company. They’ve been quite successful so far, considering more modest investments ($7.3 billion in 2023, compared to $11.3 from various funds and $13 billion from Microsoft) and lack of awareness-boosting scandals (Altman’s fired-hired maneuvers). Recently, Anthropic released the next generation of its Claude LLMs, and stated that at least one of them beats the competition.

Anthropic’s Claude 3 AI family

In a move that looks like a trend-setting marketing gimmick, Anthropic calls the set of large language models it has released a family. The siblings are, smartest to fastest, Opus, Sonnet, and Haiku. 

All three models, according to the developer, can handle complex requests and give near-instant answers. In the post of March 4, 2024, Anthropic sells the family as a perfect solution for a customer service operation, pointing that Haiku is the most cost-effective choice (yet somewhat lacking in more complicated situations involving multi-step instructions), and Opus the king of the hill capable of delivering “similar speeds to Claude 2 and 2.1, but with much higher levels of intelligence.”

As expected, the developer put its creations through tests to see how they turned out, and, pleasantly surprised, discovered that Claude 3 models work better than competing AIs, as shown by their evaluation benchmark scores. All siblings, according to Anthropic, exhibit “near-human levels of comprehension and fluency on complex tasks, leading the frontier of general intelligence.”

Claude 3 benchmarking scorecard. Image from AnthropicClaude 3 benchmarking scorecard. Image from Anthropic

Anthropic compared its products to OpenAI’s GPT-4 and GTP-3.5, and Google’s Gemini 1.0 Ultra and Gemini 1.0 Pro. The scorecard published by the company shows that Opus, the smartest of the Claude 3 family, outperforms competition in most disciplines.

AI race: what’s next?

The growing power of large language models isn’t something unexpected. The rate of technological development today is unparalleled, and it’s only gaining momentum, not slowing down. What’s more interesting is that the developer suggests a rather specific use case for its product instead of marketing it as a Swiss army knife (although you won’t find this sort of precision on its webpage). Thus, we may be looking at a trend of purpose-designed AIs, or, possibly, forks of all-purpose LLMs, that would eventually cover all fields they may sanely be applied in, from all tasks involving answer-finding through creative authoring and recognition to homework checking and tutoring.

Author's other posts

macOS 26 Tahoe: expected new features and improvements (July 2025)
Article
macOS 26 Tahoe: expected new features and improvements (July 2025)
macOS 26 Tahoe will be even more integrated into the general Apple ecosystem, sharing the Liquid Glass UI with iOS/iPadOS and offering new Continuity features.
TikTok in the US: the possible fork solution
Article
TikTok in the US: the possible fork solution
TikTok’s global saga brings together social media, geopolitics, and national security as ByteDance faces bans amid privacy concerns, with potential US market changes by 2025.
Windows 10 support ends soon: implications and alternatives
Article
Windows 10 support ends soon: implications and alternatives
So what will happen when Microsoft pulls the plug on Windows 10? And what are the alternatives to Windows 11? Read on to learn.
WWDC 2025: everything important you need to know
Article
WWDC 2025: everything important you need to know
WWDC 2025 unveiled the Liquid Glass UI redesign, new unified OS naming, and Apple Intelligence updates, and surprised with a lack of AI advancements.