Bitcoin

Bitcoin

$118,242.53

BTC 0.27%

Ethereum

Ethereum

$3,747.02

ETH 5.41%

  • Login
  • Register
Metaverse Media Group
  • Home
  • Crypto
  • NFTs
  • Artificial Intelligence
  • More
    • Technology
    • Business
    • Newsletter
No Result
View All Result
  • Home
  • Crypto
  • NFTs
  • Artificial Intelligence
  • More
    • Technology
    • Business
    • Newsletter
No Result
View All Result
Metaverse Media Group

OpenAI claims a breakthrough in LLM reasoning on complex math problems

OpenAI claims a breakthrough in LLM reasoning on complex math problems

The Decoderby The Decoder
19 July 2025
OpenAI says its experimental language model has solved International Mathematical Olympiad (IMO) problems at a gold medal level—a possible breakthrough for AI with general reasoning skills. The results have not yet been independently confirmed. The article OpenAI claims a breakthrough in LLM reasoning on complex math problems appeared first on THE DECODER….

summary
Summary

OpenAI says its experimental language model has solved International Mathematical Olympiad (IMO) problems at a gold medal level—a possible breakthrough for AI with general reasoning skills. The results have not yet been independently confirmed.

According to OpenAI researchers Alexander Wei and Noam Brown, the model tackled the IMO 2025 competition, solving the first five of the six official problems and earning 35 out of a possible 42 points.

The IMO is considered the most difficult math competition for high school students, requiring creativity and rigorous logical reasoning. Wei claims this is the first AI model that can “craft intricate, watertight arguments at the level of human mathematicians.”

A step-by-step solution generated by OpenAI’s model for an IMO problem. | Image: Screenshot via X
Recommend our article

The model generated its solutions under standard competition conditions: two 4.5-hour sessions, no outside help, all answers written in natural language, and no tool use. Former IMO medalists graded the responses anonymously. The full solutions are available on GitHub.

THE DECODER Newsletter
The most important AI news straight to your inbox.
✓ Weekly
✓ Cancel at any time

Still room to scale

Unlike DeepMind’s AlphaGeometry, which is built specifically for math, OpenAI’s model is a general-purpose reasoning language model. “We reach this capability level not via narrow, task-specific methodology, but by breaking new ground in general-purpose reinforcement learning and test-time compute scaling,” Wei explains.

Brown confirms that the model relies on “new experimental general-purpose techniques” and scales its compute at test time, though he doesn’t share the technical details.

“o1 thought for seconds. Deep Research for minutes. This one thinks for hours,” Brown notes, pointing out that the new model is more efficient and still has scaling potential. He argues that even a small advantage over human performance can be enough to drive major scientific progress.

Wei says OpenAI has no plans to release this model or a similar one in the coming months, stressing that it’s strictly a research project. He also clarified that while GPT-5 is planned “soon”, it is unrelated to the IMO model, which was developed by a small team led by Wei.

Brown points out that the technology could eventually become a product, and with progress moving so quickly, future versions may be even more advanced. He adds that the results surprised even people inside OpenAI, calling it “a milestone that many considered years away.”

Recommendation

Current models are far behind

The timing of OpenAI’s announcement seems intentional, coming just after current AI models delivered disappointing results at the same competition.

A recent evaluation by the MathArena.ai platform tested several leading models-including Gemini 2.5 Pro, Grok-4, DeepSeek-R1, and even OpenAI’s own o3 and o4-mini-on the IMO 2025 tasks. None of them managed to score the 19 points needed for a bronze medal. Gemini 2.5 Pro came out on top, but with only 13 out of 42 points, while the others performed even worse.

MathArena.ai’s chart shows major language models falling short on 2025 IMO problems. | Image: Screenshot via Matharena.ai

Even with extensive testing, which included a best-of-32 selection process and evaluations by IMO experts, the models showed serious flaws. The results were filled with logical errors, incomplete arguments, and even made-up theorems.

Viewed in this context, OpenAI’s announcement looks like a direct response to the limitations exposed by the MathArena test. While the achievement is significant, its true value will depend on whether the results can be independently reproduced and applied to real scientific problems.

Join our community
Join the DECODER community on Discord, Reddit or Twitter – we can’t wait to meet you.

Read the full article on The-Decoder.com
in AI
Reading Time: 4 mins read
0
0
22
VIEWS
Share on TwitterShare on Facebook

Subscribe to our newsletter

For the latest news & monthly prize giveaways
Join Now

Subscribe to our newsletter

For the latest news & monthly prize giveaways
Join Now
ADVERTISEMENT

Related Posts

Alibaba’s Qwen2.5 only excels at math thanks to memorized training data
AI

Alibaba’s Qwen2.5 only excels at math thanks to memorized training data

5 hours ago
21
FlexOlmo enables organizations to collaboratively train LLMs without data sharing
AI

FlexOlmo enables organizations to collaboratively train LLMs without data sharing

11 hours ago
21
An OpenAI AI model finished second in the AtCoder Heuristics World Finals
AI

An OpenAI AI model finished second in the AtCoder Heuristics World Finals

1 day ago
22

Comments

Please login to join discussion
ADVERTISEMENT

Latest News

  • All
  • Crypto
  • NFTs
  • Technology
  • Business
10 Leading AI Chatbots Predict Bitcoin’s Wild Ride to $1 Million
Crypto

10 Leading AI Chatbots Predict Bitcoin’s Wild Ride to $1 Million

Bitcoin.com News
by Bitcoin.com News
56 minutes ago
19
Ethereum Rockets Past $3.7K as Options Traders Eye $12K Moonshot Bets
Crypto

Ethereum Rockets Past $3.7K as Options Traders Eye $12K Moonshot Bets

Bitcoin.com News
by Bitcoin.com News
3 hours ago
22
Bettors Bet Big on Vance and Newsom: 2028 US Election Race Heats up Before It Even Starts
Crypto

Bettors Bet Big on Vance and Newsom: 2028 US Election Race Heats up Before It Even Starts

Bitcoin.com News
by Bitcoin.com News
4 hours ago
21
Alibaba’s Qwen2.5 only excels at math thanks to memorized training data
AI

Alibaba’s Qwen2.5 only excels at math thanks to memorized training data

The Decoder
by The Decoder
5 hours ago
21
Latam Insights: IMF Denies El Salvador’s Bitcoin Purchase Claims; US-Brazil Conflict Set to Escalate
Crypto

Latam Insights: IMF Denies El Salvador’s Bitcoin Purchase Claims; US-Brazil Conflict Set to Escalate

Bitcoin.com News
by Bitcoin.com News
6 hours ago
21
Bitcoin Price Watch: $117.5K to $118K Consolidation Signals Tension Before Breakout
Crypto

Bitcoin Price Watch: $117.5K to $118K Consolidation Signals Tension Before Breakout

Bitcoin.com News
by Bitcoin.com News
7 hours ago
21
Load More
Next Post
Bitcoin Price Watch: Momentum Cools but Uptrend Remains Intact

Bitcoin Price Watch: Momentum Cools but Uptrend Remains Intact

ADVERTISEMENT

Follow Us

Categories

  • Crypto
  • NFTs
  • AI
  • Technology
  • Business
  • Crypto
  • NFTs
  • AI
  • Technology
  • Business
Subscribe to our Newsletter

© 2022 Metaverse Media Group – The Metaverse Mecca

Privacy and Cookie Policy | Sitemap

Welcome Back!

Sign In with Google
OR

Login to your account below

Forgotten Password? Sign Up

Create New Account!

Sign Up with Google
OR

Fill the forms below to register

*By registering into our website, you agree to the Terms & Conditions and Privacy Policy.
All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In
No Result
View All Result
  • Home
  • Crypto
  • NFTs
  • Artificial Intelligence
  • More
    • Technology
    • Business
    • Newsletter
Bitcoin

Bitcoin

$118,242.53

BTC 0.27%

Ethereum

Ethereum

$3,747.02

ETH 5.41%

  • Login
  • Sign Up
This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.

Subscribe to our newsletter

Get the latest news & win monthly prizes

Subscribe to our newsletter

For the Latest News and Monthly Prize Giveaways

Join Now
Join Now