
Revolutionary AI Model Outshines ChatGPT in Reasoning Skills!
2025-08-27
Author: John Tan
A Game-Changer in AI Development
In a groundbreaking advancement, scientists have unveiled a new artificial intelligence (AI) model that surpasses renowned large language models (LLMs) like ChatGPT in reasoning capabilities.
Introducing the Hierarchical Reasoning Model (HRM)
Dubbed the Hierarchical Reasoning Model (HRM), this innovative AI draws inspiration from the intricate ways the human brain processes information across different timescales. Researchers at Singapore-based AI company Sapient designed it to be both efficient and effective, needing significantly fewer parameters than its competitors.
Impressive Stats!
The HRM operates with just 27 million parameters and requires a mere 1,000 training samples, starkly contrasting with other leading LLMs that often feature billions or even trillions of parameters, such as the upcoming GPT-5, rumored to have between 3 and 5 trillion parameters.
Tackling the Toughest Tests!
HRM’s capabilities were rigorously evaluated using the ARC-AGI benchmark, a formidable test aimed at gauging progress towards artificial general intelligence (AGI). In these tests, HRM achieved a remarkable score of 40.3% in ARC-AGI-1, leaving competitors like OpenAI’s o3-mini-high trailing at 34.5%, and even outperforming in the more challenging ARC-AGI-2, scoring 5% compared to o3-mini-high’s 3%.
Breaking Down Complexities!
While many advanced LLMs rely on chain-of-thought (CoT) reasoning—breaking complex problems into simpler, manageable steps—Sapient researchers point out crucial limitations of CoT, including its tendency for "brittle task decomposition" and considerable data demands. Instead, HRM adopts a novel approach, tackling reasoning tasks directly through a two-tiered process.
How HRM Works!
HRM elegantly divides its function into a high-level module for abstract planning and a low-level one for rapid computations, mimicking the brain's processing. This model employs a technique called iterative refinement, enhancing solution accuracy by fine-tuning initial estimates across multiple short "thinking" bursts. Each burst evaluates whether to keep processing or finalize the answer.
Unmatched Performance!
The HRM has set new benchmarks by excelling at challenging tasks like complex Sudoku puzzles, a feat that has left conventional LLMs struggling. Its exceptional ability to navigate optimal paths through mazes showcases its superior reasoning skills.
What's Next?
While the study supporting these findings awaits peer review, the ARC-AGI benchmark organizers have taken steps to replicate these results, underscoring the promising future of this revolutionary AI model.