Model Math Measer - Search News

MathOdyssey: Benchmarking Mathematical Problem-Solving Skills in Large Language Models Using Odyssey Math Data

Large language models (LLMs) have significantly advanced natural language understanding and demonstrated strong problem-solving abilities. Despite these successes, most LLMs still struggle with ...

TechRepublic

OpenAI Model Wins Gold at International Mathematical Olympiad – or Did It?

A Google DeepMind researcher and OpenAI’s former CTO are posing questions about the validity of OpenAI’s claim about its gold-medal score. OpenAI’s latest model has achieved a gold-level score at the ...

Neowin

DeepSeek launches new math-oriented model to solve secrets of the universe

DeepSeek made waves in early 2025, launching one of the world's first free-to-access thinking models. Now, the Chinese firm has just released DeepSeekMath-V2 with the objective of achieving ...

The Verge

Microsoft’s small math AI model does math better than the big boys.

Microsoft found that small language models can exceed the performance of much larger ones when trained to specialize in a single area. Researchers fine-tuned the Mistral 7B model to create Orca-Math, ...

Ars Technica

Telling AI model to “take a deep breath” causes math scores to soar in study

Google DeepMind researchers recently developed a technique to improve math ability in AI language models like ChatGPT by using other AI models to improve prompting—the written instructions that tell ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results