← Back

MATH-500

other 1 mention from 1 sources

A mathematical reasoning benchmark dataset used to evaluate AI model performance on solving math problems.

1

sources

Mentioned by

All mentions

Sebastian Raschka uses this ✓ High confidence
"I was training the Qwen 3 base model with RLVR on MATH-500. The base model had an accuracy of about 15%."

Attribution: Sebastian mentions using MATH-500 as a benchmark for training and evaluating the Qwen 3 model