The researchers started with the GSM8K's standardized set of 8,000 grade-school level mathematics word problems, a common benchmark for testing LLMs. Then they slightly altered the wording without ...
to make word problems more approachable, said Kevin Dykema, the immediate past president of the National Council of Teachers of Mathematics and an 8th grade math teacher in Mattawan, Michigan.