Solving Quantitative Reasoning Problems with Language Models

Aitor Lewkowycz

Anders Andreassen

David Martin Dohan

Ethan S Dyer

Henryk Michalewski

Vinay Ramasesh

Ambrose Slone

Cem Anil

Imanol Schlag

Theo Gutman-Solo

Yuhuai Wu

Behnam Neyshabur

Guy Gur-Ari

Vedant Misra

NeurIPS (2022)

Download Google Scholar

Abstract

Language models have achieved remarkable performance on a wide range of tasks that require natural language understanding. Nevertheless, state-of-the-art models have generally struggled with tasks that require quantitative reasoning, such as solving mathematics, science, and engineering problems at the college level. To help close this gap, we introduce Minerva, a large language model pretrained on general natural language data and further trained on technical content. The model achieves state-of-the-art performance on technical benchmarks without the use of external tools. We also evaluate our model on over two hundred undergraduate-level problems in physics, biology, chemistry, economics, and other sciences that require quantitative reasoning, and find that the model can correctly answer nearly a third of them.

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations  & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Solving Quantitative Reasoning Problems with Language Models

Abstract

Research Areas

Meet the teams driving innovation

Defining the technology of today and tomorrow.

Philosophy

People

Teams

AI/ML Foundations & Capabilities

Algorithms & Optimization

Computing Paradigms

Responsible Human-Centric Technology

Science & Societal Impact

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Solving Quantitative Reasoning Problems with Language Models

Abstract

Research Areas

Meet the teams driving innovation

AI/ML Foundations  & Capabilities