LeanSolver: Solving theorems through Large Language Models and Search

Halevy, A.L.

LeanSolver: Solving theorems through Large Language Models and Search

Improving Theorem Proving with Proof Assistants and Sequential Monte Carlo in Large Language Models

Master thesis (2025)

Authors

A.L. Halevy Electrical Engineering, Mathematics and Computer Science

Contributors

S. Dumancic Algorithmics (mentor)

Gabriela Florentina Nane Applied Probability (mentor)

Benedikt Ahrens Programming Languages (graduation committee member)

Faculty

Electrical Engineering, Mathematics and Computer Science

AI Theorem proving Sequential Monte Carlo LLMs Lean 4

To reference this document use:

http://resolver.tudelft.nl/uuid:5df9831b-543a-4a6a-bbb6-c0e55f80bc9a

More Info

expand_more

Published Date

01-04-2025

Language

English

Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Faculty

Electrical Engineering, Mathematics and Computer Science

Abstract

We consider a subset of simple proving exercises that are part of the Lean 4 tutorials. The exercises consist of a statement, and the task will consist of creating a proof term of the desired type through the use of tactics.
Large Language Models on their own are known to be able to produce syntactically correct pieces of text but are unable to come up with semantically correct pieces of text once very similar solutions are lacking in the dataset it was trained on.
This work reviews the Sequential Monte Carlo with Expectation-Maximization (SMX) algorithm for general reinforcement learning problems, and applies it to theorem proving using LLMs.
This work shows that the SMX algorithm is applicable to tasks where formal verification can be performed on the output, allowing the calculated reward to steer the reasoning process.
We furthermore formally prove a theorem that will be important in showing that the resampling step in SMX is necessary to mitigate sample impoverishment.
The theoretical part of this thesis builds on the theory of the SMX paper by verifying the derivation of the E-step, starting from the Evidence Lower Bound (ELBO). This derivation was missing from the original SMX paper.

Files

Master_Thesis_LEANSolver.pdf

(pdf | 4.74 Mb)

License info not available