Awesome

NeuroSAT

NeuroSAT is an experimental SAT solver that is learned using single-bit supervision only. We train it as a classifier to predict satisfiability of random SAT problems and it learns to search for satisfying assignments to explain that bit of supervision. When it guesses sat, we can almost always decode the satisfying assignment it has found from its activations. It can often find solutions to problems that are bigger, harder, and from entirely different domains than those it saw during training.

Specifically, we train it as a classifier to predict satisfiability on random problems that look like this:

When making a prediction about a new problem, it guesses unsat with low confidence (light blue) until it finds a satisfying assignment, at which point it guesses sat with very high confidence (red) and converges:

<img src="images/runs/run3022805014702275039_problem=data_dir=simple_n20_npb=0_nb=200_nr=40_rand=0_seed=0_t=1.pkl_v60_axis0_dpi10/round_t1.png" width=30 padding=5px> <img src="images/runs/run3022805014702275039_problem=data_dir=simple_n20_npb=0_nb=200_nr=40_rand=0_seed=0_t=1.pkl_v60_axis0_dpi10/round_t2.png" width=30 padding=5px> <img src="images/runs/run3022805014702275039_problem=data_dir=simple_n20_npb=0_nb=200_nr=40_rand=0_seed=0_t=1.pkl_v60_axis0_dpi10/round_t3.png" width=30 padding=5px> <img src="images/runs/run3022805014702275039_problem=data_dir=simple_n20_npb=0_nb=200_nr=40_rand=0_seed=0_t=1.pkl_v60_axis0_dpi10/round_t4.png" width=30 padding=5px> <img src="images/runs/run3022805014702275039_problem=data_dir=simple_n20_npb=0_nb=200_nr=40_rand=0_seed=0_t=1.pkl_v60_axis0_dpi10/round_t5.png" width=30 padding=5px> <img src="images/runs/run3022805014702275039_problem=data_dir=simple_n20_npb=0_nb=200_nr=40_rand=0_seed=0_t=1.pkl_v60_axis0_dpi10/round_t6.png" width=30 padding=5px> <img src="images/runs/run3022805014702275039_problem=data_dir=simple_n20_npb=0_nb=200_nr=40_rand=0_seed=0_t=1.pkl_v60_axis0_dpi10/round_t7.png" width=30 padding=5px> <img src="images/runs/run3022805014702275039_problem=data_dir=simple_n20_npb=0_nb=200_nr=40_rand=0_seed=0_t=1.pkl_v60_axis0_dpi10/round_t8.png" width=30 padding=5px> <img src="images/runs/run3022805014702275039_problem=data_dir=simple_n20_npb=0_nb=200_nr=40_rand=0_seed=0_t=1.pkl_v60_axis0_dpi10/round_t9.png" width=30 padding=5px> <img src="images/runs/run3022805014702275039_problem=data_dir=simple_n20_npb=0_nb=200_nr=40_rand=0_seed=0_t=1.pkl_v60_axis0_dpi10/round_t10.png" width=30 padding=5px> <img src="images/runs/run3022805014702275039_problem=data_dir=simple_n20_npb=0_nb=200_nr=40_rand=0_seed=0_t=1.pkl_v60_axis0_dpi10/round_t11.png" width=30 padding=5px> <img src="images/runs/run3022805014702275039_problem=data_dir=simple_n20_npb=0_nb=200_nr=40_rand=0_seed=0_t=1.pkl_v60_axis0_dpi10/round_t12.png" width=30 padding=5px> <img src="images/runs/run3022805014702275039_problem=data_dir=simple_n20_npb=0_nb=200_nr=40_rand=0_seed=0_t=1.pkl_v60_axis0_dpi10/round_t13.png" width=30 padding=5px> <img src="images/runs/run3022805014702275039_problem=data_dir=simple_n20_npb=0_nb=200_nr=40_rand=0_seed=0_t=1.pkl_v60_axis0_dpi10/round_t14.png" width=30 padding=5px> <img src="images/runs/run3022805014702275039_problem=data_dir=simple_n20_npb=0_nb=200_nr=40_rand=0_seed=0_t=1.pkl_v60_axis0_dpi10/round_t15.png" width=30 padding=5px> <img src="images/runs/run3022805014702275039_problem=data_dir=simple_n20_npb=0_nb=200_nr=40_rand=0_seed=0_t=1.pkl_v60_axis0_dpi10/round_t16.png" width=30 padding=5px> <img src="images/runs/run3022805014702275039_problem=data_dir=simple_n20_npb=0_nb=200_nr=40_rand=0_seed=0_t=1.pkl_v60_axis0_dpi10/round_t17.png" width=30 padding=5px> <img src="images/runs/run3022805014702275039_problem=data_dir=simple_n20_npb=0_nb=200_nr=40_rand=0_seed=0_t=1.pkl_v60_axis0_dpi10/round_t18.png" width=30 padding=5px> <img src="images/runs/run3022805014702275039_problem=data_dir=simple_n20_npb=0_nb=200_nr=40_rand=0_seed=0_t=1.pkl_v60_axis0_dpi10/round_t19.png" width=30 padding=5px> <img src="images/runs/run3022805014702275039_problem=data_dir=simple_n20_npb=0_nb=200_nr=40_rand=0_seed=0_t=1.pkl_v60_axis0_dpi10/round_t20.png" width=30 padding=5px> <img src="images/runs/run3022805014702275039_problem=data_dir=simple_n20_npb=0_nb=200_nr=40_rand=0_seed=0_t=1.pkl_v60_axis0_dpi10/round_t21.png" width=30 padding=5px> <img src="images/runs/run3022805014702275039_problem=data_dir=simple_n20_npb=0_nb=200_nr=40_rand=0_seed=0_t=1.pkl_v60_axis0_dpi10/round_t22.png" width=30 padding=5px> <img src="images/runs/run3022805014702275039_problem=data_dir=simple_n20_npb=0_nb=200_nr=40_rand=0_seed=0_t=1.pkl_v60_axis0_dpi10/round_t23.png" width=30 padding=5px> <img src="images/runs/run3022805014702275039_problem=data_dir=simple_n20_npb=0_nb=200_nr=40_rand=0_seed=0_t=1.pkl_v60_axis0_dpi10/round_t24.png" width=30 padding=5px> Iteration →

At convergence, the literal embeddings cluster according to the solution it finds:

We can almost always recover the solution by clustering the literal embeddings, thus making NeuroSAT an end-to-end SAT solver.

At test time it can often find solutions to

bigger random problems:

graph coloring problems:

clique detection problems:

dominating set problems:

and vertex cover problems:

Caveats

The graph problems are derived from small random graphs (~10 nodes, ~17 edges on average).
NeuroSAT is a research prototype and is still vastly less reliable than traditional SAT solvers.

Reproducibility

As many readers know too well, facilitating exact reproducibility in machine learning can require a lot of work. NeuroSAT is no exception. We regret that we do not currently provide a push-button way to retrain our exact model on the exact same training data we used in our experiments, though we may provide such functionality in the future depending on the level of interest. For now, we settle for providing our model code, a generator for the distribution of problems we trained on, and enough scaffolding to easily train and test it on small datasets. More utilities will be added in the coming weeks. We hope users will adapt our code to their own infrastructures, improve upon our model, and train it on a greater variety of problems.

Playing with NeuroSAT

The scripts/ directory includes a few scripts to get started.

setup.sh installs dependencies.
toy_gen_data.sh generates toy train and test data.
toy_train.sh trains a model for a few iterations on the toy training data.
toy_test.sh evaluates the trained model on the toy test data.
toy_solve.sh tries to solve the toy test problems.
toy_pipeline.sh runs toy_gen_data.sh, toy_train.sh, toy_test.sh, and toy_solve.sh in sequence.

These scripts can be easily modified to train and test on larger datasets.

Resources

More information about NeuroSAT can be found in the paper https://arxiv.org/abs/1802.03685.

Team

Daniel Selsam, Stanford University
Matthew Lamm, Stanford University
Benedikt Bünz, Stanford University
Percy Liang, Stanford University
Leonardo de Moura, Microsoft Research
David L. Dill, Stanford University

Acknowledgments

This work was supported by Future of Life Institute grant 2017-158712.