Proverbot9001

A bot for proving.

You can find the paper and talk video at our website.

Prerequisites

MacOS

Check your python version with python --version in the terminal. If your version is older than Python 3.7, or the python command isn't found, install python through the python website.
Make sure pip, the python package manager, is available, by running in your terminal: python -m ensurepip.
Install Homebrew from their website.
Install wget, git, opam, rustup, GNU awk, and graphviz through Homebrew: brew install wget git opam rustup-init gawk graphviz && rustup-init

Linux

Check your python version with python --version in the terminal. If your version is older than Python 3.7, or the python command isn't found, install python through your package manager.
1. On Ubuntu, that's:
```
sudo apt install python3 python3-dev python3-pip
```
Make sure pip, the python package manager, is available, by running in your terminal: python -m ensurepip.

Install git, opam, rustup, and graphviz using your package manager.

On Ubuntu, that's:

sudo apt install software-properties-common
sudo add-apt-repository ppa:avsm/ppa
sudo apt update
sudo apt install git opam rustup graphviz libgraphviz-dev

Windows

Windows support is more experimental, but you can try installing prereqs through:

https://gitforwindows.org/

https://fdopen.github.io/opam-repository-mingw/installation/

https://graphviz.gitlab.io/_pages/Download/Download_windows.html

https://www.python.org/downloads/windows/

or use Windows Subsystem for Linux

Setting up environments for distributed RL on Unity

modules load openmpi/4.1.3+cuda11.6.2
git clone --recursive git@github.com:pytorch/pytorch.git
If using conda: `conda install cmake ninja

pip install -r requirements.txt
pip install mkl mkl-include
pip install pytorch magma-cuda117
git submodule sync && git submodule update --init --recursive

If using conda: export CMAKE_PREFIX_PATH=${CONDA_PREFIX:-"$(dirname $(which conda))/../"}
CMAKE_C_COMPILER=$(which mpicc) CMAKE_CXX_COMPILER=$(which mpicxx) python setup.py develop
Important make sure you run module load opam/2.1.2 graphviz/2.49.0+py3.8.12 openmpi/4.1.3+cuda11.6.2 before each run.

Getting Started with RL4Proof

git submodule init CompCert && git submodule update
cd CompCert
./configure x86_64-linux
make
make data/compcert-scrape.txt && make data/scrape-test.txt

Note the make commands are to be ran under CompCert directory. If you are using a Unity cluster, srun -c8 and use -j8 arguments for make.

Troubleshooting

Make sure you are using Coq 8.10.2.
If anything went wrong and you need to re-make the CompCert, run make clean under CompCert directory and repeat the above steps again.

Running the script

Checklist

Make sure CompCert making is done.
Have data/polyarg-weights-develop.dat and data/term2vec-weights-59.dat included in your local repo.

Generating tasks

Run

python src/gen_rl_tasks.py --prelude=CompCert \
      --supervised-weights=data/polyarg-weights-develop.dat -o rl_train_jobs.json compcert_projs_splits.json

To generate training tasks. To generate test tasks, specify --data-partition='test'. To run faster, use src/gen_rl_tasks_cluster.py instead.

Filter data by length

By default, gen_rl_tasks.py extracts tasks by making 16 predictions at each state, and seeing if the solution tactic matches any of them. Also by default, rl.py only makes 5 predictions at each state to choose between. Therefore, if using both defaults, make sure to filter task length to at least 5 and prediction width to at most 5. You can use the jq tool to filter task length and width. For example, to filter as specified above, run the following command:

jq -c "select(.largest_prediction_idx <= 5 and .target_length >= 5)" rl_train_jobs.json > rl_train_jobs_len5_wid5.json

Fill in task curriculum

Run

python src/fill_in_task_curriculum.py $INPUT_FILE $OUTPUT_FILE

to fill in task currriculum. INPUT_FILE should be the output of the jq tool. The output file is used for the tasks-file parameter in the next step.

Run Reinforcement Learning Script

python src/rl.py --supervised-weights=data/polyarg-weights-develop.dat --coq2vec-weights=data/term2vec-weights-59.dat compcert_projs_splits.json \
         --tasks-file=rl_train_jobs.json --prelude=./CompCert --backend=serapi --allow-partial-batches \
         --learning-rate=0.0001 -n10 -o data/rl_weights-compcert-5.dat -s5

You may specify the number of episode to be ran to by passing that to -n.

Running Distributed RL

python src/distributed_rl.py --mem=8G --num-actors=2 --supervised-weights=data/polyarg-weights-develop.dat --coq2vec-weights=../coq2vec/term2vec-weights-59.dat compcert_projs_splits.json --prelude=./CompCert --backend=serapi --gamma=0.7 -s7 -p5 --learning-rate=0.000005 -n24 -o data/rl_weights_distributed_test.dat --tasks-file=single_task.json --resume=yes

Name		Name	Last commit message	Last commit date
Latest commit History 3,990 Commits
CompCert @ 76a4ff8		CompCert @ 76a4ff8
analysis		analysis
bench		bench
coq-projects		coq-projects
coq_serapy @ d1c9e61		coq_serapy @ d1c9e61
data		data
dataloader		dataloader
kube		kube
reports		reports
sample-files		sample-files
src		src
stubs		stubs
.gitignore		.gitignore
.gitmodules		.gitmodules
.test.sh		.test.sh
.travis.yml		.travis.yml
COPYING		COPYING
Coqgym task creation notes.txt		Coqgym task creation notes.txt
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
archive-reports.sh		archive-reports.sh
beta.txt		beta.txt
build_coq_projects.sh		build_coq_projects.sh
compcert_projs_splits.json		compcert_projs_splits.json
coqgym_projs_splits.json		coqgym_projs_splits.json
coqide		coqide
fix_proof_using_linearized.sh		fix_proof_using_linearized.sh
full-run.sh		full-run.sh
full-test.sh		full-test.sh
get-scrape-failures-coq-projects.sh		get-scrape-failures-coq-projects.sh
get-scrape-failures.sh		get-scrape-failures.sh
get_column.py		get_column.py
install_coqgym_deps.sh		install_coqgym_deps.sh
kill_csv_newlines.py		kill_csv_newlines.py
kill_gpu_users.sh		kill_gpu_users.sh
known-good-dependency-versions.md		known-good-dependency-versions.md
lemma_name_from_statement.py		lemma_name_from_statement.py
new_unifysl_coqgym_projs_splits.json		new_unifysl_coqgym_projs_splits.json
opam-make.sh		opam-make.sh
parallel-search-report.sh		parallel-search-report.sh
partial-test.sh		partial-test.sh
predictor.md		predictor.md
proverbot9001-logo-with-text.png		proverbot9001-logo-with-text.png
proverbotlogo-01.png		proverbotlogo-01.png
reinforce.sh		reinforce.sh
requirements.txt		requirements.txt
scrape_coq_projects.sh		scrape_coq_projects.sh
search-run.sh		search-run.sh
strip_proofs_over_512_steps.sh		strip_proofs_over_512_steps.sh
train-run.sh		train-run.sh
train_filtered_distributed_rl_coqgym.sh		train_filtered_distributed_rl_coqgym.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Proverbot9001

Prerequisites

MacOS

Linux

Windows

Setting up environments for distributed RL on Unity

Getting Started with RL4Proof

Troubleshooting

Running the script

Checklist

Generating tasks

Filter data by length

Fill in task curriculum

Run Reinforcement Learning Script

Running Distributed RL

About

Licenses found

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Proverbot9001

Prerequisites

MacOS

Linux

Windows

Setting up environments for distributed RL on Unity

Getting Started with RL4Proof

Troubleshooting

Running the script

Checklist

Generating tasks

Filter data by length

Fill in task curriculum

Run Reinforcement Learning Script

Running Distributed RL

About

Resources

License

Licenses found

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages