Program Synthesis with Reinforcement Learning of Structured Edits

Overview

This project focuses on using reinforcement learning to mutate a partially-correct/complete piece of coding homework to a complete and highly scored (e.g. test inputs all give correct output) homework submission. Our domain uses code written in the functional programming language, OCaml. This project extracts and preprocesses data from a large database of homework submission, transforming them into an abstract syntax tree (AST) and passing the through a graph neural network (GNN).

Build Instructions

To set up via docker, follow the following steps:

Install Docker.
Set up run-logger.
Configure your .env file so that the environment variable GRAPHQL_ENDPOINT is the server you have set up. Start direnv by running direnv allow.

GRAPHQL_ENDPOINT=http://server.com:1200/v1/graphql

Create a docker volume called rl_checkpoint by using the command

docker volume create rl_checkpoint

Now, you can build the project with docker by running the following commands in the terminal:

bash run.sh <DOCKER_IMAGE_NAME> <DOCKER_VOLUME_MOUNT_DIR> <DESCRIPTION_ON_LOGGER>

Development Instructions

If you want to work on this project on a local machine, you need to install Poetry and opam. You can run make deps to install all dependencies needed.

Visualization Instructions

To visualize the actions that your agent is taking, you can run visualize.sh. This requires you to have saved a model in your docker volume. If you have done so already, run

bash visualize <DOCKER_IMAGE_NAME> <DOCKER_VOLUME_MOUNT_DIR> <LOG_NAME> <RUN_ID>

Code Overview

The following directories each have the following functions:

agent/: This directory includes the code for our reinforcement learning agent
clib/: This directory includes the C code for our project. The C code is used for communicating between our Python and OCaml code.
envs/: This directory includes the Python code for our environment. The environment that we are using is in envs/ast_env.py.
ocamllib/: This directory includes the OCaml code for our environment.

Bug Fix Notes

If there is a sudden error of not finding a child or something like that, check if max_num_nodes is sufficient for problem.

Name		Name	Last commit message	Last commit date
Latest commit History 374 Commits
agent		agent
baselines		baselines
clib		clib
data		data
envs		envs
ocamllib		ocamllib
scripts		scripts
.dockerignore		.dockerignore
.envrc		.envrc
.flake8		.flake8
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.ocamlformat		.ocamlformat
.ocamlinit		.ocamlinit
.pre-commit-config.yaml		.pre-commit-config.yaml
Dockerfile		Dockerfile
action_num.txt		action_num.txt
check_tests.py		check_tests.py
create-sweep.sh		create-sweep.sh
cuda-keyring_1.0-1_all.deb		cuda-keyring_1.0-1_all.deb
debug.py		debug.py
debug.sh		debug.sh
debug_entrypoint.sh		debug_entrypoint.sh
docker-compose.yml		docker-compose.yml
dune-project		dune-project
entrypoint.sh		entrypoint.sh
evaluation.py		evaluation.py
logger.py		logger.py
main.py		main.py
makefile		makefile
opam.export		opam.export
params.yaml		params.yaml
params.yaml_old		params.yaml_old
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
readme.md		readme.md
requirements.txt		requirements.txt
resume.sh		resume.sh
run.sh		run.sh
sweep.sh		sweep.sh
temp.txt		temp.txt
test.py		test.py
test_gen_util.py		test_gen_util.py
test_heldout.py		test_heldout.py
trainer.py		trainer.py
unittest.sh		unittest.sh
visualize.py		visualize.py
visualize.sh		visualize.sh
visualize_entrypoint.sh		visualize_entrypoint.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Program Synthesis with Reinforcement Learning of Structured Edits

Overview

Build Instructions

Development Instructions

Visualization Instructions

Code Overview

Bug Fix Notes

About

Releases

Packages

Contributors 3

Languages

hazelgrove/environment

Folders and files

Latest commit

History

Repository files navigation

Program Synthesis with Reinforcement Learning of Structured Edits

Overview

Build Instructions

Development Instructions

Visualization Instructions

Code Overview

Bug Fix Notes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages