LanguageModelsasCompilers

Official implementation of Language Models as Compilers: Simulating the Execution Of Pseudocode Improves Algorithmic Reasoning in Language Models.

Requirements

To run our code, you need an account to access OpenAI API. Generating a pseudocode does not cost much, but running inferences on all instances of a task requires about $10 ~ $20. Also, you need the latest version of vLLM.

Phase 1: Think - Generating a Task-level Pseudocode

The goal of this phase is to generate a pseudocode prompt that can be applied to all instances of a given task. For that, we conduct following steps:

Constructing a meta prompt.
Generating an analysis from the example questions of the task.
Generating a pseudocode based on the analysis.

We provide human-written analyses and pseudocodes in tasks/{task_name}/prompt folder. Running generate_analysis.py will generate an analysis on the selected task and it will be placed at tasks/{task_name}/generated_prompts folder.

Then, run generate_code_prompt.py to generate a pseudocode for your task. You can check the generated pseudocode prompt at task/{task_name}/generated_prompts folder.

Phase 2: Execute - Simulating the Execution of the Pseudocode

In this phase, we tailor the pseudocode to each instance for conducting reasoning.

Run scoring_single_prompt.py with passing the path of the generated pseudocode propmt as an argument. After the process is finished, you can check the result file in JSON format in tasks/{task_name}/results folder.

Running the Whole Process at Once

After changing the working directory to src, run bash run.sh to execute the whole process for all tasks we have experimented.

Contact

If you have any inquiries, please feel free to raise an issue or reach out to us via email at: mapoout@yonsei.ac.kr. We're here to assist you!

Citation

If you find this useful, please consider citing our paper:

@article{chae2024language,
  title={Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models},
  author={Chae, Hyungjoo and Kim, Yeonghyeon and Kim, Seungone and Ong, Kai Tzu-iunn and Kwak, Beong-woo and Kim, Moohyeon and Kim, Seonghwan and Kwon, Taeyoon and Chung, Jiwan and Yu, Youngjae and others},
  journal={arXiv preprint arXiv:2404.02575},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
src		src
README.md		README.md
figure2.svg		figure2.svg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LanguageModelsasCompilers

Requirements

Phase 1: Think - Generating a Task-level Pseudocode

Phase 2: Execute - Simulating the Execution of the Pseudocode

Running the Whole Process at Once

Contact

Citation

About

Uh oh!

Releases

Packages

Languages

kyle8581/LanguageModelsasCompilers

Folders and files

Latest commit

History

Repository files navigation

LanguageModelsasCompilers

Requirements

Phase 1: Think - Generating a Task-level Pseudocode

Phase 2: Execute - Simulating the Execution of the Pseudocode

Running the Whole Process at Once

Contact

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages