GitHub - t-aoyam/predict-ih-review

This is a repo for the paper "Predicting the Emergence of Induction Heads in Language Model Pretraining"

Setting up the Environment

$ conda create -n env-name python=3.9
$ conda activate env-name
$ pip install -r requirements.txt

Repo Structure

this-repo/
│  README.md
│  .gitignore
│
├─ src/                  
│   ├─ data/                    # code for data generation
│   │    ├─ generate_data.py    # main data generation code
│   │    └─ .../
│   │
│   ├─ models/                  # model classes
│   ├─ training/                # training loops, trainer classes, etc.
│   ├─ evaluation/              # metrics, analysis utilities
│   └─ utils.py
│
├─ notebooks/
│   └─ figures.ipynb            # all figures in the paper can be generated here
│
├─ scripts/                     # .sh scripts
│   ├─ data.sh
│   ├─ train.sh
│   └─ evaluate.sh
│
├─ models/                      # all pytorch models
│   └─ model-name
│        └─ checkpoint-xxx/
│
└─ data/                        # data for pretraining, evaluation
    ├─ configs/                 # .json configs for training
    └─ .../                     # all other result files

How to Run the Code

Generating the Transition Matrices and Training Data

See scripts/data.sh for how to train a model.

Training the Model

See scripts/train.sh for how to train a model.

Evaluating the Model

See scripts/eval.sh for how to evaluate a trained LM on prefix-matching score, logit attribution, associative recall (accuracy), and associative recall (mean rank).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Setting up the Environment

Repo Structure

How to Run the Code

Generating the Transition Matrices and Training Data

Training the Model

Evaluating the Model

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
data/configs		data/configs
scripts		scripts
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Setting up the Environment

Repo Structure

How to Run the Code

Generating the Transition Matrices and Training Data

Training the Model

Evaluating the Model

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages